Lessons on Parameter Sharing across Layers in Transformers

Sho Takase, Shun Kiyono


Anthology ID:
2023.sustainlp-1.5
Volume:
Proceedings of The Fourth Workshop on Simple and Efficient Natural Language Processing (SustaiNLP)
Month:
July
Year:
2023
Address:
Toronto, Canada (Hybrid)
Editors:
Nafise Sadat Moosavi, Iryna Gurevych, Yufang Hou, Gyuwan Kim, Young Jin Kim, Tal Schuster, Ameeta Agrawal
Venue:
sustainlp
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
78–90
Language:
URL:
https://aclanthology.org/2023.sustainlp-1.5
DOI:
10.18653/v1/2023.sustainlp-1.5
Bibkey:
Cite (ACL):
Sho Takase and Shun Kiyono. 2023. Lessons on Parameter Sharing across Layers in Transformers. In Proceedings of The Fourth Workshop on Simple and Efficient Natural Language Processing (SustaiNLP), pages 78–90, Toronto, Canada (Hybrid). Association for Computational Linguistics.
Cite (Informal):
Lessons on Parameter Sharing across Layers in Transformers (Takase & Kiyono, sustainlp 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.sustainlp-1.5.pdf
Video:
 https://aclanthology.org/2023.sustainlp-1.5.mp4