- Anthology ID:
- 2023.conll-babylm.2
- Volume:
- Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning
- Month:
- December
- Year:
- 2023
- Address:
- Singapore
- Editors:
- Alex Warstadt, Aaron Mueller, Leshem Choshen, Ethan Wilcox, Chengxu Zhuang, Juan Ciro, Rafael Mosquera, Bhargavi Paranjabe, Adina Williams, Tal Linzen, Ryan Cotterell
- Venue:
- CoNLL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 35–46
- Language:
- URL:
- https://aclanthology.org/2023.conll-babylm.2
- DOI:
- 10.18653/v1/2023.conll-babylm.2
- Bibkey:
- Cite (ACL):
- Bastian Bunzeck and Sina Zarrieß. 2023. GPT-wee: How Small Can a Small Language Model Really Get?. In Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning, pages 35–46, Singapore. Association for Computational Linguistics.
- Cite (Informal):
- GPT-wee: How Small Can a Small Language Model Really Get? (Bunzeck & Zarrieß, CoNLL 2023)
- Copy Citation:
- PDF:
- https://aclanthology.org/2023.conll-babylm.2.pdf
Export citation
@inproceedings{bunzeck-zarriess-2023-gpt, title = "{GPT}-wee: How Small Can a Small Language Model Really Get?", author = "Bunzeck, Bastian and Zarrie{\ss}, Sina", editor = "Warstadt, Alex and Mueller, Aaron and Choshen, Leshem and Wilcox, Ethan and Zhuang, Chengxu and Ciro, Juan and Mosquera, Rafael and Paranjabe, Bhargavi and Williams, Adina and Linzen, Tal and Cotterell, Ryan", booktitle = "Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning", month = dec, year = "2023", address = "Singapore", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2023.conll-babylm.2", doi = "10.18653/v1/2023.conll-babylm.2", pages = "35--46", }
<?xml version="1.0" encoding="UTF-8"?> <modsCollection xmlns="http://www.loc.gov/mods/v3"> <mods ID="bunzeck-zarriess-2023-gpt"> <titleInfo> <title>GPT-wee: How Small Can a Small Language Model Really Get?</title> </titleInfo> <name type="personal"> <namePart type="given">Bastian</namePart> <namePart type="family">Bunzeck</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Sina</namePart> <namePart type="family">Zarrieß</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <originInfo> <dateIssued>2023-12</dateIssued> </originInfo> <typeOfResource>text</typeOfResource> <relatedItem type="host"> <titleInfo> <title>Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning</title> </titleInfo> <name type="personal"> <namePart type="given">Alex</namePart> <namePart type="family">Warstadt</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Aaron</namePart> <namePart type="family">Mueller</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Leshem</namePart> <namePart type="family">Choshen</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Ethan</namePart> <namePart type="family">Wilcox</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Chengxu</namePart> <namePart type="family">Zhuang</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Juan</namePart> <namePart type="family">Ciro</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Rafael</namePart> <namePart type="family">Mosquera</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Bhargavi</namePart> <namePart type="family">Paranjabe</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Adina</namePart> <namePart type="family">Williams</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Tal</namePart> <namePart type="family">Linzen</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Ryan</namePart> <namePart type="family">Cotterell</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <originInfo> <publisher>Association for Computational Linguistics</publisher> <place> <placeTerm type="text">Singapore</placeTerm> </place> </originInfo> <genre authority="marcgt">conference publication</genre> </relatedItem> <identifier type="citekey">bunzeck-zarriess-2023-gpt</identifier> <identifier type="doi">10.18653/v1/2023.conll-babylm.2</identifier> <location> <url>https://aclanthology.org/2023.conll-babylm.2</url> </location> <part> <date>2023-12</date> <extent unit="page"> <start>35</start> <end>46</end> </extent> </part> </mods> </modsCollection>
%0 Conference Proceedings %T GPT-wee: How Small Can a Small Language Model Really Get? %A Bunzeck, Bastian %A Zarrieß, Sina %Y Warstadt, Alex %Y Mueller, Aaron %Y Choshen, Leshem %Y Wilcox, Ethan %Y Zhuang, Chengxu %Y Ciro, Juan %Y Mosquera, Rafael %Y Paranjabe, Bhargavi %Y Williams, Adina %Y Linzen, Tal %Y Cotterell, Ryan %S Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning %D 2023 %8 December %I Association for Computational Linguistics %C Singapore %F bunzeck-zarriess-2023-gpt %R 10.18653/v1/2023.conll-babylm.2 %U https://aclanthology.org/2023.conll-babylm.2 %U https://doi.org/10.18653/v1/2023.conll-babylm.2 %P 35-46
Markdown (Informal)
[GPT-wee: How Small Can a Small Language Model Really Get?](https://aclanthology.org/2023.conll-babylm.2) (Bunzeck & Zarrieß, CoNLL 2023)
- GPT-wee: How Small Can a Small Language Model Really Get? (Bunzeck & Zarrieß, CoNLL 2023)
ACL
- Bastian Bunzeck and Sina Zarrieß. 2023. GPT-wee: How Small Can a Small Language Model Really Get?. In Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning, pages 35–46, Singapore. Association for Computational Linguistics.