Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Alex Warstadt, Aaron Mueller, Leshem Choshen, Ethan Wilcox, Chengxu Zhuang, Juan Ciro, Rafael Mosquera, Bhargavi Paranjabe, Adina Williams, Tal Linzen, Ryan Cotterell
- Anthology ID:
- 2023.conll-babylm.1
- Volume:
- Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning
- Month:
- December
- Year:
- 2023
- Address:
- Singapore
- Editors:
- Alex Warstadt, Aaron Mueller, Leshem Choshen, Ethan Wilcox, Chengxu Zhuang, Juan Ciro, Rafael Mosquera, Bhargavi Paranjabe, Adina Williams, Tal Linzen, Ryan Cotterell
- Venue:
- CoNLL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 1–34
- Language:
- URL:
- https://aclanthology.org/2023.conll-babylm.1
- DOI:
- 10.18653/v1/2023.conll-babylm.1
- Bibkey:
- Cite (ACL):
- Alex Warstadt, Aaron Mueller, Leshem Choshen, Ethan Wilcox, Chengxu Zhuang, Juan Ciro, Rafael Mosquera, Bhargavi Paranjabe, Adina Williams, Tal Linzen, and Ryan Cotterell. 2023. Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora. In Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning, pages 1–34, Singapore. Association for Computational Linguistics.
- Cite (Informal):
- Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora (Warstadt et al., CoNLL 2023)
- Copy Citation:
- PDF:
- https://aclanthology.org/2023.conll-babylm.1.pdf
Export citation
@inproceedings{warstadt-etal-2023-findings, title = "Findings of the {B}aby{LM} Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora", author = "Warstadt, Alex and Mueller, Aaron and Choshen, Leshem and Wilcox, Ethan and Zhuang, Chengxu and Ciro, Juan and Mosquera, Rafael and Paranjabe, Bhargavi and Williams, Adina and Linzen, Tal and Cotterell, Ryan", editor = "Warstadt, Alex and Mueller, Aaron and Choshen, Leshem and Wilcox, Ethan and Zhuang, Chengxu and Ciro, Juan and Mosquera, Rafael and Paranjabe, Bhargavi and Williams, Adina and Linzen, Tal and Cotterell, Ryan", booktitle = "Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning", month = dec, year = "2023", address = "Singapore", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2023.conll-babylm.1", doi = "10.18653/v1/2023.conll-babylm.1", pages = "1--34", }
<?xml version="1.0" encoding="UTF-8"?> <modsCollection xmlns="http://www.loc.gov/mods/v3"> <mods ID="warstadt-etal-2023-findings"> <titleInfo> <title>Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora</title> </titleInfo> <name type="personal"> <namePart type="given">Alex</namePart> <namePart type="family">Warstadt</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Aaron</namePart> <namePart type="family">Mueller</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Leshem</namePart> <namePart type="family">Choshen</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Ethan</namePart> <namePart type="family">Wilcox</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Chengxu</namePart> <namePart type="family">Zhuang</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Juan</namePart> <namePart type="family">Ciro</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Rafael</namePart> <namePart type="family">Mosquera</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Bhargavi</namePart> <namePart type="family">Paranjabe</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Adina</namePart> <namePart type="family">Williams</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Tal</namePart> <namePart type="family">Linzen</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Ryan</namePart> <namePart type="family">Cotterell</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <originInfo> <dateIssued>2023-12</dateIssued> </originInfo> <typeOfResource>text</typeOfResource> <relatedItem type="host"> <titleInfo> <title>Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning</title> </titleInfo> <name type="personal"> <namePart type="given">Alex</namePart> <namePart type="family">Warstadt</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Aaron</namePart> <namePart type="family">Mueller</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Leshem</namePart> <namePart type="family">Choshen</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Ethan</namePart> <namePart type="family">Wilcox</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Chengxu</namePart> <namePart type="family">Zhuang</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Juan</namePart> <namePart type="family">Ciro</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Rafael</namePart> <namePart type="family">Mosquera</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Bhargavi</namePart> <namePart type="family">Paranjabe</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Adina</namePart> <namePart type="family">Williams</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Tal</namePart> <namePart type="family">Linzen</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Ryan</namePart> <namePart type="family">Cotterell</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <originInfo> <publisher>Association for Computational Linguistics</publisher> <place> <placeTerm type="text">Singapore</placeTerm> </place> </originInfo> <genre authority="marcgt">conference publication</genre> </relatedItem> <identifier type="citekey">warstadt-etal-2023-findings</identifier> <identifier type="doi">10.18653/v1/2023.conll-babylm.1</identifier> <location> <url>https://aclanthology.org/2023.conll-babylm.1</url> </location> <part> <date>2023-12</date> <extent unit="page"> <start>1</start> <end>34</end> </extent> </part> </mods> </modsCollection>
%0 Conference Proceedings %T Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora %A Warstadt, Alex %A Mueller, Aaron %A Choshen, Leshem %A Wilcox, Ethan %A Zhuang, Chengxu %A Ciro, Juan %A Mosquera, Rafael %A Paranjabe, Bhargavi %A Williams, Adina %A Linzen, Tal %A Cotterell, Ryan %Y Warstadt, Alex %Y Mueller, Aaron %Y Choshen, Leshem %Y Wilcox, Ethan %Y Zhuang, Chengxu %Y Ciro, Juan %Y Mosquera, Rafael %Y Paranjabe, Bhargavi %Y Williams, Adina %Y Linzen, Tal %Y Cotterell, Ryan %S Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning %D 2023 %8 December %I Association for Computational Linguistics %C Singapore %F warstadt-etal-2023-findings %R 10.18653/v1/2023.conll-babylm.1 %U https://aclanthology.org/2023.conll-babylm.1 %U https://doi.org/10.18653/v1/2023.conll-babylm.1 %P 1-34
Markdown (Informal)
[Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora](https://aclanthology.org/2023.conll-babylm.1) (Warstadt et al., CoNLL 2023)
- Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora (Warstadt et al., CoNLL 2023)
ACL
- Alex Warstadt, Aaron Mueller, Leshem Choshen, Ethan Wilcox, Chengxu Zhuang, Juan Ciro, Rafael Mosquera, Bhargavi Paranjabe, Adina Williams, Tal Linzen, and Ryan Cotterell. 2023. Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora. In Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning, pages 1–34, Singapore. Association for Computational Linguistics.