WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words
Lukas Wolf, Klemen Kotar, Greta Tuckute, Eghbal Hosseini, Tamar I. Regev, Ethan Gotlieb Wilcox, Alexander Scott Warstadt
- Anthology ID:
- 2023.conll-babylm.21
- Volume:
- Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning
- Month:
- December
- Year:
- 2023
- Address:
- Singapore
- Editors:
- Alex Warstadt, Aaron Mueller, Leshem Choshen, Ethan Wilcox, Chengxu Zhuang, Juan Ciro, Rafael Mosquera, Bhargavi Paranjabe, Adina Williams, Tal Linzen, Ryan Cotterell
- Venue:
- CoNLL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 253–258
- Language:
- URL:
- https://aclanthology.org/2023.conll-babylm.21
- DOI:
- 10.18653/v1/2023.conll-babylm.21
- Bibkey:
- Cite (ACL):
- Lukas Wolf, Klemen Kotar, Greta Tuckute, Eghbal Hosseini, Tamar I. Regev, Ethan Gotlieb Wilcox, and Alexander Scott Warstadt. 2023. WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words. In Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning, pages 253–258, Singapore. Association for Computational Linguistics.
- Cite (Informal):
- WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words (Wolf et al., CoNLL 2023)
- Copy Citation:
- PDF:
- https://aclanthology.org/2023.conll-babylm.21.pdf
Export citation
@inproceedings{wolf-etal-2023-whisbert, title = "{W}his{BERT}: Multimodal Text-Audio Language Modeling on 100{M} Words", author = "Wolf, Lukas and Kotar, Klemen and Tuckute, Greta and Hosseini, Eghbal and I. Regev, Tamar and Gotlieb Wilcox, Ethan and Warstadt, Alexander Scott", editor = "Warstadt, Alex and Mueller, Aaron and Choshen, Leshem and Wilcox, Ethan and Zhuang, Chengxu and Ciro, Juan and Mosquera, Rafael and Paranjabe, Bhargavi and Williams, Adina and Linzen, Tal and Cotterell, Ryan", booktitle = "Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning", month = dec, year = "2023", address = "Singapore", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2023.conll-babylm.21", doi = "10.18653/v1/2023.conll-babylm.21", pages = "253--258", }
<?xml version="1.0" encoding="UTF-8"?> <modsCollection xmlns="http://www.loc.gov/mods/v3"> <mods ID="wolf-etal-2023-whisbert"> <titleInfo> <title>WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words</title> </titleInfo> <name type="personal"> <namePart type="given">Lukas</namePart> <namePart type="family">Wolf</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Klemen</namePart> <namePart type="family">Kotar</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Greta</namePart> <namePart type="family">Tuckute</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Eghbal</namePart> <namePart type="family">Hosseini</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Tamar</namePart> <namePart type="family">I. Regev</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Ethan</namePart> <namePart type="family">Gotlieb Wilcox</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Alexander</namePart> <namePart type="given">Scott</namePart> <namePart type="family">Warstadt</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <originInfo> <dateIssued>2023-12</dateIssued> </originInfo> <typeOfResource>text</typeOfResource> <relatedItem type="host"> <titleInfo> <title>Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning</title> </titleInfo> <name type="personal"> <namePart type="given">Alex</namePart> <namePart type="family">Warstadt</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Aaron</namePart> <namePart type="family">Mueller</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Leshem</namePart> <namePart type="family">Choshen</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Ethan</namePart> <namePart type="family">Wilcox</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Chengxu</namePart> <namePart type="family">Zhuang</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Juan</namePart> <namePart type="family">Ciro</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Rafael</namePart> <namePart type="family">Mosquera</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Bhargavi</namePart> <namePart type="family">Paranjabe</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Adina</namePart> <namePart type="family">Williams</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Tal</namePart> <namePart type="family">Linzen</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Ryan</namePart> <namePart type="family">Cotterell</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <originInfo> <publisher>Association for Computational Linguistics</publisher> <place> <placeTerm type="text">Singapore</placeTerm> </place> </originInfo> <genre authority="marcgt">conference publication</genre> </relatedItem> <identifier type="citekey">wolf-etal-2023-whisbert</identifier> <identifier type="doi">10.18653/v1/2023.conll-babylm.21</identifier> <location> <url>https://aclanthology.org/2023.conll-babylm.21</url> </location> <part> <date>2023-12</date> <extent unit="page"> <start>253</start> <end>258</end> </extent> </part> </mods> </modsCollection>
%0 Conference Proceedings %T WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words %A Wolf, Lukas %A Kotar, Klemen %A Tuckute, Greta %A Hosseini, Eghbal %A I. Regev, Tamar %A Gotlieb Wilcox, Ethan %A Warstadt, Alexander Scott %Y Warstadt, Alex %Y Mueller, Aaron %Y Choshen, Leshem %Y Wilcox, Ethan %Y Zhuang, Chengxu %Y Ciro, Juan %Y Mosquera, Rafael %Y Paranjabe, Bhargavi %Y Williams, Adina %Y Linzen, Tal %Y Cotterell, Ryan %S Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning %D 2023 %8 December %I Association for Computational Linguistics %C Singapore %F wolf-etal-2023-whisbert %R 10.18653/v1/2023.conll-babylm.21 %U https://aclanthology.org/2023.conll-babylm.21 %U https://doi.org/10.18653/v1/2023.conll-babylm.21 %P 253-258
Markdown (Informal)
[WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words](https://aclanthology.org/2023.conll-babylm.21) (Wolf et al., CoNLL 2023)
- WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words (Wolf et al., CoNLL 2023)
ACL
- Lukas Wolf, Klemen Kotar, Greta Tuckute, Eghbal Hosseini, Tamar I. Regev, Ethan Gotlieb Wilcox, and Alexander Scott Warstadt. 2023. WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words. In Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning, pages 253–258, Singapore. Association for Computational Linguistics.