Swiss AI Initiative - Collecting Large Amounts of High-Quality Data for Training Large Language Models

Jan Deriu, Maud Ehrmann, Emanuela Boros, Maximilian Böther, Christiane Sibille, Ihor Protsenko, Marta Brucka, Imanol Schlag, Elliott Ash


Anthology ID:
2024.swisstext-1.40
Volume:
Proceedings of the 9th edition of the Swiss Text Analytics Conference
Month:
June
Year:
2024
Address:
Chur, Switzerland
Editors:
Capol Corsin, Cieliebak Mark, Weichselbraun Albert, Musat Claudiu, Maier Elisabeth, Zimmermann Lucas
Venue:
SwissText
SIG:
SIGSEM
Publisher:
Association for Computational Linguistics
Note:
Pages:
188
Language:
URL:
https://aclanthology.org/2024.swisstext-1.40/
DOI:
Bibkey:
Cite (ACL):
Jan Deriu, Maud Ehrmann, Emanuela Boros, Maximilian Böther, Christiane Sibille, Ihor Protsenko, Marta Brucka, Imanol Schlag, and Elliott Ash. 2024. Swiss AI Initiative - Collecting Large Amounts of High-Quality Data for Training Large Language Models. In Proceedings of the 9th edition of the Swiss Text Analytics Conference, pages 188–188, Chur, Switzerland. Association for Computational Linguistics.
Cite (Informal):
Swiss AI Initiative - Collecting Large Amounts of High-Quality Data for Training Large Language Models (Deriu et al., SwissText 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.swisstext-1.40.pdf