Nishant Subramani
2024
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Luca Soldaini
|
Rodney Kinney
|
Akshita Bhagia
|
Dustin Schwenk
|
David Atkinson
|
Russell Authur
|
Ben Bogin
|
Khyathi Chandu
|
Jennifer Dumas
|
Yanai Elazar
|
Valentin Hofmann
|
Ananya Jha
|
Sachin Kumar
|
Li Lucy
|
Xinxi Lyu
|
Nathan Lambert
|
Ian Magnusson
|
Jacob Morrison
|
Niklas Muennighoff
|
Aakanksha Naik
|
Crystal Nam
|
Matthew Peters
|
Abhilasha Ravichander
|
Kyle Richardson
|
Zejiang Shen
|
Emma Strubell
|
Nishant Subramani
|
Oyvind Tafjord
|
Evan Walsh
|
Luke Zettlemoyer
|
Noah Smith
|
Hannaneh Hajishirzi
|
Iz Beltagy
|
Dirk Groeneveld
|
Jesse Dodge
|
Kyle Lo
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
OLMo: Accelerating the Science of Language Models
Dirk Groeneveld
|
Iz Beltagy
|
Evan Walsh
|
Akshita Bhagia
|
Rodney Kinney
|
Oyvind Tafjord
|
Ananya Jha
|
Hamish Ivison
|
Ian Magnusson
|
Yizhong Wang
|
Shane Arora
|
David Atkinson
|
Russell Authur
|
Khyathi Chandu
|
Arman Cohan
|
Jennifer Dumas
|
Yanai Elazar
|
Yuling Gu
|
Jack Hessel
|
Tushar Khot
|
William Merrill
|
Jacob Morrison
|
Niklas Muennighoff
|
Aakanksha Naik
|
Crystal Nam
|
Matthew Peters
|
Valentina Pyatkin
|
Abhilasha Ravichander
|
Dustin Schwenk
|
Saurabh Shah
|
William Smith
|
Emma Strubell
|
Nishant Subramani
|
Mitchell Wortsman
|
Pradeep Dasigi
|
Nathan Lambert
|
Kyle Richardson
|
Luke Zettlemoyer
|
Jesse Dodge
|
Kyle Lo
|
Luca Soldaini
|
Noah Smith
|
Hannaneh Hajishirzi
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
2023
Detecting Personal Information in Training Corpora: an Analysis
Nishant Subramani
|
Sasha Luccioni
|
Jesse Dodge
|
Margaret Mitchell
Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023)
2022
Don’t Say What You Don’t Know: Improving the Consistency of Abstractive Summarization by Constraining Beam Search
Daniel King
|
Zejiang Shen
|
Nishant Subramani
|
Daniel S. Weld
|
Iz Beltagy
|
Doug Downey
Proceedings of the 2nd Workshop on Natural Language Generation, Evaluation, and Metrics (GEM)
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Sebastian Gehrmann
|
Abhik Bhattacharjee
|
Abinaya Mahendiran
|
Alex Wang
|
Alexandros Papangelis
|
Aman Madaan
|
Angelina Mcmillan-major
|
Anna Shvets
|
Ashish Upadhyay
|
Bernd Bohnet
|
Bingsheng Yao
|
Bryan Wilie
|
Chandra Bhagavatula
|
Chaobin You
|
Craig Thomson
|
Cristina Garbacea
|
Dakuo Wang
|
Daniel Deutsch
|
Deyi Xiong
|
Di Jin
|
Dimitra Gkatzia
|
Dragomir Radev
|
Elizabeth Clark
|
Esin Durmus
|
Faisal Ladhak
|
Filip Ginter
|
Genta Indra Winata
|
Hendrik Strobelt
|
Hiroaki Hayashi
|
Jekaterina Novikova
|
Jenna Kanerva
|
Jenny Chim
|
Jiawei Zhou
|
Jordan Clive
|
Joshua Maynez
|
João Sedoc
|
Juraj Juraska
|
Kaustubh Dhole
|
Khyathi Raghavi Chandu
|
Laura Perez Beltrachini
|
Leonardo F . R. Ribeiro
|
Lewis Tunstall
|
Li Zhang
|
Mahim Pushkarna
|
Mathias Creutz
|
Michael White
|
Mihir Sanjay Kale
|
Moussa Kamal Eddine
|
Nico Daheim
|
Nishant Subramani
|
Ondrej Dusek
|
Paul Pu Liang
|
Pawan Sasanka Ammanamanchi
|
Qi Zhu
|
Ratish Puduppully
|
Reno Kriz
|
Rifat Shahriyar
|
Ronald Cardenas
|
Saad Mahamood
|
Salomey Osei
|
Samuel Cahyawijaya
|
Sanja Štajner
|
Sebastien Montella
|
Shailza Jolly
|
Simon Mille
|
Tahmid Hasan
|
Tianhao Shen
|
Tosin Adewumi
|
Vikas Raunak
|
Vipul Raheja
|
Vitaly Nikolaev
|
Vivian Tsai
|
Yacine Jernite
|
Ying Xu
|
Yisi Sang
|
Yixin Liu
|
Yufang Hou
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
Extracting Latent Steering Vectors from Pretrained Language Models
Nishant Subramani
|
Nivedita Suresh
|
Matthew Peters
Findings of the Association for Computational Linguistics: ACL 2022
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Julia Kreutzer
|
Isaac Caswell
|
Lisa Wang
|
Ahsan Wahab
|
Daan van Esch
|
Nasanbayar Ulzii-Orshikh
|
Allahsera Tapo
|
Nishant Subramani
|
Artem Sokolov
|
Claytone Sikasote
|
Monang Setyawan
|
Supheakmungkol Sarin
|
Sokhar Samb
|
Benoît Sagot
|
Clara Rivera
|
Annette Rios
|
Isabel Papadimitriou
|
Salomey Osei
|
Pedro Ortiz Suarez
|
Iroro Orife
|
Kelechi Ogueji
|
Andre Niyongabo Rubungo
|
Toan Q. Nguyen
|
Mathias Müller
|
André Müller
|
Shamsuddeen Hassan Muhammad
|
Nanda Muhammad
|
Ayanda Mnyakeni
|
Jamshidbek Mirzakhalov
|
Tapiwanashe Matangira
|
Colin Leong
|
Nze Lawson
|
Sneha Kudugunta
|
Yacine Jernite
|
Mathias Jenny
|
Orhan Firat
|
Bonaventure F. P. Dossou
|
Sakhile Dlamini
|
Nisansa de Silva
|
Sakine Çabuk Ballı
|
Stella Biderman
|
Alessia Battisti
|
Ahmed Baruwa
|
Ankur Bapna
|
Pallavi Baljekar
|
Israel Abebe Azime
|
Ayodele Awokoya
|
Duygu Ataman
|
Orevaoghene Ahia
|
Oghenefego Ahia
|
Sweta Agrawal
|
Mofetoluwa Adeyemi
Transactions of the Association for Computational Linguistics, Volume 10
2021
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Sebastian Gehrmann
|
Tosin Adewumi
|
Karmanya Aggarwal
|
Pawan Sasanka Ammanamanchi
|
Anuoluwapo Aremu
|
Antoine Bosselut
|
Khyathi Raghavi Chandu
|
Miruna-Adriana Clinciu
|
Dipanjan Das
|
Kaustubh Dhole
|
Wanyu Du
|
Esin Durmus
|
Ondřej Dušek
|
Chris Chinenye Emezue
|
Varun Gangal
|
Cristina Garbacea
|
Tatsunori Hashimoto
|
Yufang Hou
|
Yacine Jernite
|
Harsh Jhamtani
|
Yangfeng Ji
|
Shailza Jolly
|
Mihir Kale
|
Dhruv Kumar
|
Faisal Ladhak
|
Aman Madaan
|
Mounica Maddela
|
Khyati Mahajan
|
Saad Mahamood
|
Bodhisattwa Prasad Majumder
|
Pedro Henrique Martins
|
Angelina McMillan-Major
|
Simon Mille
|
Emiel van Miltenburg
|
Moin Nadeem
|
Shashi Narayan
|
Vitaly Nikolaev
|
Andre Niyongabo Rubungo
|
Salomey Osei
|
Ankur Parikh
|
Laura Perez-Beltrachini
|
Niranjan Ramesh Rao
|
Vikas Raunak
|
Juan Diego Rodriguez
|
Sashank Santhanam
|
João Sedoc
|
Thibault Sellam
|
Samira Shaikh
|
Anastasia Shimorina
|
Marco Antonio Sobrevilla Cabezudo
|
Hendrik Strobelt
|
Nishant Subramani
|
Wei Xu
|
Diyi Yang
|
Akhila Yerukola
|
Jiawei Zhou
Proceedings of the 1st Workshop on Natural Language Generation, Evaluation, and Metrics (GEM 2021)
Co-authors
- Iz Beltagy 3
- Salomey Osei 3
- Yacine Jernite 3
- Matthew E. Peters 3
- Jesse Dodge 3
- show all...
- Zejiang Shen 2
- Sebastian Gehrmann 2
- Aman Madaan 2
- Angelina Mcmillan-major 2
- Cristina Garbacea 2
- Esin Durmus 2
- Faisal Ladhak 2
- Hendrik Strobelt 2
- Jiawei Zhou 2
- João Sedoc 2
- Kaustubh Dhole 2
- Khyathi Raghavi Chandu 2
- Laura Perez-Beltrachini 2
- Ondřej Dušek 2
- Pawan Sasanka Ammanamanchi 2
- Saad Mahamood 2
- Shailza Jolly 2
- Simon Mille 2
- Tosin Adewumi 2
- Vikas Raunak 2
- Vitaly Nikolaev 2
- Yufang Hou 2
- Andre Niyongabo Rubungo 2
- Luca Soldaini 2
- Rodney Kinney 2
- Akshita Bhagia 2
- Dustin Schwenk 2
- David Atkinson 2
- Russell Authur 2
- Khyathi Chandu 2
- Jennifer Dumas 2
- Yanai Elazar 2
- Ananya Jha 2
- Nathan Lambert 2
- Ian Magnusson 2
- Jacob Morrison 2
- Niklas Muennighoff 2
- Aakanksha Naik 2
- Crystal Nam 2
- Abhilasha Ravichander 2
- Kyle Richardson 2
- Emma Strubell 2
- Oyvind Tafjord 2
- Evan Walsh 2
- Luke Zettlemoyer 2
- Noah A. Smith 2
- Hannaneh Hajishirzi 2
- Dirk Groeneveld 2
- Kyle Lo 2
- Daniel King 1
- Daniel S. Weld 1
- Doug Downey 1
- Abhik Bhattacharjee 1
- Abinaya Mahendiran 1
- Alex Wang 1
- Alexandros Papangelis 1
- Anna Shvets 1
- Ashish Upadhyay 1
- Bernd Bohnet 1
- Bingsheng Yao 1
- Bryan Wilie 1
- Chandra Bhagavatula 1
- Chaobin You 1
- Craig Thomson 1
- Dakuo Wang 1
- Daniel Deutsch 1
- Deyi Xiong 1
- Di Jin 1
- Dimitra Gkatzia 1
- Dragomir Radev 1
- Elizabeth Clark 1
- Filip Ginter 1
- Genta Indra Winata 1
- Hiroaki Hayashi 1
- Jekaterina Novikova 1
- Jenna Kanerva 1
- Jenny Chim 1
- Jordan Clive 1
- Joshua Maynez 1
- Juraj Juraska 1
- Leonardo F . R. Ribeiro 1
- Lewis Tunstall 1
- Li Zhang 1
- Mahim Pushkarna 1
- Mathias Creutz 1
- Michael White 1
- Mihir Sanjay Kale 1
- Moussa Kamal Eddine 1
- Nico Daheim 1
- Paul Pu Liang 1
- Qi Zhu 1
- Ratish Puduppully 1
- Reno Kriz 1
- Rifat Shahriyar 1
- Ronald Cardenas 1
- Samuel Cahyawijaya 1
- Sanja Štajner 1
- Sebastien Montella 1
- Tahmid Hasan 1
- Tianhao Shen 1
- Vipul Raheja 1
- Vivian Tsai 1
- Ying Xu 1
- Yisi Sang 1
- Yixin Liu 1
- Nivedita Suresh 1
- Sasha Luccioni 1
- Margaret Mitchell 1
- Karmanya Aggarwal 1
- Anuoluwapo Aremu 1
- Antoine Bosselut 1
- Miruna Clinciu 1
- Dipanjan Das 1
- Wanyu Du 1
- Chris Chinenye Emezue 1
- Varun Gangal 1
- Tatsunori B. Hashimoto 1
- Harsh Jhamtani 1
- Yangfeng Ji 1
- Mihir Kale 1
- Dhruv Kumar 1
- Mounica Maddela 1
- Khyati Mahajan 1
- Bodhisattwa Prasad Majumder 1
- Pedro Henrique Martins 1
- Emiel Van Miltenburg 1
- Moin Nadeem 1
- Shashi Narayan 1
- Ankur Parikh 1
- Niranjan Ramesh Rao 1
- Juan Diego Rodriguez 1
- Sashank Santhanam 1
- Thibault Sellam 1
- Samira Shaikh 1
- Anastasia Shimorina 1
- Marco Antonio Sobrevilla Cabezudo 1
- Wei Xu 1
- Diyi Yang 1
- Akhila Yerukola 1
- Julia Kreutzer 1
- Isaac Caswell 1
- Lisa Wang 1
- Ahsan Wahab 1
- Daan van Esch 1
- Nasanbayar Ulzii-Orshikh 1
- Allahsera Tapo 1
- Artem Sokolov 1
- Claytone Sikasote 1
- Monang Setyawan 1
- Supheakmungkol Sarin 1
- Sokhar Samb 1
- Benoît Sagot 1
- Clara Rivera 1
- Annette Rios Gonzales 1
- Isabel Papadimitriou 1
- Pedro Ortiz Suarez 1
- Iroro Orife 1
- Kelechi Ogueji 1
- Toan Q. Nguyen 1
- Mathias Müller 1
- André Müller 1
- Shamsuddeen Hassan Muhammad 1
- Nanda Muhammad 1
- Ayanda Mnyakeni 1
- Jamshidbek Mirzakhalov 1
- Tapiwanashe Matangira 1
- Colin Leong 1
- Nze Lawson 1
- Sneha Kudugunta 1
- Mathias Jenny 1
- Orhan Firat 1
- Bonaventure F. P. Dossou 1
- Sakhile Dlamini 1
- Nisansa De Silva 1
- Sakine Çabuk Ballı 1
- Stella Biderman 1
- Alessia Battisti 1
- Ahmed Baruwa 1
- Ankur Bapna 1
- Pallavi Baljekar 1
- Israel Abebe Azime 1
- Ayodele Awokoya 1
- Duygu Ataman 1
- Orevaoghene Ahia 1
- Oghenefego Ahia 1
- Sweta Agrawal 1
- Mofetoluwa Adeyemi 1
- Ben Bogin 1
- Valentin Hofmann 1
- Sachin Kumar 1
- Li Lucy 1
- Xinxi Lyu 1
- Hamish Ivison 1
- Yizhong Wang 1
- Shane Arora 1
- Arman Cohan 1
- Yuling Gu 1
- Jack Hessel 1
- Tushar Khot 1
- William Merrill 1
- Valentina Pyatkin 1
- Saurabh Shah 1
- William Smith 1
- Mitchell Wortsman 1
- Pradeep Dasigi 1