SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories Ben Bogin author Kejuan Yang author Shashank Gupta author Kyle Richardson author Erin Bransom author Peter Clark author Ashish Sabharwal author Tushar Khot author 2024-11 text Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing Yaser Al-Onaizan editor Mohit Bansal editor Yun-Nung Chen editor Association for Computational Linguistics Miami, Florida, USA conference publication bogin-etal-2024-super 10.18653/v1/2024.emnlp-main.702 https://aclanthology.org/2024.emnlp-main.702/ 2024-11 12622 12645