INF-rsrs at SemEval-2026 Task 1: Is the best really better? The limits of creative work in the era of LLMs

Guilherme Bazzo; Eduardo Faé; Júlia Junqueira; Higor Moreira; Lucas Pessutto

INF-rsrs at SemEval-2026 Task 1: Is the best really better? The limits of creative work in the era of LLMs

Guilherme Bazzo, Eduardo Faé, Júlia Junqueira, Higor Moreira, Lucas Rafael Costella Pessutto

Abstract

Generating humor is a complex and challenging task for Large Language Models (LLMs), requiring both linguistic creativity and strict adherence to constraints. This paper presents INF-rsrs, our solution for SemEval 2026 Task~1: Humor Generation, which tasks models with creating jokes from headlines and word pairs without labeled data. We propose a two-stage framework: a production stage and a selection stage. The production stage employs diverse model families and hyperparameter configurations to generate a wide range of candidate jokes, with each candidate generated by an LLM prompted in the role of a comedian under structured constraints to ensure relevance and humor. Our system was designed to substantiate our claim that the direct use of LLMs in creative works, such as humor generation, hits a hard ceiling that is inescapable through simple prompting. Our proposed system tied in first place in the task ranking, obtaining a top-tier performance.

Anthology ID:: 2026.semeval-1.396
Volume:: Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3156–3164
Language:
URL:: https://aclanthology.org/2026.semeval-1.396/
DOI:
Bibkey:
Cite (ACL):: Guilherme Bazzo, Eduardo Faé, Júlia Junqueira, Higor Moreira, and Lucas Rafael Costella Pessutto. 2026. INF-rsrs at SemEval-2026 Task 1: Is the best really better? The limits of creative work in the era of LLMs. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 3156–3164, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: INF-rsrs at SemEval-2026 Task 1: Is the best really better? The limits of creative work in the era of LLMs (Bazzo et al., SemEval 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.semeval-1.396.pdf

PDF Cite Search Fix data