Vanessa Bonato
2026
A Domain-Specific Curated Benchmark for Entity and Document-Level Relation Extraction
Marco Martinelli | Stefano Marchesin | Vanessa Bonato | Giorgio Di Nunzio | Nicola Ferro | Ornella Irrera | Laura Menotti | Federica Vezzani | Gianmaria Silvello
Findings of the Association for Computational Linguistics: EACL 2026
Marco Martinelli | Stefano Marchesin | Vanessa Bonato | Giorgio Di Nunzio | Nicola Ferro | Ornella Irrera | Laura Menotti | Federica Vezzani | Gianmaria Silvello
Findings of the Association for Computational Linguistics: EACL 2026
Information Extraction (IE), encompassing Named Entity Recognition (NER), Named Entity Linking (NEL), and Relation Extraction (RE), is critical for transforming the rapidly growing volume of scientific publications into structured, actionable knowledge. This need is especially evident in fast-evolving biomedical fields such as the gut-brain axis, where research investigates complex interactions between the gut microbiota and brain-related disorders. Existing biomedical IE benchmarks, however, are often narrow in scope and rely heavily on distantly supervised or automatically generated annotations, limiting their utility for advancing robust IE methods. We introduce GutBrainIE, a benchmark based on more than 1,600 PubMed abstracts, manually annotated by biomedical and terminological experts with fine-grained entities, concept-level links, and relations. While grounded in the gut-brain axis, the benchmark’s rich schema, multiple tasks, and combination of highly curated and weakly supervised data make it broadly applicable to the development and evaluation of biomedical IE systems across domains.
2024
Findings of the WMT 2024 Biomedical Translation Shared Task: Test Sets on Abstract Level
Mariana Neves | Cristian Grozea | Philippe Thomas | Roland Roller | Rachel Bawden | Aurélie Névéol | Steffen Castle | Vanessa Bonato | Giorgio Maria Di Nunzio | Federica Vezzani | Maika Vicente Navarro | Lana Yeganova | Antonio Jimeno Yepes
Proceedings of the Ninth Conference on Machine Translation
Mariana Neves | Cristian Grozea | Philippe Thomas | Roland Roller | Rachel Bawden | Aurélie Névéol | Steffen Castle | Vanessa Bonato | Giorgio Maria Di Nunzio | Federica Vezzani | Maika Vicente Navarro | Lana Yeganova | Antonio Jimeno Yepes
Proceedings of the Ninth Conference on Machine Translation
We present the results of the ninth edition of the Biomedical Translation Task at WMT’24. We released test sets for six language pairs, namely, French, German, Italian, Portuguese, Russian, and Spanish, from and into English. Eachtest set consists of 50 abstracts from PubMed. Differently from previous years, we did not split abstracts into sentences. We received submissions from five teams, and for almost all language directions. We used a baseline/comparison system based on Llama 3.1 and share the source code at https://github.com/cgrozea/wmt24biomed-ref.