Norton Trevisan Roman

Also published as: Norton T. Roman, Norton Trevisan Roman


2025

pdf bib
It’s about What and How you say it: A Corpus with Stance and Sentiment Annotation for COVID-19 Vaccines Posts on X/Twitter by Brazilian Political Elites
Lorena Barberia | Pedro Schmalz | Norton Trevisan Roman | Belinda Lombard | Tatiane Moraes de Sousa
Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities

This paper details the development of a corpus with posts in Brazilian Portuguese published by Brazilian political elites on X (formerly Twitter) regarding COVID-19 vaccines. The corpus consists of 9,045 posts annotated for relevance, stance and sentiment towards COVID-19 vaccines and vaccination during the first three years of the COVID-19 pandemic (2020-2022).Nine annotators, working in three groups, classified relevance, stance, and sentiment in messages posted between 2020 and 2022 by local political elites. The annotators underwent extensive training, and weekly meetings were conducted to ensure intra-group annotation consistency. The analysis revealed fair to moderate inter-annotator agreement (Average Krippendorf’s alpha of 0.94 for relevance, 0,67 for sentiment and 0,70 for stance). This work makes four significant contributions to the literature. First, it addresses the scarcity of corpora in Brazilian Portuguese, particularly on COVID-19 or vaccines in general. Second, it provides a reliable annotation scheme for sentiment and stance classification, distinguishing both tasks, thereby improving classification precision. Third, it offers a corpus annotated with stance and sentiment according to this scheme, demonstrating how these tasks differ and how conflating them may lead to inconsistencies in corpus construction, as a results of confounding these phenomena — a recurring issue in NLP research beyond studies focusing on vaccines. And fourth, this annotated corpus may serve as the gold standard for fine-tuning and evaluating supervised machine learning models for relevance, sentiment and stance analysis of X posts on similar domains.

2024

pdf bib
Bringing Pragmatics to Porttinari - Adding Speech Acts to News Texts
Nataly L. Patti da Silva | Norton Trevisan Roman | Ariani Di Felippo
Proceedings of the 16th International Conference on Computational Processing of Portuguese - Vol. 1

pdf bib
A Corpus of Stock Market Tweets Annotated with Named Entities
Michel Monteiro Zerbinati | Norton Trevisan Roman | Ariani Di Felippo
Proceedings of the 16th International Conference on Computational Processing of Portuguese - Vol. 1

2022

pdf bib
Proceedings of the Universal Dependencies Brazilian Festival
Thiago Alexandre Salgueiro Pardo | Ariani Di-Felippo | Norton Trevisan Roman
Proceedings of the Universal Dependencies Brazilian Festival

2015

pdf bib
Squibs: Spelling Error Patterns in Brazilian Portuguese
Priscila A. Gimenes | Norton T. Roman | Ariadne M. B. R. Carvalho
Computational Linguistics, Volume 41, Issue 1 - March 2015

pdf bib
An Annotated Corpus for Sentiment Analysis in Political News
Gabriel Domingos de Arruda | Norton Trevisan Roman | Ana Maria Monteiro
Proceedings of the 10th Brazilian Symposium in Information and Human Language Technology

2013

pdf bib
Introducing a Corpus of Human-Authored Dialogue Summaries in Portuguese
Norton Trevisan Roman | Paul Piwek | Ariadne M. B. Rizzoni Carvalho | Alexandre Rossi Alvares
Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP 2013

pdf bib
AgreeCalc: Uma Ferramenta para Análise da Concordância entre Múltiplos Anotadores (AgreeCalc: A Tool for the Analysis of Agreement Between Multiple Annotators) [in Portuguese]
Alexandre Rossi Alvares | Norton Trevisan Roman
Proceedings of the 9th Brazilian Symposium in Information and Human Language Technology

pdf bib
MetaAnn: Um Gerador de Ferramentas para Anotação de Textos (MetaAnn: a Generator of Text Annotation Tools) [in Portuguese]
Tiago Emanuel Infante Missão | Norton Trevisan Roman
Proceedings of the 9th Brazilian Symposium in Information and Human Language Technology

pdf bib
JWN-Br - Uma API Java para a WordNet.Br (JWN-Br - an Java API for Wordnet.Br) [in Portuguese]
Vitor Machado Oliveira | Norton Trevisan Roman
Proceedings of the 9th Brazilian Symposium in Information and Human Language Technology