Hidelberg O. Albuquerque
Also published as: Hidelberg O. Albuquerque
2026
The PROPOR Ecosystem: Structure, Roles, and Evolution of Portuguese-Language NLP
Rafael O. Nunes | Gustavo L. Tamiosso | Pedro L. C. de Andrade | Matheus S. de Aguiar | Rafael P. de Gouveia | Higor Moreira | Bruno Tavares | Laura P. de Gouveia | Felipe S. F. Paula | Andre Spritzer | Hidelberg O. Albuquerque | Nádia F. F. da Silva | Ellen P. R. S. Pereira | Dennis G. Balreira | Joel L. Carbonera
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
Rafael O. Nunes | Gustavo L. Tamiosso | Pedro L. C. de Andrade | Matheus S. de Aguiar | Rafael P. de Gouveia | Higor Moreira | Bruno Tavares | Laura P. de Gouveia | Felipe S. F. Paula | Andre Spritzer | Hidelberg O. Albuquerque | Nádia F. F. da Silva | Ellen P. R. S. Pereira | Dennis G. Balreira | Joel L. Carbonera
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
The PROPOR conference has been the main venue for Portuguese language Natural Language Processing (NLP) research for over two decades. This paper presents a longitudinal bibliometric analysis of PROPOR from 2003 to 2024, examining thematic evolution, community structure, and scientific impact. We identify a shift from speech-oriented research toward text-based tasks, alongside the sustained importance of resources and linguistic theory. The community exhibits a stable structure, with complementary leadership models centered on institutional hubs and brokerage roles. Scientific impact is highly concentrated, following a long tail distribution, and distinguishes between cumulative productivity-driven impact and rapidly accelerating citation uptake in recent editions. These findings characterize PROPOR as a resilient regional linguistic ecosystem evolving in dialogue with broader NLP paradigms.
UlyssesLegalNER-Br: from Legislative to Legal, a comprehensive corpus of Brazilian legal documents for Named Entity Recognition
Hidelberg O. Albuquerque | Ellen Souza | Danilo C. G. Lucena | Héldon J. O. Albuquerque | Nádia F. F. da Silva | Márcio de S. Dias | Rafael O. Nunes | Adriano L. I. Oliveira | André C. P. L. F. de Carvalho
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
Hidelberg O. Albuquerque | Ellen Souza | Danilo C. G. Lucena | Héldon J. O. Albuquerque | Nádia F. F. da Silva | Márcio de S. Dias | Rafael O. Nunes | Adriano L. I. Oliveira | André C. P. L. F. de Carvalho
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
The legal domain presents several challenges for Natural Language Processing (NLP), particularly due to its linguistic complexity and lack of public datasets. Named Entity Recognition (NER), a subarea of NLP, has been successfully used to extract useful knowledge from legal texts. Its widespread use is limited by the lack of legal text corpora. This paper introduces UlyssesLegalNER-Br, a comprehensive corpus of Brazilian legal documents for NER, covering bills, case laws and laws, including the first NER corpus based exclusively on Brazilian laws. This research expand the UlyssesNER-Br corpus, previously focused only on the Brazilian legislative domain. The proposed corpus has 560 public documents annotated using a hybrid approach, organized in 9 categories and 23 fine-grained types, experimentally evaluated with the CRF, BiLSTM, and BERTimbau architectures. The corpus was experimentally evaluated regarding predictive performance, computational cost and label-level results. The best micro F1 96.18% was achieved by BERTimbau on the unified corpus, providing a strong baseline for Brazilian legal NER. At the label level, six categories and seven types presented a F1-score above 95%, while the lowest were distributed in the interval 71-82%.
2024
UlyssesNERQ: Expanding Queries from Brazilian Portuguese Legislative Documents through Named Entity Recognition
Hidelberg O. Albuquerque | Ellen Souza | Tainan Silva | Rafael P. Gouveia | Flavio Junior | Douglas Vitório | Nádia F. F. da Silva | André C.P.L.F. de Carvalho | Adriano L.I. Oliveira | Francisco Edmundo de Andrade
Proceedings of the 16th International Conference on Computational Processing of Portuguese - Vol. 1
Hidelberg O. Albuquerque | Ellen Souza | Tainan Silva | Rafael P. Gouveia | Flavio Junior | Douglas Vitório | Nádia F. F. da Silva | André C.P.L.F. de Carvalho | Adriano L.I. Oliveira | Francisco Edmundo de Andrade
Proceedings of the 16th International Conference on Computational Processing of Portuguese - Vol. 1
RoBERTaLexPT: A Legal RoBERTa Model pretrained with deduplication for Portuguese
Eduardo Garcia | Nadia Silva | Felipe Siqueira | Juliana Gomes | Hidelberg O. Albuquerque | Ellen Souza | Eliomar Lima | André de Carvalho
Proceedings of the 16th International Conference on Computational Processing of Portuguese - Vol. 1
Eduardo Garcia | Nadia Silva | Felipe Siqueira | Juliana Gomes | Hidelberg O. Albuquerque | Ellen Souza | Eliomar Lima | André de Carvalho
Proceedings of the 16th International Conference on Computational Processing of Portuguese - Vol. 1
Search
Fix author
Co-authors
- Ellen Souza 3
- Nádia F. F. da Silva 3
- André C. P. L. F. de Carvalho 2
- Rafael O. Nunes 2
- Adriano L. I. Oliveira 2
- Matheus S. de Aguiar 1
- Héldon J. O. Albuquerque 1
- Pedro L. C. de Andrade 1
- Dennis G. Balreira 1
- Joel L. Carbonera 1
- Márcio de S. Dias 1
- Eduardo Garcia 1
- Juliana Gomes 1
- Rafael P. de Gouveia 1
- Laura P. de Gouveia 1
- Rafael P. Gouveia 1
- Flavio Junior 1
- Eliomar Lima 1
- Danilo C. G. Lucena 1
- Higor Moreira 1
- Felipe S. F. Paula 1
- Ellen P. R. S. Pereira 1
- Tainan Silva 1
- Nádia Silva 1
- Felipe Siqueira 1
- André Spritzer 1
- Gustavo L. Tamiosso 1
- Bruno Tavares 1
- Douglas Vitório 1
- Francisco Edmundo de Andrade 1
- André de Carvalho 1