Vládia Pinheiro
Also published as: Vladia Pinheiro
2026
Portuguese Sentiment Analysis with Open-Source LLMs: Models, Prompts, and Efficient Deployment
João V R J Lima | Vládia Pinheiro | Carlos Caminha
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
João V R J Lima | Vládia Pinheiro | Carlos Caminha
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
Robust sentiment analysis in Portuguese is central to applications across Lusophone contexts, yet systematic evaluations still focus predominantly on English and proprietary systems. This paper presents a comparative study of 29 open-source Large Language Models (LLMs) and two proprietary models on Portuguese sentiment classification under four prompting strategies: Zero-Shot, Few-Shot, Chain-of-Thought (CoT), and CoT with Few-Shot (CoT+FS). Experiments on a unified three-class benchmark built from three public review corpora (about 3,000 instances) comprise roughly 372,000 inferences, totaling approximately 150M input tokens and 65M output tokens. Results show that CoT+FS generally yields the best performance for larger models, while several compact open-source models obtain competitive F1-scores with substantially lower computational cost, making them suitable for real-world deployments. We identify concrete teacher–student configurations tailored for knowledge distillation in Portuguese sentiment analysis.
Causal_QA.PT: A Human–LLM Co-Curated Benchmark for Causal Question Answering in Portuguese Language
Lia Furtado | Cíntia Araripe | Jocelani Castilhos | Lucas Holanda | Vladia Pinheiro
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
Lia Furtado | Cíntia Araripe | Jocelani Castilhos | Lucas Holanda | Vladia Pinheiro
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
We present Causal_QA.PT, a human–LLM co-curated benchmark for causal question answering in Portuguese, addressing the lack of high-quality evaluation resources for causal reasoning in non-English languages. The dataset is developed through a hybrid human–LLM process with targeted generation, validation, and evaluation procedures, and is organized according to the PEARL causal typology. Using this resource, we evaluate the ability of Large Language Models to answer causal questions in Portuguese and examine the role of explicitly providing causal class information in prompt design. Our findings show that current LLMs are capable of producing high-quality causal responses in Portuguese, with GPT-5 Mini in particular demonstrating strong performance in judgment-based evaluation. Explicit causal class information yields model- and question-dependent benefits, particularly for interventional and counterfactual questions. Finally, we observe that human reference answers are not always superior, underscoring the importance of careful benchmark curation and robust evaluation for underrepresented languages.
2025
CaLQuest.PT: Towards the Collection and Evaluation of Natural Causal Ladder Questions in Portuguese for AI Agents
Uriel Anderson Lasheras | Vladia Pinheiro
Proceedings of the First Workshop on Language Models for Low-Resource Languages
Uriel Anderson Lasheras | Vladia Pinheiro
Proceedings of the First Workshop on Language Models for Low-Resource Languages
Large Language Models (LLMs) are increasingly central to the development of generative AI across diverse fields. While some anticipate these models may mark a step toward artificial general intelligence, their ability to handle complex causal reasoning remains unproven. Causal reasoning, particularly at Pearl’s interventional and counterfactual levels, is essential for true general intelligence. In this work, we introduce CaLQuest.PT, a dataset of over 8,000 natural causal questions in Portuguese, collected from real human interactions. Built upon a novel three-axis taxonomy, CaLQuest.PT categorizes questions by causal intent, action requirements, and the level of causal reasoning needed (associational, interventional, or counterfactual). Our findings from evaluating CaLQuest.PT’s seed questions with GPT-4o reveal that this LLM face challenges in handling interventional and relation-seeking causal queries. These results suggest limitations in using GPT-4o for extending causal question annotations and highlight the need for improved LLM strategies in causal reasoning. CaLQuest.PT provides a foundation for advancing LLM capabilities in causal understanding, particularly for the Portuguese-speaking world.
2024
CLSJUR.BR - A Model for Abstractive Summarization of Legal Documents in Portuguese Language based on Contrastive Learning
Alex Aguiar Lins | Cecilia Silvestre Carvalho | Francisco Das Chagas Jucá Bomfim | Daniel de Carvalho Bentes | Vládia Pinheiro
Proceedings of the 16th International Conference on Computational Processing of Portuguese - Vol. 1
Alex Aguiar Lins | Cecilia Silvestre Carvalho | Francisco Das Chagas Jucá Bomfim | Daniel de Carvalho Bentes | Vládia Pinheiro
Proceedings of the 16th International Conference on Computational Processing of Portuguese - Vol. 1
2023
CDJUR-BR - Uma Coleão Dourada do Judiciario Brasileiro com Entidades Nomeadas Refinadas
Mauricio Brito | Vladia Pinheiro | Vasco Furtado | João Monteiro Neto | Francisco Bomfim | Andre da Costa | Raquel Silveira
Proceedings of the 14th Brazilian Symposium in Information and Human Language Technology
Mauricio Brito | Vladia Pinheiro | Vasco Furtado | João Monteiro Neto | Francisco Bomfim | Andre da Costa | Raquel Silveira
Proceedings of the 14th Brazilian Symposium in Information and Human Language Technology
2017
Proceedings of the 11th Brazilian Symposium in Information and Human Language Technology
Gustavo Henrique Paetzold | Vládia Pinheiro
Proceedings of the 11th Brazilian Symposium in Information and Human Language Technology
Gustavo Henrique Paetzold | Vládia Pinheiro
Proceedings of the 11th Brazilian Symposium in Information and Human Language Technology
Análise de Medidas de Similaridade Semântica na Tarefa de Reconhecimento de Implicação Textual (Analysis of Semantic Similarity Measures in the Recognition of Textual Entailment Task)[In Portuguese]
David Feitosa | Vládia Pinheiro
Proceedings of the 11th Brazilian Symposium in Information and Human Language Technology
David Feitosa | Vládia Pinheiro
Proceedings of the 11th Brazilian Symposium in Information and Human Language Technology
FrameFOR – Uma Base de Conhecimento de Frames Semânticos para Perícias de Informática (FrameFOR - a Knowledge Base of Semantic Frames for Digital Forensics)[In Portuguese]
Ravi Barreira | Vládia Pinheiro | Vasco Furtado
Proceedings of the 11th Brazilian Symposium in Information and Human Language Technology
Ravi Barreira | Vládia Pinheiro | Vasco Furtado
Proceedings of the 11th Brazilian Symposium in Information and Human Language Technology
2015
RePort - Um Sistema de Extração de Informações Aberta para Língua Portuguesa (Report - An Open Information Extraction System for Portuguese Language)
Victor Pereira | Vládia Pinheiro
Proceedings of the 10th Brazilian Symposium in Information and Human Language Technology
Victor Pereira | Vládia Pinheiro
Proceedings of the 10th Brazilian Symposium in Information and Human Language Technology
2011
Search
Fix author
Co-authors
- Vasco Furtado 3
- Cíntia Araripe 1
- Ravi Barreira 1
- Francisco Das Chagas Jucá Bomfim 1
- Francisco Bomfim 1
- Mauricio Brito 1
- Carlos Caminha 1
- Cecilia Silvestre Carvalho 1
- Jocelani Castilhos 1
- Andre da Costa 1
- David Feitosa 1
- Wellington Franco 1
- Lia Furtado 1
- Lucas Holanda 1
- Uriel Anderson Lasheras 1
- João V R J Lima 1
- Alex Aguiar Lins 1
- João Monteiro Neto 1
- Gustavo Paetzold 1
- Tarcísio Pequeno 1
- Victor Pereira 1
- Raquel Silveira 1
- Daniel de Carvalho Bentes 1