Lucas Nildaimon dos Santos Silva

Also published as: Lucas Nildaimon dos Santos Silva


2024

pdf bib
Brazilian Portuguese Product Reviews Moderation with AutoML
Lucas Nildaimon dos Santos Silva | Livy Real | Fernando Rezende Zagatti | Ana Claudia Bianchini Zandavalle | Tatiana da Silva Gama | Carolina Francisco Gadelha Rodrigues
Proceedings of the 16th International Conference on Computational Processing of Portuguese

pdf bib
RePro: a benchmark for Opinion Mining for Brazilian Portuguese
Lucas Nildaimon dos Santos Silva | Livy Real | Ana Claudia Bianchini Zandavalle | Carolina Francisco Gadelha Rodrigues | Tatiana da Silva Gama | Fernando Guedes Souza | Phillipe Derwich Silva Zaidan
Proceedings of the 16th International Conference on Computational Processing of Portuguese

2022

pdf bib
mwetoolkit-lib: Adaptation of the mwetoolkit as a Python Library and an Application to MWE-based Document Clustering
Fernando Zagatti | Paulo Augusto de Lima Medeiros | Esther da Cunha Soares | Lucas Nildaimon dos Santos Silva | Carlos Ramisch | Livy Real
Proceedings of the 18th Workshop on Multiword Expressions @LREC2022

This paper introduces the mwetoolkit-lib, an adaptation of the mwetoolkit as a python library. The original toolkit performs the extraction and identification of multiword expressions (MWEs) in large text bases through the command line. One of the contributions of our work is the adaptation of the MWE extraction pipeline from the mwetoolkit, allowing its usage in python development environments and integration in larger pipelines. The other contribution is the execution of a pilot experiment aiming to show the impact of MWE discovery in data professionals’ work. This experiment found that the addition of MWE knowledge to the Term Frequency-Inverse Document Frequency (TF-IDF) vectorization altered the word relevance order, improving the linguistic quality of the clusters returned by k-means method.