Pablo Picasso Feliciano de Faria
Also published as: Pablo Picasso Feliciano de Faria
2019
The Role of Utterance Boundaries and Word Frequencies for Part-of-speech Learning in Brazilian Portuguese Through Distributional Analysis
Pablo Picasso Feliciano de Faria
Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics
In this study, we address the problem of part-of-speech (or syntactic category) learning during language acquisition through distributional analysis of utterances. A model based on Redington et al.’s (1998) distributional learner is used to investigate the informativeness of distributional information in Brazilian Portuguese (BP). The data provided to the learner comes from two publicly available corpora of child directed speech. We present preliminary results from two experiments. The first one investigates the effects of different assumptions about utterance boundaries when presenting the input data to the learner. The second experiment compares the learner’s performance when counting contextual words’ frequencies versus just acknowledging their co-occurrence with a given target word. In general, our results indicate that explicit boundaries are more informative, frequencies are important, and that distributional information is useful to the child as a source of categorial information. These results are in accordance with Redington et al.’s findings for English.
2010
An Integrated Tool for Annotating Historical Corpora
Pablo Picasso Feliciano de Faria
|
Fabio Natanael Kepler
|
Maria Clara Paixão de Sousa
Proceedings of the Fourth Linguistic Annotation Workshop
Search