Nyoman Juniarta
2022
Organizing and Improving a Database of French Word Formation Using Formal Concept Analysis
Nyoman Juniarta
|
Olivier Bonami
|
Nabil Hathout
|
Fiammetta Namer
|
Yannick Toussaint
Proceedings of the Thirteenth Language Resources and Evaluation Conference
We apply Formal Concept Analysis (FCA) to organize and to improve the quality of Démonette2, a French derivational database, through a detection of both missing and spurious derivations in the database. We represent each derivational family as a graph. Given that the subgraph relation exists among derivational families, FCA can group families and represent them in a partially ordered set (poset). This poset is also useful for improving the database. A family is regarded as a possible anomaly (meaning that it may have missing and/or spurious derivations) if its derivational graph is almost, but not completely identical to a large number of other families.