Clustering Similar Amendments at the Italian Senate

Tommaso Agnoloni, Carlo Marchetti, Roberto Battistoni, Giuseppe Briotti


Abstract
In this paper we describe an experiment for the application of text clustering techniques to dossiers of amendments to proposed legislation discussed in the Italian Senate. The aim is to assist the Senate staff in the detection of groups of amendments similar in their textual formulation in order to schedule their simultaneous voting. Experiments show that the exploitation (extraction, annotation and normalization) of domain features is crucial to improve the clustering performance in many problematic cases not properly dealt with by standard approaches. The similarity engine was implemented and integrated as an experimental feature in the internal application used for the management of amendments in the Senate Assembly and Committees. Thanks to the Open Data strategy pursued by the Senate for several years, all documents and data produced by the institution are publicly available for reuse in open formats.
Anthology ID:
2022.parlaclarin-1.7
Volume:
Proceedings of the Workshop ParlaCLARIN III within the 13th Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Darja Fišer, Maria Eskevich, Jakob Lenardič, Franciska de Jong
Venue:
ParlaCLARIN
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
39–46
Language:
URL:
https://aclanthology.org/2022.parlaclarin-1.7
DOI:
Bibkey:
Cite (ACL):
Tommaso Agnoloni, Carlo Marchetti, Roberto Battistoni, and Giuseppe Briotti. 2022. Clustering Similar Amendments at the Italian Senate. In Proceedings of the Workshop ParlaCLARIN III within the 13th Language Resources and Evaluation Conference, pages 39–46, Marseille, France. European Language Resources Association.
Cite (Informal):
Clustering Similar Amendments at the Italian Senate (Agnoloni et al., ParlaCLARIN 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.parlaclarin-1.7.pdf
Code
 senatodellarepubblica/similis