Mitigating Silence in Compliance Terminology during Parsing of Utterances

Esme Manandise, Conrad de Peuter


Abstract
This paper reports on an approach to increase multi-token-term recall in a parsing task. We use a compliance-domain parser to extract, during the process of parsing raw text, terms that are unlisted in the terminology. The parser uses a similarity measure (Generalized Dice Coefficient) between listed terms and unlisted term candidates to (i) determine term status, (ii) serve putative terms to the parser, (iii) decrease parsing complexity by glomming multi-tokens as lexical singletons, and (iv) automatically augment the terminology after parsing of an utterance completes. We illustrate a small experiment with examples from the tax-and-regulations domain. Bootstrapping the parsing process to detect out- of-vocabulary terms at runtime increases parsing accuracy in addition to producing other benefits to a natural-language-processing pipeline, which translates arithmetic calculations written in English into computer-executable operations.
Anthology ID:
2020.fnp-1.33
Volume:
Proceedings of the 1st Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation
Month:
December
Year:
2020
Address:
Barcelona, Spain (Online)
Editors:
Dr Mahmoud El-Haj, Dr Vasiliki Athanasakou, Dr Sira Ferradans, Dr Catherine Salzedo, Dr Ans Elhag, Dr Houda Bouamor, Dr Marina Litvak, Dr Paul Rayson, Dr George Giannakopoulos, Nikiforos Pittaras
Venue:
FNP
SIG:
Publisher:
COLING
Note:
Pages:
204–212
Language:
URL:
https://aclanthology.org/2020.fnp-1.33
DOI:
Bibkey:
Cite (ACL):
Esme Manandise and Conrad de Peuter. 2020. Mitigating Silence in Compliance Terminology during Parsing of Utterances. In Proceedings of the 1st Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation, pages 204–212, Barcelona, Spain (Online). COLING.
Cite (Informal):
Mitigating Silence in Compliance Terminology during Parsing of Utterances (Manandise & de Peuter, FNP 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.fnp-1.33.pdf