How’s Business Going Worldwide ? A Multilingual Annotated Corpus for Business Relation Extraction

Hadjer Khaldi, Farah Benamara, Camille Pradel, Grégoire Sigel, Nathalie Aussenac-Gilles


Abstract
The business world has changed due to the 21st century economy, where borders have melted and trades became free. Nowadays,competition is no longer only at the local market level but also at the global level. In this context, the World Wide Web has become a major source of information for companies and professionals to keep track of their complex, rapidly changing, and competitive business environment. A lot of effort is nonetheless needed to collect and analyze this information due to information overload problem and the huge number of web pages to process and analyze. In this paper, we propose the BizRel resource, the first multilingual (French,English, Spanish, and Chinese) dataset for automatic extraction of binary business relations involving organizations from the web. This dataset is used to train several monolingual and cross-lingual deep learning models to detect these relations in texts. Our results are encouraging, demonstrating the effectiveness of such a resource for both research and business communities. In particular, we believe multilingual business relation extraction systems are crucial tools for decision makers to identify links between specific market stakeholders and build business networks which enable to anticipate changes and discover new threats or opportunities. Our work is therefore an important direction toward such tools.
Anthology ID:
2022.lrec-1.394
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
3696–3705
Language:
URL:
https://aclanthology.org/2022.lrec-1.394
DOI:
Bibkey:
Cite (ACL):
Hadjer Khaldi, Farah Benamara, Camille Pradel, Grégoire Sigel, and Nathalie Aussenac-Gilles. 2022. How’s Business Going Worldwide ? A Multilingual Annotated Corpus for Business Relation Extraction. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 3696–3705, Marseille, France. European Language Resources Association.
Cite (Informal):
How’s Business Going Worldwide ? A Multilingual Annotated Corpus for Business Relation Extraction (Khaldi et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.394.pdf