Automatic Term and Sentence Classification Via Augmented Term and Pre-trained language model in ESG Taxonomy texts

Ke Tian, Zepeng Zhang, Hua Chen


Abstract
In this paper, we present our solutions to the FinSim4 Shared Task which is co-located with the FinNLP workshop at IJCAI-2022. This new edition of FinSim4-ESG is extended to the “Environment, Social and Governance (ESG)” related issues in the financial domain. There are two sub-tasks in the FinSim4 shared task. The goal of sub-task1 is to develop a model to predict correctly a list of given terms from ESG taxonomy domain into the most relevant concepts. The aim of subtask2 is to design a system that can automatically classify the ESG Taxonomy text sentence into sustainable or unsustainable class. We have developed different classifiers to automatically classify the terms and sentences with augmented term and pre-trained language models: tf-idf vector, word2vec, Bert, Distill-Bert, Albert, Roberta. The result dashboard shows that our proposed methods yield a significant performance improvement compared to the baseline which ranked 1st in the subtask2 and 2rd of mean rank in the subtask1.
Anthology ID:
2022.finnlp-1.30
Volume:
Proceedings of the Fourth Workshop on Financial Technology and Natural Language Processing (FinNLP)
Month:
December
Year:
2022
Address:
Abu Dhabi, United Arab Emirates (Hybrid)
Editors:
Chung-Chi Chen, Hen-Hsen Huang, Hiroya Takamura, Hsin-Hsi Chen
Venue:
FinNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
224–227
Language:
URL:
https://aclanthology.org/2022.finnlp-1.30
DOI:
10.18653/v1/2022.finnlp-1.30
Bibkey:
Cite (ACL):
Ke Tian, Zepeng Zhang, and Hua Chen. 2022. Automatic Term and Sentence Classification Via Augmented Term and Pre-trained language model in ESG Taxonomy texts. In Proceedings of the Fourth Workshop on Financial Technology and Natural Language Processing (FinNLP), pages 224–227, Abu Dhabi, United Arab Emirates (Hybrid). Association for Computational Linguistics.
Cite (Informal):
Automatic Term and Sentence Classification Via Augmented Term and Pre-trained language model in ESG Taxonomy texts (Tian et al., FinNLP 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.finnlp-1.30.pdf