More than Text: Multi-modal Chinese Word Segmentation

Dong Zhang; Zheng Hu; Shoushan Li (李寿山); Hanqian Wu; Qiaoming Zhu (朱巧明); Guodong Zhou (周国栋)

doi:10.18653/v1/2021.acl-short.70

More than Text: Multi-modal Chinese Word Segmentation

Dong Zhang, Zheng Hu, Shoushan Li, Hanqian Wu, Qiaoming Zhu, Guodong Zhou

Abstract

Chinese word segmentation (CWS) is undoubtedly an important basic task in natural language processing. Previous works only focus on the textual modality, but there are often audio and video utterances (such as news broadcast and face-to-face dialogues), where textual, acoustic and visual modalities normally exist. To this end, we attempt to combine the multi-modality (mainly the converted text and actual voice information) to perform CWS. In this paper, we annotate a new dataset for CWS containing text and audio. Moreover, we propose a time-dependent multi-modal interactive model based on Transformer framework to integrate multi-modal information for word sequence labeling. The experimental results on three different training sets show the effectiveness of our approach with fusing text and audio.

Anthology ID:: 2021.acl-short.70
Volume:: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)
Month:: August
Year:: 2021
Address:: Online
Editors:: Chengqing Zong, Fei Xia, Wenjie Li, Roberto Navigli
Venues:: ACL | IJCNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 550–557
Language:
URL:: https://aclanthology.org/2021.acl-short.70/
DOI:: 10.18653/v1/2021.acl-short.70
Bibkey:
Cite (ACL):: Dong Zhang, Zheng Hu, Shoushan Li, Hanqian Wu, Qiaoming Zhu, and Guodong Zhou. 2021. More than Text: Multi-modal Chinese Word Segmentation. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pages 550–557, Online. Association for Computational Linguistics.
Cite (Informal):: More than Text: Multi-modal Chinese Word Segmentation (Zhang et al., ACL-IJCNLP 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.acl-short.70.pdf
Video:: https://aclanthology.org/2021.acl-short.70.mp4

PDF Cite Search Video Fix data