Sentence Boundary Detection on Line Breaks in Japanese
Yuta Hayashibe | Kensuke Mitsuzawa
Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020)
For NLP, sentence boundary detection (SBD) is an essential task to decompose a text into sentences. Most of the previous studies have used a simple rule that uses only typical characters as sentence boundaries. However, some characters may or may not be sentence boundaries depending on the context. We focused on line breaks in them. We newly constructed annotated corpora, implemented sentence boundary detectors, and analyzed performance of SBD in several settings.
NAIST at 2013 CoNLL Grammatical Error Correction Shared Task
Ippei Yoshimoto | Tomoya Kose | Kensuke Mitsuzawa | Keisuke Sakaguchi | Tomoya Mizumoto | Yuta Hayashibe | Mamoru Komachi | Yuji Matsumoto
Proceedings of the Seventeenth Conference on Computational Natural Language Learning: Shared Task
- Yuta Hayashibe 2
- Ippei Yoshimoto 1
- Tomoya Kose 1
- Keisuke Sakaguchi 1
- Tomoya Mizumoto 1
- show all...