David Stéphane Belemkoabga


pdf bib
Neural Network-Based Generation of Sport Summaries: A Preliminary Study
David Stéphane Belemkoabga | Aurélien Bossard | Abdallah Essa | Christophe Rodrigues | Kévin Sylla
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021)

This paper presents a global summarization method for live sport commentaries for which we have a human-written summary available. This method is based on a neural generative summarizer. The amount of data available for training is limited compared to corpora commonly used by neural summarizers. We propose to help the summarizer to learn from a limited amount of data by limiting the entropy of the input texts. This step is performed by a classification into categories derived by a detailed analysis of the human-written summaries. We show that the filtering helps the summarization system to overcome the lack of resources. However, several improving points have emerged from this preliminary study, that we discuss and plan to implement in future work.