Neural Network-Based Generation of Sport Summaries: A Preliminary Study

David Stéphane Belemkoabga, Aurélien Bossard, Abdallah Essa, Christophe Rodrigues, Kévin Sylla


Abstract
This paper presents a global summarization method for live sport commentaries for which we have a human-written summary available. This method is based on a neural generative summarizer. The amount of data available for training is limited compared to corpora commonly used by neural summarizers. We propose to help the summarizer to learn from a limited amount of data by limiting the entropy of the input texts. This step is performed by a classification into categories derived by a detailed analysis of the human-written summaries. We show that the filtering helps the summarization system to overcome the lack of resources. However, several improving points have emerged from this preliminary study, that we discuss and plan to implement in future work.
Anthology ID:
2021.ranlp-1.18
Volume:
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021)
Month:
September
Year:
2021
Address:
Held Online
Editors:
Ruslan Mitkov, Galia Angelova
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
147–154
Language:
URL:
https://aclanthology.org/2021.ranlp-1.18
DOI:
Bibkey:
Cite (ACL):
David Stéphane Belemkoabga, Aurélien Bossard, Abdallah Essa, Christophe Rodrigues, and Kévin Sylla. 2021. Neural Network-Based Generation of Sport Summaries: A Preliminary Study. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), pages 147–154, Held Online. INCOMA Ltd..
Cite (Informal):
Neural Network-Based Generation of Sport Summaries: A Preliminary Study (Belemkoabga et al., RANLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.ranlp-1.18.pdf