Proceedings of the 16th International Natural Language Generation Conference: System Demonstrations
C. Maria Keet, Hung-Yi Lee, Sina Zarrieß (Editors)
- Anthology ID:
- 2023.inlg-demos
- Month:
- September
- Year:
- 2023
- Address:
- Prague, Czechia
- Venues:
- INLG | SIGDIAL
- SIG:
- SIGGEN
- Publisher:
- Association for Computational Linguistics
- URL:
- https://aclanthology.org/2023.inlg-demos
- DOI:
- PDF:
- https://aclanthology.org/2023.inlg-demos.pdf
Proceedings of the 16th International Natural Language Generation Conference: System Demonstrations
C. Maria Keet
|
Hung-Yi Lee
|
Sina Zarrieß
Overview of MiReportor: Generating Reports for Multimodal Medical Images
Xuwen Wang
|
Hetong Ma
|
Zhen Guo
|
Jiao Li
This demo paper presents a brief introduction of MiReportor, a computer-aided medical imaging report generator, which leverages a unified framework of medical image understanding and generation to predict readable descriptions for medical images, and assists radiologists in imaging reports writing.
enunlg: a Python library for reproducible neural data-to-text experimentation
David M. Howcroft
|
Dimitra Gkatzia
Over the past decade, a variety of neural architectures for data-to-text generation (NLG) have been proposed. However, each system typically has its own approach to pre- and post-processing and other implementation details. Diversity in implementations is desirable, but it also confounds attempts to compare model performance: are the differences due to the proposed architectures or are they a byproduct of the libraries used or a result of pre- and post-processing decisions made? To improve reproducibility, we re-implement several pre-Transformer neural models for data-to-text NLG within a single framework to facilitate direct comparisons of the models themselves and better understand the contributions of other design choices. We release our library at https://github.com/NapierNLP/enunlg to serve as a baseline for ongoing work in this area including research on NLG for low-resource languages where transformers might not be optimal.
VisuaLLM: Easy Web-based Visualization for Neural Language Generation
František Trebuňa
|
Ondrej Dusek
VisuaLLM is a Python library that enables interactive visualization of common tasks in natural language generation with pretrained language models (using HuggingFace’s model API), with tight integration of benchmark datasets and fine-grained generation control. The system runs as a local generation backend server and features a web-based frontend, allowing simple interface configuration by minimal Python code. The currently implemented views include data visualization, next-token prediction with probability distributions, and decoding parameter control, with simple extension to additional tasks.
Audio Commentary System for Real-Time Racing Game Play
Tatsuya Ishigaki
|
Goran Topić
|
Yumi Hamazono
|
Ichiro Kobayashi
|
Yusuke Miyao
|
Hiroya Takamura
Live commentaries are essential for enhancing spectators’ enjoyment and understanding during sports events or e-sports streams. We introduce a live audio commentator system designed specifically for a racing game, driven by the high demand in the e-sports field. While a player is playing a racing game, our system tracks real-time user play data including speed and steer rotations, and generates commentary to accompany the live stream. Human evaluation suggested that generated commentary enhances enjoyment and understanding of races compared to streams without commentary. Incorporating additional modules to improve diversity and detect irregular events, such as course-outs and collisions, further increases the preference for the output commentaries.