The Badalona Corpus - An Audio, Video and Neuro-Physiological Conversational Dataset

Philippe Blache, Salomé Antoine, Dorina De Jong, Lena-Marie Huttner, Emilia Kerr, Thierry Legou, Eliot Maës, Clément François


Abstract
We present in this paper the first natural conversation corpus recorded with all modalities and neuro-physiological signals. 5 dyads (10 participants) have been recorded three times, during three sessions (30mns each) with 4 days interval. During each session, audio and video are captured as well as the neural signal (EEG with Emotiv-EPOC) and the electro-physiological one (with Empatica-E4). This resource original in several respects. Technically, it is the first one gathering all these types of data in a natural conversation situation. Moreover, the recording of the same dyads at different periods opens the door to new longitudinal investigations such as the evolution of interlocutors’ alignment during the time. The paper situates this new type of resources with in the literature, presents the experimental setup and describes different annotations enriching the corpus.
Anthology ID:
2022.lrec-1.554
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
5170–5177
Language:
URL:
https://aclanthology.org/2022.lrec-1.554
DOI:
Bibkey:
Cite (ACL):
Philippe Blache, Salomé Antoine, Dorina De Jong, Lena-Marie Huttner, Emilia Kerr, Thierry Legou, Eliot Maës, and Clément François. 2022. The Badalona Corpus - An Audio, Video and Neuro-Physiological Conversational Dataset. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 5170–5177, Marseille, France. European Language Resources Association.
Cite (Informal):
The Badalona Corpus - An Audio, Video and Neuro-Physiological Conversational Dataset (Blache et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.554.pdf
Data
AMIGOSK-EmoCon