Hagen Wierstorf
2022
A Comparative Cross Language View On Acted Databases Portraying Basic Emotions Utilising Machine Learning
Felix Burkhardt
|
Anabell Hacker
|
Uwe Reichel
|
Hagen Wierstorf
|
Florian Eyben
|
Björn Schuller
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Since several decades emotional databases have been recorded by various laboratories. Many of them contain acted portrays of Darwin’s famous “big four” basic emotions. In this paper, we investigate in how far a selection of them are comparable by two approaches: on the one hand modeling similarity as performance in cross database machine learning experiments and on the other by analyzing a manually picked set of four acoustic features that represent different phonetic areas. It is interesting to see in how far specific databases (we added a synthetic one) perform well as a training set for others while some do not. Generally speaking, we found indications for both similarity as well as specificiality across languages.
Nkululeko: A Tool For Rapid Speaker Characteristics Detection
Felix Burkhardt
|
Johannes Wagner
|
Hagen Wierstorf
|
Florian Eyben
|
Björn Schuller
Proceedings of the Thirteenth Language Resources and Evaluation Conference
We present advancements with a software tool called Nkululeko, that lets users perform (semi-) supervised machine learning experiments in the speaker characteristics domain. It is based on audformat, a format for speech database metadata description. Due to an interface based on configurable templates, it supports best practise and very fast setup of experiments without the need to be proficient in the underlying language: Python. The paper explains the handling of Nkululeko and presents two typical experiments: comparing the expert acoustic features with artificial neural net embeddings for emotion classification and speaker age regression.
Search
Fix data
Co-authors
- Felix Burkhardt 2
- Florian Eyben 2
- Björn Schuller 2
- Anabell Hacker 1
- Uwe Reichel 1
- show all...
Venues
- lrec2