Juhan Nam
2024
K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling
Haven Kim
|
Jongmin Jung
|
Dasaem Jeong
|
Juhan Nam
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Lyric translation, a field studied for over a century, is now attracting computational linguistics researchers. We identified two limitations in previous studies. Firstly, lyric translation studies have predominantly focused on Western genres and languages, with no previous study centering on K-pop despite its popularity. Second, the field of lyric translation suffers from a lack of publicly available datasets; to the best of our knowledge, no such dataset exists. To broaden the scope of genres and languages in lyric translation studies, we introduce a novel singable lyric translation dataset, approximately 89% of which consists of K-pop song lyrics. This dataset aligns Korean and English lyrics line-by-line and section-by-section. We leveraged this dataset to unveil unique characteristics of K-pop lyric translation, distinguishing it from other extensively studied genres, and to construct a neural lyric translation model, thereby underscoring the importance of a dedicated dataset for singable lyric translations.
PIAST: A Multimodal Piano Dataset with Audio, Symbolic and Text
Hayeon Bang
|
Eunjin Choi
|
Megan Finch
|
Seungheon Doh
|
Seolhee Lee
|
Gyeong-Hoon Lee
|
Juhan Nam
Proceedings of the 3rd Workshop on NLP for Music and Audio (NLP4MusA)
While piano music has become a significant area of study in Music Information Retrieval (MIR), there is a notable lack of datasets for piano solo music with text labels. To address this gap, we present PIAST (PIano dataset with Audio, Symbolic, and Text), a piano music dataset. Utilizing a piano-specific taxonomy of semantic tags, we collected 9,673 tracks from YouTube and added human annotations for 2,023 tracks by music experts, resulting in two subsets: PIAST-YT and PIAST-AT. Both include audio, text, tag annotations, and transcribed MIDI utilizing state-of-the-art piano transcription and beat tracking models. Among many possible tasks with the multimodal dataset, we conduct music tagging and retrieval using both audio and MIDI data and report baseline performances to demonstrate its potential as a valuable resource for MIR research.
2021
Music Playlist Title Generation: A Machine-Translation Approach
Seungheon Doh
|
Junwon Lee
|
Juhan Nam
Proceedings of the 2nd Workshop on NLP for Music and Spoken Audio (NLP4MusA)
Search
Fix data
Co-authors
- Seungheon Doh 2
- Hayeon Bang 1
- Eunjin Choi 1
- Megan Finch 1
- Dasaem Jeong 1
- show all...