Ken Yano
2025
ELAINE-medLLM: Lightweight English Japanese Chinese Trilingual Large Language Model for Bio-medical Domain
Ken Yano
|
Zheheng Luo
|
Jimin Huang
|
Qianqian Xie
|
Masaki Asada
|
Chenhan Yuan
|
Kailai Yang
|
Makoto Miwa
|
Sophia Ananiadou
|
Jun’ichi Tsujii
Proceedings of the 31st International Conference on Computational Linguistics
We propose ELAINE (EngLish-jApanese-chINesE)-medLLM, a trilingual (English, Japanese, Chinese) large language model adapted for the bio-medical domain based on Llama-3-8B. The training dataset was carefully curated in terms of volume and diversity to adapt to the biomedical domain and endow trilingual capability while preserving the knowledge and abilities of the base model. The training follows 2-stage paths: continued pre-training and supervised fine-tuning (SFT). Our results demonstrate that ELAINE-medLLM exhibits superior trilingual capabilities compared to existing bilingual or multilingual medical LLMs without severely sacrificing the base model’s capability.
2023
DISTANT: Distantly Supervised Entity Span Detection and Classification
Ken Yano
|
Makoto Miwa
|
Sophia Ananiadou
The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks
We propose a distantly supervised pipeline NER which executes entity span detection and entity classification in sequence named DISTANT (DIstantly Supervised enTity spAN deTection and classification).The former entity span detector extracts possible entity mention spans by the distant supervision. Then the later entity classifier assigns each entity span to one of the positive entity types or none by employing a positive and unlabeled (PU) learning framework. Two models were built based on the pre-trained SciBERT model and fine-tuned with the silver corpus generated by the distant supervision. Experimental results on BC5CDR and NCBI-Disease datasets show that our method outperforms the end-to-end NER baselines without PU learning by a large margin. In particular, it increases the recall score effectively.
2021
Pipeline Signed Japanese Translation Focusing on a Post-positional Particle Complement and Conjugation in a Low-resource Setting
Ken Yano
|
Akira Utsumi
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
Search
Fix data
Co-authors
- Sophia Ananiadou 2
- Makoto Miwa 2
- Masaki Asada 1
- Jimin Huang 1
- Zheheng Luo 1
- show all...