Gaussian Distributed Prototypical Network for Few-shot Genomic Variant Detection

Jiarun Cao, Niels Peek, Andrew Renehan, Sophia Ananiadou


Abstract
Automatically identifying genetic mutations in the cancer literature using text mining technology has been an important way to study the vast amount of cancer medical literature. However, novel knowledge regarding the genetic variants proliferates rapidly, though current supervised learning models struggle with discovering these unknown entity types. Few-shot learning allows a model to perform effectively with great generalization on new entity types, which has not been explored in recognizing cancer mutation detection. This paper addresses cancer mutation detection tasks with few-shot learning paradigms. We propose GDPN framework, which models the label dependency from the training examples in the support set and approximates the transition scores via Gaussian distribution. The experiments on three benchmark cancer mutation datasets show the effectiveness of our proposed model.
Anthology ID:
2023.bionlp-1.2
Volume:
The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Dina Demner-fushman, Sophia Ananiadou, Kevin Cohen
Venue:
BioNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
26–36
Language:
URL:
https://aclanthology.org/2023.bionlp-1.2
DOI:
10.18653/v1/2023.bionlp-1.2
Bibkey:
Cite (ACL):
Jiarun Cao, Niels Peek, Andrew Renehan, and Sophia Ananiadou. 2023. Gaussian Distributed Prototypical Network for Few-shot Genomic Variant Detection. In The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, pages 26–36, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Gaussian Distributed Prototypical Network for Few-shot Genomic Variant Detection (Cao et al., BioNLP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.bionlp-1.2.pdf