Abdul Moeed
2020
An Evaluation of Progressive Neural Networksfor Transfer Learning in Natural Language Processing
Abdul Moeed
|
Gerhard Hagerer
|
Sumit Dugar
|
Sarthak Gupta
|
Mainak Ghosh
|
Hannah Danner
|
Oliver Mitevski
|
Andreas Nawroth
|
Georg Groh
Proceedings of the Twelfth Language Resources and Evaluation Conference
A major challenge in modern neural networks is the utilization of previous knowledge for new tasks in an effective manner, otherwise known as transfer learning. Fine-tuning, the most widely used method for achieving this, suffers from catastrophic forgetting. The problem is often exacerbated in natural language processing (NLP). In this work, we assess progressive neural networks (PNNs) as an alternative to fine-tuning. The evaluation is based on common NLP tasks such as sequence labeling and text classification. By gauging PNNs across a range of architectures, datasets, and tasks, we observe improvements over the baselines throughout all experiments.
Evaluation Metrics for Headline Generation Using Deep Pre-Trained Embeddings
Abdul Moeed
|
Yang An
|
Gerhard Hagerer
|
Georg Groh
Proceedings of the Twelfth Language Resources and Evaluation Conference
With the explosive growth in textual data, it is becoming increasingly important to summarize text automatically. Recently, generative language models have shown promise in abstractive text summarization tasks. Since these models rephrase text and thus use similar but different words as found in the summarized text, existing metrics such as ROUGE that use n-gram overlap may not be optimal. Therefore we evaluate two embedding-based evaluation metrics that are applicable to abstractive summarization: Fr ́echet embedding distance, which has been introduced recently, and angular embedding similarity, which is our proposed metric. To demonstrate the utility of both metrics, we analyze the headline generation capacity of two state-of-the-art language models: GPT-2 and ULMFiT. In particular, our proposed metric shows close relation with human judgments in our experiments and has overall better correlations with them. To provide reproducibility, the source code plus human assessments of our experiments is available on GitHub.
Search
Co-authors
- Gerhard Hagerer 2
- Georg Groh 2
- Sumit Dugar 1
- Sarthak Gupta 1
- Mainak Ghosh 1
- show all...
Venues
- lrec2