Pranav Nair
2023
Domain Aligned Prefix Averaging for Domain Generalization in Abstractive Summarization
Pranav Nair
|
Sukomal Pal
|
Pradeepika Verma
Findings of the Association for Computational Linguistics: ACL 2023
Domain generalization is hitherto an underexplored area applied in abstractive summarization. Moreover, most existing works on domain generalization have sophisticated training algorithms. In this paper, we propose a lightweight, weight averaging based, Domain Aligned Prefix Averaging approach to domain generalization for abstractive summarization. Given a number of source domains, our method first trains a prefix for each one of them. These source prefixes generate summaries for a small number of target domain documents. The similarity of the generated summaries to their corresponding source documents is used for calculating weights required to average source prefixes. In DAPA, prefix tuning allows for lightweight finetuning, and weight averaging allows for the computationally efficient addition of new source domains. When evaluated on four diverse summarization domains, DAPA shows comparable or better performance against the baselines demonstrating the effectiveness of its prefix averaging scheme.
The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing
Debayan Banerjee
|
Pranav Nair
|
Ricardo Usbeck
|
Chris Biemann
Findings of the Association for Computational Linguistics: ACL 2023
In this work, we analyse the role of output vocabulary for text-to-text (T2T) models on the task of SPARQL semantic parsing. We perform experiments within the the context of knowledge graph question answering (KGQA), where the task is to convert questions in natural language to the SPARQL query language. We observe that the query vocabulary is distinct from human vocabulary. Language Models (LMs) are pre-dominantly trained for human language tasks, and hence, if the query vocabulary is replaced with a vocabulary more attuned to the LM tokenizer, the performance of models may improve. We carry out carefully selected vocabulary substitutions on the queries and find absolute gains in the range of 17% on the GrailQA dataset.
2021
On Reducing Repetition in Abstractive Summarization
Pranav Nair
|
Anil Kumar Singh
Proceedings of the Student Research Workshop Associated with RANLP 2021
Repetition in natural language generation reduces the informativeness of text and makes it less appealing. Various techniques have been proposed to alleviate it. In this work, we explore and propose techniques to reduce repetition in abstractive summarization. First, we explore the application of unlikelihood training and embedding matrix regularizers from previous work on language modeling to abstractive summarization. Next, we extend the coverage and temporal attention mechanisms to the token level to reduce repetition. In our experiments on the CNN/Daily Mail dataset, we observe that these techniques reduce the amount of repetition and increase the informativeness of the summaries, which we confirm via human evaluation.
Improving Abstractive Summarization with Commonsense Knowledge
Pranav Nair
|
Anil Kumar Singh
Proceedings of the Student Research Workshop Associated with RANLP 2021
Large scale pretrained models have demonstrated strong performances on several natural language generation and understanding benchmarks. However, introducing commonsense into them to generate more realistic text remains a challenge. Inspired from previous work on commonsense knowledge generation and generative commonsense reasoning, we introduce two methods to add commonsense reasoning skills and knowledge into abstractive summarization models. Both methods beat the baseline on ROUGE scores, demonstrating the superiority of our models over the baseline. Human evaluation results suggest that summaries generated by our methods are more realistic and have fewer commonsensical errors.
Search
Fix data
Co-authors
- Anil Kumar Singh 2
- Debayan Banerjee 1
- Chris Biemann 1
- Sukomal Pal 1
- Ricardo Usbeck 1
- show all...