Ser-Nam Lim
2021
Cross-Modal Retrieval Augmentation for Multi-Modal Classification
Shir Gur
|
Natalia Neverova
|
Chris Stauffer
|
Ser-Nam Lim
|
Douwe Kiela
|
Austin Reiter
Findings of the Association for Computational Linguistics: EMNLP 2021
Recent advances in using retrieval components over external knowledge sources have shown impressive results for a variety of downstream tasks in natural language processing. Here, we explore the use of unstructured external knowledge sources of images and their corresponding captions for improving visual question answering (VQA). First, we train a novel alignment model for embedding images and captions in the same space, which achieves substantial improvement in performance on image-caption retrieval w.r.t. similar methods. Second, we show that retrieval-augmented multi-modal transformers using the trained alignment model improve results on VQA over strong baselines. We further conduct extensive experiments to establish the promise of this approach, and examine novel applications for inference time such as hot-swapping indices.
When in Doubt: Improving Classification Performance with Alternating Normalization
Menglin Jia
|
Austin Reiter
|
Ser-Nam Lim
|
Yoav Artzi
|
Claire Cardie
Findings of the Association for Computational Linguistics: EMNLP 2021
We introduce Classification with Alternating Normalization (CAN), a non-parametric post-processing step for classification. CAN improves classification accuracy for challenging examples by re-adjusting their predicted class probability distribution using the predicted class distributions of high-confidence validation examples. CAN is easily applicable to any probabilistic classifier, with minimal computation overhead. We analyze the properties of CAN using simulated experiments, and empirically demonstrate its effectiveness across a diverse set of classification tasks.
Search
Co-authors
- Austin Reiter 2
- Shir Gur 1
- Natalia Neverova 1
- Chris Stauffer 1
- Douwe Kiela 1
- show all...