pdf bibDetecting Visually Relevant Sentences for Fine-Grained ClassificationOlivia Winn | Madhavan Kavanur Kidambi | Smaranda MuresanProceedings of the 5th Workshop on Vision and Language