pdf bibSeeing Cause and Time: A Visually Grounded Evaluation of Multimodal ModelsSalvatore Ergoli | Alessandro Bondielli | Alessandro LenciProceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)