"Full Paper","Proceedings of the Sixth Workshop on Vision and Language","Anya Belz;Erkut Erdem;Katerina Pastra;Krystian Mikolajczyk;","acl@aclweb.org","2000" "Full Paper","The BURCHAK corpus: a Challenge Data Set for Interactive Learning of Visually Grounded Word Meanings","Yanchao Yu;Arash Eshghi;Gregory Mills;Oliver Lemon;","acl@aclweb.org","2001" "Full Paper","The Use of Object Labels and Spatial Prepositions as Keywords in a Web-Retrieval-Based Image Caption Generation System","Brandon Birmingham;Adrian Muscat;","acl@aclweb.org","2002" "Full Paper","Learning to Recognize Animals by Watching Documentaries: Using Subtitles as Weak Supervision","Aparna Nurani Venkitasubramanian;Tinne Tuytelaars;Marie-Francine Moens;","acl@aclweb.org","2003" "Full Paper","Human Evaluation of Multi-modal Neural Machine Translation: A Case-Study on E-Commerce Listing Titles","Iacer Calixto;Daniel Stein;Evgeny Matusov;Sheila Castilho;Andy Way;","acl@aclweb.org","2004" "Full Paper","The BreakingNews Dataset","Arnau Ramisa;Fei Yan;Francesc Moreno-Noguer;Krystian Mikolajczyk;","acl@aclweb.org","2005" "Full Paper","Automatic identification of head movements in video-recorded conversations: can words help?","Patrizia Paggio;Costanza Navarretta;Bart Jongejan;","acl@aclweb.org","2006" "Full Paper","Multi-Modal Fashion Product Retrieval","Antonio Rubio Romano;LongLong Yu;Edgar Simo-Serra;Francesc Moreno-Noguer;","acl@aclweb.org","2007"