BUCC 2014
Seventh Workshop on Building and Using
Comparable Corpora
Proceedings of the Workshop
27 May 2014
[Co-located with LREC 2014]
Table of contents
Keynote
Crowdsourcing translation [abstract]
Chris Callison-Burch…………1
Building corpora
Construction of a French-LSF corpus
Michael Filhol & Xavier Tannier……….2-5
Liling Tan, Marcos Zampieri, Nikola Ljubešić, Jörg Tiedemann………..6-10
Building comparable corpora from social networks
Maroua Trabelsi, Malek Hajjem, Chiraz Latiri………..11-16
Twitter as a comparable corpus to build multilingual affective lexicons
Amel Fraisse & Patrick Paroubek………..17-21
Machine translation
Multimodal comparable corpora for machine translation
Haithem Afli, Loïc Barrault, Holger Schwenk………..22-27
Extended translation memories for multilingual document authoring
Jen-Luc Meunier, & Marc Dymetman………….28-37
Using partly multilingual patents to support research on multilingual IR by building translation memories and MT systems [abstract for defective PDF]
Lingxiao Wang, Christian Boitet, Mathieu Mangeot
Comparability of corpora in human and machine translation
Ekaterina Lapshinova-Koltunski & Santanu Pal………….42-48
Terminology
Identifying Japanese-Chinese bilingual synonymous technical terms from patent families
Zi Long, Lijuan Dong, Takehito Utsuro, Tomohara Mitsuhashi, Mikio Yamamoto……….49-54
Revisiting comparable corpora in connected space
Pierre Zweigenbaum…………55-64