International Conference on Language Resources and Evaluation (2018)

Volumes

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) 729 papers

pdf (full)
bib (full) Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

pdf bib

Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation
Ali Can Kocabiyikoglu | Laurent Besacier | Olivier Kraif

pdf bib

Evaluating Domain Adaptation for Machine Translation Across Scenarios
Thierry Etchegoyhen | Anna Fernández Torné | Andoni Azpeitia | Eva Martínez Garcia | Anna Matamala

pdf bib

Upping the Ante: Towards a Better Benchmark for Chinese-to-English Machine Translation
Christian Hadiwinoto | Hwee Tou Ng

pdf bib

ESCAPE: a Large-scale Synthetic Corpus for Automatic Post-Editing
Matteo Negri | Marco Turchi | Rajen Chatterjee | Nicola Bertoldi

pdf bib

Evaluating Machine Translation Performance on Chinese Idioms with a Blacklist Method
Yutong Shao | Rico Sennrich | Bonnie Webber | Federico Fancellu

pdf bib

Network Features Based Co-hyponymy Detection
Abhik Jana | Pawan Goyal

pdf bib

Cross-Lingual Generation and Evaluation of a Wide-Coverage Lexical Semantic Resource
Attila Novák | Borbála Novák

pdf bib

Advances in Pre-Training Distributed Word Representations
Tomas Mikolov | Edouard Grave | Piotr Bojanowski | Christian Puhrsch | Armand Joulin

pdf bib

Integrating Generative Lexicon Event Structures into VerbNet
Susan Windisch Brown | James Pustejovsky | Annie Zaenen | Martha Palmer

pdf bib

FontLex: A Typographical Lexicon based on Affective Associations
Tugba Kulahcioglu | Gerard de Melo

pdf bib

Multi-layer Annotation of the Rigveda
Oliver Hellwig | Heinrich Hettrich | Ashutosh Modi | Manfred Pinkal

pdf bib

pdf bib

Semi-automatic Korean FrameNet Annotation over KAIST Treebank
Younggyun Hahm | Jiseong Kim | Sunggoo Kwon | Key-Sun Choi

pdf bib

pdf bib

pdf bib

A Corpus for Modeling Word Importance in Spoken Dialogue Transcripts
Sushant Kafle | Matt Huenerfauth

pdf bib

pdf bib

Effects of Gender Stereotypes on Trust and Likability in Spoken Human-Robot Interaction
Matthias Kraus | Johannes Kraus | Martin Baumann | Wolfgang Minker

pdf bib

pdf bib

Improving Dialogue Act Classification for Spontaneous Arabic Speech and Instant Messages at Utterance Level
AbdelRahim Elmadany | Sherif Abdou | Mervat Gheith

pdf bib

Data Management Plan (DMP) for Language Data under the New General Da-ta Protection Regulation (GDPR)
Pawel Kamocki | Valérie Mapelli | Khalid Choukri

pdf bib

We Are Depleting Our Research Subject as We Are Investigating It: In Language Technology, more Replication and Diversity Are Needed
António Branco

pdf bib

Lessons Learned: On the Challenges of Migrating a Research Data Repository from a Research Institution to a University Library.
Thorsten Trippel | Claus Zinn

pdf bib

Introducing NIEUW: Novel Incentives and Workflows for Eliciting Linguistic Data
Christopher Cieri | James Fiumara | Mark Liberman | Chris Callison-Burch | Jonathan Wright

pdf bib

pdf bib

Content-Based Conflict of Interest Detection on Wikipedia
Udochukwu Orizu | Yulan He

pdf bib

Word Affect Intensities
Saif Mohammad

pdf bib

Representation Mapping: A Novel Approach to Generate High-Quality Multi-Lingual Emotion Lexicons
Sven Buechel | Udo Hahn

pdf bib

Unfolding the External Behavior and Inner Affective State of Teammates through Ensemble Learning: Experimental Evidence from a Dyadic Team Corpus
Aggeliki Vlachostergiou | Mark Dennison | Catherine Neubauer | Stefan Scherer | Peter Khooshabeh | Andre Harrison

pdf bib

Understanding Emotions: A Dataset of Tweets to Study Interactions between Affect Categories
Saif Mohammad | Svetlana Kiritchenko

pdf bib

When ACE met KBP: End-to-End Evaluation of Knowledge Base Population with Component-level Annotation
Bonan Min | Marjorie Freedman | Roger Bock | Ralph Weischedel

pdf bib

Simple Large-scale Relation Extraction from Unstructured Text
Christos Christodoulopoulos | Arpit Mittal

pdf bib

Joint Learning of Sense and Word Embeddings
Mohammed Alsuhaibani | Danushka Bollegala

pdf bib

Comparing Pretrained Multilingual Word Embeddings on an Ontology Alignment Task
Dagmar Gromann | Thierry Declerck

pdf bib

A Large Resource of Patterns for Verbal Paraphrases
Octavian Popescu | Ngoc Phuoc An Vo | Vadim Sheinin

pdf bib

Building Parallel Monolingual Gan Chinese Dialects Corpus
Fan Xu | Mingwen Wang | Maoxi Li

pdf bib

pdf bib

Building a Corpus from Handwritten Picture Postcards: Transcription, Annotation and Part-of-Speech Tagging
Kyoko Sugisaki | Nicolas Wiedmer | Heiko Hausendorf

pdf bib

A Lexical Tool for Academic Writing in Spanish based on Expert and Novice Corpora
Marcos García Salido | Marcos García | Milka Villayandre-Llamazares | Margarita Alonso-Ramos

pdf bib

Framing Named Entity Linking Error Types
Adrian Braşoveanu | Giuseppe Rizzo | Philipp Kuntschik | Albert Weichselbraun | Lyndon J.B. Nixon

pdf bib

A FrameNet for Cancer Information in Clinical Narratives: Schema and Annotation
Kirk Roberts | Yuqi Si | Anshul Gandhi | Elmer Bernstam

pdf bib

pdf bib

Parallel Corpora for the Biomedical Domain
Aurélie Névéol | Antonio Jimeno Yepes | Mariana Neves | Karin Verspoor

pdf bib

pdf bib

Word Embedding Approach for Synonym Extraction of Multi-Word Terms
Amir Hazem | Béatrice Daille

pdf bib

A Large Automatically-Acquired All-Words List of Multiword Expressions Scored for Compositionality
Will Roberts | Markus Egg

pdf bib

A Hybrid Approach for Automatic Extraction of Bilingual Multiword Expressions from Parallel Corpora
Nasredine Semmar

pdf bib

No more beating about the bush : A Step towards Idiom Handling for Indian Language NLP
Ruchit Agrawal | Vighnesh Chenthil Kumar | Vigneshwaran Muralidharan | Dipti Sharma

pdf bib

Sentence Level Temporality Detection using an Implicit Time-sensed Resource
Sabyasachi Kamila | Asif Ekbal | Pushpak Bhattacharyya

pdf bib

Comprehensive Annotation of Various Types of Temporal Information on the Time Axis
Tomohiro Sakaguchi | Daisuke Kawahara | Sadao Kurohashi

pdf bib

Systems’ Agreements and Disagreements in Temporal Processing: An Extensive Error Analysis of the TempEval-3 Task
Tommaso Caselli | Roser Morante

pdf bib

Annotating Temporally-Anchored Spatial Knowledge by Leveraging Syntactic Dependencies
Alakananda Vempala | Eduardo Blanco

pdf bib

Contextualized Usage-Based Material Selection
Dirk De Hertog | Piet Desmet

pdf bib

CBFC: a parallel L2 speech corpus for Korean and French learners
Hiyon Yoo | Inyoung Kim

pdf bib

SW4ALL: a CEFR Classified and Aligned Corpus for Language Learning
Rodrigo Wilkens | Leonardo Zilio | Cédrick Fairon

pdf bib

Towards a Diagnosis of Textual Difficulties for Children with Dyslexia
Solen Quiniou | Béatrice Daille

pdf bib

Coreference Resolution in FreeLing 4.0
Montserrat Marimon | Lluís Padró | Jordi Turmo

pdf bib

BASHI: A Corpus of Wall Street Journal Articles Annotated with Bridging Links
Ina Rösiger

pdf bib

SACR: A Drag-and-Drop Based Tool for Coreference Annotation
Bruno Oberle

pdf bib

Deep Neural Networks for Coreference Resolution for Polish
Bartłomiej Nitoń | Paweł Morawiecki | Maciej Ogrodniczuk

pdf bib

SzegedKoref: A Hungarian Coreference Corpus
Veronika Vincze | Klára Hegedűs | Alex Sliz-Nagy | Richárd Farkas

pdf bib

A Corpus to Learn Refer-to-as Relations for Nominals
Wasi Ahmad | Kai-Wei Chang

pdf bib

pdf bib

ANCOR-AS: Enriching the ANCOR Corpus with Syntactic Annotations
Loïc Grobol | Isabelle Tellier | Éric de la Clergerie | Marco Dinarelli | Frédéric Landragin

pdf bib

ParCorFull: a Parallel Corpus Annotated with Full Coreference
Ekaterina Lapshinova-Koltunski | Christian Hardmeier | Pauline Krielke

pdf bib

An Application for Building a Polish Telephone Speech Corpus
Bartosz Ziółko | Piotr Żelasko | Ireneusz Gawlik | Tomasz Pędzimąż | Tomasz Jadczyk

pdf bib

CPJD Corpus: Crowdsourced Parallel Speech Corpus of Japanese Dialects
Shinnosuke Takamichi | Hiroshi Saruwatari

pdf bib

Korean L2 Vocabulary Prediction: Can a Large Annotated Corpus be Used to Train Better Models for Predicting Unknown Words?
Kevin Yancey | Yves Lepage

pdf bib

pdf bib

FEIDEGGER: A Multi-modal Corpus of Fashion Images and Descriptions in German
Leonidas Lefakis | Alan Akbik | Roland Vollgraf

pdf bib

Toward a Lightweight Solution for Less-resourced Languages: Creating a POS Tagger for Alsatian Using Voluntary Crowdsourcing
Alice Millour | Karën Fort

pdf bib

Crowdsourced Corpus of Sentence Simplification with Core Vocabulary
Akihiro Katsuta | Kazuhide Yamamoto

pdf bib

pdf bib

Using Crowd Agreement for Wordnet Localization
Amarsanaa Ganbold | Altangerel Chagnaa | Gábor Bella

pdf bib

pdf bib

Building an English Vocabulary Knowledge Dataset of Japanese English-as-a-Second-Language Learners Using Crowdsourcing
Yo Ehara

pdf bib

Chinese Relation Classification using Long Short Term Memory Networks
Linrui Zhang | Dan Moldovan

pdf bib

pdf bib

pdf bib

Annotating Spin in Biomedical Scientific Publications : the case of Random Controlled Trials (RCTs)
Anna Koroleva | Patrick Paroubek

pdf bib

Visualization of the occurrence trend of infectious diseases using Twitter
Ryusei Matsumoto | Minoru Yoshida | Kazuyuki Matsumoto | Hironobu Matsuda | Kenji Kita

pdf bib

Reusable workflows for gender prediction
Matej Martinc | Senja Pollak

pdf bib

Knowing the Author by the Company His Words Keep
Armin Hoenen | Niko Schenk

pdf bib

Towards a Gold Standard Corpus for Variable Detection and Linking in Social Science Publications
Andrea Zielinski | Peter Mutschke

pdf bib

KRAUTS: A German Temporally Annotated News Corpus
Jannik Strötgen | Anne-Lyse Minard | Lukas Lange | Manuela Speranza | Bernardo Magnini

pdf bib

A Framework for the Needs of Different Types of Users in Multilingual Semantic Enrichment
Jan Nehring | Felix Sasaki

pdf bib

The LREC Workshops Map
Roberto Bartolini | Sara Goggi | Monica Monachini | Gabriella Pardelli

pdf bib

Preserving Workflow Reproducibility: The RePlay-DH Client as a Tool for Process Documentation
Markus Gärtner | Uli Hahn | Sibylle Hermann

pdf bib

The ACoLi CoNLL Libraries: Beyond Tab-Separated Values
Christian Chiarcos | Niko Schenk

pdf bib

What’s Wrong, Python? – A Visual Differ and Graph Library for NLP in Python
Balázs Indig | András Simonyi | Noémi Ligeti-Nagy

pdf bib

ScholarGraph:a Chinese Knowledge Graph of Chinese Scholars
Shuo Wang | Zehui Hao | Xiaofeng Meng | Qiuyue Wang

pdf bib

Enriching Frame Representations with Distributionally Induced Senses
Stefano Faralli | Alexander Panchenko | Chris Biemann | Simone Paolo Ponzetto

pdf bib

An Integrated Formal Representation for Terminological and Lexical Data included in Classification Schemes
Thierry Declerck | Kseniya Egorova | Eileen Schnur

pdf bib

One event, many representations. Mapping action concepts through visual features.
Alessandro Panunzi | Lorenzo Gregori | Andrea Amelio Ravelli

pdf bib

Tel(s)-Telle(s)-Signs: Highly Accurate Automatic Crosslingual Hypernym Discovery
Ada Wan

pdf bib

Disambiguation of Verbal Shifters
Michael Wiegand | Sylvette Loda | Josef Ruppenhofer

pdf bib

Bootstrapping Polar-Opposite Emotion Dimensions from Online Reviews
Luwen Huangfu | Mihai Surdeanu

pdf bib

Sentiment-Stance-Specificity (SSS) Dataset: Identifying Support-based Entailment among Opinions.
Pavithra Rajendran | Danushka Bollegala | Simon Parsons

pdf bib

Resource Creation Towards Automated Sentiment Analysis in Telugu (a low resource language) and Integrating Multiple Domain Sources to Enhance Sentiment Prediction
Rama Rohit Reddy Gangula | Radhika Mamidi

pdf bib

Multilingual Multi-class Sentiment Classification Using Convolutional Neural Networks
Mohammed Attia | Younes Samih | Ali Elkahky | Laura Kallmeyer

pdf bib

A Large Self-Annotated Corpus for Sarcasm
Mikhail Khodak | Nikunj Saunshi | Kiran Vodrahalli

pdf bib

pdf bib

MultiBooked: A Corpus of Basque and Catalan Hotel Reviews Annotated for Aspect-level Sentiment Classification
Jeremy Barnes | Toni Badia | Patrik Lambert

pdf bib

BlogSet-BR: A Brazilian Portuguese Blog Corpus
Henrique Santos | Vinicius Woloszyn | Renata Vieira

pdf bib

SoMeWeTa: A Part-of-Speech Tagger for German Social Media and Web Texts
Thomas Proisl

pdf bib

Collecting Code-Switched Data from Social Media
Gideon Mendels | Victor Soto | Aaron Jaech | Julia Hirschberg

pdf bib

Classifying the Informative Behaviour of Emoji in Microblogs
Giulia Donato | Patrizia Paggio

pdf bib

A Taxonomy for In-depth Evaluation of Normalization for User Generated Content
Rob van der Goot | Rik van Noord | Gertjan van Noord

pdf bib

Gaining and Losing Influence in Online Conversation
Arun Sharma | Tomek Strzalkowski

pdf bib

Arap-Tweet: A Large Multi-Dialect Twitter Corpus for Gender, Age and Language Variety Identification
Wajdi Zaghouani | Anis Charfi

pdf bib

Transc&Anno: A Graphical Tool for the Transcription and On-the-Fly Annotation of Handwritten Documents
Nadezda Okinina | Lionel Nicolas | Verena Lyding

pdf bib

Correction of OCR Word Segmentation Errors in Articles from the ACL Collection through Neural Machine Translation Methods
Vivi Nastase | Julian Hitschler

pdf bib

From Manuscripts to Archetypes through Iterative Clustering
Armin Hoenen

pdf bib

Building A Handwritten Cuneiform Character Imageset
Kenji Yamauchi | Hajime Yamamoto | Wakaha Mori

pdf bib

PDF-to-Text Reanalysis for Linguistic Data Mining
Michael Wayne Goodman | Ryan Georgi | Fei Xia

pdf bib

Crowdsourced Multimodal Corpora Collection Tool
Patrik Jonell | Catharine Oertel | Dimosthenis Kontogiorgos | Jonas Beskow | Joakim Gustafson

pdf bib

pdf bib

JAIST Annotated Corpus of Free Conversation
Kiyoaki Shirai | Tomotaka Fukuoka

pdf bib

pdf bib

Towards Continuous Dialogue Corpus Creation: writing to corpus and generating from it
Andrei Malchanau | Volha Petukhova | Harry Bunt

pdf bib

MYCanCor: A Video Corpus of spoken Malaysian Cantonese
Andreas Liesenfeld

pdf bib

KTH Tangrams: A Dataset for Research on Alignment and Conceptual Pacts in Task-Oriented Dialogue
Todd Shore | Theofronia Androulakaki | Gabriel Skantze

pdf bib

On the Vector Representation of Utterances in Dialogue Context
Louisa Pragst | Niklas Rach | Wolfgang Minker | Stefan Ultes

pdf bib

ES-Port: a Spontaneous Spoken Human-Human Technical Support Corpus for Dialogue Research in Spanish
Laura García-Sardiña | Manex Serras | Arantza del Pozo

pdf bib

From analysis to modeling of engagement as sequences of multimodal behaviors
Soumia Dermouche | Catherine Pelachaud

pdf bib

A corpus of German political speeches from the 21st century
Adrien Barbaresi

pdf bib

Building Literary Corpora for Computational Literary Analysis - A Prototype to Bridge the Gap between CL and DH
Andrew Frank | Christine Ivanovic

pdf bib

Towards faithfully visualizing global linguistic diversity
Garland McNew | Curdin Derungs | Steven Moran

pdf bib

The GermaParl Corpus of Parliamentary Protocols
Andreas Blätte | Andre Blessing

pdf bib

pdf bib

Word Embedding Evaluation Datasets and Wikipedia Title Embedding for Chinese
Chi-Yen Chen | Wei-Yun Ma

pdf bib

An Automatic Learning of an Algerian Dialect Lexicon by using Multilingual Word Embeddings
Abidi Karima | Kamel Smaïli

pdf bib

Candidate Ranking for Maintenance of an Online Dictionary
Claire Broad | Helen Langone | David Guy Brizan

pdf bib

Language adaptation experiments via cross-lingual embeddings for related languages
Serge Sharoff

pdf bib

Tools for Building an Interlinked Synonym Lexicon Network
Zdeňka Urešová | Eva Fučíková | Eva Hajičová | Jan Hajič

pdf bib

Very Large-Scale Lexical Resources to Enhance Chinese and Japanese Machine Translation
Jack Halpern

pdf bib

Combining Concepts and Their Translations from Structured Dictionaries of Uralic Minority Languages
Mika Hämäläinen | Liisa Lotta Tarvainen | Jack Rueter

pdf bib

Transfer of Frames from English FrameNet to Construct Chinese FrameNet: A Bilingual Corpus-Based Approach
Tsung-Han Yang | Hen-Hsen Huang | An-Zi Yen | Hsin-Hsi Chen

pdf bib

EFLLex: A Graded Lexical Resource for Learners of English as a Foreign Language
Luise Dürlich | Thomas François

pdf bib

English-Basque Statistical and Neural Machine Translation
Inigo Jauregi Unanue | Lierni Garmendia Arratibel | Ehsan Zare Borzeshi | Massimo Piccardi

pdf bib

TQ-AutoTest – An Automated Test Suite for (Machine) Translation Quality
Vivien Macketanz | Renlong Ai | Aljoscha Burchardt | Hans Uszkoreit

pdf bib

Exploiting Pre-Ordering for Neural Machine Translation
Yang Zhao | Jiajun Zhang | Chengqing Zong

pdf bib

Improving a Multi-Source Neural Machine Translation Model with Corpus Extension for Low-Resource Languages
Gyu-Hyeon Choi | Jong-Hun Shin | Young-Kil Kim

pdf bib

Dynamic Oracle for Neural Machine Translation in Decoding Phase
Zi-Yi Dou | Hao Zhou | Shu-Jian Huang | Xin-Yu Dai | Jia-Jun Chen

pdf bib

One Sentence One Model for Neural Machine Translation
Xiaoqing Li | Jiajun Zhang | Chengqing Zong

pdf bib

A Parallel Corpus of Arabic-Japanese News Articles
Go Inoue | Nizar Habash | Yuji Matsumoto | Hiroyuki Aoyama

pdf bib

Examining the Tip of the Iceberg: A Data Set for Idiom Translation
Marzieh Fadaee | Arianna Bisazza | Christof Monz

pdf bib

Automatic Enrichment of Terminological Resources: the IATE RDF Example
Mihael Arcan | Elena Montiel-Ponsoda | John P. McCrae | Paul Buitelaar

pdf bib

A Comparative Study of Extremely Low-Resource Transliteration of the World’s Languages
Winston Wu | David Yarowsky

pdf bib

Translating Web Search Queries into Natural Language Questions
Adarsh Kumar | Sandipan Dandapat | Sushil Chordia

pdf bib

Construction of a Japanese Word Similarity Dataset
Yuya Sakaizawa | Mamoru Komachi

pdf bib

Acquiring Verb Classes Through Bottom-Up Semantic Verb Clustering
Olga Majewska | Diana McCarthy | Ivan Vulić | Anna Korhonen

pdf bib

Constructing High Quality Sense-specific Corpus and Word Embedding via Unsupervised Elimination of Pseudo Multi-sense
Haoyue Shi | Xihao Wang | Yuqi Sun | Junfeng Hu

pdf bib

Urdu Word Embeddings
Samar Haider

pdf bib

Social Image Tags as a Source of Word Embeddings: A Task-oriented Evaluation
Mika Hasegawa | Tetsunori Kobayashi | Yoshihiko Hayashi

pdf bib

Towards AMR-BR: A SemBank for Brazilian Portuguese Language
Rafael Anchiêta | Thiago Pardo

pdf bib

Towards a Welsh Semantic Annotation System
Scott Piao | Paul Rayson | Dawn Knight | Gareth Watkins

pdf bib

Semantic Frame Parsing for Information Extraction : the CALOR corpus
Gabriel Marzinotto | Jeremy Auguste | Frederic Bechet | Geraldine Damnati | Alexis Nasr

pdf bib

Using a Corpus of English and Chinese Political Speeches for Metaphor Analysis
Kathleen Ahrens | Huiheng Zeng | Shun-han Rebekah Wong

pdf bib

A Multi- versus a Single-classifier Approach for the Identification of Modality in the Portuguese Language
João Sequeira | Teresa Gonçalves | Paulo Quaresma | Amália Mendes | Iris Hendrickx

pdf bib

All-words Word Sense Disambiguation Using Concept Embeddings
Rui Suzuki | Kanako Komiya | Masayuki Asahara | Minoru Sasaki | Hiroyuki Shinnou

pdf bib

Enhancing Modern Supervised Word Sense Disambiguation Models by Semantic Lexical Resources
Stefano Melacci | Achille Globo | Leonardo Rigutini

pdf bib

pdf bib

Unsupervised Korean Word Sense Disambiguation using CoreNet
Kijong Han | Sangha Nam | Jiseong Kim | Younggyun Hahm | Key-Sun Choi

pdf bib

UFSAC: Unification of Sense Annotated Corpora and Tools
Loïc Vial | Benjamin Lecouteux | Didier Schwab

pdf bib

Retrofitting Word Representations for Unsupervised Sense Aware Word Similarities
Steffen Remus | Chris Biemann

pdf bib

FastSense: An Efficient Word Sense Disambiguation Classifier
Tolga Uslu | Alexander Mehler | Daniel Baumartz | Wahed Hemati

pdf bib

Text Annotation Graphs: Annotating Complex Natural Language Phenomena
Angus Forbes | Kristine Lee | Gus Hahn-Powell | Marco A. Valenzuela-Escárcega | Mihai Surdeanu

pdf bib

Manzanilla: An Image Annotation Tool for TKB Building
Arianne Reimerink | Pilar León-Araúz

pdf bib

Tools for The Production of Analogical Grids and a Resource of N-gram Analogical Grids in 11 Languages
Rashel Fam | Yves Lepage

pdf bib

The Automatic Annotation of the Semiotic Type of Hand Gestures in Obama’ s Humorous Speeches
Costanza Navarretta

pdf bib

WASA: A Web Application for Sequence Annotation
Fahad AlGhamdi | Mona Diab

pdf bib

Annotation and Quantitative Analysis of Speaker Information in Novel Conversation Sentences in Japanese
Makoto Yamazaki | Yumi Miyazaki | Wakako Kashino

pdf bib

PDFAnno: a Web-based Linguistic Annotation Tool for PDF Documents
Hiroyuki Shindo | Yohei Munesada | Yuji Matsumoto

pdf bib

A Lightweight Modeling Middleware for Corpus Processing
Markus Gärtner | Jonas Kuhn

pdf bib

An Annotation Language for Semantic Search of Legal Sources
Adeline Nazarenko | François Levy | Adam Wyner

pdf bib

Resource Interoperability for Sustainable Benchmarking: The Case of Events
Chantal van Son | Oana Inel | Roser Morante | Lora Aroyo | Piek Vossen

pdf bib

Parsivar: A Language Processing Toolkit for Persian
Salar Mohtaj | Behnam Roshanfekr | Atefeh Zafarian | Habibollah Asghari

pdf bib

Multilingual Word Segmentation: Training Many Language-Specific Tokenizers Smoothly Thanks to the Universal Dependencies Corpus
Erwan Moreau | Carl Vogel

pdf bib

Build Fast and Accurate Lemmatization for Arabic
Hamdy Mubarak

pdf bib

JESC: Japanese-English Subtitle Corpus
Reid Pryzant | Youngjoo Chung | Dan Jurafsky | Denny Britz

pdf bib

pdf bib

Linguistic and Sociolinguistic Annotation of 17th Century Dutch Letters
Marijn Schraagen | Feike Dietz | Marjo van Koppen

pdf bib

Simplified Corpus with Core Vocabulary
Takumi Maruyama | Kazuhide Yamamoto

pdf bib

A Pragmatic Approach for Classical Chinese Word Segmentation
Shilei Huang | Jiangqin Wu

pdf bib

ASAP++: Enriching the ASAP Automated Essay Grading Dataset with Essay Attribute Scores
Sandeep Mathias | Pushpak Bhattacharyya

pdf bib

MirasText: An Automatically Generated Text Corpus for Persian
Behnam Sabeti | Hossein Abedi Firouzjaee | Ali Janalizadeh Choobbasti | S.H.E. Mortazavi Najafabadi | Amir Vaheb

pdf bib

The Reference Corpus of the Contemporary Romanian Language (CoRoLa)
Verginica Barbu Mititelu | Dan Tufiș | Elena Irimia

pdf bib

A Corpus of Drug Usage Guidelines Annotated with Type of Advice
Sarah Masud Preum | Md. Rizwan Parvez | Kai-Wei Chang | John Stankovic

pdf bib

BioRo: The Biomedical Corpus for the Romanian Language
Maria Mitrofan | Dan Tufiş

pdf bib

A Comparison Of Emotion Annotation Schemes And A New Annotated Data Set
Ian D. Wood | John P. McCrae | Vladimir Andryushechkin | Paul Buitelaar

pdf bib

Humor Detection in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System
Ankush Khandelwal | Sahil Swami | Syed S. Akhtar | Manish Shrivastava

pdf bib

pdf bib

SentiArabic: A Sentiment Analyzer for Standard Arabic
Ramy Eskander

pdf bib

Contextual Dependencies in Time-Continuous Multidimensional Affect Recognition
Dmitrii Fedotov | Denis Ivanko | Maxim Sidorov | Wolfgang Minker

pdf bib

WikiArt Emotions: An Annotated Dataset of Emotions Evoked by Art
Saif Mohammad | Svetlana Kiritchenko

pdf bib

Arabic Data Science Toolkit: An API for Arabic Language Feature Extraction
Paul Rodrigues | Valerie Novak | C. Anton Rytting | Julie Yelle | Jennifer Boutz

pdf bib

Sentence and Clause Level Emotion Annotation, Detection, and Classification in a Multi-Genre Corpus
Shabnam Tafreshi | Mona Diab

pdf bib

A Swedish Cookie-Theft Corpus
Dimitrios Kokkinakis | Kristina Lundholm Fors | Kathleen Fraser | Arto Nordlund

pdf bib

Sharing Copies of Synthetic Clinical Corpora without Physical Distribution — A Case Study to Get Around IPRs and Privacy Constraints Featuring the German JSYNCC Corpus
Christina Lohr | Sven Buechel | Udo Hahn

pdf bib

A Legal Perspective on Training Models for Natural Language Processing
Richard Eckart de Castilho | Giulia Dore | Thomas Margoni | Penny Labropoulou | Iryna Gurevych

pdf bib

LREMap, a Song of Resources and Evaluation
Riccardo Del Gratta | Sara Goggi | Gabriella Pardelli | Nicoletta Calzolari

pdf bib

Metadata Collection Records for Language Resources
Henk van den Heuvel | Erwin Komen | Nelleke Oostdijk

pdf bib

Managing Public Sector Data for Multilingual Applications Development
Stelios Piperidis | Penny Labropoulou | Miltos Deligiannis | Maria Giagkou

pdf bib

pdf bib

Fluid Annotation: A Granularity-aware Annotation Tool for Chinese Word Fluidity
Shu-Kai Hsieh | Yu-Hsiang Tseng | Chih-Yao Lee | Chiung-Yu Chiang

pdf bib

pdf bib

pdf bib

CLARIN’s Key Resource Families
Darja Fišer | Jakob Lenardič | Tomaž Erjavec

pdf bib

pdf bib

A UIMA Database Interface for Managing NLP-related Text Annotations
Giuseppe Abrami | Alexander Mehler

pdf bib

pdf bib

pdf bib

Improving homograph disambiguation with supervised machine learning
Kyle Gorman | Gleb Mazovetskiy | Vitaly Nikolaev

pdf bib

pdf bib

DeModify: A Dataset for Analyzing Contextual Constraints on Modifier Deletion
Vivi Nastase | Devon Fritz | Anette Frank

pdf bib

Open Subtitles Paraphrase Corpus for Six Languages
Mathias Creutz

pdf bib

Fine-grained Semantic Textual Similarity for Serbian
Vuk Batanović | Miloš Cvetanović | Boško Nikolić

pdf bib

SPADE: Evaluation Dataset for Monolingual Phrase Alignment
Yuki Arase | Junichi Tsujii

pdf bib

ETPC - A Paraphrase Identification Corpus Annotated with Extended Paraphrase Typology and Negation
Venelin Kovatchev | M. Antònia Martí | Maria Salamó

pdf bib

Introducing a Lexicon of Verbal Polarity Shifters for English
Marc Schulder | Michael Wiegand | Josef Ruppenhofer | Stephanie Köser

pdf bib

JFCKB: Japanese Feature Change Knowledge Base
Tetsuaki Nakamura | Daisuke Kawahara

pdf bib

Quantifying Qualitative Data for Understanding Controversial Issues
Michael Wojatzki | Saif Mohammad | Torsten Zesch | Svetlana Kiritchenko

pdf bib

Distribution of Emotional Reactions to News Articles in Twitter
Omar Juárez Gambino | Hiram Calvo | Consuelo-Varinia García-Mendoza

pdf bib

Aggression-annotated Corpus of Hindi-English Code-mixed Data
Ritesh Kumar | Aishwarya N. Reganti | Akshit Bhatia | Tushar Maheshwari

pdf bib

Creating a Verb Synonym Lexicon Based on a Parallel Corpus
Zdeňka Urešová | Eva Fučíková | Eva Hajičová | Jan Hajič

pdf bib

Evaluation of Domain-specific Word Embeddings using Knowledge Resources
Farhad Nooralahzadeh | Lilja Øvrelid | Jan Tore Lønning

pdf bib

Automatic Thesaurus Construction for Modern Hebrew
Chaya Liebeskind | Ido Dagan | Jonathan Schler

pdf bib

Automatic Wordnet Mapping: from CoreNet to Princeton WordNet
Jiseong Kim | Younggyun Hahm | Sunggoo Kwon | Key-Sun Choi

pdf bib

pdf bib

The Boarnsterhim Corpus: A Bilingual Frisian-Dutch Panel and Trend Study
Marjoleine Sloos | Eduard Drenth | Wilbert Heeringa

pdf bib

The French-Algerian Code-Switching Triggered audio corpus (FACST)
Amazouz Djegdjiga | Martine Adda-Decker | Lori Lamel

pdf bib

Strategies and Challenges for Crowdsourcing Regional Dialect Perception Data for Swiss German and Swiss French
Jean-Philippe Goldman | Simon Clematide | Mathieu Avanzi | Raphael Tandler

pdf bib

pdf bib

Chinese-Portuguese Machine Translation: A Study on Building Parallel Corpora from Comparable Texts
Siyou Liu | Longyue Wang | Chao-Hong Liu

pdf bib

Evaluating the WordsEye Text-to-Scene System: Imaginative and Realistic Sentences
Morgan Ulinski | Bob Coyne | Julia Hirschberg

pdf bib

Computer-assisted Speaker Diarization: How to Evaluate Human Corrections
Pierre-Alexandre Broux | David Doukhan | Simon Petitrenaud | Sylvain Meignier | Jean Carrive

pdf bib

Performance Impact Caused by Hidden Bias of Training Data for Recognizing Textual Entailment
Masatoshi Tsuchiya

pdf bib

Evaluation of Croatian Word Embeddings
Lukáš Svoboda | Slobodan Beliga

pdf bib

C-HTS: A Concept-based Hierarchical Text Segmentation approach
Mostafa Bayomi | Séamus Lawless

pdf bib

Semantic Supersenses for English Possessives
Austin Blodgett | Nathan Schneider

pdf bib

A Corpus of Metaphor Novelty Scores for Syntactically-Related Word Pairs
Natalie Parde | Rodney Nielsen

pdf bib

Improving Hypernymy Extraction with Distributional Semantic Classes
Alexander Panchenko | Dmitry Ustalov | Stefano Faralli | Simone P. Ponzetto | Chris Biemann

pdf bib

Laying the Groundwork for Knowledge Base Population: Nine Years of Linguistic Resources for TAC KBP
Jeremy Getman | Joe Ellis | Stephanie Strassel | Zhiyi Song | Jennifer Tracey

pdf bib

A Dataset for Inter-Sentence Relation Extraction using Distant Supervision
Angrosh Mandya | Danushka Bollegala | Frans Coenen | Katie Atkinson

pdf bib

Diacritics Restoration Using Neural Networks
Jakub Náplava | Milan Straka | Pavel Straňák | Jan Hajič

pdf bib

Ensemble Romanian Dependency Parsing with Neural Networks
Radu Ion | Elena Irimia | Verginica Barbu Mititelu

pdf bib

Classifying Sluice Occurrences in Dialogue
Austin Baird | Anissa Hamza | Daniel Hardt

pdf bib

pdf bib

pdf bib

EmotionLines: An Emotion Corpus of Multi-Party Conversations
Chao-Chun Hsu | Sheng-Yeh Chen | Chuan-Chun Kuo | Ting-Hao Huang | Lun-Wei Ku

pdf bib

Academic-Industrial Perspective on the Development and Deployment of a Moderation System for a Newspaper Website
Dietmar Schabus | Marcin Skowron

pdf bib

Community-Driven Crowdsourcing: Data Collection with Local Developers
Christina Funk | Michael Tseng | Ravindran Rajakumar | Linne Ha

pdf bib

pdf bib

An Integrated Representation of Linguistic and Social Functions of Code-Switching
Silvana Hartmann | Monojit Choudhury | Kalika Bali

pdf bib

A Corpus of eRulemaking User Comments for Measuring Evaluability of Arguments
Joonsuk Park | Claire Cardie

pdf bib

pdf bib

Discourse Coherence Through the Lens of an Annotated Text Corpus: A Case Study
Eva Hajičová | Jiří Mírovský

pdf bib

Automatic Prediction of Discourse Connectives
Eric Malmi | Daniele Pighin | Sebastian Krause | Mikhail Kozhevnikov

pdf bib

Handling Rare Word Problem using Synthetic Training Data for Sinhala and Tamil Neural Machine Translation
Pasindu Tennage | Prabath Sandaruwan | Malith Thilakarathne | Achini Herath | Surangika Ranathunga

pdf bib

BDPROTO: A Database of Phonological Inventories from Ancient and Reconstructed Languages
Egidio Marsico | Sebastien Flavier | Annemarie Verkerk | Steven Moran

pdf bib

Creating a Translation Matrix of the Bible’s Names Across 591 Languages
Winston Wu | Nidhi Vyas | David Yarowsky

pdf bib

pdf bib

Simple Semantic Annotation and Situation Frames: Two Approaches to Basic Text Understanding in LORELEI
Kira Griffitt | Jennifer Tracey | Ann Bies | Stephanie Strassel

pdf bib

pdf bib

Evaluating Scoped Meaning Representations
Rik van Noord | Lasha Abzianidze | Hessel Haagsma | Johan Bos

pdf bib

Huge Automatically Extracted Training-Sets for Multilingual Word SenseDisambiguation
Tommaso Pasini | Francesco Elia | Roberto Navigli

pdf bib

SentEval: An Evaluation Toolkit for Universal Sentence Representations
Alexis Conneau | Douwe Kiela

pdf bib

A Survey on Automatically-Constructed WordNets and their Evaluation: Lexical and Word Embedding-based Approaches
Steven Neale

pdf bib

Linguistically-driven Framework for Computationally Efficient and Scalable Sign Recognition
Dimitris Metaxas | Mark Dilsizian | Carol Neidle

pdf bib

CONDUCT: An Expressive Conducting Gesture Dataset for Sound Control
Lei Chen | Sylvie Gibet | Camille Marteau

pdf bib

Neural Caption Generation for News Images
Vishwash Batra | Yulan He | George Vogiatzis

pdf bib

MPST: A Corpus of Movie Plot Synopses with Tags
Sudipta Kar | Suraj Maharjan | A. Pastor López-Monroy | Thamar Solorio

pdf bib

OpenSubtitles2018: Statistical Rescoring of Sentence Alignments in Large, Noisy Parallel Corpora
Pierre Lison | Jörg Tiedemann | Milen Kouylekov

pdf bib

pdf bib

pdf bib

A Deep Neural Network based Approach for Entity Extraction in Code-Mixed Indian Social Media Text
Deepak Gupta | Asif Ekbal | Pushpak Bhattacharyya

pdf bib

pdf bib

Annotating If the Authors of a Tweet are Located at the Locations They Tweet About
Vivek Doudagiri | Alakananda Vempala | Eduardo Blanco

pdf bib

MOCCA: Measure of Confidence for Corpus Analysis - Automatic Reliability Check of Transcript and Automatic Segmentation
Thomas Kisler | Florian Schiel

pdf bib

Towards an ISO Standard for the Annotation of Quantification
Harry Bunt | James Pustejovsky | Kiyong Lee

pdf bib

Lightweight Grammatical Annotation in the TEI: New Perspectives
Piotr Bański | Susanne Haaf | Martin Mueller

pdf bib

A Gold Standard for Multilingual Automatic Term Extraction from Comparable Corpora: Term Structure and Translation Equivalents
Ayla Rigouts Terryn | Véronique Hoste | Els Lefever

pdf bib

Handling Big Data and Sensitive Data Using EUDAT’s Generic Execution Framework and the WebLicht Workflow Engine.
Claus Zinn | Wei Qui | Marie Hinrichs | Emanuel Dima | Alexandr Chernov

pdf bib

Building a Web-Scale Dependency-Parsed Corpus from CommonCrawl
Alexander Panchenko | Eugen Ruppert | Stefano Faralli | Simone P. Ponzetto | Chris Biemann

pdf bib

pdf bib

Developing the Bangla RST Discourse Treebank
Debopam Das | Manfred Stede

pdf bib

A New Version of the Składnica Treebank of Polish Harmonised with the Walenty Valency Dictionary
Marcin Woliński | Elżbieta Hajnicz | Tomasz Bartosiak

pdf bib

Parse Me if You Can: Artificial Treebanks for Parsing Experiments on Elliptical Constructions
Kira Droganova | Daniel Zeman | Jenna Kanerva | Filip Ginter

pdf bib

Semi-Automatic Construction of Word-Formation Networks (for Polish and Spanish)
Mateusz Lango | Magda Ševčíková | Zdeněk Žabokrtský

pdf bib

A multilingual collection of CoNLL-U-compatible morphological lexicons
Benoît Sagot

pdf bib

pdf bib

A Computational Architecture for the Morphology of Upper Tanana
Olga Lovick | Christopher Cox | Miikka Silfverberg | Antti Arppe | Mans Hulden

pdf bib

Expanding Abbreviations in a Strongly Inflected Language: Are Morphosyntactic Tags Sufficient?
Piotr Żelasko

pdf bib

A High-Quality Gold Standard for Citation-based Tasks
Michael Färber | Alexander Thiemann | Adam Jatowt

pdf bib

Measuring Innovation in Speech and Language Processing Publications.
Joseph Mariani | Gil Francopoulo | Patrick Paroubek

pdf bib

PDFdigest: an Adaptable Layout-Aware PDF-to-XML Textual Content Extractor for Scientific Articles
Daniel Ferrés | Horacio Saggion | Francesco Ronzano | Àlex Bravo

pdf bib

pdf bib

A «Portrait» Approach to Multichannel Discourse
Andrej Kibrik | Olga Fedorova

pdf bib

Multilingual Extension of PDTB-Style Annotation: The Case of TED Multilingual Discourse Bank
Deniz Zeyrek | Amália Mendes | Murathan Kurfalı

pdf bib

Building a Macro Chinese Discourse Treebank
Xiaomin Chu | Feng Jiang | Sheng Xu | Qiaoming Zhu

pdf bib

Enhancing the AI2 Diagrams Dataset Using Rhetorical Structure Theory
Tuomo Hiippala | Serafina Orekhova

pdf bib

QUD-Based Annotation of Discourse Structure and Information Structure: Tool and Evaluation
Kordula De Kuthy | Nils Reiter | Arndt Riester

pdf bib

The Spot the Difference corpus: a multi-modal corpus of spontaneous task oriented spoken interactions
José Lopes | Nils Hemmingsson | Oliver Åstrand

pdf bib

Attention for Implicit Discourse Relation Recognition
Andre Cianflone | Leila Kosseim

pdf bib

A Context-based Approach for Dialogue Act Recognition using Simple Recurrent Neural Networks
Chandrakant Bothe | Cornelius Weber | Sven Magg | Stefan Wermter

pdf bib

TreeAnnotator: Versatile Visual Annotation of Hierarchical Text Relations
Philipp Helfrich | Elias Rieb | Giuseppe Abrami | Andy Lücking | Alexander Mehler

pdf bib

Chats and Chunks: Annotation and Analysis of Multiparty Long Casual Conversations
Emer Gilmartin | Carl Vogel | Nick Campbell

pdf bib

Extending the gold standard for a lexical substitution task: is it worth it?
Ludovic Tanguy | Cécile Fabre | Laura Rivière

pdf bib

Lexical and Semantic Features for Cross-lingual Text Reuse Classification: an Experiment in English and Latin Paraphrases
Maria Moritz | David Steding

pdf bib

Investigating the Influence of Bilingual MWU on Trainee Translation Quality
Yu Yuan | Serge Sharoff

pdf bib

Evaluation of Dictionary Creating Methods for Finno-Ugric Minority Languages
Zsanett Ferenczi | Iván Mittelholcz | Eszter Simon | Tamás Váradi

pdf bib

Dysarthric speech evaluation: automatic and perceptual approaches
Imed Laaridh | Christine Meunier | Corinne Fredouille

pdf bib

Towards an Automatic Assessment of Crowdsourced Data for NLU
Patricia Braunger | Wolfgang Maier | Jan Wessling | Maria Schmidt

pdf bib

pdf bib

Is it worth it? Budget-related evaluation metrics for model selection
Filip Klubička | Giancarlo D. Salton | John D. Kelleher

pdf bib

Automated Evaluation of Out-of-Context Errors
Patrick Huber | Jan Niehues | Alex Waibel

pdf bib

Matics Software Suite: New Tools for Evaluation and Data Exploration
Olivier Galibert | Guillaume Bernard | Agnes Delaborde | Sabrina Lecadre | Juliette Kahn

pdf bib

MGAD: Multilingual Generation of Analogy Datasets
Mostafa Abdou | Artur Kulmizev | Vinit Ravishankar

pdf bib

MIsA: Multilingual “IsA” Extraction from Corpora
Stefano Faralli | Els Lefever | Simone Paolo Ponzetto

pdf bib

Biomedical term normalization of EHRs with UMLS
Naiara Perez-Miguel | Montse Cuadros | German Rigau

pdf bib

Revisiting the Task of Scoring Open IE Relations
William Léchelle | Philippe Langlais

pdf bib

A supervised approach to taxonomy extraction using word embeddings
Rajdeep Sarkar | John P. McCrae | Paul Buitelaar

pdf bib

A Chinese Dataset with Negative Full Forms for General Abbreviation Prediction
Yi Zhang | Xu Sun

pdf bib

Korean TimeBank Including Relative Temporal Information
Chae-Gyun Lim | Young-Seob Jeong | Ho-Jin Choi

pdf bib

Mining Biomedical Publications With The LAPPS Grid
Nancy Ide | Keith Suderman | Jin-Dong Kim

pdf bib

An Initial Test Collection for Ranked Retrieval of SMS Conversations
Rashmi Sankepally | Douglas W. Oard

pdf bib

pdf bib

PyRATA, Python Rule-based feAture sTructure Analysis
Nicolas Hernandez | Amir Hazem

pdf bib

pdf bib

Multi Modal Distance - An Approach to Stemma Generation With Weighting
Armin Hoenen

pdf bib

A Corpus of Natural Multimodal Spatial Scene Descriptions
Ting Han | David Schlangen

pdf bib

The Effects of Unimodal Representation Choices on Multimodal Learning
Fernando Tadao Ito | Helena de Medeiros Caseli | Jander Moreira

pdf bib

An Evaluation Framework for Multimodal Interaction
Nikhil Krishnaswamy | James Pustejovsky

pdf bib

The WAW Corpus: The First Corpus of Interpreted Speeches and their Translations for English and Arabic
Ahmed Abdelali | Irina Temnikova | Samy Hedaya | Stephan Vogel

pdf bib

Polish Corpus of Annotated Descriptions of Images
Alina Wróblewska

pdf bib

Action Verb Corpus
Stephanie Gross | Matthias Hirschmanner | Brigitte Krenn | Friedrich Neubarth | Michael Zillich

pdf bib

EMO&LY (EMOtion and AnomaLY) : A new corpus for anomaly detection in an audiovisual stream with emotional context.
Cédric Fayet | Arnaud Delhay | Damien Lolive | Pierre-François Marteau

pdf bib

Development of an Annotated Multimodal Dataset for the Investigation of Classification and Summarisation of Presentations using High-Level Paralinguistic Features
Keith Curtis | Nick Campbell | Gareth Jones

pdf bib

BKTreebank: Building a Vietnamese Dependency Treebank
Kiem-Hieu Nguyen

pdf bib

GeCoTagger: Annotation of German Verb Complements with Conditional Random Fields
Roman Schneider | Monica Fürbacher

pdf bib

AET: Web-based Adjective Exploration Tool for German
Tatiana Bladier | Esther Seyffarth | Oliver Hellwig | Wiebke Petersen

pdf bib

ZAP: An Open-Source Multilingual Annotation Projection Framework
Alan Akbik | Roland Vollgraf

pdf bib

Palmyra: A Platform Independent Dependency Annotation Tool for Morphologically Rich Languages
Talha Javed | Nizar Habash | Dima Taji

pdf bib

A Web-based System for Crowd-in-the-Loop Dependency Treebanking
Stephen Tratz | Nhien Phan

pdf bib

Building Universal Dependency Treebanks in Korean
Jayeol Chun | Na-Rae Han | Jena D. Hwang | Jinho D. Choi

pdf bib

Moving TIGER beyond Sentence-Level
Agnieszka Falenska | Kerstin Eckart | Jonas Kuhn

pdf bib

Spanish HPSG Treebank based on the AnCora Corpus
Luis Chiruzzo | Dina Wonsever

pdf bib

Universal Dependencies for Amharic
Binyam Ephrem Seyoum | Yusuke Miyao | Baye Yimam Mekonnen

pdf bib

A Parser for LTAG and Frame Semantics
David Arps | Simon Petitjean

pdf bib

Multilingual Dependency Parsing for Low-Resource Languages: Case Studies on North Saami and Komi-Zyrian
KyungTae Lim | Niko Partanen | Thierry Poibeau

pdf bib

FonBund: A Library for Combining Cross-lingual Phonological Segment Data
Alexander Gutkin | Martin Jansche | Tatiana Merkulova

pdf bib

Voice Builder: A Tool for Building Text-To-Speech Voices
Pasindu De Silva | Theeraphol Wattanavekin | Tang Hao | Knot Pipatsrisawat

pdf bib

pdf bib

pdf bib

Using Discourse Information for Education with a Spanish-Chinese Parallel Corpus
Shuyuan Cao | Harritxu Gete

pdf bib

A 2nd Longitudinal Corpus for Children’s Writing with Enhanced Output for Specific Spelling Patterns
Kay Berkling

pdf bib

Development of a Mobile Observation Support System for Students: FishWatchr Mini
Masaya Yamaguchi | Masanori Kitamura | Naomi Yanagida

pdf bib

pdf bib

pdf bib

Infant Word Comprehension-to-Production Index Applied to Investigation of Noun Learning Predominance Using Cross-lingual CDI database
Yasuhiro Minami | Tessei Kobayashi | Yuko Okumura

pdf bib

Building a TOCFL Learner Corpus for Chinese Grammatical Error Diagnosis
Lung-Hao Lee | Yuen-Hsien Tseng | Li-Ping Chang

pdf bib

MIAPARLE: Online training for the discrimination of stress contrasts
Jean-Philippe Goldman | Sandra Schwab

pdf bib

ESCRITO - An NLP-Enhanced Educational Scoring Toolkit
Torsten Zesch | Andrea Horbach

pdf bib

A Leveled Reading Corpus of Modern Standard Arabic
Muhamed Al Khalil | Hind Saddiki | Nizar Habash | Latifa Alfalasi

pdf bib

Developing New Linguistic Resources and Tools for the Galician Language
Rodrigo Agerri | Xavier Gómez Guinovart | German Rigau | Miguel Anxo Solla Portela

pdf bib

Modeling Northern Haida Verb Morphology
Jordan Lachler | Lene Antonsen | Trond Trosterud | Sjur Moshagen | Antti Arppe

pdf bib

Low-resource Post Processing of Noisy OCR Output for Historical Corpus Digitisation
Caitlin Richter | Matthew Wickes | Deniz Beser | Mitch Marcus

pdf bib

pdf bib

Low Resource Methods for Medieval Document Sections Analysis
Petra Galuščáková | Lucie Neužilová

pdf bib

SB-CH: A Swiss German Corpus with Sentiment Annotations
Ralf Grubenmann | Don Tuggener | Pius von Däniken | Jan Deriu | Mark Cieliebak

pdf bib

Universal Dependencies for Ainu
Hajime Senuma | Akiko Aizawa

pdf bib

pdf bib

pdf bib

Building a List of Synonymous Words and Phrases of Japanese Compound Verbs
Kyoko Kanzaki | Hitoshi Isahara

pdf bib

Evaluating EcoLexiCAT: a Terminology-Enhanced CAT Tool
Pilar León-Araúz | Arianne Reimerink

pdf bib

A Danish FrameNet Lexicon and an Annotated Corpus Used for Training and Evaluating a Semantic Frame Classifier
Bolette Pedersen | Sanni Nimb | Anders Søgaard | Mareike Hartmann | Sussi Olsen

pdf bib

SLIDE - a Sentiment Lexicon of Common Idioms
Charles Jochim | Francesca Bonin | Roy Bar-Haim | Noam Slonim

pdf bib

PronouncUR: An Urdu Pronunciation Lexicon Generator
Haris Bin Zia | Agha Ali Raza | Awais Athar

pdf bib

SimLex-999 for Polish
Agnieszka Mykowiecka | Małgorzata Marciniak | Piotr Rychlik

pdf bib

Finely Tuned, 2 Billion Token Based Word Embeddings for Portuguese
João Rodrigues | António Branco

pdf bib

Teanga: A Linked Data based platform for Natural Language Processing
Housam Ziad | John P. McCrae | Paul Buitelaar

pdf bib

Automatic and Manual Web Annotations in an Infrastructure to handle Fake News and other Online Media Phenomena
Georg Rehm | Julian Moreno-Schneider | Peter Bourgonje

pdf bib

The LODeXporter: Flexible Generation of Linked Open Data Triples from NLP Frameworks for Automatic Knowledge Base Construction
René Witte | Bahar Sateli

pdf bib

LiDo RDF: From a Relational Database to a Linked Data Graph of Linguistic Terms and Bibliographic Data
Bettina Klimek | Robert Schädlich | Dustin Kröger | Edwin Knese | Benedikt Elßmann

pdf bib

Towards a Linked Open Data Edition of Sumerian Corpora
Christian Chiarcos | Émilie Pagé-Perron | Ilya Khait | Niko Schenk | Lucas Reckling

pdf bib

A Bird’s-eye View of Language Processing Projects at the Romanian Academy
Dan Tufiș | Dan Cristea

pdf bib

PMKI: an European Commission action for the interoperability, maintainability and sustainability of Language Resources
Peter Schmitz | Enrico Francesconi | Najeh Hajlaoui | Brahim Batouche

pdf bib

The Abkhaz National Corpus
Paul Meurer

pdf bib

Collecting Language Resources from Public Administrations in the Nordic and Baltic Countries
Andrejs Vasiļjevs | Rihards Kalniņš | Roberts Rozis | Aivars Bērziņš

pdf bib

LIdioms: A Multilingual Linked Idioms Data Set
Diego Moussallem | Mohamed Ahmed Sherif | Diego Esteves | Marcos Zampieri | Axel-Cyrille Ngonga Ngomo

pdf bib

Annotating Modality Expressions and Event Factuality for a Japanese Chess Commentary Corpus
Suguru Matsuyoshi | Hirotaka Kameko | Yugo Murawaki | Shinsuke Mori

pdf bib

Annotating Chinese Light Verb Constructions according to PARSEME guidelines
Menghan Jiang | Natalia Klyueva | Hongzhi Xu | Chu-Ren Huang

pdf bib

Using English Baits to Catch Serbian Multi-Word Terminology
Cvetana Krstev | Branislava Šandrih | Ranka Stanković | Miljana Mladenović

pdf bib

Construction of Large-scale English Verbal Multiword Expression Annotated Corpus
Akihiko Kato | Hiroyuki Shindo | Yuji Matsumoto

pdf bib

Konbitzul: an MWE-specific database for Spanish-Basque
Uxoa Iñurrieta | Itziar Aduriz | Arantza Díaz de Ilarraza | Gorka Labaka | Kepa Sarasola

pdf bib

pdf bib

Towards the Inference of Semantic Relations in Complex Nominals: a Pilot Study
Melania Cabezas-García | Pilar León-Araúz

pdf bib

Generation of a Spanish Artificial Collocation Error Corpus
Sara Rodríguez-Fernández | Roberto Carlini | Leo Wanner

pdf bib

Improving a Neural-based Tagger for Multiword Expressions Identification
Dušan Variš | Natalia Klyueva

pdf bib

Designing a Russian Idiom-Annotated Corpus
Katsiaryna Aharodnik | Anna Feldman | Jing Peng

pdf bib

DeepTC – An Extension of DKPro Text Classification for Fostering Reproducibility of Deep Learning Experiments
Tobias Horsmann | Torsten Zesch

pdf bib

Improving Hate Speech Detection with Deep Learning Ensembles
Steven Zimmerman | Udo Kruschwitz | Chris Fox

pdf bib

Distributional Term Set Expansion
Amaru Cuba Gyllensten | Magnus Sahlgren

pdf bib

Can Domain Adaptation be Handled as Analogies?
Núria Bel | Joel Pocostales

pdf bib

Author Profiling from Facebook Corpora
Fernando Hsieh | Rafael Dias | Ivandré Paraboni

pdf bib

pdf bib

Experiments with Convolutional Neural Networks for Multi-Label Authorship Attribution
Dainis Boumber | Yifan Zhang | Arjun Mukherjee

pdf bib

A Fast and Accurate Vietnamese Word Segmenter
Dat Quoc Nguyen | Dai Quoc Nguyen | Thanh Vu | Mark Dras | Mark Johnson

pdf bib

Finite-state morphological analysis for Gagauz
Sevilay Bayatli | Güllü Karanfil | Memduh Gökırmak | Francis M. Tyers

pdf bib

Albanian Part-of-Speech Tagging: Gold Standard and Evaluation
Besim Kabashi | Thomas Proisl

pdf bib

Morphology Injection for English-Malayalam Statistical Machine Translation
Sreelekha S | Pushpak Bhattacharyya

pdf bib

The Morpho-syntactic Annotation of Animacy for a Dependency Parser
Mohammed Attia | Vitaly Nikolaev | Ali Elkahky

pdf bib

pdf bib

A Morphological Analyzer for St. Lawrence Island / Central Siberian Yupik
Emily Chen | Lane Schwartz

pdf bib

pdf bib

EMTC: Multilabel Corpus in Movie Domain for Emotion Analysis in Conversational Text
Duc-Anh Phan | Yuji Matsumoto

pdf bib

Complex and Precise Movie and Book Annotations in French Language for Aspect Based Sentiment Analysis
Stefania Pecore | Jeanne Villaneau

pdf bib

Lingmotif-lex: a Wide-coverage, State-of-the-art Lexicon for Sentiment Analysis
Antonio Moreno-Ortiz | Chantal Pérez-Hernández

pdf bib

A Japanese Corpus for Analyzing Customer Loyalty Information
Yiou Wang | Takuji Tahara

pdf bib

FooTweets: A Bilingual Parallel Corpus of World Cup Tweets
Henny Sluyter-Gäthje | Pintu Lohar | Haithem Afli | Andy Way

pdf bib

The SSIX Corpora: Three Gold Standard Corpora for Sentiment Analysis in English, Spanish and German Financial Microblogs
Thomas Gaillat | Manel Zarrouk | André Freitas | Brian Davis

pdf bib

Sarcasm Target Identification: Dataset and An Introductory Approach
Aditya Joshi | Pranav Goel | Pushpak Bhattacharyya | Mark Carman

pdf bib

Annotating Opinions and Opinion Targets in Student Course Feedback
Janaka Chathuranga | Shanika Ediriweera | Ravindu Hasantha | Pranidhith Munasinghe | Surangika Ranathunga

pdf bib

Generating a Gold Standard for a Swedish Sentiment Lexicon
Jacobo Rouces | Nina Tahmasebi | Lars Borin | Stian Rødven Eide

pdf bib

WordKit: a Python Package for Orthographic and Phonological Featurization
Stéphan Tulkens | Dominiek Sandra | Walter Daelemans

pdf bib

Pronunciation Variants and ASR of Colloquial Speech: A Case Study on Czech
David Lukeš | Marie Kopřivová | Zuzana Komrsková | Petra Poukarová

pdf bib

Epitran: Precision G2P for Many Languages
David R. Mortensen | Siddharth Dalmia | Patrick Littell

pdf bib

A Multilingual Approach to Question Classification
Aikaterini-Lida Kalouli | Katharina Kaiser | Annette Hautli-Janisz | Georg A. Kaiser | Miriam Butt

pdf bib

pdf bib

A Multi-Domain Framework for Textual Similarity. A Case Study on Question-to-Question and Question-Answering Similarity Tasks
Amir Hazem | Basma El Amal Boussaha | Nicolas Hernandez

pdf bib

WorldTree: A Corpus of Explanation Graphs for Elementary Science Questions supporting Multi-hop Inference
Peter Jansen | Elizabeth Wainwright | Steven Marmorstein | Clayton Morrison

pdf bib

Analysis of Implicit Conditions in Database Search Dialogues
Shun-ya Fukunaga | Hitoshi Nishikawa | Takenobu Tokunaga | Hikaru Yokono | Tetsuro Takahashi

pdf bib

An Information-Providing Closed-Domain Human-Agent Interaction Corpus
Jelte van Waterschoot | Guillaume Dubuisson Duplessis | Lorenzo Gatti | Merijn Bruijnes | Dirk Heylen

pdf bib

Augmenting Image Question Answering Dataset by Exploiting Image Captions
Masashi Yokota | Hideki Nakayama

pdf bib

Semi-supervised Training Data Generation for Multilingual Question Answering
Kyungjae Lee | Kyoungho Yoon | Sunghyun Park | Seung-won Hwang

pdf bib

pdf bib

BioRead: A New Dataset for Biomedical Reading Comprehension
Dimitris Pappas | Ion Androutsopoulos | Haris Papageorgiou

pdf bib

MMQA: A Multi-domain Multi-lingual Question-Answering Framework for English and Hindi
Deepak Gupta | Surabhi Kumari | Asif Ekbal | Pushpak Bhattacharyya

pdf bib

The First 100 Days: A Corpus Of Political Agendas on Twitter
Nathan Green | Septina Larasati

pdf bib

Medical Sentiment Analysis using Social Media: Towards building a Patient Assisted System
Shweta Yadav | Asif Ekbal | Sriparna Saha | Pushpak Bhattacharyya

pdf bib

An Italian Twitter Corpus of Hate Speech against Immigrants
Manuela Sanguinetti | Fabio Poletto | Cristina Bosco | Viviana Patti | Marco Stranisci

pdf bib

A Large Multilingual and Multi-domain Dataset for Recommender Systems
Giorgia Di Tommaso | Stefano Faralli | Paola Velardi

pdf bib

RtGender: A Corpus for Studying Differential Responses to Gender
Rob Voigt | David Jurgens | Vinodkumar Prabhakaran | Dan Jurafsky | Yulia Tsvetkov

pdf bib

A Neural Network Model for Part-Of-Speech Tagging of Social Media Texts
Sara Meftah | Nasredine Semmar

pdf bib

Utilizing Large Twitter Corpora to Create Sentiment Lexica
Valerij Fredriksen | Brage Jahren | Björn Gambäck

pdf bib

The Nautilus Speaker Characterization Corpus: Speech Recordings and Labels of Speaker Characteristics and Voice Descriptions
Laura Fernández Gallardo | Benjamin Weiss

pdf bib

Evaluation of Automatic Formant Trackers
Florian Schiel | Thomas Zitzelsberger

pdf bib

pdf bib

A First South African Corpus of Multilingual Code-switched Soap Opera Speech
Ewald van der Westhuizen | Thomas Niesler

pdf bib

A Web Service for Pre-segmenting Very Long Transcribed Speech Recordings
Nina Poerner | Florian Schiel

pdf bib

pdf bib

Creating Lithuanian and Latvian Speech Corpora from Inaccurately Annotated Web Data
Askars Salimbajevs

Preparing Data from Psychotherapy for Natural Language Processing
Margot Mieskes | Andreas Stiegelmayr

pdf bib

MirasVoice: A bilingual (English-Persian) speech corpus
Amir Vaheb | Ali Janalizadeh Choobbasti | S.H.E. Mortazavi Najafabadi | Saeid Safavi | Behnam Sabeti

pdf bib

Dialog Intent Structure: A Hierarchical Schema of Linked Dialog Acts
Silvia Pareti | Tatiana Lando

pdf bib

JDCFC: A Japanese Dialogue Corpus with Feature Changes
Tetsuaki Nakamura | Daisuke Kawahara

pdf bib

Japanese Dialogue Corpus of Information Navigation and Attentive Listening Annotated with Extended ISO-24617-2 Dialogue Act Tags
Koichiro Yoshino | Hiroki Tanaka | Kyoshiro Sugiyama | Makoto Kondo | Satoshi Nakamura

pdf bib

pdf bib

Constructing a Chinese Medical Conversation Corpus Annotated with Conversational Structures and Actions
Nan Wang | Yan Song | Fei Xia

pdf bib

Predicting Nods by using Dialogue Acts in Dialogue
Ryo Ishii | Ryuichiro Higashinaka | Junji Tomita

pdf bib

Modeling Collaborative Multimodal Behavior in Group Dialogues: The MULTISIMO Corpus
Maria Koutsombogera | Carl Vogel

pdf bib

pdf bib

Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas
Sashi Novitasari | Quoc Truong Do | Sakriani Sakti | Dessi Lestari | Satoshi Nakamura

pdf bib

pdf bib

TF-LM: TensorFlow-based Language Modeling Toolkit
Lyan Verwimp | Hugo Van hamme | Patrick Wambacq

pdf bib

Grapheme-level Awareness in Word Embeddings for Morphologically Rich Languages
Suzi Park | Hyopil Shin

pdf bib

Building a Constraint Grammar Parser for Plains Cree Verbs and Arguments
Katherine Schmirler | Antti Arppe | Trond Trosterud | Lene Antonsen

pdf bib

BPEmb: Tokenization-free Pre-trained Subword Embeddings in 275 Languages
Benjamin Heinzerling | Michael Strube

pdf bib

Reference production in human-computer interaction: Issues for Corpus-based Referring Expression Generation
Danillo Rocha | Ivandré Paraboni

pdf bib

Definite Description Lexical Choice: taking Speaker’s Personality into account
Alex Lan | Ivandré Paraboni

pdf bib

Referring Expression Generation in time-constrained communication
André Mariotti | Ivandré Paraboni

pdf bib

Incorporating Semantic Attention in Video Description Generation
Natsuda Laokulrat | Naoaki Okazaki | Hideki Nakayama

pdf bib

GenDR: A Generic Deep Realizer with Complex Lexicalization
François Lareau | Florie Lambrey | Ieva Dubinskaite | Daniel Galarreta-Piquette | Maryam Nejat

pdf bib

A Detailed Evaluation of Neural Sequence-to-Sequence Models for In-domain and Cross-domain Text Simplification
Sanja Štajner | Sergiu Nisioi

pdf bib

Don’t Annotate, but Validate: a Data-to-Text Method for Capturing Event Data
Piek Vossen | Filip Ilievski | Marten Postma | Roxane Segers

pdf bib

pdf bib

Towards a music-language mapping
Michele Berlingerio | Francesca Bonin

pdf bib

Up-cycling Data for Natural Language Generation
Amy Isard | Jon Oberlander | Claire Grover

pdf bib

Neural Models of Selectional Preferences for Implicit Semantic Role Labeling
Minh Le | Antske Fokkens

pdf bib

A database of German definitory contexts from selected web sources
Adrien Barbaresi | Lothar Lemnitzer | Alexander Geyken

pdf bib

Annotating Abstract Meaning Representations for Spanish
Noelia Migueles-Abraira | Rodrigo Agerri | Arantza Diaz de Ilarraza

pdf bib

Browsing the Terminological Structure of a Specialized Domain: A Method Based on Lexical Functions and their Classification
Marie-Claude L’Homme | Benoît Robichaud | Nathalie Prévil

pdf bib

Rollenwechsel-English: a large-scale semantic role corpus
Asad Sayeed | Pavel Shkadzko | Vera Demberg

pdf bib

Towards a Standardized Dataset for Noun Compound Interpretation
Girishkumar Ponkiya | Kevin Patel | Pushpak Bhattacharyya | Girish K Palshikar

pdf bib

Structured Interpretation of Temporal Relations
Yuchen Zhang | Nianwen Xue

pdf bib

NL2Bash: A Corpus and Semantic Parser for Natural Language Interface to the Linux Operating System
Xi Victoria Lin | Chenglong Wang | Luke Zettlemoyer | Michael D. Ernst

pdf bib

World Knowledge for Abstract Meaning Representation Parsing
Charles Welch | Jonathan K. Kummerfeld | Song Feng | Rada Mihalcea

pdf bib

Improved Transcription and Indexing of Oral History Interviews for Digital Humanities Research
Michael Gref | Joachim Köhler | Almut Leh

pdf bib

Sound Signal Processing with Seq2Tree Network
Weicheng Ma | Kai Cao | Zhaoheng Ni | Peter Chin | Xiang Li

pdf bib

Open ASR for Icelandic: Resources and a Baseline System
Anna Björk Nikulásdóttir | Inga Rún Helgadóttir | Matthías Pétursson | Jón Guðnason

pdf bib

Towards Neural Speaker Modeling in Multi-Party Conversation: The Task, Dataset, and Models
Zhao Meng | Lili Mou | Zhi Jin

pdf bib

Discriminating between Similar Languages on Imbalanced Conversational Texts
Junqing He | Xian Huang | Xuemin Zhao | Yan Zhang | Yonghong Yan

pdf bib

Data-Driven Pronunciation Modeling of Swiss German Dialectal Speech for Automatic Speech Recognition
Michael Stadtschnitzer | Christoph Schmidt

pdf bib

Simulating ASR errors for training SLU systems
Edwin Simonnet | Sahar Ghannay | Nathalie Camelin | Yannick Estève

pdf bib

Evaluation of Feature-Space Speaker Adaptation for End-to-End Acoustic Models
Natalia Tomashenko | Yannick Estève

pdf bib

Creating New Language and Voice Components for the Updated MaryTTS Text-to-Speech Synthesis Platform
Ingmar Steiner | Sébastien Le Maguer

pdf bib

Speech Rate Calculations with Short Utterances: A Study from a Speech-to-Speech, Machine Translation Mediated Map Task
Akira Hayakawa | Carl Vogel | Saturnino Luz | Nick Campbell

pdf bib

Beyond Generic Summarization: A Multi-faceted Hierarchical Summarization Corpus of Large Heterogeneous Data
Christopher Tauchmann | Thomas Arnold | Andreas Hanselowski | Christian M. Meyer | Margot Mieskes

pdf bib

A New Annotated Portuguese/Spanish Corpus for the Multi-Sentence Compression Task
Elvys Linhares Pontes | Juan-Manuel Torres-Moreno | Stéphane Huet | Andréa Carneiro Linhares

pdf bib

Live Blog Corpus for Summarization
Avinesh P.V.S. | Maxime Peyrard | Christian M. Meyer

pdf bib

TSix: A Human-involved-creation Dataset for Tweet Summarization
Minh-Tien Nguyen | Dac Viet Lai | Huy-Tien Nguyen | Le-Minh Nguyen

pdf bib

A Workbench for Rapid Generation of Cross-Lingual Summaries
Nisarg Jhaveri | Manish Gupta | Vasudeva Varma

pdf bib

Annotation and Analysis of Extractive Summaries for the Kyutech Corpus
Takashi Yamamura | Kazutaka Shimada

pdf bib

A Repository of Corpora for Summarization
Franck Dernoncourt | Mohammad Ghassemi | Walter Chang

pdf bib

Auto-hMDS: Automatic Construction of a Large Heterogeneous Multilingual Multi-Document Summarization Corpus
Markus Zopf

pdf bib

PyrEval: An Automated Method for Summary Content Analysis
Yanjun Gao | Andrew Warner | Rebecca Passonneau

pdf bib

Mapping Texts to Scripts: An Entailment Study
Simon Ostermann | Hannah Seitz | Stefan Thater | Manfred Pinkal

pdf bib

Semantic Equivalence Detection: Are Interrogatives Harder than Declaratives?
João Rodrigues | Chakaveh Saedi | António Branco | João Silva

pdf bib

CEFR-based Lexical Simplification Dataset
Satoru Uchida | Shohei Takada | Yuki Arase

pdf bib

CLARIN: Towards FAIR and Responsible Data Science Using Language Resources
Franciska de Jong | Bente Maegaard | Koenraad De Smedt | Darja Fišer | Dieter Van Uytvanck

pdf bib

pdf bib

New directions in ELRA activities
Valérie Mapelli | Victoria Arranz | Hélène Mazo | Pawel Kamocki | Vladimir Popescu

pdf bib

A Framework for Multi-Language Service Design with the Language Grid
Donghui Lin | Yohei Murakami | Toru Ishida

pdf bib

Language Technology for Multilingual Europe: An Analysis of a Large-Scale Survey regarding Challenges, Demands, Gaps and Needs
Georg Rehm | Stefanie Hegele

pdf bib

Annotating High-Level Structures of Short Stories and Personal Anecdotes
Boyang Li | Beth Cardier | Tong Wang | Florian Metze

pdf bib

Discovering the Language of Wine Reviews: A Text Mining Account
Els Lefever | Iris Hendrickx | Ilja Croijmans | Antal van den Bosch | Asifa Majid

pdf bib

Toward An Epic Epigraph Graph
Francis Bond | Graham Matthews

pdf bib

pdf bib

An Attribution Relations Corpus for Political News
Edward Newell | Drew Margolin | Derek Ruths

pdf bib

pdf bib

Adapting Serious Game for Fallacious Argumentation to German: Pitfalls, Insights, and Best Practices
Ivan Habernal | Patrick Pauli | Iryna Gurevych

pdf bib

pdf bib

pdf bib

Grounding Gradable Adjectives through Crowdsourcing
Rebecca Sharp | Mithun Paul | Ajay Nagesh | Dane Bell | Mihai Surdeanu

pdf bib

pdf bib

pdf bib

Chahta Anumpa: A multimodal corpus of the Choctaw Language
Jacqueline Brixey | Eli Pincus | Ron Artstein

pdf bib

pdf bib

Researching Less-Resourced Languages – the DigiSami Corpus
Kristiina Jokinen

pdf bib

pdf bib

Designing a Collaborative Process to Create Bilingual Dictionaries of Indonesian Ethnic Languages
Arbi Haza Nasution | Yohei Murakami | Toru Ishida

pdf bib

Constructing a Lexicon of Relational Nouns
Edward Newell | Jackie C.K. Cheung

pdf bib

Creating Large-Scale Multilingual Cognate Tables
Winston Wu | David Yarowsky

pdf bib

Lexical Profiling of Environmental Corpora
Patrick Drouin | Marie-Claude L’Homme | Benoît Robichaud

pdf bib

Linking, Searching, and Visualizing Entities in Wikipedia
Marcus Klang | Pierre Nugues

pdf bib

Learning to Map Natural Language Statements into Knowledge Base Representations for Knowledge Base Construction
Chin-Ho Lin | Hen-Hsen Huang | Hsin-Hsi Chen

pdf bib

Building a Knowledge Graph from Natural Language Definitions for Interpretable Text Entailment Recognition
Vivian Silva | André Freitas | Siegfried Handschuh

pdf bib

Combining rule-based and embedding-based approaches to normalize textual entities with an ontology
Arnaud Ferré | Louise Deléger | Pierre Zweigenbaum | Claire Nédellec

pdf bib

pdf bib

Multilingual Parallel Corpus for Global Communication Plan
Kenji Imamura | Eiichiro Sumita

pdf bib

A Large Parallel Corpus of Full-Text Scientific Articles
Felipe Soares | Viviane Moreira | Karin Becker

pdf bib

NegPar: A parallel corpus annotated for negation
Qianchu Liu | Federico Fancellu | Bonnie Webber

pdf bib

The IIT Bombay English-Hindi Parallel Corpus
Anoop Kunchukuttan | Pratik Mehta | Pushpak Bhattacharyya

pdf bib

Extracting an English-Persian Parallel Corpus from Comparable Corpora
Akbar Karimi | Ebrahim Ansari | Bahram Sadeghi Bigham

pdf bib

Learning Word Vectors for 157 Languages
Edouard Grave | Piotr Bojanowski | Prakhar Gupta | Armand Joulin | Tomas Mikolov

pdf bib

pdf bib

A Diachronic Corpus for Literary Style Analysis
Carmen Klaussner | Carl Vogel

pdf bib

Text Simplification from Professionally Produced Corpora
Carolina Scarton | Gustavo Paetzold | Lucia Specia

pdf bib

Intertextual Correspondence for Integrating Corpora
Jacky Visser | Rory Duthie | John Lawrence | Chris Reed

pdf bib

A Gold Anaphora Annotation Layer on an Eye Movement Corpus
Olga Seminck | Pascal Amsili

pdf bib

Annotating Zero Anaphora for Question Answering
Yoshihiko Asao | Ryu Iida | Kentaro Torisawa

pdf bib

pdf bib

pdf bib

TAP-DLND 1.0 : A Corpus for Document Level Novelty Detection
Tirthankar Ghosal | Amitra Salam | Swati Tiwari | Asif Ekbal | Pushpak Bhattacharyya

pdf bib

A Corpus for Multilingual Document Classification in Eight Languages
Holger Schwenk | Xian Li

pdf bib

Analyzing Citation-Distance Networks for Evaluating Publication Impact
Drahomira Herrmannova | Petr Knoth | Robert Patton

pdf bib

Annotating Educational Questions for Student Response Analysis
Andreea Godea | Rodney Nielsen

pdf bib

Incorporating Global Contexts into Sentence Embedding for Relational Extraction at the Paragraph Level with Distant Supervision
Eun-kyung Kim | Key-Sun Choi

pdf bib

MCScript: A Novel Dataset for Assessing Machine Comprehension Using Script Knowledge
Simon Ostermann | Ashutosh Modi | Michael Roth | Stefan Thater | Manfred Pinkal

pdf bib

A Neural Network Based Model for Loanword Identification in Uyghur
Chenggang Mi | Yating Yang | Lei Wang | Xi Zhou | Tonghai Jiang

pdf bib

Revisiting Distant Supervision for Relation Extraction
Tingsong Jiang | Jing Liu | Chin-Yew Lin | Zhifang Sui

pdf bib

Incorporating Contextual Information for Language-Independent, Dynamic Disambiguation Tasks
Tobias Staron | Özge Alaçam | Wolfgang Menzel

pdf bib

Overcoming the Long Tail Problem: A Case Study on CO2-Footprint Estimation of Recipes using Information Retrieval
Melanie Geiger | Martin Braschler

pdf bib

Comparison of Pun Detection Methods Using Japanese Pun Corpus
Motoki Yatsu | Kenji Araki

pdf bib

A vision-grounded dataset for predicting typical locations for verbs
Nelson Mukuze | Anna Rohrbach | Vera Demberg | Bernt Schiele

pdf bib

Creating dialect sub-corpora by clustering: a case in Japanese for an adaptive method
Yo Sato | Kevin Heffernan

pdf bib

A Fast and Flexible Webinterface for Dialect Research in the Low Countries
Roeland van Hout | Nicoline van der Sijs | Erwin Komen | Henk van den Heuvel

pdf bib

Arabic Dialect Identification in the Context of Bivalency and Code-Switching
Mahmoud El-Haj | Paul Rayson | Mariam Aboelezz

pdf bib

Automatic Identification of Maghreb Dialects Using a Dictionary-Based Approach
Houda Saâdane | Hosni Seffih | Christian Fluhr | Khalid Choukri | Nasredine Semmar

pdf bib

Shami: A Corpus of Levantine Arabic Dialects
Kathrein Abu Kwaik | Motaz Saad | Stergios Chatzikyriakidis | Simon Dobnik

pdf bib

You Tweet What You Speak: A City-Level Dataset of Arabic Dialects
Muhammad Abdul-Mageed | Hassan Alhuzali | Mohamed Elaraby

pdf bib

Visualizing the “Dictionary of Regionalisms of France” (DRF)
Ada Wan

pdf bib

DART: A Large Dataset of Dialectal Arabic Tweets
Israa Alsarsour | Esraa Mohamed | Reem Suwaileh | Tamer Elsayed

pdf bib

Classification of Closely Related Sub-dialects of Arabic Using Support-Vector Machines
Samantha Wray

pdf bib

Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features
Gregor Wiedemann | Gerhard Heyer

pdf bib

Automating Document Discovery in the Systematic Review Process: How to Use Chaff to Extract Wheat
Christopher Norman | Mariska Leeflang | Pierre Zweigenbaum | Aurélie Névéol

pdf bib

Two Multilingual Corpora Extracted from the Tenders Electronic Daily for Machine Learning and Machine Translation Applications.
Oussama Ahmia | Nicolas Béchet | Pierre-François Marteau

pdf bib

Using Adversarial Examples in Natural Language Processing
Petr Bělohlávek | Ondřej Plátek | Zdeněk Žabokrtský | Milan Straka

pdf bib

Modeling Trolling in Social Media Conversations
Luis Gerardo Mojica de la Vega | Vincent Ng

pdf bib

Automatic Annotation of Semantic Term Types in the Complete ACL Anthology Reference Corpus
Anne-Kathrin Schumann | Héctor Martínez Alonso

pdf bib

Annotated Corpus of Scientific Conference’s Homepages for Information Extraction
Piotr Andruszkiewicz | Rafał Hazan

pdf bib

Improving Unsupervised Keyphrase Extraction using Background Knowledge
Yang Yu | Vincent Ng

pdf bib

WikiDragon: A Java Framework For Diachronic Content And Network Analysis Of MediaWikis
Rüdiger Gleim | Alexander Mehler | Sung Y. Song

pdf bib

Studying Muslim Stereotyping through Microportrait Extraction
Antske Fokkens | Nel Ruigrok | Camiel Beukeboom | Gagestein Sarah | Wouter van Atteveldt

pdf bib

pdf bib

Interpersonal Relationship Labels for the CALLHOME Corpus
Denys Katerenchuk | David Guy Brizan | Andrew Rosenberg

pdf bib

Text Mining for History: first steps on building a large dataset
Suemi Higuchi | Cláudia Freitas | Bruno Cuconato | Alexandre Rademaker

pdf bib

Building Evaluation Datasets for Cultural Microblog Retrieval
Lorraine Goeuriot | Josiane Mothe | Philippe Mulhem | Eric SanJuan

pdf bib

Training and Adapting Multilingual NMT for Less-resourced and Morphologically Rich Languages
Matīss Rikters | Mārcis Pinnis | Rihards Krišlauks

pdf bib

Cross-lingual Terminology Extraction for Translation Quality Estimation
Yu Yuan | Yuze Gao | Yue Zhang | Serge Sharoff

pdf bib

Machine Translation of Low-Resource Spoken Dialects: Strategies for Normalizing Swiss German
Pierre-Edouard Honnet | Andrei Popescu-Belis | Claudiu Musat | Michael Baeriswyl

pdf bib

Improving domain-specific SMT for low-resourced languages using data from different domains
Fathima Farhath | Pranavan Theivendiram | Surangika Ranathunga | Sanath Jayasena | Gihan Dias

pdf bib

Discovering Parallel Language Resources for Training MT Engines
Vassilis Papavassiliou | Prokopis Prokopidis | Stelios Piperidis

pdf bib

A fine-grained error analysis of NMT, SMT and RBMT output for English-to-Dutch
Laura Van Brussel | Arda Tezcan | Lieve Macken

pdf bib

Collection and Analysis of Code-switch Egyptian Arabic-English Speech Corpus
Injy Hamed | Mohamed Elmahdy | Slim Abdennadher

pdf bib

Multimodal Lexical Translation
Chiraag Lala | Lucia Specia

pdf bib

Literality and cognitive effort: Japanese and Spanish
Isabel Lacruz | Michael Carl | Masaru Yamada

pdf bib

Evaluation of Machine Translation Performance Across Multiple Genres and Languages
Marlies van der Wees | Arianna Bisazza | Christof Monz

pdf bib

A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora
Pierre Zweigenbaum | Serge Sharoff | Reinhard Rapp

pdf bib

Manual vs Automatic Bitext Extraction
Aibek Makazhanov | Bagdat Myrzakhmetov | Zhenisbek Assylbekov

pdf bib

pdf bib

pdf bib

Manually Annotated Corpus of Polish Texts Published between 1830 and 1918
Witold Kieraś | Marcin Woliński

pdf bib

pdf bib

pdf bib

Massively Translingual Compound Analysis and Translation Discovery
Winston Wu | David Yarowsky

pdf bib

Building a Morphological Treebank for German from a Linguistic Database
Petra Steiner | Josef Ruppenhofer

pdf bib

Baselines and Test Data for Cross-Lingual Inference
Željko Agić | Natalie Schluter

pdf bib

CATS: A Tool for Customized Alignment of Text Simplification Corpora
Sanja Štajner | Marc Franco-Salvador | Paolo Rosso | Simone Paolo Ponzetto

pdf bib

KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus
Thanh-Le Ha | Jan Niehues | Matthias Sperber | Ngoc Quan Pham | Alexander Waibel

pdf bib

pdf bib

SemR-11: A Multi-Lingual Gold-Standard for Semantic Similarity and Relatedness for Eleven Languages
Siamak Barzegar | Brian Davis | Manel Zarrouk | Siegfried Handschuh | Andre Freitas

pdf bib

pdf bib

Part-of-Speech Tagging for Arabic Gulf Dialect Using Bi-LSTM
Randah Alharbi | Walid Magdy | Kareem Darwish | Ahmed AbdelAli | Hamdy Mubarak

pdf bib

Web-based Annotation Tool for Inflectional Language Resources
Abdulrahman Alosaimy | Eric Atwell

pdf bib

HiNTS: A Tagset for Middle Low German
Fabian Barteld | Sarah Ihden | Katharina Dreessen | Ingrid Schröder

pdf bib

Leveraging Lexical Resources and Constraint Grammar for Rule-Based Part-of-Speech Tagging in Welsh
Steven Neale | Kevin Donnelly | Gareth Watkins | Dawn Knight

pdf bib

Graph Based Semi-Supervised Learning Approach for Tamil POS tagging
Mokanarangan Thayaparan | Surangika Ranathunga | Uthayasanker Thayasivam

pdf bib

What Causes the Differences in Communication Styles? A Multicultural Study on Directness and Elaborateness
Juliana Miehle | Wolfgang Minker | Stefan Ultes

pdf bib

pdf bib

pdf bib

Exploring Conversational Language Generation for Rich Content about Hotels
Marilyn Walker | Albry Smither | Shereen Oraby | Vrindavan Harrison | Hadar Shemtov

pdf bib

Identification of Personal Information Shared in Chat-Oriented Dialogue
Sarah Fillwock | David Traum

pdf bib

A Vietnamese Dialog Act Corpus Based on ISO 24617-2 standard
Thi-Lan Ngo | Pham Khac Linh | Hideaki Takeda

pdf bib

Annotating Reflections for Health Behavior Change Therapy
Nishitha Guntakandla | Rodney Nielsen

pdf bib

Annotating Attribution Relations in Arabic
Amal Alsaif | Tasniem Alyahya | Madawi Alotaibi | Huda Almuzaini | Abeer Algahtani

pdf bib

pdf bib

An Assessment of Explicit Inter- and Intra-sentential Discourse Connectives in Turkish Discourse Bank
Deniz Zeyrek | Murathan Kurfalı

pdf bib

Compilation of Corpora for the Study of the Information Structure–Prosody Interface
Alicia Burga | Mónica Domínguez | Mireia Farrús | Leo Wanner

pdf bib

Preliminary Analysis of Embodied Interactions between Science Communicators and Visitors Based on a Multimodal Corpus of Japanese Conversations in a Science Museum
Rui Sakaida | Ryosaku Makino | Mayumi Bono

pdf bib

Improving Crowdsourcing-Based Annotation of Japanese Discourse Relations
Yudai Kishimoto | Shinnosuke Sawada | Yugo Murawaki | Daisuke Kawahara | Sadao Kurohashi

pdf bib

Persian Discourse Treebank and coreference corpus
Azadeh Mirzaei | Pegah Safari

pdf bib

Automatic Labeling of Problem-Solving Dialogues for Computational Microgenetic Learning Analytics
Yuanliang Meng | Anna Rumshisky | Florence Sullivan

pdf bib

Increasing Argument Annotation Reproducibility by Using Inter-annotator Agreement to Improve Guidelines
Milagro Teruel | Cristian Cardellino | Fernando Cardellino | Laura Alonso Alemany | Serena Villata

pdf bib

Semi-Supervised Clustering for Short Answer Scoring
Andrea Horbach | Manfred Pinkal

pdf bib

Analyzing Vocabulary Commonality Index Using Large-scaled Database of Child Language Development
Yan Cao | Yasuhiro Minami | Yuko Okumura | Tessei Kobayashi

pdf bib

The ICoN Corpus of Academic Written Italian (L1 and L2)
Mirko Tavosanis | Federica Cominetti

pdf bib

Revita: a Language-learning Platform at the Intersection of ITS and CALL
Anisia Katinskaia | Javad Nouri | Roman Yangarber

pdf bib

The Distribution and Prosodic Realization of Verb Forms in German Infant-Directed Speech
Bettina Braun | Katharina Zahner

pdf bib

Cross-linguistically Small World Networks are Ubiquitous in Child-directed Speech
Steven Moran | Danica Pajović | Sabine Stoll

pdf bib

L1-L2 Parallel Treebank of Learner Chinese: Overused and Underused Syntactic Structures
Keying Li | John Lee

pdf bib

The Use of Text Alignment in Semi-Automatic Error Analysis: Use Case in the Development of the Corpus of the Latvian Language Learners
Roberts Darģis | Ilze Auziņa | Kristīne Levāne-Petrova

pdf bib

Error annotation in a Learner Corpus of Portuguese
Iria del Río | Amália Mendes

pdf bib

An SLA Corpus Annotated with Pedagogically Relevant Grammatical Structures
Leonardo Zilio | Rodrigo Wilkens | Cédrick Fairon

pdf bib

Portable Spelling Corrector for a Less-Resourced Language: Amharic
Andargachew Mekonnen Gezmu | Andreas Nürnberger | Binyam Ephrem Seyoum

pdf bib

A Speaking Atlas of the Regional Languages of France
Philippe Boula de Mareüil | Albert Rilliard | Frédéric Vernier

pdf bib

Towards Language Technology for Mi’kmaq
Anant Maheshwari | Léo Bouscarrat | Paul Cook

pdf bib

Pronunciation Dictionaries for the Alsatian Dialects to Analyze Spelling and Phonetic Variation
Lucie Steiblé | Delphine Bernhard

pdf bib

ChAnot: An Intelligent Annotation Tool for Indigenous and Highly Agglutinative Languages in Peru
Rodolfo Mercado-Gonzales | José Pereira-Noriega | Marco Sobrevilla | Arturo Oncevay

pdf bib

The DLDP Survey on Digital Use and Usability of EU Regional and Minority Languages
Claudia Soria | Valeria Quochi | Irene Russo

pdf bib

ASR for Documenting Acutely Under-Resourced Indigenous Languages
Robbie Jimerson | Emily Prud’hommeaux

pdf bib

Building a Sentiment Corpus of Tweets in Brazilian Portuguese
Henrico Brum | Maria das Graças Volpe Nunes

pdf bib

‘Aye’ or ‘No’? Speech-level Sentiment Analysis of Hansard UK Parliamentary Debate Transcripts
Gavin Abercrombie | Riza Batista-Navarro

pdf bib

Scalable Visualisation of Sentiment and Stance
Jon Chamberlain | Udo Kruschwitz | Orland Hoeber

pdf bib

pdf bib

SenSALDO: Creating a Sentiment Lexicon for Swedish
Jacobo Rouces | Nina Tahmasebi | Lars Borin | Stian Rødven Eide

pdf bib

pdf bib

Application and Analysis of a Multi-layered Scheme for Irony on the Italian Twitter Corpus TWITTIRÒ
Alessandra Teresa Cignarella | Cristina Bosco | Viviana Patti | Mirko Lai

pdf bib

Classifier-based Polarity Propagation in a WordNet
Jan Kocoń | Arkadiusz Janz | Maciej Piasecki

pdf bib

pdf bib

IPSL: A Database of Iconicity Patterns in Sign Languages. Creation and Use
Vadim Kimmelman | Anna Klezovich | George Moroz

pdf bib

Sign Languages and the Online World Online Dictionaries & Lexicostatistics
Shi Yu | Carlo Geraci | Natasha Abner

pdf bib

Elicitation protocol and material for a corpus of long prepared monologues in Sign Language
Michael Filhol | Mohamed Nassime Hadjadj

pdf bib

Deep JSLC: A Multimodal Corpus Collection for Data-driven Generation of Japanese Sign Language Expressions
Heike Brock | Kazuhiro Nakadai

pdf bib

Modeling French Sign Language: a proposal for a semantically compositional system
Mohamed Nassime Hadjadj | Michael Filhol | Annelies Braffort

pdf bib

pdf bib

pdf bib

A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks
Arif Khan | Ingmar Steiner | Yusuke Sugano | Andreas Bulling | Ross Macdonald

pdf bib

Statistical Analysis of Missing Translation in Simultaneous Interpretation Using A Large-scale Bilingual Speech Corpus
Zhongxi Cai | Koichiro Ryu | Shigeki Matsubara

pdf bib

SynPaFlex-Corpus: An Expressive French Audiobooks Corpus dedicated to expressive speech synthesis.
Aghilas Sini | Damien Lolive | Gaëlle Vidal | Marie Tahon | Élisabeth Delais-Roussarie

pdf bib

Increasing the Accessibility of Time-Aligned Speech Corpora with Spokes Mix
Piotr Pęzik

pdf bib

The MonPaGe_HA Database for the Documentation of Spoken French Throughout Adulthood
Cécile Fougeron | Véronique Delvaux | Lucie Ménard | Marina Laganaro

pdf bib

Bringing Order to Chaos: A Non-Sequential Approach for Browsing Large Sets of Found Audio Data
Per Fallgren | Zofia Malisz | Jens Edlund

pdf bib

CoLoSS: Cognitive Load Corpus with Speech and Performance Data from a Symbol-Digit Dual-Task
Robert Herms | Maria Wirzberger | Maximilian Eibl | Günter Daniel Rey

pdf bib

VAST: A Corpus of Video Annotation for Speech Technologies
Jennifer Tracey | Stephanie Strassel

pdf bib

pdf bib

Enriching a Lexicon of Discourse Connectives with Corpus-based Data
Anna Feltracco | Elisabetta Jezek | Bernardo Magnini

pdf bib

SimPA: A Sentence-Level Simplification Corpus for the Public Administration Domain
Carolina Scarton | Gustavo Paetzold | Lucia Specia

pdf bib

The brWaC Corpus: A New Open Resource for Brazilian Portuguese
Jorge A. Wagner Filho | Rodrigo Wilkens | Marco Idiart | Aline Villavicencio

pdf bib

Czech Text Document Corpus v 2.0
Pavel Král | Ladislav Lenc

pdf bib

Corpora of Typical Sentences
Lydia Müller | Uwe Quasthoff | Maciej Sumalvico

pdf bib

The German Reference Corpus DeReKo: New Developments – New Opportunities
Marc Kupietz | Harald Lüngen | Paweł Kamocki | Andreas Witt

pdf bib

Risamálheild: A Very Large Icelandic Text Corpus
Steinþór Steingrímsson | Sigrún Helgadóttir | Eiríkur Rögnvaldsson | Starkaður Barkarson | Jón Guðnason

pdf bib

TriMED: A Multilingual Terminological Database
Federica Vezzani | Giorgio Maria Di Nunzio | Geneviève Henrot

pdf bib

Preparation and Usage of Xhosa Lexicographical Data for a Multilingual, Federated Environment
Sonja Bosch | Thomas Eckart | Bettina Klimek | Dirk Goldhahn | Uwe Quasthoff

pdf bib

A Lexicon of Discourse Markers for Portuguese – LDM-PT
Amália Mendes | Iria del Rio | Manfred Stede | Felix Dombek

pdf bib

One Language to rule them all: modelling Morphological Patterns in a Large Scale Italian Lexicon with SWRL
Fahad Khan | Andrea Bellandi | Francesca Frontini | Monica Monachini

pdf bib

Metaphor Suggestions based on a Semantic Metaphor Repository
Gerard de Melo

pdf bib

The Linguistic Category Model in Polish (LCM-PL)
Aleksander Wawer | Justyna Sarzyńska

pdf bib

WordNet-Shp: Towards the Building of a Lexical Database for a Peruvian Minority Language
Diego Maguiño-Valencia | Arturo Oncevay-Marcos | Marco A. Sobrevilla Cabezudo

pdf bib

Retrieving Information from the French Lexical Network in RDF/OWL Format
Alexsandro Fonseca | Fatiha Sadat | François Lareau

pdf bib

Transforming Wikipedia into a Large-Scale Fine-Grained Entity Type Corpus
Abbas Ghaddar | Philippe Langlais

pdf bib

pdf bib

BiLSTM-CRF for Persian Named-Entity Recognition ArmanPersoNERCorpus: the First Entity-Annotated Persian Dataset
Hanieh Poostchi | Ehsan Zare Borzeshi | Massimo Piccardi

pdf bib

Data Anonymization for Requirements Quality Analysis: a Reproducible Automatic Error Detection Task
Juyeon Kang | Jungyeul Park

pdf bib

pdf bib

A Corpus Study and Annotation Schema for Named Entity Recognition and Relation Extraction of Business Products
Saskia Schön | Veselina Mironova | Aleksandra Gabryszak | Leonhard Hennig

pdf bib

Portuguese Named Entity Recognition using Conditional Random Fields and Local Grammars
Juliana Pirovani | Elias Oliveira

pdf bib

M-CNER: A Corpus for Chinese Named Entity Recognition in Multi-Domains
Qi Lu | YaoSheng Yang | Zhenghua Li | Wenliang Chen | Min Zhang

pdf bib

SlugNERDS: A Named Entity Recognition Tool for Open Domain Dialogue Systems
Kevin Bowden | Jiaqi Wu | Shereen Oraby | Amita Misra | Marilyn Walker

pdf bib

Transfer Learning for Named-Entity Recognition with Neural Networks
Ji Young Lee | Franck Dernoncourt | Peter Szolovits

pdf bib

ForFun 1.0: Prague Database of Forms and Functions – An Invaluable Resource for Linguistic Research
Marie Mikulová | Eduard Bejček

pdf bib

pdf bib

Errator: a Tool to Help Detect Annotation Errors in the Universal Dependencies Project
Guillaume Wisniewski

pdf bib

SandhiKosh: A Benchmark Corpus for Evaluating Sanskrit Sandhi Tools
Shubham Bhardwaj | Neelamadhav Gantayat | Nikhil Chaturvedi | Rahul Garg | Sumeet Agarwal

pdf bib

Czech Legal Text Treebank 2.0
Vincent Kríž | Barbora Hladká

pdf bib

pdf bib

Test Sets for Chinese Nonlocal Dependency Parsing
Manjuan Duan | William Schuler

pdf bib

Adding Syntactic Annotations to Flickr30k Entities Corpus for Multimodal Ambiguous Prepositional-Phrase Attachment Resolution
Sebastien Delecraz | Alexis Nasr | Frederic Bechet | Benoit Favre

pdf bib

Analyzing Middle High German Syntax with RDF and SPARQL
Christian Chiarcos | Benjamin Kosmehl | Christian Fäth | Maria Sukhareva

pdf bib

Cheating a Parser to Death: Data-driven Cross-Treebank Annotation Transfer
Djamé Seddah | Eric de la Clergerie | Benoît Sagot | Héctor Martínez Alonso | Marie Candito

pdf bib

Universal Dependencies and Quantitative Typological Trends. A Case Study on Word Order
Chiara Alzetta | Felice Dell’Orletta | Simonetta Montemagni | Giulia Venturi

pdf bib

Undersampling Improves Hypernymy Prototypicality Learning
Koki Washio | Tsuneaki Kato

pdf bib

Interoperability of Language-related Information: Mapping the BLL Thesaurus to Lexvo and Glottolog
Vanya Dimitrova | Christian Fäth | Christian Chiarcos | Heike Renner-Westermann | Frank Abromeit

pdf bib

Browsing and Supporting Pluricentric Global Wordnet, or just your Wordnet of Interest
António Branco | Ruben Branco | Chakaveh Saedi | João Silva

pdf bib

Cross-checking WordNet and SUMO Using Meronymy
Javier Álvez | Itziar Gonzalez-Dios | German Rigau

pdf bib

Extended HowNet 2.0 – An Entity-Relation Common-Sense Representation Model
Wei-Yun Ma | Yueh-Yin Shih

pdf bib

The Circumstantial Event Ontology (CEO) and ECB+/CEO: an Ontology and Corpus for Implicit Causal Relations between Events
Roxane Segers | Tommaso Caselli | Piek Vossen

pdf bib

Profiling Medical Journal Articles Using a Gene Ontology Semantic Tagger
Mahmoud El-Haj | Paul Rayson | Scott Piao | Jo Knight

pdf bib

Towards a Conversation-Analytic Taxonomy of Speech Overlap
Felix Gervits | Matthias Scheutz

pdf bib

Indian Language Wordnets and their Linkages with Princeton WordNet
Diptesh Kanojia | Kevin Patel | Pushpak Bhattacharyya