Sandrine Zufferey
2023
Exploring the Sensitivity to Alternative Signals of Coherence Relations
Ekaterina Tskhovrebova | Sandrine Zufferey | Pascal Gygax
Dialogue Discourse Volume 14
Ekaterina Tskhovrebova | Sandrine Zufferey | Pascal Gygax
Dialogue Discourse Volume 14
Coherence relations between elements of discourse can be signaled by linguistic devices such as connectives and/or alternative signals. While the use and comprehension of connectives have been studied in different categories of speakers, less is known about the functioning of alternative signals of coherence relations, especially in younger populations. In the current study, we aim to examine the sensitivity of French-speaking teenagers to the alternative signals of list relation (words such as plusieurs ‘several’ and différents ‘various’), combined with connectives varying in frequency and signaling two types of coherence relations (addition: en plus, en outre; consequence: donc, ainsi). Our results reveal that, as early as in teenage years, speakers are sensitive (i.e., they produce list continuation sentences) to alternative signals of list relation. Furthermore, the inference of list relation is not significantly changed when an alternative signal is combined with the more frequent additive connective en plus. However, this inference is inhibited by the less frequent additive connective en outre, and is almost completely hindered by the consequence connectives donc and ainsi. Overall, these results show that alternative list signals are an important source for the inference of the list relation, even in the presence of more salient signals of coherence such as connectives.
2015
Factors Influencing the Implicitation of Discourse Relations across Languages
Jet Hoek | Sandrine Zufferey
Proceedings of the 11th Joint ACL-ISO Workshop on Interoperable Semantic Annotation (ISA-11)
Jet Hoek | Sandrine Zufferey
Proceedings of the 11th Joint ACL-ISO Workshop on Interoperable Semantic Annotation (ISA-11)
Using a unified taxonomy to annotate discourse markers in speech and writing
Ludivine Crible | Sandrine Zufferey
Proceedings of the 11th Joint ACL-ISO Workshop on Interoperable Semantic Annotation (ISA-11)
Ludivine Crible | Sandrine Zufferey
Proceedings of the 11th Joint ACL-ISO Workshop on Interoperable Semantic Annotation (ISA-11)
2013
Annotating the meaning of discourse connectives by looking at their translation: The translation-spotting technique
Bruno Cartoni | Sandrine Zufferey | Thomas Meyer
Dialogue Discourse Volume 4
Bruno Cartoni | Sandrine Zufferey | Thomas Meyer
Dialogue Discourse Volume 4
The various meanings of discourse connectives like while and however are difficult to identify and annotate, even for trained human annotators. This problem is all the more important that connectives are salient textual markers of cohesion and need to be correctly interpreted for many NLP applications. In this paper, we suggest an alternative route to reach a reliable annotation of connectives, by making use of the information provided by their translation in large parallel corpora. This method thus replaces the difficult explicit reasoning involved in traditional sense annotation by an empirical clustering of the senses emerging from the translations. We argue that this method has the advantage of providing more reliable reference data than traditional sense annotation. In addition, its simplicity allows for the rapid constitution of large annotated datasets.
2012
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns
Andrei Popescu-Belis | Thomas Meyer | Jeevanthi Liyanapathirana | Bruno Cartoni | Sandrine Zufferey
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Andrei Popescu-Belis | Thomas Meyer | Jeevanthi Liyanapathirana | Bruno Cartoni | Sandrine Zufferey
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
This paper describes methods and results for the annotation of two discourse-level phenomena, connectives and pronouns, over a multilingual parallel corpus. Excerpts from Europarl in English and French have been annotated with disambiguation information for connectives and pronouns, for about 3600 tokens. This data is then used in several ways: for cross-linguistic studies, for training automatic disambiguation software, and ultimately for training and testing discourse-aware statistical machine translation systems. The paper presents the annotation procedures and their results in detail, and overviews the first systems trained on the annotated resources and their use for machine translation.
2011
How Comparable are Parallel Corpora? Measuring the Distribution of General Vocabulary and Connectives
Bruno Cartoni | Sandrine Zufferey | Thomas Meyer | Andrei Popescu-Belis
Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web
Bruno Cartoni | Sandrine Zufferey | Thomas Meyer | Andrei Popescu-Belis
Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web
Multilingual Annotation and Disambiguation of Discourse Connectives for Machine Translation
Thomas Meyer | Andrei Popescu-Belis | Sandrine Zufferey | Bruno Cartoni
Proceedings of the SIGDIAL 2011 Conference
Thomas Meyer | Andrei Popescu-Belis | Sandrine Zufferey | Bruno Cartoni
Proceedings of the SIGDIAL 2011 Conference
2007
Contrasting the Automatic Identification of Two Discourse Markers in Multiparty Dialogues
Andrei Popescu-Belis | Sandrine Zufferey
Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue
Andrei Popescu-Belis | Sandrine Zufferey
Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue