Erwin Marsi


2017

pdf bib
NTNU-1@ScienceIE at SemEval-2017 Task 10: Identifying and Labelling Keyphrases with Conditional Random Fields
Erwin Marsi | Utpal Kumar Sikdar | Cristina Marco | Biswanath Barik | Rune Sætre
Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)

We present NTNU’s systems for Task A (prediction of keyphrases) and Task B (labelling as Material, Process or Task) at SemEval 2017 Task 10: Extracting Keyphrases and Relations from Scientific Publications (Augenstein et al., 2017). Our approach relies on supervised machine learning using Conditional Random Fields. Our system yields a micro F-score of 0.34 for Tasks A and B combined on the test data. For Task C (relation extraction), we relied on an independently developed system described in (Barik and Marsi, 2017). For the full Scenario 1 (including relations), our approach reaches a micro F-score of 0.33 (5th place). Here we describe our systems, report results and discuss errors.

pdf bib
NTNU-2 at SemEval-2017 Task 10: Identifying Synonym and Hyponym Relations among Keyphrases in Scientific Documents
Biswanath Barik | Erwin Marsi
Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)

This paper presents our relation extraction system for subtask C of SemEval-2017 Task 10: ScienceIE. Assuming that the keyphrases are already annotated in the input data, our work explores a wide range of linguistic features, applies various feature selection techniques, optimizes the hyper parameters and class weights and experiments with different problem formulations (single classification model vs individual classifiers for each keyphrase type, single-step classifier vs pipeline classifier for hyponym relations). Performance of five popular classification algorithms are evaluated for each problem formulation along with feature selection. The best setting achieved an F1 score of 71.0% for synonym and 30.0% for hyponym relation on the test data.

pdf bib
Marine Variable Linker: Exploring Relations between Changing Variables in Marine Science Literature
Erwin Marsi | Pinar Pinar Øzturk | Murat V. Ardelan
Proceedings of the Software Demonstrations of the 15th Conference of the European Chapter of the Association for Computational Linguistics

We report on a demonstration system for text mining of literature in marine science and related disciplines. It automatically extracts variables (“CO2”) involved in events of change/increase/decrease (“increasing CO2”), as well as co-occurrence and causal relations among these events (“increasing CO2 causes a decrease in pH in seawater”), resulting in a big knowledge graph. A web-based graphical user interface targeted at marine scientists facilitates searching, browsing and visualising events and their relations in an interactive way.

2016

pdf bib
IDI@NTNU at SemEval-2016 Task 6: Detecting Stance in Tweets Using Shallow Features and GloVe Vectors for Word Representation
Henrik Bøhler | Petter Asla | Erwin Marsi | Rune Sætre
Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)

2015

pdf bib
Extraction and generalisation of variables from scientific publications
Erwin Marsi | Pinar Öztürk
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing

2014

pdf bib
Care Episode Retrieval
Hans Moen | Erwin Marsi | Filip Ginter | Laura-Maria Murtola | Tapio Salakoski | Sanna Salanterä
Proceedings of the 5th International Workshop on Health Text Mining and Information Analysis (Louhi)

2013

pdf bib
Improving Word Translation Disambiguation by Capturing Multiword Expressions with Dictionaries
Lars Bungum | Björn Gambäck | André Lynum | Erwin Marsi
Proceedings of the 9th Workshop on Multiword Expressions

pdf bib
Towards Dynamic Word Sense Discrimination with Random Indexing
Hans Moen | Erwin Marsi | Björn Gambäck
Proceedings of the Workshop on Continuous Vector Space Models and their Compositionality

pdf bib
NTNU-CORE: Combining strong features for semantic similarity
Erwin Marsi | Hans Moen | Lars Bungum | Gleb Sizov | Björn Gambäck | André Lynum
Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 1: Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity

2011

pdf bib
Comparing Phrase-based and Syntax-based Paraphrase Generation
Sander Wubben | Erwin Marsi | Antal van den Bosch | Emiel Krahmer
Proceedings of the Workshop on Monolingual Text-To-Text Generation

2010

pdf bib
Automatic analysis of semantic similarity in comparable text through syntactic tree matching
Erwin Marsi | Emiel Krahmer
Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)

2009

pdf bib
Is Sentence Compression an NLG task?
Erwin Marsi | Emiel Krahmer | Iris Hendrickx | Walter Daelemans
Proceedings of the 12th European Workshop on Natural Language Generation (ENLG 2009)

pdf bib
Clustering and Matching Headlines for Automatic Paraphrase Acquisition
Sander Wubben | Antal van den Bosch | Emiel Krahmer | Erwin Marsi
Proceedings of the 12th European Workshop on Natural Language Generation (ENLG 2009)

pdf bib
Reducing Redundancy in Multi-document Summarization Using Lexical Semantic Similarity
Iris Hendrickx | Walter Daelemans | Erwin Marsi | Emiel Krahmer
Proceedings of the 2009 Workshop on Language Generation and Summarisation (UCNLG+Sum 2009)

2008

pdf bib
Query-based Sentence Fusion is Better Defined and Leads to More Preferred Results than Generic Sentence Fusion
Emiel Krahmer | Erwin Marsi | Paul van Pelt
Proceedings of ACL-08: HLT, Short Papers

2007

pdf bib
Dependency-based paraphrasing for recognizing textual entailment
Erwin Marsi | Emiel Krahmer | Wauter Bosma
Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing

2006

pdf bib
CoNLL-X Shared Task on Multilingual Dependency Parsing
Sabine Buchholz | Erwin Marsi
Proceedings of the Tenth Conference on Computational Natural Language Learning (CoNLL-X)

2005

pdf bib
Memory-Based Morphological Analysis Generation and Part-of-Speech Tagging of Arabic
Erwin Marsi | Antal van den Bosch | Abdelhadi Soudi
Proceedings of the ACL Workshop on Computational Approaches to Semitic Languages

pdf bib
Classification of Semantic Relations by Humans and Machines
Erwin Marsi | Emiel Krahmer
Proceedings of the ACL Workshop on Empirical Modeling of Semantic Equivalence and Entailment

pdf bib
Explorations in Sentence Fusion
Erwin Marsi | Emiel Krahmer
Proceedings of the Tenth European Workshop on Natural Language Generation (ENLG-05)

2003

pdf bib
Learning to Predict Pitch Accents and Prosodic Boundaries in Dutch
Erwin Marsi | Martin Reynaert | Antal van den Bosch | Walter Daelemans | Véronique Hoste
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics

pdf bib
Learning PP attachment for filtering prosodic phrasing
Olga van Herwijnen | Jacques Terken | Antal van den Bosch | Erwin Marsi
10th Conference of the European Chapter of the Association for Computational Linguistics

1998

pdf bib
Introducing Maximal Variation in Text Planning for Small Domains
Erwin Marsi
Natural Language Generation