Marc Vilain

Also published as: Marc B. Vilain


2009

pdf bib
Sources of Performance in CRF Transfer Training: a Business Name-tagging Case Study
Marc Vilain | Jonathan Huggins | Ben Wellner
Proceedings of the International Conference RANLP-2009

pdf bib
A simple feature-copying approach for long-distance dependencies
Marc Vilain | Jonathan Huggins | Ben Wellner
Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL-2009)

2007

pdf bib
Entity Extraction is a Boring Solved Problem—Or is it?
Marc Vilain | Jennifer Su | Suzi Lubar
Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers

2006

pdf bib
Leveraging Machine Readable Dictionaries in Discriminative Sequence Models
Ben Wellner | Marc Vilain
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

Many natural language processing tasks make use of a lexicon – typically the words collected from some annotated training data along with their associated properties. We demonstrate here the utility of corpora-independent lexicons derived from machine readable dictionaries. Lexical information is encoded in the form of features in a Conditional Random Field tagger providing improved performance in cases where: i) limited training data is made available ii) the data is case-less and iii) the test data genre or domain is different than that of the training data. We show substantial error reductions, especially on unknown words, for the tasks of part-of-speech tagging and shallow parsing, achieving up to 20% error reduction on Penn TreeBank part-of-speech tagging and up to a 15.7% error reduction for shallow parsing using the CoNLL 2000 data. Our results here point towards a simple, but effective methodology for increasing the adaptability of text processing systems by training models with annotated data in one genre augmented with general lexical information or lexical information pertinent to the target genre (or domain).

pdf bib
Maytag: A Multi-Staged Approach to Identifying Complex Events in Textual Data
Conrad Chang | Lisa Ferro | John Gibson | Janet Hitzeman | Suzi Lubar | Justin Palmer | Sean Munson | Marc Vilain | Benjamin Wellner
Demonstrations

2004

pdf bib
Building part-of-speech Corpora Through Histogram Hopping
Marc Vilain
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

2000

pdf bib
Phrase Parsing with Rule Sequence Processors: an Application to the Shared CoNLL Task
Marc Vilain | David Day
Fourth Conference on Computational Natural Language Learning and the Second Learning Language in Logic Workshop

1999

pdf bib
Learning Transformation Rules to Find Grammatical Relations
Lisa Ferro | Marc Vilain | Alexander Yeh
EACL 1999: CoNLL-99 Computational Natural Language Learning

1998

pdf bib
Some Properties of Preposition and Subordinate Conjunction Attachments
Alexander S. Yeh | Marc B. Vilain
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 2

pdf bib
Some Properties of Preposition and Subordinate Conjunction Attachments
Alexander S. Yeh | Marc B. Vilain
COLING 1998 Volume 2: The 17th International Conference on Computational Linguistics

1997

pdf bib
Mixed-Initiative Development of Language Processing Systems
David Day | John Aberdeen | Lynette Hirschman | Robyn Kozierok | Patricia Robinson | Marc Vilain
Fifth Conference on Applied Natural Language Processing

1996

pdf bib
Finite-state phrase parsing by rule sequences
Marc Vilain | David Day
COLING 1996 Volume 1: The 16th International Conference on Computational Linguistics

pdf bib
MITRE: Description of the Alembic System as Used in MET
John Aberdeen | John Burger | David Day | Lynette Hirschman | David Palmer | Patricia Robinson | Marc Vilain
TIPSTER TEXT PROGRAM PHASE II: Proceedings of a Workshop held at Vienna, Virginia, May 6-8, 1996

1995

pdf bib
A Model-Theoretic Coreference Scoring Scheme
Marc Vilain | John Burger | John Aberdeen | Dennis Connolly | Lynette Hirschman
Sixth Message Understanding Conference (MUC-6): Proceedings of a Conference Held in Columbia, Maryland, November 6-8, 1995

pdf bib
MITRE: Description of the Alembic System Used for MUC-6
John Aberdeen | John Burger | David Day | Lynette Hirschman | Patricia Robinson | Marc Vilain
Sixth Message Understanding Conference (MUC-6): Proceedings of a Conference Held in Columbia, Maryland, November 6-8, 1995

1993

pdf bib
Validation of Terminological Inference in an Information Extraction Task
Marc Vilain
Human Language Technology: Proceedings of a Workshop Held at Plainsboro, New Jersey, March 21-24, 1993

pdf bib
MITRE-Bedford:Description of theAlembicSystem as Used for MUC-5
John Aberdeen | John Burger | Dennis Connolly | Susan Roberts | Marc Vilain
Fifth Message Understanding Conference (MUC-5): Proceedings of a Conference Held in Baltimore, Maryland, August 25-27, 1993

1992

pdf bib
MITRE-Bedford ALEMBIC: MUC-4 Test Results and Analysis
John Aberdeen | John Burger | Dennis Connolly | Susan Roberts | Marc Vilain
Fourth Message Understanding Conference (MUC-4): Proceedings of a Conference Held in McLean, Virginia, June 16-18, 1992

pdf bib
MITRE-Bedford: Description of the ALEMBIC System as Used for MUC-4
John Aberdeen | John Burger | Dennis Connolly | Susan Roberts | Marc Vilain
Fourth Message Understanding Conference (MUC-4): Proceedings of a Conference Held in McLean, Virginia, June 16-18, 1992