Syntactic Search by Example

Micah Shlain, Hillel Taub-Tabib, Shoval Sadde, Yoav Goldberg


Abstract
We present a system that allows a user to search a large linguistically annotated corpus using syntactic patterns over dependency graphs. In contrast to previous attempts to this effect, we introduce a light-weight query language that does not require the user to know the details of the underlying syntactic representations, and instead to query the corpus by providing an example sentence coupled with simple markup. Search is performed at an interactive speed due to efficient linguistic graph-indexing and retrieval engine. This allows for rapid exploration, development and refinement of syntax-based queries. We demonstrate the system using queries over two corpora: the English wikipedia, and a collection of English pubmed abstracts. A demo of the wikipedia system is available at https://allenai.github.io/spike/ .
Anthology ID:
2020.acl-demos.3
Volume:
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations
Month:
July
Year:
2020
Address:
Online
Editors:
Asli Celikyilmaz, Tsung-Hsien Wen
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
17–23
Language:
URL:
https://aclanthology.org/2020.acl-demos.3
DOI:
10.18653/v1/2020.acl-demos.3
Bibkey:
Cite (ACL):
Micah Shlain, Hillel Taub-Tabib, Shoval Sadde, and Yoav Goldberg. 2020. Syntactic Search by Example. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pages 17–23, Online. Association for Computational Linguistics.
Cite (Informal):
Syntactic Search by Example (Shlain et al., ACL 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.acl-demos.3.pdf
Video:
 http://slideslive.com/38928592