Nikhil Londhe
2017
Summarizing World Speak : A Preliminary Graph Based Approach
Nikhil Londhe
|
Rohini Srihari
Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017
Social media platforms play a crucial role in piecing together global news stories via their corresponding online discussions. Thus, in this work, we introduce the problem of automatically summarizing massively multilingual microblog text streams. We discuss the challenges involved in both generating summaries as well as evaluating them. We introduce a simple word graph based approach that utilizes node neighborhoods to identify keyphrases and thus in turn, pick summary candidates. We also demonstrate the effectiveness of our method in generating precise summaries as compared to other popular techniques.
2016
Time-Independent and Language-Independent Extraction of Multiword Expressions From Twitter
Nikhil Londhe
|
Rohini Srihari
|
Vishrawas Gopalakrishnan
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
Multiword Expressions (MWEs) are crucial lexico-semantic units in any language. However, most work on MWEs has been focused on standard monolingual corpora. In this work, we examine MWE usage on Twitter - an inherently multilingual medium with an extremely short average text length that is often replete with grammatical errors. In this work we present a new graph based, language agnostic method for automatically extracting MWEs from tweets. We show how our method outperforms standard Association Measures. We also present a novel unsupervised evaluation technique to ascertain the accuracy of MWE extraction.