Parth Gupta


PRHLT: Combination of Deep Autoencoders with Classification and Regression Techniques for SemEval-2015 Task 11
Parth Gupta | Jon Ander Gómez
Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015)


English-to-Hindi system description for WMT 2014: Deep Source-Context Features for Moses
Marta R. Costa-jussà | Parth Gupta | Paolo Rosso | Rafael E. Banchs
Proceedings of the Ninth Workshop on Statistical Machine Translation

Enrichment of Bilingual Dictionary through News Stream Data
Ajay Dubey | Parth Gupta | Vasudeva Varma | Paolo Rosso
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

Bilingual dictionaries are the key component of the cross-lingual similarity estimation methods. Usually such dictionary generation is accomplished by manual or automatic means. Automatic generation approaches include to exploit parallel or comparable data to derive dictionary entries. Such approaches require large amount of bilingual data in order to produce good quality dictionary. Many time the language pair does not have large bilingual comparable corpora and in such cases the best automatic dictionary is upper bounded by the quality and coverage of such corpora. In this work we propose a method which exploits continuous quasi-comparable corpora to derive term level associations for enrichment of such limited dictionary. Though we propose our experiments for English and Hindi, our approach can be easily extendable to other languages. We evaluated dictionary by manually computing the precision. In experiments we show our approach is able to derive interesting term level associations across languages.


Text Reuse with ACL: (Upward) Trends
Parth Gupta | Paolo Rosso
Proceedings of the ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries

Expected Divergence Based Feature Selection for Learning to Rank
Parth Gupta | Paolo Rosso
Proceedings of COLING 2012: Posters