pdf bibSemi-Supervised SimHash for Efficient Document Similarity SearchQixia Jiang | Maosong SunProceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies