Effective distributed representations for academic expert search

Mark Berger, Jakub Zavrel, Paul Groth


Abstract
Expert search aims to find and rank experts based on a user’s query. In academia, retrieving experts is an efficient way to navigate through a large amount of academic knowledge. Here, we study how different distributed representations of academic papers (i.e. embeddings) impact academic expert retrieval. We use the Microsoft Academic Graph dataset and experiment with different configurations of a document-centric voting model for retrieval. In particular, we explore the impact of the use of contextualized embeddings on search performance. We also present results for paper embeddings that incorporate citation information through retrofitting. Additionally, experiments are conducted using different techniques for assigning author weights based on author order. We observe that using contextual embeddings produced by a transformer model trained for sentence similarity tasks produces the most effective paper representations for document-centric expert retrieval. However, retrofitting the paper embeddings and using elaborate author contribution weighting strategies did not improve retrieval performance.
Anthology ID:
2020.sdp-1.7
Volume:
Proceedings of the First Workshop on Scholarly Document Processing
Month:
November
Year:
2020
Address:
Online
Editors:
Muthu Kumar Chandrasekaran, Anita de Waard, Guy Feigenblat, Dayne Freitag, Tirthankar Ghosal, Eduard Hovy, Petr Knoth, David Konopnicki, Philipp Mayr, Robert M. Patton, Michal Shmueli-Scheuer
Venue:
sdp
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
56–71
Language:
URL:
https://aclanthology.org/2020.sdp-1.7
DOI:
10.18653/v1/2020.sdp-1.7
Bibkey:
Cite (ACL):
Mark Berger, Jakub Zavrel, and Paul Groth. 2020. Effective distributed representations for academic expert search. In Proceedings of the First Workshop on Scholarly Document Processing, pages 56–71, Online. Association for Computational Linguistics.
Cite (Informal):
Effective distributed representations for academic expert search (Berger et al., sdp 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.sdp-1.7.pdf
Video:
 https://slideslive.com/38940716
Code
 mabergerx/SDP500_expert_search
Data
Microsoft Academic Graph