MS-LaTTE: A Dataset of Where and When To-do Tasks are Completed

Sujay Kumar Jauhar, Nirupama Chandrasekaran, Michael Gamon, Ryen White


Abstract
Tasks are a fundamental unit of work in the daily lives of people, who are increasingly using digital means to keep track of, organize, triage, and act on them. These digital tools – such as task management applications – provide a unique opportunity to study and understand tasks and their connection to the real world, and through intelligent assistance, help people be more productive. By logging signals such as text, timestamp information, and social connectivity graphs, an increasingly rich and detailed picture of how tasks are created and organized, what makes them important, and who acts on them, can be progressively developed. Yet the context around actual task completion remains fuzzy, due to the basic disconnect between actions taken in the real world and telemetry recorded in the digital world. Thus, in this paper we compile and release a novel, real-life, large-scale dataset called MS-LaTTE that captures two core aspects of the context surrounding task completion: location and time. We describe our annotation framework and conduct a number of analyses on the data that were collected, demonstrating that it captures intuitive contextual properties for common tasks. Finally, we test the dataset on the two problems of predicting spatial and temporal task co-occurrence, concluding that predictors for co-location and co-time are both learnable, with a BERT fine-tuned model outperforming several other baselines. The MS-LaTTE dataset provides an opportunity to tackle many new modeling challenges in contextual task understanding and we hope that its release will spur future research in task intelligence more broadly.
Anthology ID:
2022.lrec-1.577
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
5393–5403
Language:
URL:
https://aclanthology.org/2022.lrec-1.577
DOI:
Bibkey:
Cite (ACL):
Sujay Kumar Jauhar, Nirupama Chandrasekaran, Michael Gamon, and Ryen White. 2022. MS-LaTTE: A Dataset of Where and When To-do Tasks are Completed. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 5393–5403, Marseille, France. European Language Resources Association.
Cite (Informal):
MS-LaTTE: A Dataset of Where and When To-do Tasks are Completed (Jauhar et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.577.pdf
Code
 microsoft/ms-latte