NSTM: Real-Time Query-Driven News Overview Composition at Bloomberg

Joshua Bambrick, Minjie Xu, Andy Almonte, Igor Malioutov, Guim Perarnau, Vittorio Selo, Iat Chong Chan


Abstract
Millions of news articles from hundreds of thousands of sources around the globe appear in news aggregators every day. Consuming such a volume of news presents an almost insurmountable challenge. For example, a reader searching on Bloomberg’s system for news about the U.K. would find 10,000 articles on a typical day. Apple Inc., the world’s most journalistically covered company, garners around 1,800 news articles a day. We realized that a new kind of summarization engine was needed, one that would condense large volumes of news into short, easy to absorb points. The system would filter out noise and duplicates to identify and summarize key news about companies, countries or markets. When given a user query, Bloomberg’s solution, Key News Themes (or NSTM), leverages state-of-the-art semantic clustering techniques and novel summarization methods to produce comprehensive, yet concise, digests to dramatically simplify the news consumption process. NSTM is available to hundreds of thousands of readers around the world and serves thousands of requests daily with sub-second latency. At ACL 2020, we will present a demo of NSTM.
Anthology ID:
2020.acl-demos.40
Volume:
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations
Month:
July
Year:
2020
Address:
Online
Editors:
Asli Celikyilmaz, Tsung-Hsien Wen
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
350–361
Language:
URL:
https://aclanthology.org/2020.acl-demos.40
DOI:
10.18653/v1/2020.acl-demos.40
Bibkey:
Cite (ACL):
Joshua Bambrick, Minjie Xu, Andy Almonte, Igor Malioutov, Guim Perarnau, Vittorio Selo, and Iat Chong Chan. 2020. NSTM: Real-Time Query-Driven News Overview Composition at Bloomberg. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pages 350–361, Online. Association for Computational Linguistics.
Cite (Informal):
NSTM: Real-Time Query-Driven News Overview Composition at Bloomberg (Bambrick et al., ACL 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.acl-demos.40.pdf
Video:
 http://slideslive.com/38928599