Ethan Selfridge

Also published as: Ethan O. Selfridge


2024

pdf bib
Two-tiered Encoder-based Hallucination Detection for Retrieval-Augmented Generation in the Wild
Ilana Zimmerman | Jadin Tredup | Ethan Selfridge | Joseph Bradley
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Industry Track

Detecting hallucinations, where Large Language Models (LLMs) are not factually consistent with a Knowledge Base (KB), is a challenge for Retrieval-Augmented Generation (RAG) systems. Current solutions rely on public datasets to develop prompts or fine-tune a Natural Language Inference (NLI) model. However, these approaches are not focused on developing an enterprise RAG system; they do not consider latency, train or evaluate on production data, nor do they handle non-verifiable statements such as small talk or questions. To address this, we leverage the customer service conversation data of four large brands to evaluate existing solutions and propose a set of small encoder models trained on a new dataset. We find the proposed models to outperform existing methods and highlight the value of combining a small amount of in-domain data with public datasets.

2023

pdf bib
The economic trade-offs of large language models: A case study
Kristen Howell | Gwen Christian | Pavel Fomitchov | Gitit Kehat | Julianne Marzulla | Leanne Rolston | Jadin Tredup | Ilana Zimmerman | Ethan Selfridge | Joseph Bradley
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track)

Contacting customer service via chat is a common practice. Because employing customer service agents is expensive, many companies are turning to NLP that assists human agents by auto-generating responses that can be used directly or with modifications. With their ability to handle large context windows, Large Language Models (LLMs) are a natural fit for this use case. However, their efficacy must be balanced with the cost of training and serving them. This paper assesses the practical cost and impact of LLMs for the enterprise as a function of the usefulness of the responses that they generate. We present a cost framework for evaluating an NLP model’s utility for this use case and apply it to a single brand as a case study in the context of an existing agent assistance product. We compare three strategies for specializing an LLM — prompt engineering, fine-tuning, and knowledge distillation — using feedback from the brand’s customer service agents. We find that the usability of a model’s responses can make up for a large difference in inference cost for our case study brand, and we extrapolate our findings to the broader enterprise space.

2014

pdf bib
MVA: The Multimodal Virtual Assistant
Michael Johnston | John Chen | Patrick Ehlen | Hyuckchul Jung | Jay Lieske | Aarthi Reddy | Ethan Selfridge | Svetlana Stoyanchev | Brant Vasilieff | Jay Wilpon
Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL)

2013

pdf bib
Continuously Predicting and Processing Barge-in During a Live Spoken Dialogue Task
Ethan Selfridge | Iker Arizmendi | Peter Heeman | Jason Williams
Proceedings of the SIGDIAL 2013 Conference

2012

pdf bib
A Temporal Simulator for Developing Turn-Taking Methods for Spoken Dialogue Systems
Ethan O. Selfridge | Peter A. Heeman
Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue

pdf bib
Integrating Incremental Speech Recognition and POMDP-Based Dialogue Systems
Ethan O. Selfridge | Iker Arizmendi | Peter A. Heeman | Jason D. Williams
Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue

2011

pdf bib
Proceedings of the ACL 2011 Student Session
Sasa Petrovic | Ethan Selfridge | Emily Pitler | Miles Osborne | Thamar Solorio
Proceedings of the ACL 2011 Student Session

pdf bib
Stability and Accuracy in Incremental Speech Recognition
Ethan Selfridge | Iker Arizmendi | Peter Heeman | Jason Williams
Proceedings of the SIGDIAL 2011 Conference

2010

pdf bib
Importance-Driven Turn-Bidding for Spoken Dialogue Systems
Ethan Selfridge | Peter Heeman
Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics

pdf bib
Autism and Interactional Aspects of Dialogue
Peter Heeman | Rebecca Lunsford | Ethan Selfridge | Lois Black | Jan van Santen
Proceedings of the SIGDIAL 2010 Conference