Zhenduo Wang
2024
ChatHF: Collecting Rich Human Feedback from Real-time Conversations
Andrew Li
|
Zhenduo Wang
|
Ethan Mendes
|
Duong Minh Le
|
Wei Xu
|
Alan Ritter
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
We introduce ChatHF, an interactive annotation framework for chatbot evaluation, which integrates configurable annotation within a chat interface. ChatHF can be flexibly configured to accommodate various chatbot evaluation tasks, for example detecting offensive content, identifying incorrect or misleading information in chatbot responses, and chatbot responses that might compromise privacy. It supports post-editing of chatbot outputs and supports visual inputs, in addition to an optional voice interface. ChatHF is suitable for collection and annotation of NLP datasets, and Human-Computer Interaction studies, as demonstrated in case studies on image geolocation and assisting older adults with daily activities. ChatHF is publicly accessible at https://chat-hf.com.
2018
UMDSub at SemEval-2018 Task 2: Multilingual Emoji Prediction Multi-channel Convolutional Neural Network on Subword Embedding
Zhenduo Wang
|
Ted Pedersen
Proceedings of the 12th International Workshop on Semantic Evaluation
This paper describes the UMDSub system that participated in Task 2 of SemEval-2018. We developed a system that predicts an emoji given the raw text in a English tweet. The system is a Multi-channel Convolutional Neural Network based on subword embeddings for the representation of tweets. This model improves on character or word based methods by about 2%. Our system placed 21st of 48 participating systems in the official evaluation.
ALANIS at SemEval-2018 Task 3: A Feature Engineering Approach to Irony Detection in English Tweets
Kevin Swanberg
|
Madiha Mirza
|
Ted Pedersen
|
Zhenduo Wang
Proceedings of the 12th International Workshop on Semantic Evaluation
This paper describes the ALANIS system that participated in Task 3 of SemEval-2018. We develop a system for detection of irony, as well as the detection of three types of irony: verbal polar irony, other verbal irony, and situational irony. The system uses a logistic regression model in subtask A and a voted classifier system with manually developed features to identify ironic tweets. This model improves on a naive bayes baseline by about 8 percent on training set.
Search
Co-authors
- Ted Pedersen 2
- Andrew Li 1
- Ethan Mendes 1
- Duong Minh Le 1
- Wei Xu 1
- show all...