Matthieu Geist
2024
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion
Yannis Flet-Berliac
|
Nathan Grinsztajn
|
Florian Strub
|
Eugene Choi
|
Bill Wu
|
Chris Cremer
|
Arash Ahmadian
|
Yash Chandak
|
Mohammad Gheshlaghi Azar
|
Olivier Pietquin
|
Matthieu Geist
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
2023
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
Paul Roit
|
Johan Ferret
|
Lior Shani
|
Roee Aharoni
|
Geoffrey Cideron
|
Robert Dadashi
|
Matthieu Geist
|
Sertan Girgin
|
Leonard Hussenot
|
Orgad Keller
|
Nikola Momchev
|
Sabela Ramos Garea
|
Piotr Stanczyk
|
Nino Vieillard
|
Olivier Bachem
|
Gal Elidan
|
Avinatan Hassidim
|
Olivier Pietquin
|
Idan Szpektor
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
2013
Model-free POMDP optimisation of tutoring systems with echo-state networks
Lucie Daubigney
|
Matthieu Geist
|
Olivier Pietquin
Proceedings of the SIGDIAL 2013 Conference
2012
Optimisation d’un tuteur intelligent à partir d’un jeu de données fixé (Optimization of a tutoring system from a fixed set of data) [in French]
Lucie Daubigney
|
Matthieu Geist
|
Olivier Pietquin
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, volume 1: JEP
2010
Sparse Approximate Dynamic Programming for Dialog Management
Senthilkumar Chandramohan
|
Matthieu Geist
|
Olivier Pietquin
Proceedings of the SIGDIAL 2010 Conference
Co-authors
- Olivier Pietquin 5
- Lucie Daubigney 2
- Yannis Flet-Berliac 1
- Nathan Grinsztajn 1
- Florian Strub 1
- show all...
- Eugene Choi 1
- Bill Wu 1
- Chris Cremer 1
- Arash Ahmadian 1
- Yash Chandak 1
- Mohammad Gheshlaghi Azar 1
- Paul Roit 1
- Johan Ferret 1
- Lior Shani 1
- Roee Aharoni 1
- Geoffrey Cideron 1
- Robert Dadashi 1
- Sertan Girgin 1
- Leonard Hussenot 1
- Orgad Keller 1
- Nikola Momchev 1
- Sabela Ramos Garea 1
- Piotr Stanczyk 1
- Nino Vieillard 1
- Olivier Bachem 1
- Gal Elidan 1
- Avinatan Hassidim 1
- Idan Szpektor 1
- Senthilkumar Chandramohan 1