Olivier Pietquin
Olivier Pietquin
Google Brain (On leave of Professor at University Lille 1 - CRIStAL - SequeL team)
Verified email at univ-lille.fr - Homepage
TitleCited byYear
Deep q-learning from demonstrations
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ...
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2052018
A probabilistic framework for dialog simulation and optimal strategy learning
O Pietquin, T Dutoit
IEEE Transactions on Audio, Speech, and Language Processing 14 (2), 589-599, 2006
1742006
Noisy networks for exploration
M Fortunato, MG Azar, B Piot, J Menick, I Osband, A Graves, V Mnih, ...
arXiv preprint arXiv:1706.10295, 2017
1672017
Guesswhat?! visual object discovery through multi-modal dialogue
H De Vries, F Strub, S Chandar, O Pietquin, H Larochelle, A Courville
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
1282017
A framework for unsupervised learning of dialogue strategies
O Pietquin
Presses univ. de Louvain, 2005
1232005
Machine learning for spoken dialogue systems
O Lemon, O Pietquin
1152007
Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards
M Večerík, T Hester, J Scholz, F Wang, O Pietquin, B Piot, N Heess, ...
arXiv preprint arXiv:1707.08817, 2017
862017
Modulating early visual processing by language
H De Vries, F Strub, J Mary, H Larochelle, O Pietquin, AC Courville
Advances in Neural Information Processing Systems, 6594-6604, 2017
852017
A survey on metrics for the evaluation of user simulations
O Pietquin, H Hastie
The knowledge engineering review 28 (1), 59-73, 2013
812013
Sample-efficient batch reinforcement learning for dialogue management optimization
O Pietquin, M Geist, S Chandramohan, H Frezza-Buet
ACM Transactions on Speech and Language Processing (TSLP) 7 (3), 7, 2011
762011
Kalman temporal differences
M Geist, O Pietquin
Journal of artificial intelligence research 39, 483-532, 2010
762010
Algorithmic Survey of Parametric Value Function Approximation
M Geist, O Pietquin
Transactions on Neural Networks and Learning Systems 24 (6), 845-867, 2013
71*2013
User simulation in dialogue systems using inverse reinforcement learning
S Chandramohan, M Geist, F Lefevre, O Pietquin
702011
Data-driven methods for adaptive spoken dialogue systems: Computational learning for conversational interfaces
O Lemon, O Pietquin
Springer Science & Business Media, 2012
642012
ASR system modeling for automatic evaluation and optimization of dialogue systems
O Pietquin, S Renals
2002 IEEE International Conference on Acoustics, Speech, and Signal …, 2002
612002
End-to-end optimization of goal-driven and visually grounded dialogue systems
F Strub, H De Vries, J Mary, B Piot, A Courville, O Pietquin
arXiv preprint arXiv:1703.05423, 2017
572017
Inverse reinforcement learning through structured classification
E Klein, M Geist, B Piot, O Pietquin
Advances in neural information processing systems, 1007-1015, 2012
552012
Laugh-aware virtual agent and its impact on user amusement
R Niewiadomski, J Hofmann, J Urbain, T Platt, J Wagner, B Piot, ...
Proceedings of the 2013 international conference on Autonomous agents and …, 2013
542013
A comprehensive reinforcement learning framework for dialogue management optimization
L Daubigney, M Geist, S Chandramohan, O Pietquin
IEEE Journal of Selected Topics in Signal Processing 6 (8), 891-902, 2012
492012
Listen and translate: A proof of concept for end-to-end speech-to-text translation
A Bérard, O Pietquin, C Servan, L Besacier
arXiv preprint arXiv:1612.01744, 2016
412016
The system can't perform the operation now. Try again later.
Articles 1–20