Pierre-Luc Bacon
Pierre-Luc Bacon
University of Montreal
Verified email at mila.quebec - Homepage
Title
Cited by
Cited by
Year
The option-critic architecture
PL Bacon, J Harb, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017
6212017
Conditional computation in neural networks for faster models
E Bengio, PL Bacon, J Pineau, D Precup
arXiv preprint arXiv:1511.06297, 2015
1942015
When waiting is not an option: Learning options with a deliberation cost
J Harb, PL Bacon, M Klissarov, D Precup
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
812018
Optiongan: Learning joint reward-policy options using generative adversarial inverse reinforcement learning
P Henderson, WD Chang, PL Bacon, D Meger, J Pineau, D Precup
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
472018
Convergent TREE BACKUP and RETRACE with function approximation
A Touati, PL Bacon, D Precup, P Vincent
International Conference on Machine Learning, 4955-4964, 2018
292018
Learning robust options
DJ Mankowitz, TA Mann, PL Bacon, D Precup, S Mannor
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
282018
Learnings options end-to-end for continuous action tasks
M Klissarov, PL Bacon, J Harb, D Precup
arXiv preprint arXiv:1712.00004, 2017
272017
Temporal Representation Learning
PL Bacon
McGill University (Canada), 2018
172018
Learning with options that terminate off-policy
A Harutyunyan, P Vrancx, PL Bacon, D Precup, A Nowe
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
162018
Options of interest: Temporal abstraction with interest functions
K Khetarpal, M Klissarov, M Chevalier-Boisvert, PL Bacon, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4444-4451, 2020
132020
Understanding the curse of horizon in off-policy evaluation via conditional importance sampling
Y Liu, PL Bacon, E Brunskill
International Conference on Machine Learning, 6184-6193, 2020
122020
Learning with options: Just deliberate and relax
PL Bacon, D Precup
NIPS Bounded Optimality and Rational Metareasoning Workshop, 2015
92015
On the bottleneck concept for options discovery: Theoretical underpinnings and extension in continuous state spaces
PL Bacon
92014
Xlvin: executed latent value iteration nets
A Deac, P Veličković, O Milinković, PL Bacon, J Tang, M Nikolić
arXiv preprint arXiv:2010.13146, 2020
72020
Policy evaluation networks
J Harb, T Schaul, D Precup, PL Bacon
arXiv preprint arXiv:2002.11833, 2020
72020
On the bottleneck concept for options discovery
PL Bacon
Ph. D. dissertation, Masters thesis, 2013
62013
Using label propagation for learning temporally abstract actions in reinforcement learning
PL Bacon, D Precup
Proceedings of the Workshop on Multiagent Interaction Networks, 1-7, 2013
62013
The barbados 2018 list of open issues in continual learning
T Schaul, H van Hasselt, J Modayil, M White, A White, PL Bacon, J Harb, ...
arXiv preprint arXiv:1811.07004, 2018
52018
Conditional computation in neural networks using a decision-theoretic approach
PL Bacon, E Bengio, J Pineau, D Precup
Proceedings of the 2nd Multidisciplinary Conference on Reinforcement …, 2015
52015
Graph neural induction of value iteration
A Deac, PL Bacon, J Tang
arXiv preprint arXiv:2009.12604, 2020
42020
The system can't perform the operation now. Try again later.
Articles 1–20