GHEORGHE COMANICI

Cited by

	All	Since 2019
Citations	353	281
h-index	10	8
i10-index	10	7

201120122013201420152016201720182019202020212022202320247 7 12 10 9 7 11 7 9 28 47 90 80 27

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

GHEORGHE COMANICI

Research Scientist, DeepMind

Verified email at deepmind.com

Reinforcement Learning Hierarchical Behavior Bisimulation metrics Spectral Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
The option keyboard: Combining skills in reinforcement learning A Barreto, D Borsa, S Hou, G Comanici, E Aygün, P Hamel, D Toyama, ... Advances in Neural Information Processing Systems 32, 2019	91	2019
What can i do here? a theory of affordances in reinforcement learning K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup International Conference on Machine Learning, 5243-5253, 2020	61	2020
Optimal policy switching algorithms for reinforcement learning G Comanici, D Precup Proceedings of the 9th International Conference on Autonomous Agents and …, 2010	47	2010
Androidenv: A reinforcement learning platform for android D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ... arXiv preprint arXiv:2105.13231, 2021	42	2021
On-the-fly algorithms for bisimulation metrics G Comanici, P Panangaden, D Precup 2012 ninth international conference on quantitative evaluation of systems …, 2012	24	2012
Basis function discovery using spectral clustering and bisimulation metrics G Comanici, D Precup International Workshop on Adaptive and Learning Agents, 85-99, 2011	20	2011
Representation discovery for mdps using bisimulation metrics S Ruan, G Comanici, P Panangaden, D Precup Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015	17	2015
Temporally abstract partial models K Khetarpal, Z Ahmed, G Comanici, D Precup Advances in Neural Information Processing Systems 34, 1979-1991, 2021	11	2021
Knowledge representation for reinforcement learning using general value functions G Comanici, D Precup, A Barreto, DK Toyama, E Aygün, P Hamel, ...	10	2018
An empirical analysis of off-policy learning in discrete mdps C Păduraru, D Precup, J Pineau, G Comănici European Workshop on Reinforcement Learning, 89-102, 2013	10	2013
Basis refinement strategies for linear value function approximation in MDPs G Comanici, D Precup, P Panangaden Advances in neural information processing systems 28, 2015	8	2015
What can I do here K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup A theory of affordances in reinforcement learning. arXiv [cs. LG], 2020	5	2020
A study of off-policy learning in computational sustainability C Paduraru, D Precup, J Pineau, G Comanici European Workshop on Reinforcement Learning (EWRL) 24, 89-102, 2012	4	2012
Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search A Mehrabian, A Anand, H Kim, N Sonnerat, M Balog, G Comanici, ... arXiv preprint arXiv:2311.03583, 2023	2	2023
Vision-language models as a source of rewards K Baumli, S Baveja, F Behbahani, H Chan, G Comanici, S Flennerhag, ... arXiv preprint arXiv:2312.09187, 2023	1	2023
Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning G Comanici, A Glaese, A Gergely, D Toyama, Z Ahmed, T Jackson, ... arXiv preprint arXiv:2204.10374, 2022		2022
Representation discovery for Markov decision processes using behavioural similarity G Comanici McGill University (Canada), 2016		2016
Optimal Time Scales for Reinforcement Learning Behaviour Strategies G Comanici, D Precup McGill University, 2010		2010

The system can't perform the operation now. Try again later.

Articles 1–18

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by