Timothy A Mann
Title
Cited by
Cited by
Year
Deep reinforcement learning in large discrete action spaces
G Dulac-Arnold, R Evans, H van Hasselt, P Sunehag, T Lillicrap, J Hunt, ...
arXiv preprint arXiv:1512.07679, 2015
1632015
A Dual Approach to Scalable Verification of Deep Networks.
K Dvijotham, R Stanforth, S Gowal, TA Mann, P Kohli
UAI 1, 2, 2018
1132018
On the effectiveness of interval bound propagation for training verifiably robust models
S Gowal, K Dvijotham, R Stanforth, R Bunel, C Qin, J Uesato, ...
arXiv preprint arXiv:1810.12715, 2018
632018
Adaptive skills adaptive partitions (ASAP)
DJ Mankowitz, TA Mann, S Mannor
Advances in Neural Information Processing Systems, 1588-1596, 2016
362016
Scaling up approximate value iteration with options: Better policies with fewer iterations
T Mann, S Mannor
International conference on machine learning, 127-135, 2014
352014
Time regularized interrupting options
DJ Mankowitz, TA Mann, S Mannor
Internation Conference on Machine Learning, 2014
33*2014
Approximate Value Iteration with Temporally Extended Actions
SMDP Timothy A. Mann
Journal of Artificial Intelligence Research 53, 375-438, 2015
302015
How hard is my MDP?" The distribution-norm to the rescue"
OA Maillard, TA Mann, S Mannor
Advances in Neural Information Processing Systems, 1835-1843, 2014
222014
Off-policy model-based learning under unknown factored dynamics
A Hallak, F Schnitzler, T Mann, S Mannor
International Conference on Machine Learning, 711-719, 2015
162015
Directed Exploration in Reinforcement Learning with Transferred Knowledge.
TA Mann, Y Choe
Ewrl, 59-76, 2012
162012
Beyond greedy ranking: Slate optimization via list-CVAE
R Jiang, S Gowal, TA Mann, DJ Rezende
arXiv preprint arXiv:1803.01682, 2018
152018
Learning robust options
DJ Mankowitz, TA Mann, PL Bacon, D Precup, S Mannor
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
102018
Autonomous and interactive improvement of binocular visual depth estimation through sensorimotor interaction
TA Mann, Y Park, S Jeong, M Lee, Y Choe
IEEE Transactions on Autonomous Mental Development 5 (1), 74-84, 2012
102012
Adaptive lambda least-squares temporal difference learning
TA Mann, H Penedones, S Mannor, T Hester
arXiv preprint arXiv:1612.09465, 2016
92016
The advantage of planning with options
TA Mann, S Mannor
RLDM 2013, 9, 2013
82013
Evolution of time in neural networks: from the present to the past, and forward to the future
JR Chung, J Kwon, TA Mann, Y Choe
The Relevance of the Time Domain to Neural Network Models, 99-115, 2012
82012
An alternative surrogate loss for pgd-based adversarial testing
S Gowal, J Uesato, C Qin, PS Huang, T Mann, P Kohli
arXiv preprint arXiv:1910.09338, 2019
72019
Scaling up reinforcement learning through targeted exploration
TA Mann, Y Choe
Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011
72011
Iterative hierarchical optimization for misspecified problems (IHOMP)
DJ Mankowitz, TA Mann, S Mannor
arXiv preprint arXiv:1602.03348, 2016
62016
Prenatal to postnatal transfer of motor skills through motor-compatible sensory representations
TA Mann, Y Choe
2010 IEEE 9th International Conference on Development and Learning, 185-190, 2010
62010
The system can't perform the operation now. Try again later.
Articles 1–20