Tom Zahavy
Title
Cited by
Cited by
Year
A deep hierarchical approach to lifelong learning in minecraft
C Tessler, S Givony, T Zahavy, DJ Mankowitz, S Mannor
Thirty-First AAAI Conference on Artificial Intelligence, 2017
1752017
Graying the black box: Understanding dqns
T Zahavy, N Ben-Zrihem, S Mannor
International Conference on Machine Learning, 1899-1908, 2016
1262016
Is a picture worth a thousand words? A deep multi-modal architecture for product classification in e-commerce
T Zahavy, A Krishnan, A Magnani, S Mannor
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
64*2018
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
T Zahavy, M Haroush, N Merlis, DJ Mankowitz, S Mannor
Advances in Neural Information Processing Systems, 2018
41*2018
Deep learning reconstruction of ultrashort pulses
T Zahavy, A Dikopoltsev, D Moss, GI Haham, O Cohen, S Mannor, ...
Optica 5 (5), 666-673, 2018
322018
Shallow updates for deep reinforcement learning
N Levine, T Zahavy, DJ Mankowitz, A Tamar, S Mannor
Advances in Neural Information Processing Systems, 3135-3145, 2017
232017
Ensemble Robustness and Generalization of Stochastic Deep Learning Algorithms
T Zahavy, B Kang, A Sivak, J Feng, H Xu, S Mannor
International Conference on Learning Representations Workshop (ICLRW'18), 2016
20*2016
Sub-Nyquist sampling of OFDM signals for cognitive radios
T Zahavy, O Shayer, D Cohen, A Tolmachev, YC Eldar
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
82014
Visualizing Dynamics: from t-SNE to SEMI-MDPs
NB Zrihem, T Zahavy, S Mannor
ICML Workshop on Human Interpretability in Machine Learning (WHI 2016),, 2016
7*2016
Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces
C Tessler, T Zahavy, D Cohen, DJ Mankowitz, S Mannor
arXiv preprint arXiv:1905.09700, 2019
32019
Deep neural linear bandits: Overcoming catastrophic forgetting through likelihood matching
T Zahavy, S Mannor
arXiv preprint arXiv:1901.08612, 2019
32019
Deep learning reconstruction of ultrashort pulses from 2D spatial intensity patterns recorded by an all-in-line system in a single-shot
R Ziv, A Dikopoltsev, T Zahavy, I Rubinstein, P Sidorenko, O Cohen, ...
Optics Express 28 (5), 7528-7538, 2020
12020
Average reward reinforcement learning with unknown mixing times
T Zahavy, A Cohen, H Kaplan, Y Mansour
The 36th Conference on Uncertainty in Artificial Intelligence, 2019
12019
Train on validation: squeezing the data lemon
G Tennenholtz, T Zahavy, S Mannor
arXiv preprint arXiv:1802.05846, 2018
12018
Planning in Hierarchical Reinforcement Learning: Guarantees for Using Local Policies
T Zahavy, A Hasidim, H Kaplan, Y Mansour
Algorithmic Learning Theory (ALT) 2020, 0
1*
Deep neural networks in single-shot ptychography
O Wengrowicz, O Peleg, T Zahavy, B Loevsky, O Cohen
Optics Express 28 (12), 17511-17520, 2020
2020
Learning to Ask Medical Questions using Reinforcement Learning
U Shaham, T Zahavy, C Caraballo, S Mahajan, D Massey, H Krumholz
arXiv preprint arXiv:2004.00994, 2020
2020
Self-Tuning Deep Reinforcement Learning
T Zahavy, Z Xu, V Veeriah, M Hessel, J Oh, H van Hasselt, D Silver, ...
arXiv preprint arXiv:2002.12928, 2020
2020
Apprenticeship Learning via Frank-Wolfe
T Zahavy, A Cohen, H Kaplan, Y Mansour
Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2020
Inverse Reinforcement Learning in Contextual MDPs
P Korsunsky, S Belo, T Zahavy, C Tessler, S Mannor
arXiv preprint arXiv:1905.09710, 2019
2019
The system can't perform the operation now. Try again later.
Articles 1–20