Tom Zahavy
Tom Zahavy
Google, Technion
Verified email at - Homepage
TitleCited byYear
A deep hierarchical approach to lifelong learning in minecraft
C Tessler, S Givony, T Zahavy, DJ Mankowitz, S Mannor
Thirty-First AAAI Conference on Artificial Intelligence, 2017
Graying the black box: Understanding dqns
T Zahavy, N Ben-Zrihem, S Mannor
International Conference on Machine Learning, 1899-1908, 2016
Is a picture worth a thousand words? A deep multi-modal architecture for product classification in e-commerce
T Zahavy, A Krishnan, A Magnani, S Mannor
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
T Zahavy, M Haroush, N Merlis, DJ Mankowitz, S Mannor
Advances in Neural Information Processing Systems, 2018
Ensemble Robustness and Generalization of Stochastic Deep Learning Algorithms
T Zahavy, B Kang, A Sivak, J Feng, H Xu, S Mannor
International Conference on Learning Representations Workshop (ICLRW'18), 2016
Deep learning reconstruction of ultrashort pulses
T Zahavy, A Dikopoltsev, D Moss, GI Haham, O Cohen, S Mannor, ...
Optica 5 (5), 666-673, 2018
Shallow updates for deep reinforcement learning
N Levine, T Zahavy, DJ Mankowitz, A Tamar, S Mannor
Advances in Neural Information Processing Systems, 3135-3145, 2017
Sub-Nyquist sampling of OFDM signals for cognitive radios
T Zahavy, O Shayer, D Cohen, A Tolmachev, YC Eldar
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
Visualizing Dynamics: from t-SNE to SEMI-MDPs
NB Zrihem, T Zahavy, S Mannor
ICML Workshop on Human Interpretability in Machine Learning (WHI 2016),, 2016
Planning in Hierarchical Reinforcement Learning: Guarantees for Using Local Policies
T Zahavy, A Hasidim, H Kaplan, Y Mansour
The 14th European Workshop on Reinforcement Learning (EWRL 2018), 2019
Train on Validation: Squeezing the Data Lemon
G Tennenholtz, T Zahavy, S Mannor
arXiv preprint arXiv:1802.05846, 2018
Average reward reinforcement learning with unknown mixing times
T Zahavy, A Cohen, H Kaplan, Y Mansour
arXiv preprint arXiv:1905.09704, 2019
Inverse Reinforcement Learning in Contextual MDPs
P Korsunsky, S Belo, T Zahavy, C Tessler, S Mannor
arXiv preprint arXiv:1905.09710, 2019
Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces
C Tessler, T Zahavy, D Cohen, DJ Mankowitz, S Mannor
arXiv preprint arXiv:1905.09700, 2019
Improving techniques for diagnostics of laser pulses by compact representations
P Sidorenko, A Dikopoltsev, T Zahavy, O Lahav, S Gazit, Y Shechtman, ...
Optics express 27 (6), 8920-8934, 2019
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching
T Zahavy, S Mannor
arXiv preprint arXiv:1901.08612, 2019
Reconstruction of Ultrashort Pulses Using Deep Neural Networks
A Dikopoltsev, T Zahavy, R Ziv, I Rubinstein, P Sidorenko, S Mannor, ...
2018 2nd URSI Atlantic Radio Science Meeting (AT-RASC), 1-1, 2018
Systems, method, and non-transitory computer-readable storage media for multi-modal product classification
A Magnani, TBZ Zahavy, A Krishnan, S Mannor
US Patent App. 15/339,711, 2018
The mind of a machine
A Rutkin
New Scientist 229 (3061), 22, 2016
The system can't perform the operation now. Try again later.
Articles 1–19