Shie Mannor
Shie Mannor
Professor of Electrical Engineering, Technion
Verified email at technion.ac.il - Homepage
TitleCited byYear
A Tutorial on the Cross-Entropy Method
B DE, P KROESE, S MANNOR
Annals of Operations Research 134 (1), 19-67, 2005
1234*2005
The kernel recursive least-squares algorithm
Y Engel, S Mannor, R Meir
IEEE Transactions on signal processing 52 (8), 2275-2285, 2004
7952004
Reinforcement learning with Gaussian processes
Y Engel, S Mannor, R Meir
ICML, 201-208, 2005
3582005
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
E Even-Dar, S Mannor, Y Mansour
Journal of machine learning research 7 (Jun), 1079-1105, 2006
2922006
Robustness and regularization of support vector machines
H Xu, C Caramanis, S Mannor
Journal of Machine Learning Research 10 (Jul), 1485-1510, 2009
2792009
PAC bounds for multi-armed bandit and Markov decision processes
E Even-Dar, S Mannor, Y Mansour
International Conference on Computational Learning Theory, 255-270, 2002
2682002
The sample complexity of exploration in the multi-armed bandit problem
S Mannor, JN Tsitsiklis
Journal of Machine Learning Research 5 (Jun), 623-648, 2004
2542004
Q-cut—dynamic discovery of sub-goals in reinforcement learning
I Menache, S Mannor, N Shimkin
European Conference on Machine Learning, 295-306, 2002
2382002
Bayes meets Bellman: The Gaussian process approach to temporal difference learning
Y Engel, S Mannor, R Meir
Proceedings of the 20th International Conference on Machine Learning (ICML …, 2003
2232003
Dynamic abstraction in reinforcement learning via clustering
S Mannor, I Menache, A Hoze, U Klein
Proceedings of the twenty-first international conference on Machine learning, 71, 2004
2042004
Basis function adaptation in temporal difference reinforcement learning
I Menache, S Mannor, N Shimkin
Annals of Operations Research 134 (1), 215-238, 2005
1942005
Robust regression and lasso
H Xu, C Caramanis, S Mannor
Advances in Neural Information Processing Systems, 1801-1808, 2009
1862009
Percentile optimization for Markov decision processes with parameter uncertainty
E Delage, S Mannor
Operations research 58 (1), 203-213, 2010
183*2010
Fully parallel stochastic LDPC decoders
SS Tehrani, S Mannor, WJ Gross
IEEE Transactions on Signal Processing 56 (11), 5692-5703, 2008
1762008
Automatic basis function construction for approximate dynamic programming and reinforcement learning
PW Keller, S Mannor, D Precup
Proceedings of the 23rd international conference on Machine learning, 449-456, 2006
1682006
Robustness and generalization
H Xu, S Mannor
Machine learning 86 (3), 391-423, 2012
1612012
Stochastic decoding of LDPC codes
SS Tehrani, WJ Gross, S Mannor
IEEE Communications Letters 10 (10), 716-718, 2006
1442006
Regularized policy iteration
AM Farahmand, M Ghavamzadeh, S Mannor, C Szepesvári
Advances in Neural Information Processing Systems, 441-448, 2009
1402009
The cross entropy method for fast policy search
S Mannor, RY Rubinstein, Y Gat
Proceedings of the 20th International Conference on Machine Learning (ICML …, 2003
1392003
A deep hierarchical approach to lifelong learning in minecraft
C Tessler, S Givony, T Zahavy, DJ Mankowitz, S Mannor
Thirty-First AAAI Conference on Artificial Intelligence, 2017
1342017
The system can't perform the operation now. Try again later.
Articles 1–20