Zheng Wen

Cited by

	All	Since 2019
Citations	5159	4342
h-index	31	30
i10-index	54	50

1000

500

250

750

2014201520162017201820192020202120222023202428 68 146 184 330 502 738 847 985 924 344

Public access

View all

8 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Branislav KvetonAmazonVerified email at amazon.com
Benjamin Van RoyStanford UniversityVerified email at stanford.edu
Ian OsbandOpenAIVerified email at openai.com
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Azin AshkanGoogleVerified email at uwaterloo.ca
Xiuyuan LuGoogle DeepMindVerified email at google.com
Yasin Abbasi YadkoriDeepMindVerified email at google.com
Vikranth DwaracherlaDeepMindVerified email at google.com
Morteza IbrahimiStanford UniversityVerified email at stanford.edu
Mohammad GhavamzadehAmazonVerified email at amazon.com
Sharan VaswaniSimon Fraser UniversityVerified email at sfu.ca
Daniel RussoColumbia UniversityVerified email at gsb.columbia.edu
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindVerified email at meta.com
Seyed Mohammad AsghariResearch Engineer, DeepMindVerified email at google.com
Brian ErikssonAdobeVerified email at adobe.com
Botao HaoDeepmindVerified email at google.com
S MuthukrishnanRutgers UnivVerified email at cs.rutgers.edu
Sumeet KatariyaAmazonVerified email at wisc.edu
Shlomo BerkovskyMacquarie UniversityVerified email at mq.edu.au
Claire VernadeUniversity of TuebingenVerified email at uni-tuebingen.de

Zheng Wen

Google DeepMind

Verified email at google.com - Homepage

Artificial Intelligence Reinforcement Learning Operations Research Large Language Models


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A Tutorial on Thompson Sampling D Russo, B Van Roy, A Kazerouni, I Osband, Z Wen arXiv, https://arxiv.org/pdf/1707.02038.pdf, 0	1048*
Generalization and exploration via randomized value functions I Osband, B Van Roy, Z Wen International Conference on Machine Learning, 2377-2386, 2016	327	2016
Deep exploration via randomized value functions I Osband, B Van Roy, DJ Russo, Z Wen Journal of Machine Learning Research 20 (124), 1-62, 2019	320	2019
Cascading bandits: Learning to rank in the cascade model B Kveton, C Szepesvári, Z Wen, A Ashkan ICML, 2015	306	2015
Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits B Kveton, Z Wen, A Ashkan, C Szepesvari International Conference on Artificial Intelligence and Statistics (AISTATS …, 2014	305	2014
Optimal demand response using device based reinforcement learning Z Wen, D O'Neill, HR Maei IEEE Transactions on Smart Grid, 2014	303	2014
Online influence maximization under independent cascade model with semi-bandit feedback Z Wen, B Kveton, M Valko, S Vaswani Advances in neural information processing systems 30, 2017	143*	2017
Nearly optimal adaptive procedure with change detection for piecewise-stationary bandit Y Cao, Z Wen, B Kveton, Y Xie The 22nd International Conference on Artificial Intelligence and Statistics …, 2019	128*	2019
Cascading bandits for large-scale recommendation problems S Zong, H Ni, K Sung, NR Ke, Z Wen, B Kveton arXiv preprint arXiv:1603.05359, 2016	125	2016
Combinatorial cascading bandits B Kveton, Z Wen, A Ashkan, C Szepesvari Advances in Neural Information Processing Systems 28, 2015	125	2015
Matroid bandits: Fast combinatorial optimization with learning B Kveton, Z Wen, A Ashkan, H Eydgahi, B Eriksson UAI 2014, 2014	125	2014
Efficient learning in large-scale combinatorial semi-bandits Z Wen, B Kveton, A Ashkan http://jmlr.org/proceedings/papers/v37/wen15.html, 2014	108	2014
Optimal Greedy Diversity for Recommendation A Ashkan, B Kveton, S Berkovsky, Z Wen	107	2015
Online learning to rank in stochastic click models M Zoghi, T Tunys, M Ghavamzadeh, B Kveton, C Szepesvari, Z Wen International conference on machine learning, 4199-4208, 2017	105	2017
DCM Bandits: Learning to Rank with Multiple Clicks S Katariya, B Kveton, C Szepesvári, Z Wen arXiv, 2016	88	2016
Efficient Exploration and Value Function Generalization in Deterministic Systems Z Wen, B Van Roy Advances in Neural Information Processing Systems, 3021--3029, 2013	86	2013
Model-independent online learning for influence maximization S Vaswani, B Kveton, Z Wen, M Ghavamzadeh, LVS Lakshmanan, ... International conference on machine learning, 3530-3539, 2017	81*	2017
Epistemic neural networks I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ... Advances in Neural Information Processing Systems 36, 2024	79	2024
Stochastic rank-1 bandits S Katariya, B Kveton, C Szepesvari, C Vernade, Z Wen Artificial Intelligence and Statistics, 392-401, 2017	74	2017
Garbage in, reward out: Bootstrapping exploration in multi-armed bandits B Kveton, C Szepesvari, S Vaswani, Z Wen, T Lattimore, M Ghavamzadeh International Conference on Machine Learning, 3601-3610, 2019	72	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors