Bei Peng

Cited by

	All	Since 2019
Citations	2056	1875
h-index	16	15
i10-index	19	16

640

320

160

480

201520162017201820192020202120222023202416 36 43 77 56 139 313 519 628 217

Public access

View all

14 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Matthew E. TaylorAssociate Professor, University of AlbertaVerified email at ualberta.ca
Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoVerified email at cs.ox.ac.uk
David L. RobertsAssociate Professor, Assistant Director Undergraduate Programs, Interim Director Digital GamesVerified email at csc.ncsu.edu
Michael LittmanBrown UniversityVerified email at brown.edu
Robert LoftinLecturer, University of SheffieldVerified email at sheffield.ac.uk
James MacGlashanSony AIVerified email at sony.com
Wendelin BöhmerSequential Decision Making Group, Delft University of TechnologyVerified email at tudelft.nl
Tabish RashidMicrosoft ResearchVerified email at microsoft.com
Christian Schroeder de WittUniversity of OxfordVerified email at robots.ox.ac.uk
Tarun GuptaUniversity of Oxford, Microsoft ResearchVerified email at microsoft.com
Jeff HuangBrown UniversityVerified email at jeffhuang.com
Philip TorrProfessor, University of OxfordVerified email at eng.ox.ac.uk
Anuj MahajanAmazonVerified email at cs.ox.ac.uk
Sanmit NarvekarResearch Scientist, WaymoVerified email at cs.utexas.edu
Peter StoneProfessor of Computer Science, The University of Texas at AustinVerified email at cs.utexas.edu
Jivko SinapovAssistant Professor, Tufts UniversityVerified email at cs.tufts.edu
Matteo LeonettiDepartment of Informatics, King's College LondonVerified email at kcl.ac.uk
Gregory FarquharDeepMindVerified email at google.com
Tonghan WangEcon CS group, Harvard UniversityVerified email at g.harvard.edu
Shariq IqbalResearch Scientist, DeepmindVerified email at deepmind.com

Bei Peng

Lecturer (Assistant Professor), University of Liverpool

Verified email at liverpool.ac.uk - Homepage

Machine Learning Reinforcement Learning Interactive Learning Multi-Agent Systems Curriculum Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey S Narvekar, B Peng, M Leonetti, J Sinapov, ME Taylor, P Stone Journal of Machine Learning Research (JMLR 2020) 21, 1-50, 2020	445	2020
Weighted QMIX: Expanding Monotonic Value Function Factorisation T Rashid, G Farquhar, B Peng, S Whiteson Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020), 2020	314*	2020
Interactive learning from policy-dependent human feedback J MacGlashan, MK Ho, R Loftin, B Peng, G Wang, DL Roberts, ME Taylor, ... 34th International Conference on Machine Learning (ICML 2017), 2285-2294, 2017	301	2017
RODE: Learning Roles to Decompose Multi-Agent Tasks T Wang, T Gupta, A Mahajan, B Peng, S Whiteson, C Zhang International Conference on Learning Representations (ICLR 2021), 2020	181	2020
FACMAC: Factored Multi-Agent Centralised Policy Gradients B Peng, T Rashid, CAS de Witt, PA Kamienny, PHS Torr, W Böhmer, ... 35th Conference on Neural Information Processing Systems (NeurIPS 2021), 2021	168	2021
Learning behaviors via human-delivered discrete feedback: modeling implicit feedback strategies to speed up learning R Loftin, B Peng, J MacGlashan, ML Littman, ME Taylor, J Huang, ... Autonomous agents and multi-agent systems (JAAMAS 2016) 30 (1), 30-59, 2016	120	2016
A strategy-aware technique for learning behaviors from discrete human feedback RT Loftin, J MacGlashan, B Peng, ME Taylor, ML Littman, J Huang, ... Twenty-Eighth AAAI Conference on Artificial Intelligence (AAAI 2014), 2014	79	2014
Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning S Iqbal, CAS de Witt, B Peng, W Böhmer, S Whiteson, F Sha 38th International Conference on Machine Learning (ICML 2021), 2021	73*	2021
Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control CS de Witt, B Peng (equal contribution), PA Kamienny, P Torr, W Böhmer, ... arXiv preprint arXiv:2003.06709, 2020	72	2020
A need for speed: Adapting agent action speed to improve task learning from non-expert humans B Peng, J MacGlashan, R Loftin, ML Littman, DL Roberts, ME Taylor Autonomous Agents and Multiagent Systems (AAMAS 2016), 2016	56	2016
UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning T Gupta, A Mahajan, B Peng, W Böhmer, S Whiteson 38th International Conference on Machine Learning (ICML 2021), 2021	48	2021
Optimistic Exploration even with a Pessimistic Initialisation T Rashid, B Peng, W Böhmer, S Whiteson International Conference on Learning Representations (ICLR 2020), 2020	46	2020
Learning something from nothing: Leveraging implicit human feedback strategies R Loftin, B Peng, J MacGlashan, ML Littman, ME Taylor, J Huang, ... The 23rd IEEE international symposium on robot and human interactive …, 2014	30	2014
Regularized Softmax Deep Multi-Agent Q-Learning L Pan, T Rashid, B Peng, L Huang, S Whiteson 35th Conference on Neural Information Processing Systems (NeurIPS 2021), 2021	26*	2021
Training an agent to ground commands with reward and punishment J MacGlashan, M Littman, R Loftin, B Peng, D Roberts, M Taylor Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014	25	2014
Curriculum Design for Machine Learners in Sequential Decision Tasks B Peng, J MacGlashan, R Loftin, ML Littman, DL Roberts, ME Taylor IEEE Transactions on Emerging Topics in Computational Intelligence 2 (4 …, 2018	18	2018
An empirical study of non-expert curriculum design for machine learners B Peng, J MacGlashan, R Loftin, ML Littman, DL Roberts, ME Taylor Proceedings of the IJCAI Interactive Machine Learning Workshop, 2016	14	2016
Convergent Actor Critic by Humans J MacGlashan, ML Littman, DL Roberts, R Loftin, B Peng, ME Taylor International Conference on Intelligent Robots and Systems (IROS 2016), 2016	12	2016
Towards integrating real-time crowd advice with reinforcement learning GV de la Cruz, B Peng, WS Lasecki, ME Taylor Proceedings of the 20th International Conference on Intelligent User …, 2015	10	2015
Generating real-time crowd advice to improve reinforcement learning agents GV de la Cruz, B Peng, WS Lasecki, ME Taylor Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015	4	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors