Yi Wu

Cited by

	All	Since 2019
Citations	9311	8835
h-index	24	23
i10-index	33	32

2800

1400

700

2100

20172018201920202021202220232024114 317 667 1032 1495 2061 2730 842

Public access

View all

12 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Aviv TamarTechnionVerified email at technion.ac.il
Stuart RussellProfessor of Computer Science, University of California, BerkeleyVerified email at cs.berkeley.edu
Yuandong TianResearch Scientist, Meta AI (FAIR)Verified email at fb.com
Yu Wang (汪玉)Department of Electronic Engineering, Tsinghua University, ChinaVerified email at mail.tsinghua.edu.cn
Fei FangCarnegie Mellon UniversityVerified email at cmu.edu
Igor MordatchGoogle DeepMindVerified email at google.com
Pieter AbbeelUC Berkeley | CovariantVerified email at cs.berkeley.edu
Huazhe XuTsinghua UniversityVerified email at berkeley.edu
Xiaolong WangAssistant Professor, UC San DiegoVerified email at ucsd.edu
Ryan LoweOpenAIVerified email at openai.com
Jean HarbOpenAIVerified email at openai.com
Akash VeluStudent, Stanford UniversityVerified email at stanford.edu
Eugene VinitskyAssistant Professer, NYUVerified email at nyu.edu
Chao Yu（于超）Tsinghua UniversityVerified email at mail.tsinghua.edu.cn
Georgia GkioxariCaltechVerified email at caltech.edu
Yunfei LiTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Shusheng XuIIIS, Tsinghua UniversityVerified email at mails.tsinghua.edu.cn
Alexandre BayenProfessor Electrical Engineering and Computer Science, UC BerkeleyVerified email at berkeley.edu
Yuxin WuVerified email at google.com
Ingmar KanitscheiderOpenAIVerified email at openai.com

Yi Wu

Institute for Interdisciplinary Information Sciences, Tsinghua University

Verified email at mail.tsinghua.edu.cn - Homepage

Reinforcement Learning Human-AI Interaction Multi-Agent Learning Robot Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Multi-agent actor-critic for mixed cooperative-competitive environments R Lowe, YI Wu, A Tamar, J Harb, OAI Pieter Abbeel, I Mordatch Advances in neural information processing systems 30, 2017	4577	2017
The surprising effectiveness of ppo in cooperative multi-agent games C Yu, A Velu, E Vinitsky, J Gao, Y Wang, A Bayen, Y Wu Advances in Neural Information Processing Systems 35, 24611-24624, 2022	883	2022
Emergent tool use from multi-agent autocurricula B Baker, I Kanitscheider, T Markov, Y Wu, G Powell, B McGrew, ... arXiv preprint arXiv:1909.07528, 2019	788	2019
Value iteration networks A Tamar, Y Wu, G Thomas, S Levine, P Abbeel Advances in neural information processing systems 29, 2016	726	2016
Building generalizable agents with a realistic and rich 3d environment Y Wu, Y Wu, G Gkioxari, Y Tian arXiv preprint arXiv:1801.02209, 2018	367	2018
Robust multi-agent reinforcement learning via minimax deep deterministic policy gradient S Li, Y Wu, X Cui, H Dong, F Fang, S Russell Proceedings of the AAAI conference on artificial intelligence 33 (01), 4213-4220, 2019	296	2019
Adversarial training for relation extraction Y Wu, D Bamman, S Russell Proceedings of the 2017 Conference on Empirical Methods in Natural Language …, 2017	242	2017
Multi-task reinforcement learning with soft modularization R Yang, H Xu, Y Wu, X Wang Advances in Neural Information Processing Systems 33, 4767-4777, 2020	166	2020
Influence-based multi-agent exploration T Wang, J Wang, Y Wu, C Zhang arXiv preprint arXiv:1910.05512, 2019	130	2019
Bayesian relational memory for semantic visual navigation Y Wu, Y Wu, A Tamar, S Russell, G Gkioxari, Y Tian Proceedings of the IEEE/CVF international conference on computer vision …, 2019	121*	2019
Evolutionary population curriculum for scaling multi-agent reinforcement learning Q Long, Z Zhou, A Gupta, F Fang, Y Wu, X Wang arXiv preprint arXiv:2003.10423, 2020	104	2020
Noveld: A simple yet effective exploration criterion T Zhang, H Xu, X Wang, Y Wu, K Keutzer, JE Gonzalez, Y Tian Advances in Neural Information Processing Systems 34, 25217-25230, 2021	96*	2021
Deep reinforcement learning for green security games with real-time information Y Wang, ZR Shi, L Yu, Y Wu, R Singh, L Joppa, F Fang Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 1401-1408, 2019	86	2019
Sequence level contrastive learning for text summarization S Xu, X Zhang, Y Wu, F Wei Proceedings of the AAAI conference on artificial intelligence 36 (10), 11556 …, 2022	72	2022
Unsupervised extractive summarization by pre-training hierarchical transformers S Xu, X Zhang, Y Wu, F Wei, M Zhou arXiv preprint arXiv:2010.08242, 2020	53	2020
Discovering diverse multi-agent strategic behavior via reward randomization Z Tang, C Yu, B Chen, H Xu, X Wang, F Fang, S Du, Y Wang, Y Wu arXiv preprint arXiv:2103.04564, 2021	48	2021
Swift: Compiled inference for probabilistic programming languages Y Wu, L Li, S Russell, R Bodik arXiv preprint arXiv:1606.09242, 2016	40*	2016
Meta-learning MCMC proposals T Wang, Y Wu, D Moore, SJ Russell Advances in neural information processing systems 31, 2018	38	2018
Maximum entropy population-based training for zero-shot human-ai coordination R Zhao, J Song, Y Yuan, H Hu, Y Gao, Y Wu, Z Sun, W Yang Proceedings of the AAAI Conference on Artificial Intelligence 37 (5), 6145-6153, 2023	36	2023
Revisiting some common practices in cooperative multi-agent reinforcement learning W Fu, C Yu, Z Xu, J Yang, Y Wu arXiv preprint arXiv:2206.07505, 2022	36	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors