Yi Wu
Yi Wu
Institute for Interdisciplinary Information Sciences, Tsinghua University
Verified email at mail.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
Multi-agent actor-critic for mixed cooperative-competitive environments
R Lowe, Y Wu, A Tamar, J Harb, P Abbeel, I Mordatch
arXiv preprint arXiv:1706.02275, 2017
15852017
Value iteration networks
A Tamar, Y Wu, G Thomas, S Levine, P Abbeel
arXiv preprint arXiv:1602.02867, 2016
5112016
Emergent tool use from multi-agent autocurricula
B Baker, I Kanitscheider, T Markov, Y Wu, G Powell, B McGrew, ...
arXiv preprint arXiv:1909.07528, 2019
2562019
Building generalizable agents with a realistic and rich 3d environment
Y Wu, Y Wu, G Gkioxari, Y Tian
arXiv preprint arXiv:1801.02209, 2018
2372018
Adversarial training for relation extraction
Y Wu, D Bamman, S Russell
Proceedings of the 2017 Conference on Empirical Methods in Natural Language …, 2017
1722017
Robust multi-agent reinforcement learning via minimax deep deterministic policy gradient
S Li, Y Wu, X Cui, H Dong, F Fang, S Russell
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 4213-4220, 2019
1002019
Bayesian relational memory for semantic visual navigation
Y Wu, Y Wu, A Tamar, S Russell, G Gkioxari, Y Tian
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
47*2019
Deep reinforcement learning for green security games with real-time information
Y Wang, ZR Shi, L Yu, Y Wu, R Singh, L Joppa, F Fang
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 1401-1408, 2019
392019
Swift: Compiled inference for probabilistic programming languages
Y Wu, L Li, S Russell, R Bodik
arXiv preprint arXiv:1606.09242, 2016
32*2016
Influence-based multi-agent exploration
T Wang, J Wang, Y Wu, C Zhang
arXiv preprint arXiv:1910.05512, 2019
292019
Meta-learning MCMC proposals
T Wang, Y Wu, DA Moore, SJ Russell
arXiv preprint arXiv:1708.06040, 2017
272017
Multi-task reinforcement learning with soft modularization
R Yang, H Xu, Y Wu, X Wang
arXiv preprint arXiv:2003.13661, 2020
202020
Discrete-continuous mixtures in probabilistic programming: Generalized semantics and inference algorithms
Y Wu, S Srivastava, N Hay, S Du, S Russell
International Conference on Machine Learning, 5343-5352, 2018
182018
Understanding and evaluating sparse linear discriminant analysis
Y Wu, D Wipf, JM Yun
Artificial Intelligence and Statistics, 1070-1078, 2015
182015
Evolutionary population curriculum for scaling multi-agent reinforcement learning
Q Long, Z Zhou, A Gupta, F Fang, Y Wu, X Wang
arXiv preprint arXiv:2003.10423, 2020
172020
Dual-space analysis of the sparse linear model
D Wipf, Y Wu
arXiv preprint arXiv:1207.2422, 2012
162012
The surprising effectiveness of mappo in cooperative, multi-agent games
C Yu, A Velu, E Vinitsky, Y Wang, A Bayen, Y Wu
arXiv preprint arXiv:2103.01955, 2021
152021
Unsupervised extractive summarization by pre-training hierarchical transformers
S Xu, X Zhang, Y Wu, F Wei, M Zhou
arXiv preprint arXiv:2010.08242, 2020
72020
A nearly-black-box online algorithm for joint parameter and state estimation in temporal models
YB Erol, Y Wu, L Li, S Russell
Thirty-First AAAI Conference on Artificial Intelligence, 2017
5*2017
BeBold: Exploration Beyond the Boundary of Explored Regions
T Zhang, H Xu, X Wang, Y Wu, K Keutzer, JE Gonzalez, Y Tian
arXiv preprint arXiv:2012.08621, 2020
42020
The system can't perform the operation now. Try again later.
Articles 1–20