Vinicius Zambaldi
Vinicius Zambaldi
Google Deepmind
Verified email at google.com
Title
Cited by
Cited by
Year
Relational inductive biases, deep learning, and graph networks
PW Battaglia, JB Hamrick, V Bapst, A Sanchez-Gonzalez, V Zambaldi, ...
arXiv preprint arXiv:1806.01261, 2018
6582018
Multi-agent reinforcement learning in sequential social dilemmas
JZ Leibo, V Zambaldi, M Lanctot, J Marecki, T Graepel
arXiv preprint arXiv:1702.03037, 2017
2672017
A unified game-theoretic approach to multiagent reinforcement learning
M Lanctot, V Zambaldi, A Gruslys, A Lazaridou, K Tuyls, J Pérolat, D Silver, ...
Advances in neural information processing systems, 4190-4203, 2017
1972017
Deep reinforcement learning with relational inductive biases
V Zambaldi, D Raposo, A Santoro, V Bapst, Y Li, I Babuschkin, K Tuyls, ...
International Conference on Learning Representations, 2018
131*2018
Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward.
P Sunehag, G Lever, A Gruslys, WM Czarnecki, VF Zambaldi, ...
AAMAS, 2085-2087, 2018
882018
Value-decomposition networks for cooperative multi-agent learning
P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ...
arXiv preprint arXiv:1706.05296, 2017
812017
Dawn of the selfie era: The whos, wheres, and hows of selfies on Instagram
F Souza, D de Las Casas, V Flores, SB Youn, M Cha, D Quercia, ...
Proceedings of the 2015 ACM on conference on online social networks, 221-231, 2015
752015
A multi-agent reinforcement learning model of common-pool resource appropriation
J Perolat, JZ Leibo, V Zambaldi, C Beattie, K Tuyls, T Graepel
Advances in Neural Information Processing Systems, 3643-3652, 2017
692017
Actor-critic policy optimization in partially observable multiagent environments
S Srinivasan, M Lanctot, V Zambaldi, J Pérolat, K Tuyls, R Munos, ...
Advances in neural information processing systems, 3422-3435, 2018
572018
Relational forward models for multi-agent learning
A Tacchetti, HF Song, PAM Mediano, V Zambaldi, NC Rabinowitz, ...
arXiv preprint arXiv:1809.11044, 2018
222018
OpenSpiel: A framework for reinforcement learning in games
M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ...
arXiv preprint arXiv:1908.09453, 2019
132019
Lightweight contextual ranking of city pictures: urban sociology to the rescue
VF Zambaldi, JP Pesce, D Quercia, V Almeida
Eighth International AAAI Conference on Weblogs and Social Media, 2014
132014
CompILE: Compositional imitation learning and execution
T Kipf, Y Li, H Dai, V Zambaldi, A Sanchez-Gonzalez, E Grefenstette, ...
International Conference on Machine Learning, 3418-3428, 2019
122019
Compositional imitation learning: Explaining and executing one task at a time
T Kipf, Y Li, H Dai, V Zambaldi, E Grefenstette, P Kohli, P Battaglia
arXiv preprint arXiv:1812.01483, 2018
92018
Memo: A deep network for flexible combination of episodic memories
A Banino, AP Badia, R Köster, MJ Chadwick, V Zambaldi, D Hassabis, ...
arXiv preprint arXiv:2001.10913, 2020
42020
Deep Learning Monitor
CT Page, M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, ...
Nature Communications 11 (1), 1760, 2020
2020
Reinforcement learning using a relational network for generating data encoding relationships between entities in an environment
Y Li, VC Bapst, V Zambaldi, DN Raposo, AA Santoro
US Patent App. 16/417,580, 2019
2019
使用 NLP 预测电影类型-多标签...
PW Battaglia, JB Hamrick, V Bapst, A Sanchez-Gonzalez, V Zambaldi, ...
The system can't perform the operation now. Try again later.
Articles 1–18