Large Scale GAN Training for High Fidelity Natural Image Synthesis A Brock, J Donahue, K Simonyan arXiv preprint arXiv:1809.11096, 2018 | 6289 | 2018 |
Flamingo: a Visual Language Model for Few-Shot Learning JB Alayrac, J Donahue, P Luc, A Miech, I Barr, Y Hasson, K Lenc, ... arXiv preprint arXiv:2204.14198, 2022 | 3274 | 2022 |
Gemini: A family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 2144 | 2023 |
Improving language models by retrieving from trillions of tokens S Borgeaud, A Mensch, J Hoffmann, T Cai, E Rutherford, K Millican, ... arXiv preprint arXiv:2112.04426, 2021 | 1004 | 2021 |
Perceiver: General Perception with Iterative Attention A Jaegle, F Gimeno, A Brock, A Zisserman, O Vinyals, J Carreira arXiv preprint arXiv:2103.03206, 2021 | 985 | 2021 |
SMASH: One-Shot Model Architecture Search through HyperNetworks A Brock, T Lim, JM Ritchie, N Weston arXiv preprint arXiv:1708.05344, 2017 | 896 | 2017 |
Skillful Precipitation Nowcasting using Deep Generative Models of Radar S Ravuri, K Lenc, M Willson, D Kangin, R Lam, P Mirowski, M Fitzsimons, ... arXiv preprint arXiv:2104.00954, 2021 | 822 | 2021 |
Generative and Discriminative Voxel Modeling with Convolutional Neural Networks A Brock, T Lim, JM Ritchie, N Weston arXiv preprint arXiv:1608.04236, 2016 | 693 | 2016 |
High-Performance Large-Scale Image Recognition Without Normalization A Brock, S De, SL Smith, K Simonyan arXiv preprint arXiv:2102.06171, 2021 | 603 | 2021 |
Perceiver IO: A General Architecture for Structured Inputs & Outputs A Jaegle, S Borgeaud, JB Alayrac, C Doersch, C Ionescu, D Ding, ... arXiv preprint arXiv:2107.14795, 2021 | 591 | 2021 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 585 | 2024 |
Neural Photo Editing with Introspective Adversarial Networks A Brock, T Lim, JM Ritchie, N Weston arXiv preprint arXiv:1609.07093, 2016 | 574 | 2016 |
FreezeOut: Accelerate Training by Progressively Freezing Layers A Brock, T Lim, JM Ritchie, N Weston arXiv preprint arXiv:1706.04983, 2017 | 152 | 2017 |
Characterizing signal propagation to close the performance gap in unnormalized ResNets A Brock, S De, SL Smith arXiv preprint arXiv:2101.08692, 2021 | 139 | 2021 |
Implicit weight uncertainty in neural networks N Pawlowski, A Brock, MCH Lee, M Rajchl, B Glocker arXiv preprint arXiv:1711.01297, 2017 | 122 | 2017 |
BYOL works even without batch statistics PH Richemond, JB Grill, F Altché, C Tallec, F Strub, A Brock, S Smith, ... arXiv preprint arXiv:2010.10241, 2020 | 108 | 2020 |
Evolving Normalization-Activation Layers H Liu, A Brock, K Simonyan, QV Le arXiv preprint arXiv:2004.02967, 2020 | 103 | 2020 |
Towards Learning Universal Audio Representations L Wang, P Luc, Y Wu, A Recasens, L Smaira, A Brock, A Jaegle, ... arXiv preprint arXiv:2111.12124, 2021 | 76 | 2021 |
Training generative adversarial networks by solving ordinary differential equations C Qin, Y Wu, JT Springenberg, A Brock, J Donahue, T Lillicrap, P Kohli Advances in Neural Information Processing Systems 33, 5599-5609, 2020 | 40 | 2020 |
Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation B He, J Martens, G Zhang, A Botev, A Brock, SL Smith, YW Teh International Conference on Learning Representations, 0 | 33* | |