Flamingo: a visual language model for few-shot learning JB Alayrac, J Donahue, P Luc, A Miech, I Barr, Y Hasson, K Lenc, ... Advances in neural information processing systems 35, 23716-23736, 2022 | 1881 | 2022 |
Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023 | 1291* | 2023 |
Scaling language models: Methods, analysis & insights from training gopher JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... arXiv preprint arXiv:2112.11446, 2021 | 746 | 2021 |
Improving language models by retrieving from trillions of tokens S Borgeaud, A Mensch, J Hoffmann, T Cai, E Rutherford, K Millican, ... International conference on machine learning, 2206-2240, 2022 | 644 | 2022 |
Automated curriculum learning for neural networks A Graves, MG Bellemare, J Menick, R Munos, K Kavukcuoglu international conference on machine learning, 1311-1320, 2017 | 586 | 2017 |
Multimodal few-shot learning with frozen language models M Tsimpoukelli, JL Menick, S Cabi, SM Eslami, O Vinyals, F Hill Advances in Neural Information Processing Systems 34, 200-212, 2021 | 547 | 2021 |
Rigging the lottery: Making all tickets winners U Evci, T Gale, J Menick, PS Castro, E Elsen International conference on machine learning, 2943-2952, 2020 | 483 | 2020 |
Chatgpt: Optimizing language models for dialogue J Schulman, B Zoph, C Kim, J Hilton, J Menick, J Weng, JFC Uribe, ... OpenAI blog 2, 4, 2022 | 340 | 2022 |
Generating high fidelity images with subscale pixel networks and multidimensional upscaling J Menick, N Kalchbrenner arXiv preprint arXiv:1812.01608, 2018 | 139 | 2018 |
Teaching language models to support answers with verified quotes J Menick, M Trebacz, V Mikulik, J Aslanides, F Song, M Chadwick, ... arXiv preprint arXiv:2203.11147, 2022 | 137 | 2022 |
Multiplicative interactions and where to find them SM Jayakumar, WM Czarnecki, J Menick, J Schwarz, J Rae, S Osindero, ... | 119 | 2020 |
Generating images with sparse representations C Nash, J Menick, S Dieleman, PW Battaglia arXiv preprint arXiv:2103.03841, 2021 | 113 | 2021 |
Cyprien de Masson d’Autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego de Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew J JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, HF Song, J Aslanides, ... Johnson, Blake A. Hechtman, Laura Weidinger, Iason Gabriel, William S. Isaac …, 2021 | 47 | 2021 |
A practical sparse approximation for real time recurrent learning J Menick, E Elsen, U Evci, S Osindero, K Simonyan, A Graves arXiv preprint arXiv:2006.07232, 2020 | 46* | 2020 |
Noisy networks for exploration. arXiv 2017 M Fortunato, MG Azar, B Piot, J Menick, I Osband, A Graves, V Mnih, ... arXiv preprint arXiv:1706.10295, 0 | 39 | |
Introducing chatgpt J Schulman, B Zoph, C Kim, J Hilton, J Menick, J Weng, JFC Uribe, ... OpenAI Blog, 2022 | 28 | 2022 |
Associative compression networks for representation learning A Graves, J Menick, A Oord arXiv preprint arXiv:1804.02476, 2018 | 19 | 2018 |
Data compression using jointly trained encoder, decoder, and prior neural networks JL Menick, AB Graves US Patent App. 16/767,010, 2021 | 14 | 2021 |
Alethea Power, Stanislas Polu, Jesse Han, Raul Puri, Shawn Jain J Schulman, B Zoph, C Kim, J Hilton, J Menick, J Weng, JFC Uribe, ... Benjamin Chess, Christian Gibson, Oleg Boiko, Emy Parparita, Amin …, 2022 | 8 | 2022 |
Noisy neural network layers O Pietquin, JL Menick, MG Azar, B Piot, V Mnih, C Blundell, M Fortunato, ... US Patent App. 16/439,026, 2019 | 5 | 2019 |