Follow
Jordan Hoffmann
Jordan Hoffmann
Microsoft AI
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Training compute-optimal large language models
J Hoffmann, S Borgeaud, A Mensch, E Buchatskaya, T Cai, E Rutherford, ...
arXiv preprint arXiv:2203.15556, 2022
1285*2022
Scaling language models: Methods, analysis & insights from training gopher
JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ...
arXiv preprint arXiv:2112.11446, 2021
887*2021
Infograph: Unsupervised and semi-supervised graph-level representation learning via mutual information maximization
FY Sun, J Hoffmann, V Verma, J Tang
arXiv preprint arXiv:1908.01000, 2019
8622019
Improving language models by retrieving from trillions of tokens
S Borgeaud, A Mensch, J Hoffmann, T Cai, E Rutherford, K Millican, ...
International conference on machine learning, 2206-2240, 2022
6532022
Recurrent independent mechanisms
A Goyal, A Lamb, J Hoffmann, S Sodhani, S Levine, Y Bengio, ...
arXiv preprint arXiv:1909.10893, 2019
3252019
Reconnaissance of the HR 8799 exosolar system. II. Astrometry and orbital motion
L Pueyo, R Soummer, J Hoffmann, R Oppenheimer, JR Graham, ...
The Astrophysical Journal 803 (1), 31, 2015
1182015
vgraph: A generative model for joint community detection and node representation learning
FY Sun, M Qu, J Hoffmann, CW Huang, J Tang
Advances in Neural Information Processing Systems 32, 2019
972019
Unified scaling laws for routed language models
A Clark, D de Las Casas, A Guy, A Mensch, M Paganini, J Hoffmann, ...
International conference on machine learning, 4057-4086, 2022
87*2022
Data-driven approach to encoding and decoding 3-d crystal structures
J Hoffmann, L Maestrati, Y Sawada, J Tang, JM Sellier, Y Bengio
arXiv preprint arXiv:1909.00949, 2019
682019
Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets
J Hoffmann, Y Bar-Sinai, LM Lee, J Andrejevic, S Mishra, SM Rubinstein, ...
Science advances 5 (4), eaau6792, 2019
682019
An empirical analysis of compute-optimal large language model training
J Hoffmann, S Borgeaud, A Mensch, E Buchatskaya, T Cai, E Rutherford, ...
Advances in Neural Information Processing Systems 35, 30016-30030, 2022
642022
Ion correlations in nanofluidic channels: Effects of ion size, valence, and concentration on voltage-and pressure-driven currents
J Hoffmann, D Gillespie
Langmuir 29 (4), 1303-1317, 2013
562013
A simple developmental model recapitulates complex insect wing venation patterns
J Hoffmann, S Donoughe, K Li, MK Salcedo, CH Rycroft
Proceedings of the National Academy of Sciences 115 (40), 9905-9910, 2018
422018
A systematic investigation of commonsense knowledge in large language models
XL Li, A Kuncoro, J Hoffmann, CM d'Autume, P Blunsom, A Nematzadeh
arXiv preprint arXiv:2111.00607, 2021
372021
Computational analysis of size, shape and structure of insect wings
MK Salcedo, J Hoffmann, S Donoughe, L Mahadevan
Biology Open 8 (10), bio040774, 2019
362019
The role of negative selection in protein evolution revealed through the energetics of the native state ensemble
J Hoffmann, JO Wrabl, VJ Hilser
Proteins: Structure, Function, and Bioinformatics 84 (4), 435-447, 2016
202016
Nuclear speed and cycle length co-vary with local density during syncytial blastoderm formation in a cricket
S Donoughe, J Hoffmann, T Nakamura, CH Rycroft, CG Extavour
Nature communications 13 (1), 3889, 2022
19*2022
Training compute-optimal large language models. arXiv
J Hoffmann, S Borgeaud, A Mensch, E Buchatskaya, T Cai, E Rutherford, ...
arXiv preprint arXiv:2203.15556, 2022
172022
Training compute-optimal large language models. arXiv 2022
J Hoffmann, S Borgeaud, A Mensch, E Buchatskaya, T Cai, E Rutherford, ...
arXiv preprint arXiv:2203.15556 10, 2022
142022
Scaling language models: Methods, analysis & insights from training gopher. arXiv
JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ...
Preprint posted online on December 1, 2021
132021
The system can't perform the operation now. Try again later.
Articles 1–20