Deep speech 2: End-to-end speech recognition in english and mandarin D Amodei, S Ananthanarayanan, R Anubhai, J Bai, E Battenberg, C Case, ... International conference on machine learning, 173-182, 2016 | 3619 | 2016 |
Megatron-lm: Training multi-billion parameter language models using model parallelism M Shoeybi, M Patwary, R Puri, P LeGresley, J Casper, B Catanzaro arXiv preprint arXiv:1909.08053, 2019 | 1328 | 2019 |
Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... arXiv preprint arXiv:2201.11990, 2022 | 502 | 2022 |
Efficient large-scale language model training on gpu clusters using megatron-lm D Narayanan, M Shoeybi, J Casper, P LeGresley, M Patwary, ... Proceedings of the International Conference for High Performance Computing …, 2021 | 420 | 2021 |
Large calculation of the flow over a hypersonic vehicle using a GPU E Elsen, P LeGresley, E Darve Journal of Computational Physics 227 (24), 10148-10161, 2008 | 294 | 2008 |
Investigation of non-linear projection for POD based reduced order models for aerodynamics P LeGresley, J Alonso 39th aerospace sciences meeting and exhibit, 926, 2001 | 201 | 2001 |
Airfoil design optimization using reduced order models based on proper orthogonal decomposition P LeGresley, J Alonso Fluids 2000 conference and exhibit, 2545, 2000 | 192 | 2000 |
Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, and Bryan Catanzaro S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... Using deepspeed and megatron to train megatron-turing nlg 530b, a large …, 2022 | 120 | 2022 |
Aircraft design optimization JJ Alonso, P LeGresley, V Pereyra Mathematics and Computers in Simulation 79 (6), 1948-1958, 2009 | 97 | 2009 |
Application of proper orthogonal decomposition (POD) to design decomposition methods PA LeGresley Stanford University, 2006 | 81 | 2006 |
pyMDO: A framework for high-fidelity multi-disciplinary optimization J Alonso, P LeGresley, E Van Der Weide, JRRA Martins, J Reuther 10th AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference, 4480, 2004 | 64 | 2004 |
Dynamic domain decomposition and error correction for reduced order models P LeGresley, J Alonso 41st Aerospace Sciences Meeting and Exhibit, 250, 2003 | 62 | 2003 |
Chimps: A high-performance scalable module for multi-physics simulations J Alonso, S Hahn, F Ham, M Herrmann, G Iaccarino, G Kalitzin, ... 42nd AIAA/ASME/SAE/ASEE Joint Propulsion Conference & Exhibit, 5274, 2006 | 60 | 2006 |
Improving the performance of design decomposition methods with POD P LeGresley, J Alonso 10th AIAA/ISSMO multidisciplinary analysis and optimization conference, 4465, 2004 | 41 | 2004 |
GPU Enhancement of the Trigger to Extend Physics Reach at the LHC V Halyo, A Hunt, P Jindal, P LeGresley, P Lujan Journal of Instrumentation 8 (10), P10005, 2013 | 36 | 2013 |
First evaluation of the CPU, GPGPU and MIC architectures for real time particle tracking based on Hough transform at the LHC VHV Halyo, P LeGresley, P Lujan, V Karpusenko, A Vladimirov Journal of Instrumentation 9 (04), P04005, 2014 | 35 | 2014 |
High performance computing with CUDA M Fatica, P LeGresley, I Buck, J Stone, J Phillips, S Morton, P Micikevicius SC08, 2008 | 33 | 2008 |
Massively parallel computing and the search for jets and black holes at the LHC V Halyo, P LeGresley, P Lujan Nuclear Instruments and Methods in Physics Research Section A: Accelerators …, 2014 | 15 | 2014 |
Neural odes for image segmentation with level sets R Valle, F Reda, M Shoeybi, P Legresley, A Tao, B Catanzaro arXiv preprint arXiv:1912.11683, 2019 | 6 | 2019 |
GPU Enhancement of the Trigger to Extend Physics Reach at the LHC P Lujan, V Halyo, A Hunt, P Jindal, P LeGresley Journal of Physics: Conference Series 513 (1), 012019, 2014 | 4 | 2014 |