Deep speech 2: End-to-end speech recognition in english and mandarin D Amodei, S Ananthanarayanan, R Anubhai, J Bai, E Battenberg, C Case, ... International conference on machine learning, 173-182, 2016 | 3780 | 2016 |
Deep Speech: Scaling up end-to-end speech recognition A Hannun arXiv preprint arXiv:1412.5567, 2014 | 2699 | 2014 |
Megatron-lm: Training multi-billion parameter language models using model parallelism M Shoeybi, M Patwary, R Puri, P LeGresley, J Casper, B Catanzaro arXiv preprint arXiv:1909.08053, 2019 | 1646 | 2019 |
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... | 1500 | 2023 |
Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... arXiv preprint arXiv:2201.11990, 2022 | 597 | 2022 |
Efficient large-scale language model training on gpu clusters using megatron-lm D Narayanan, M Shoeybi, J Casper, P LeGresley, M Patwary, ... Proceedings of the International Conference for High Performance Computing …, 2021 | 572 | 2021 |
An effective hybrid transactional memory system with strong isolation guarantees CC Minh, M Trautmann, JW Chung, A McDonald, N Bronson, J Casper, ... Proceedings of the 34th annual international symposium on Computer …, 2007 | 463 | 2007 |
A practical concurrent binary search tree NG Bronson, J Casper, H Chafi, K Olukotun ACM Sigplan Notices 45 (5), 257-268, 2010 | 307 | 2010 |
The vector-thread architecture R Krashinsky, C Batten, M Hampton, S Gerding, B Pharris, J Casper, ... ACM SIGARCH Computer Architecture News 32 (2), 52, 2004 | 272 | 2004 |
Hardware acceleration of database operations J Casper, K Olukotun Proceedings of the 2014 ACM/SIGDA international symposium on Field …, 2014 | 229 | 2014 |
A scalable, non-blocking approach to transactional memory H Chafi, J Casper, BD Carlstrom, A McDonald, CC Minh, W Baek, ... 2007 IEEE 13th International Symposium on High Performance Computer …, 2007 | 188 | 2007 |
Reducing activation recomputation in large transformer models VA Korthikanti, J Casper, S Lym, L McAfee, M Andersch, M Shoeybi, ... Proceedings of Machine Learning and Systems 5, 341-353, 2023 | 169 | 2023 |
Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, and Bryan Catanzaro S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... Using deepspeed and megatron to train megatron-turing nlg 530b, a large …, 2022 | 131 | 2022 |
Eigenbench: A simple exploration tool for orthogonal TM characteristics S Hong, T Oguntebi, J Casper, N Bronson, C Kozyrakis, K Olukotun IEEE International Symposium on Workload Characterization (IISWC'10), 1-11, 2010 | 97 | 2010 |
A practical FPGA-based framework for novel CMP research S Wee, J Casper, N Njoroge, Y Tesylar, D Ge, C Kozyrakis, K Olukotun Proceedings of the 2007 ACM/SIGDA 15th international symposium on Field …, 2007 | 95 | 2007 |
Atlas: A chip-multiprocessor with transactional memory support N Njoroge, J Casper, S Wee, Y Teslyar, D Ge, C Kozyrakis, K Olukotun 2007 Design, Automation & Test in Europe Conference & Exhibition, 1-6, 2007 | 87 | 2007 |
Systems and methods for speech transcription A Hannun, C Case, J Casper, B Catanzaro, G Diamos, E Elsen, ... US Patent 10,540,957, 2020 | 75 | 2020 |
Transactional predication: high-performance concurrent sets and maps for stm NG Bronson, J Casper, H Chafi, K Olukotun Proceedings of the 29th ACM SIGACT-SIGOPS symposium on Principles of …, 2010 | 68 | 2010 |
Deep speech: Scaling up end-to-end speech recognition. arXiv 2014 A Hannun, C Case, J Casper, B Catanzaro, G Diamos, E Elsen, ... arXiv preprint arXiv:1412.5567, 2014 | 62 | 2014 |
Hardware acceleration of transactional memory on commodity systems J Casper, T Oguntebi, S Hong, NG Bronson, C Kozyrakis, K Olukotun ACM SIGPLAN Notices 46 (3), 27-38, 2011 | 41 | 2011 |