Beidi Chen

Cited by

	All	Since 2019
Citations	1182	1130
h-index	18	18
i10-index	23	23

420

210

105

315

2016201720182019202020212022202320247 13 18 29 61 113 148 412 366

Public access

View all

14 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Christopher RéComputer Science, Stanford UniversityVerified email at cs.stanford.edu
Anshumali ShrivastavaRice University, ThirdAI Corp.Verified email at rice.edu
Anima AnandkumarCalifornia Institute of Technology and NVIDIAVerified email at caltech.edu
Randy KatzUniversity of California, BerkeleyVerified email at cs.Berkeley.edu
Dan S. WallachProfessor, Rice University, Department of Computer ScienceVerified email at cs.rice.edu
Farinaz KoushanfarProfessor and Henry Booker Faculty Scholar of ECE, UC San DiegoVerified email at ucsd.edu
Sadegh RiaziTechnology Enthusiast | Founder CEO at Pyte (FKA CipherMode) | PhD UCSD, Microsoft ResearchVerified email at ucsd.edu
Sara AlspaughUniversity of California, BerkeleyVerified email at eecs.berkeley.edu
Rebecca SteortsDuke UniversityVerified email at stat.duke.edu
Kaifei ChenSoftware Engineer, WaymoVerified email at berkeley.edu
David CULLERUniversity of California, BerkeleyVerified email at berkeley.edu

Beidi Chen

Carnegie Mellon University

Verified email at andrew.cmu.edu

Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU Y Sheng, L Zheng, B Yuan, Z Li, M Ryabinin, B Chen, P Liang, C Re, ... International Conference on Machine Learning, 2023	123	2023
SLIDE: In Defense of Smart Algorithms over Hardware Acceleration for Large-scale Deep Learning Systems B Chen, T Medini, J Farwell, S Gobriel, C Tai, A Shrivastava Proceedings of Machine Learning and System 2, 291--306, 2020	123	2020
Efficient streaming language models with attention sinks G Xiao, Y Tian, B Chen, S Han, M Lewis arXiv preprint arXiv:2309.17453, 2023	89	2023
Scatterbrain: Unifying sparse and low-rank attention B Chen, T Dao, E Winsor, Z Song, A Rudra, C Ré Advances in Neural Information Processing Systems 34, 17413-17426, 2021	80*	2021
Deja vu: Contextual sparsity for efficient llms at inference time Z Liu, J Wang, T Dao, T Zhou, B Yuan, Z Song, A Shrivastava, C Zhang, ... International Conference on Machine Learning, 22137-22176, 2023	76	2023
MONGOOSE: A learnable LSH framework for efficient neural network training B Chen, Z Liu, B Peng, Z Xu, JL Li, T Dao, Z Song, A Shrivastava, C Re International Conference on Learning Representations, 2021	71	2021
Analyzing log analysis: An empirical study of user log mining S Alspaugh, B Chen, J Lin, A Ganapathi, M Hearst, R Katz 28th Large Installation System Administration Conference (LISA14), 62-77, 2014	65	2014
Monarch: Expressive structured matrices for efficient and accurate training T Dao, B Chen, NS Sohoni, A Desai, M Poli, J Grogan, A Liu, A Rao, ... International Conference on Machine Learning, 4690-4721, 2022	56	2022
Pixelated butterfly: Simple and efficient sparse training for neural network models B Chen, T Dao, K Liang, J Yang, Z Song, A Rudra, C Re International Conference on Learning Representations, 2022	54*	2022
H O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models Z Zhang, Y Sheng, T Zhou, T Chen, L Zheng, R Cai, Z Song, Y Tian, C Ré, ... International Conference on Machine Learning, 2023	52	2023
Decentralized training of foundation models in heterogeneous environments B Yuan, Y He, JQ Davis, T Zhang, T Dao, B Chen, P Liang, C Re, C Zhang Neural Information Processing Systems., 2022	48	2022
Angular visual hardness B Chen, W Liu, Z Yu, J Kautz, A Shrivastava, A Garg, A Anandkumar International Conference on Machine Learning, 1637-1648, 2020	47	2020
Fast and accurate stochastic gradient estimation B Chen, Y Xu, A Shrivastava Advances in Neural Information Processing Systems 32, 2019	47*	2019
Unique entity estimation with application to the Syrian conflict B Chen, A Shrivastava, RC Steorts The Annals of Applied Statistics 12 (2), 1039-1067, 2018	36	2018
Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer Y Tian, Y Wang, B Chen, S Du International Conference on Machine Learning, 2023	30	2023
Sub-linear privacy-preserving near-neighbor search MS Riazi, B Chen, A Shrivastava, D Wallach, F Koushanfar arXiv preprint arXiv:1612.01835, 2016	26*	2016
Densified winner take all (WTA) hashing for sparse datasets B Chen, A Shrivastava Uncertainty in artificial intelligence, 2018	23*	2018
Locality sensitive teaching Z Xu, B Chen, C Li, W Liu, L Song, Y Lin, A Shrivastava Advances in Neural Information Processing Systems 34, 18049-18062, 2021	18	2021
CocktailSGD: Fine-tuning foundation models over 500Mbps networks J Wang, Y Lu, B Yuan, B Chen, P Liang, C De Sa, C Re, C Zhang International Conference on Machine Learning, 36058-36076, 2023	15	2023
Compress, then prompt: Improving accuracy-efficiency trade-off of llm inference with transferable prompt Z Xu, Z Liu, B Chen, Y Tang, J Wang, K Zhou, X Hu, A Shrivastava arXiv preprint arXiv:2305.11186, 2023	15	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors