Follow
Beidi Chen
Beidi Chen
Verified email at andrew.cmu.edu
Title
Cited by
Cited by
Year
SLIDE: In Defense of Smart Algorithms over Hardware Acceleration for Large-scale Deep Learning Systems
B Chen, T Medini, J Farwell, S Gobriel, C Tai, A Shrivastava
Proceedings of Machine Learning and System 2, 291--306, 2020
1102020
Analyzing log analysis: An empirical study of user log mining
S Alspaugh, B Chen, J Lin, A Ganapathi, M Hearst, R Katz
28th Large Installation System Administration Conference (LISA14), 62-77, 2014
622014
MONGOOSE: A learnable LSH framework for efficient neural network training
B Chen, Z Liu, B Peng, Z Xu, JL Li, T Dao, Z Song, A Shrivastava, C Re
International Conference on Learning Representations, 2021
612021
Scatterbrain: Unifying sparse and low-rank attention
B Chen, T Dao, E Winsor, Z Song, A Rudra, C Ré
Advances in Neural Information Processing Systems 34, 17413-17426, 2021
59*2021
Angular visual hardness
B Chen, W Liu, Z Yu, J Kautz, A Shrivastava, A Garg, A Anandkumar
International Conference on Machine Learning, 1637-1648, 2020
442020
Pixelated butterfly: Simple and efficient sparse training for neural network models
B Chen, T Dao, K Liang, J Yang, Z Song, A Rudra, C Re
International Conference on Learning Representations, 2022
42*2022
Fast and accurate stochastic gradient estimation
B Chen, Y Xu, A Shrivastava
Advances in Neural Information Processing Systems 32, 2019
42*2019
Monarch: Expressive structured matrices for efficient and accurate training
T Dao, B Chen, NS Sohoni, A Desai, M Poli, J Grogan, A Liu, A Rao, ...
International Conference on Machine Learning, 4690-4721, 2022
412022
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
Y Sheng, L Zheng, B Yuan, Z Li, M Ryabinin, B Chen, P Liang, C Re, ...
International Conference on Machine Learning, 2023
34*2023
Unique entity estimation with application to the Syrian conflict
B Chen, A Shrivastava, RC Steorts
The Annals of Applied Statistics 12 (2), 1039-1067, 2018
322018
Decentralized training of foundation models in heterogeneous environments
B Yuan, Y He, JQ Davis, T Zhang, T Dao, B Chen, P Liang, C Re, C Zhang
Neural Information Processing Systems., 2022
272022
Sub-linear privacy-preserving near-neighbor search
MS Riazi, B Chen, A Shrivastava, D Wallach, F Koushanfar
arXiv preprint arXiv:1612.01835, 2016
26*2016
Deja vu: Contextual sparsity for efficient llms at inference time
Z Liu, J Wang, T Dao, T Zhou, B Yuan, Z Song, A Shrivastava, C Zhang, ...
International Conference on Machine Learning, 22137-22176, 2023
192023
Densified winner take all (WTA) hashing for sparse datasets
B Chen, A Shrivastava
Uncertainty in artificial intelligence, 2018
19*2018
Locality sensitive teaching
Z Xu, B Chen, C Li, W Liu, L Song, Y Lin, A Shrivastava
Advances in Neural Information Processing Systems 34, 18049-18062, 2021
152021
A tale of two efficient and informative negative sampling distributions
S Daghaghi, T Medini, N Meisburger, B Chen, M Zhao, A Shrivastava
International Conference on Machine Learning, 2319-2329, 2021
102021
H O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Z Zhang, Y Sheng, T Zhou, T Chen, L Zheng, R Cai, Z Song, Y Tian, C Ré, ...
International Conference on Machine Learning, 2023
82023
Fine-tuning language models over slow networks using activation compression with guarantees
J Wang, B Yuan, L Rimanic, Y He, T Dao, B Chen, C Re, C Zhang
arXiv preprint arXiv:2206.01299, 2022
8*2022
HALOS: Hashing Large Output Space for Cheap Inference
Z Liu, Z Xu, A Ji, J Zhang, J Li, B Chen, A Shrivastava
Proceedings of Machine Learning and Systems 4, 110-125, 2022
8*2022
Efficient streaming language models with attention sinks
G Xiao, Y Tian, B Chen, S Han, M Lewis
arXiv preprint arXiv:2309.17453, 2023
72023
The system can't perform the operation now. Try again later.
Articles 1–20