Nexus: a GPU cluster engine for accelerating DNN-based video analysis H Shen, L Chen, Y Jin, L Zhao, B Kong, M Philipose, A Krishnamurthy, ... Proceedings of the 27th ACM Symposium on Operating Systems Principles, 322-337, 2019 | 107 | 2019 |
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving Y Zhao, CY Lin, K Zhu, Z Ye, L Chen, S Zheng, L Ceze, A Krishnamurthy, ... arXiv preprint arXiv:2310.19102, 2023 | 16 | 2023 |
Enabling Strong Database Integrity using Trusted Execution Environments K Mast, L Chen, EG Sirer arXiv preprint arXiv:1801.01618, 2018 | 9 | 2018 |
Punica: Multi-Tenant LoRA Serving L Chen, Z Ye, Y Wu, D Zhuo, L Ceze, A Krishnamurthy arXiv preprint arXiv:2310.18547, 2023 | 7 | 2023 |
A Vision for Autonomous Blockchains backed by Secure Hardware K Mast, L Chen, EG Sirer Proceedings of the 4th Workshop on System Software for Trusted Execution, 1, 2019 | 6 | 2019 |
Scaling databases through trusted hardware proxies K Mast, L Chen, EG Sirer Proceedings of the 2nd Workshop on System Software for Trusted Execution, 1-6, 2017 | 5 | 2017 |
ADARES: Adaptive Resource Management for Virtual Machines I Cano, L Chen, P Fonseca, T Chen, C Cheah, K Gupta, R Chandra, ... arXiv preprint arXiv:1812.01837, 2018 | 4 | 2018 |
Symphony: Optimized Model Serving using Centralized Orchestration L Chen, W Deng, A Canumalla, Y Xin, M Philipose, A Krishnamurthy arXiv preprint arXiv:2308.07470, 2023 | 2 | 2023 |
Computing in the Era of Large Generative Models: From Cloud-Native to AI-Native Y Lu, S Bian, L Chen, Y He, Y Hui, M Lentz, B Li, F Liu, J Li, Q Liu, R Liu, ... arXiv preprint arXiv:2401.12230, 2024 | 1 | 2024 |