Volgen
Zeyuan Allen-Zhu (朱澤園)
Zeyuan Allen-Zhu (朱澤園)
Andere namenZeyuan Allen Zhu
Meta FAIR Labs
Geverifieerd e-mailadres voor csail.mit.edu - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
A convergence theory for deep learning via over-parameterization
Z Allen-Zhu, Y Li, Z Song
ICML 2019: International Conference on Machine Learning, 2019
10742019
Is Q-learning Provably Efficient?
C Jin, Z Allen-Zhu, S Bubeck, MI Jordan
NIPS 2018: Neural Information Processing Systems, 2018
6352018
Learning and generalization in overparameterized neural networks, going beyond two layers
Z Allen-Zhu, Y Li, Y Liang
NeurIPS 2019: Neural Information Processing Systems, 2019
6212019
Katyusha: the first direct acceleration of stochastic gradient methods
Z Allen-Zhu
STOC 2017: Symposium on Theory of Computing, 19-23, 2017
5722017
Variance reduction for faster non-convex optimization
Z Allen-Zhu, E Hazan
ICML 2016: International Conference on Machine Learning, 699-707, 2016
3792016
Linear coupling: An ultimate unification of gradient and mirror descent
Z Allen-Zhu, L Orecchia
ITCS 2017: Innovations in Theoretical Computer Science, 2017
339*2017
Finding approximate local minima faster than gradient descent
N Agarwal, Z Allen-Zhu, B Bullins, E Hazan, T Ma
STOC 2017: Symposium on Theory of Computing, 1195-1199, 2017
305*2017
A simple, combinatorial algorithm for solving SDD systems in nearly-linear time
JA Kelner, L Orecchia, A Sidford, ZA Zhu
STOC 2013: Symposium on Theory of Computing, 911-920, 2013
2622013
Natasha 2: Faster Non-Convex Optimization Than SGD
Z Allen-Zhu
NIPS 2018: Neural Information Processing Systems, 2018
2312018
Byzantine Stochastic Gradient Descent
D Alistarh, Z Allen-Zhu, J Li
NIPS 2018: Neural Information Processing Systems, 2018
2282018
Improved SVRG for non-strongly-convex or sum-of-non-convex objectives
Z Allen-Zhu, Y Yuan
ICML 2016: International Conference on Machine Learning, 1080-1089, 2016
2002016
Even faster accelerated coordinate descent using non-uniform sampling
Z Allen-Zhu, Z Qu, P Richtárik, Y Yuan
ICML 2016: International Conference on Machine Learning, 1110-1119, 2016
1782016
Lora: Low-rank adaptation of large language models
EJ Hu, Y Shen, P Wallis, Z Allen-Zhu, Y Li, S Wang, L Wang, W Chen
arXiv preprint arXiv:2106.09685, 2021
1642021
Asymptotically optimal strategy-proof mechanisms for two-facility games
P Lu, X Sun, Y Wang, ZA Zhu
ACM-EC 2010: Conference on Economics and Computation, 315-324, 2010
1632010
Towards understanding ensemble, knowledge distillation and self-distillation in deep learning
Z Allen-Zhu, Y Li
arXiv preprint arXiv:2012.09816, 2020
1512020
On the convergence rate of training recurrent neural networks
Z Allen-Zhu, Y Li, Z Song
NeurIPS 2019: Neural Information Processing Systems, 2019
1482019
What Can ResNet Learn Efficiently, Going Beyond Kernels?
Z Allen-Zhu, Y Li
NeurIPS 2019: Neural Information Processing Systems, 2019
1372019
Neon2: Finding Local Minima via First-Order Oracles
Z Allen-Zhu, Y Li
NIPS 2018: Neural Information Processing Systems, 2018
1262018
LazySVD: Even faster SVD decomposition yet without agonizing pain
Z Allen-Zhu, Y Li
NIPS 2016: Neural Information Processing Systems, 974-982, 2016
1202016
A novel click model and its applications to online advertising
ZA Zhu, W Chen, T Minka, C Zhu, Z Chen
WSDM 2010: International Conference on Web Search and Data Mining, 321-330, 2010
1202010
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20