Volgen
Kaixuan Huang
Kaixuan Huang
Geverifieerd e-mailadres voor princeton.edu - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
On the Convergence of FedAvg on Non-IID Data
X Li, K Huang, W Yang, S Wang, Z Zhang
arXiv preprint arXiv:1907.02189, 2019
20682019
Why Do Deep Residual Networks Generalize Better than Deep Feedforward Networks?---A Neural Tangent Kernel Perspective
K Huang, Y Wang, M Tao, T Zhao
Advances in neural information processing systems 33, 2698-2709, 2020
972020
Visual adversarial examples jailbreak aligned large language models
X Qi, K Huang, A Panda, P Henderson, M Wang, P Mittal
Proceedings of the AAAI Conference on Artificial Intelligence 38 (19), 21527 …, 2024
66*2024
Fast federated learning in the presence of arbitrary device unavailability
X Gu, K Huang, J Zhang, L Huang
Advances in Neural Information Processing Systems 34, 12052-12064, 2021
662021
Score approximation, estimation and distribution recovery of diffusion models on low-dimensional data
M Chen, K Huang, T Zhao, M Wang
International Conference on Machine Learning, 4672-4712, 2023
592023
Optimal gradient-based algorithms for non-concave bandit optimization
B Huang, K Huang, S Kakade, JD Lee, Q Lei, R Wang, J Yang
Advances in Neural Information Processing Systems 34, 29101-29115, 2021
142021
Reward-directed conditional diffusion: Provable distribution estimation and reward improvement
H Yuan, K Huang, C Ni, M Chen, M Wang
Advances in Neural Information Processing Systems 36, 2024
122024
Going beyond linear rl: Sample efficient neural function approximation
B Huang, K Huang, S Kakade, JD Lee, Q Lei, R Wang, J Yang
Advances in Neural Information Processing Systems 34, 8968-8983, 2021
92021
A Short Note on the Relationship of Information Gain and Eluder Dimension
K Huang, SM Kakade, JD Lee, Q Lei
arXiv preprint arXiv:2107.02377, 2021
82021
Deep Reinforcement Learning for Cost-Effective Medical Diagnosis
Z Yu, Y Li, J Kim, K Huang, Y Luo, M Wang
arXiv preprint arXiv:2302.10261, 2023
42023
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
B Wei, K Huang, Y Huang, T Xie, X Qi, M Xia, P Mittal, M Wang, ...
arXiv preprint arXiv:2402.05162, 2024
32024
A 5′ UTR language model for decoding untranslated regions of mRNA and function predictions
Y Chu, D Yu, Y Li, K Huang, Y Shen, L Cong, J Zhang, M Wang
Nature Machine Intelligence, 1-12, 2024
12024
Diffusion Model for Data-Driven Black-Box Optimization
Z Li, H Yuan, K Huang, C Ni, Y Ye, M Chen, M Wang
arXiv preprint arXiv:2403.13219, 2024
12024
Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization
K Huang, Y Wu, X Zhang, S Tu, Q Wu, M Wang, H Wang
arXiv preprint arXiv:2206.14846, 2022
12022
Embodied LLM Agents Learn to Cooperate in Organized Teams
X Guo, K Huang, J Liu, W Fan, N Vélez, Q Wu, H Wang, TL Griffiths, ...
arXiv preprint arXiv:2403.12482, 2024
2024
Deep Reinforcement Learning for Efficient and Fair Allocation of Health Care Resources
Y Li, C Mao, K Huang, H Wang, Z Yu, M Wang, Y Luo
arXiv preprint arXiv:2309.08560, 2023
2023
Scaling In-Context Demonstrations with Structured Attention
T Cai, K Huang, JD Lee, M Wang
arXiv preprint arXiv:2307.02690, 2023
2023
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–17