A diffusion theory for deep learning dynamics: Stochastic gradient descent exponentially favors flat minima Z Xie, I Sato, M Sugiyama International Conference on Learning Representations (ICLR 2021), 2021 | 135 | 2021 |
Dataset Pruning: Reducing Training Data by Examining Generalization Influence S Yang, Z Xie, H Peng, M Xu, M Sun, P Li International Conference on Learning Representations (ICLR 2023), 2023 | 62 | 2023 |
Artificial Neural Variability for Deep Learning: On Overfitting, Noise Memorization, and Catastrophic Forgetting Z Xie, F He, S Fu, I Sato, D Tao, M Sugiyama Neural Computation 33 (8), 2163–2192, 2021 | 56 | 2021 |
Adaptive Inertia: Disentangling the effects of adaptive learning rate and momentum Z Xie, X Wang, H Zhang, I Sato, M Sugiyama International Conference on Machine Learning (ICML 2022, Oral), 2022 | 54* | 2022 |
Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization Z Xie, L Yuan, Z Zhu, M Sugiyama International Conference on Machine Learning (ICML 2021) 139, 11448--11458, 2021 | 33 | 2021 |
Stable weight decay regularization Z Xie, I Sato, M Sugiyama | 28 | 2020 |
Sparse Double Descent: Where Network Pruning Aggravates Overfitting Z He, Z Xie, Q Zhu, Z Qin International Conference on Machine Learning (ICML 2022), 2022 | 26 | 2022 |
On the Overlooked Pitfalls of Weight Decay and How to Mitigate Them: A Gradient-Norm Perspective Z Xie, Z Xu, J Zhang, I Sato, M Sugiyama Neural Information Processing Systems (NeurIPS 2023), 2024 | 23* | 2024 |
S3IM: Stochastic Structural SIMilarity and Its Unreasonable Effectiveness for Neural Fields Z Xie, X Yang, Y Yang, Q Sun, Y Jiang, H Wang, Y Cai, M Sun International Conference on Computer Vision (ICCV 2023), 2023 | 19 | 2023 |
On the power-law spectrum in deep learning: A bridge to protein science Z Xie, QY Tang, Y Cai, M Sun, P Li arXiv preprint arXiv:2201.13011 2, 2022 | 17* | 2022 |
On the Overlooked Structure of Stochastic Gradients Z Xie, QY Tang, M Sun, P Li Neural Information Processing Systems (NeurIPS 2023), 2024 | 7* | 2024 |
A Quantum-Inspired Ensemble Method and Quantum-Inspired Forest Regressors Z Xie, I Sato Asian Conference on Machine Learning 2017, PMLR 77, 81-96, 2017 | 3 | 2017 |
SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior Z Yu, H Wang, J Yang, H Wang, Z Xie, Y Cai, J Cao, Z Ji, M Sun arXiv preprint arXiv:2403.20079, 2024 | 2 | 2024 |
HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models H Wang, H Wang, J Yang, Z Yu, Z Xie, L Tian, X Xiao, J Jiang, X Liu, ... arXiv preprint arXiv:2401.05870, 2024 | 2 | 2024 |
Not All Noises Are Created Equally: Diffusion Noise Selection and Optimization Z Qi, L Bai, H Xiong, Z Xie arXiv preprint arXiv:2407.14041, 2024 | | 2024 |
Converging Paradigms: The Synergy of Symbolic and Connectionist AI in LLM-Empowered Autonomous Agents H Xiong, Z Wang, X Li, J Bian, Z Xie, S Mumtaz, LE Barnes arXiv preprint arXiv:2407.08516, 2024 | | 2024 |
VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model J Yang, H Wang, Z Zhu, C Liu, MW Wu, Z Xie, Z Ji, J Han, M Sun arXiv preprint arXiv:2406.01059, 2024 | | 2024 |
Variance-enlarged Poisson Learning for Graph-based Semi-Supervised Learning with Extremely Sparse Labeled Data X Zhou, X Liu, H Yu, J Wang, Z Xie, J Jiang, X Ji International Conference on Learning Representations (ICLR 2024), 2024 | | 2024 |
Neural Field Classifiers via Target Encoding and Classification Loss X Yang, Z Xie, X Zhou, B Liu, B Liu, Y Liu, H Wang, Y Cai, M Sun International Conference on Learning Representations (ICLR 2024), 2024 | | 2024 |