Moe-llava: Mixture of experts for large vision-language models B Lin, Z Tang, Y Ye, J Cui, B Zhu, P Jin, J Zhang, M Ning, L Yuan arXiv preprint arXiv:2401.15947, 2024 | 158 | 2024 |
Open-sora plan: Open-source large video generation model B Lin, Y Ge, X Cheng, Z Li, B Zhu, S Wang, X He, Y Ye, S Yuan, L Chen, ... arXiv preprint arXiv:2412.00131, 2024 | 5 | 2024 |