Volgen
Yining Li
Yining Li
Shanghai AI Lab
Geverifieerd e-mailadres voor pjlab.org.cn
Titel
Geciteerd door
Geciteerd door
Jaar
Learning deep representation for imbalanced classification
C Huang, Y Li, CC Loy, X Tang
Proceedings of the IEEE conference on computer vision and pattern …, 2016
11302016
Deep imbalanced learning for face recognition and attribute prediction
C Huang, Y Li, CC Loy, X Tang
IEEE transactions on pattern analysis and machine intelligence 42 (11), 2781 …, 2019
3512019
Openmmlab pose estimation toolbox and benchmark
MMP Contributors
2362020
Dense intrinsic appearance flow for human pose transfer
Y Li, C Huang, CC Loy
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
1932019
Human attribute recognition by deep hierarchical contexts
Y Li, C Huang, CC Loy, X Tang
Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016
1902016
Rtmpose: Real-time multi-person pose estimation based on mmpose
T Jiang, P Lu, L Zhang, N Ma, R Han, C Lyu, Y Li, K Chen
arXiv preprint arXiv:2303.07399, 2023
362023
Learning to disambiguate by asking discriminative questions
Y Li, C Huang, X Tang, C Change Loy
Proceedings of the IEEE International Conference on Computer Vision, 3419-3428, 2017
292017
InternLM-XComposer2: Mastering free-form text-image composition and comprehension in vision-language large model
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ...
arXiv preprint arXiv:2401.16420, 2024
202024
OMG-Seg: Is One Model Good Enough For All Segmentation?
X Li, H Yuan, W Li, H Ding, S Wu, W Zhang, Y Li, K Chen, CC Loy
arXiv preprint arXiv:2401.10229, 2024
92024
Dst-det: Simple dynamic self-training for open-vocabulary object detection
S Xu, X Li, S Wu, W Zhang, Y Li, G Cheng, Y Tong, K Chen, CC Loy
arXiv preprint arXiv:2310.01393, 2023
62023
Internlm2 technical report
Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ...
arXiv preprint arXiv:2403.17297, 2024
52024
Towards language-driven video inpainting via multimodal large language models
J Wu, X Li, C Si, S Zhou, J Yang, J Zhang, Y Li, K Chen, Y Tong, Z Liu, ...
arXiv preprint arXiv:2401.10226, 2024
32024
An open and comprehensive pipeline for unified object grounding and detection
X Zhao, Y Chen, S Xu, X Li, X Wang, Y Li, H Huang
arXiv preprint arXiv:2401.02361, 2024
32024
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively
H Yuan, X Li, C Zhou, Y Li, K Chen, CC Loy
arXiv preprint arXiv:2401.02955, 2024
22024
RAP-SAM: Towards Real-Time All-Purpose Segment Anything
S Xu, H Yuan, Q Shi, L Qi, J Wang, Y Yang, Y Li, K Chen, Y Tong, ...
arXiv preprint arXiv:2401.10228, 2024
12024
RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation
P Lu, T Jiang, Y Li, X Li, K Chen, W Yang
arXiv preprint arXiv:2312.07526, 2023
12023
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ...
arXiv preprint arXiv:2404.06512, 2024
2024
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–17