Volgen
Shu Zhang
Shu Zhang
Salesforce Inc.
Geverifieerd e-mailadres voor salesforce.com
Titel
Geciteerd door
Geciteerd door
Jaar
Heterogeneous memory enhanced multimodal attention model for video question answering
C Fan, X Zhang, S Zhang, W Wang, C Zhang, H Huang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
3152019
Context-aware surveillance video summarization
S Zhang, Y Zhu, AK Roy-Chowdhury
IEEE Transactions on Image Processing 25 (11), 5469-5478, 2016
1012016
A camera network tracking (camnet) dataset and performance baseline
S Zhang, E Staudt, T Faltemier, AK Roy-Chowdhury
2015 IEEE winter conference on applications of computer vision, 365-372, 2015
782015
Ulip-2: Towards scalable multimodal pre-training for 3d understanding
L Xue, N Yu, S Zhang, A Panagopoulou, J Li, R Martín-Martín, J Wu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
732024
Unicontrol: A unified diffusion model for controllable visual generation in the wild
C Qin, S Zhang, N Yu, Y Feng, X Yang, Y Zhou, H Wang, JC Niebles, ...
arXiv preprint arXiv:2305.11147, 2023
692023
Use all the labels: A hierarchical multi-label contrastive learning framework
S Zhang, R Xu, C Xiong, C Ramaiah
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
682022
Hive: Harnessing human feedback for instructional visual editing
S Zhang, X Yang, Y Feng, C Qin, CC Chen, N Yu, Z Chen, H Wang, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
632024
Tracking multiple interacting targets in a camera network
S Zhang, Y Zhu, A Roy-Chowdhury
Computer Vision and Image Understanding 134, 64-73, 2015
492015
Gluegen: Plug and play multi-modal encoders for x-to-image generation
C Qin, N Yu, C Xing, S Zhang, Z Chen, S Ermon, Y Fu, C Xiong, R Xu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
182023
Video summarization through change detection in a non-overlapping camera network
S Zhang, AK Roy-Chowdhury
2015 IEEE International Conference on Image Processing (ICIP), 3832-3836, 2015
92015
Online social behavior modeling for multi-target tracking
S Zhang, A Das, C Ding, A Roy-Chowdhury
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2013
72013
Adaptive algorithm selection, with applications in pedestrian detection
S Zhang, Q Zhu, A Roy-Chowdhury
2016 IEEE International Conference on Image Processing (ICIP), 3768-3772, 2016
52016
xgen-mm (blip-3): A family of open large multimodal models
L Xue, M Shu, A Awadalla, J Wang, A Yan, S Purushwalkam, H Zhou, ...
arXiv preprint arXiv:2408.08872, 2024
42024
Template-based key-value extraction for inferring OCR key values within form images
S Zhang, C Ramaiah, R Xu, C Xiong
US Patent 11,495,011, 2022
12022
Adaptive algorithm and platform selection for visual detection and tracking
S Zhang, Q Zhu, A Roy-Chowdhury
arXiv preprint arXiv:1605.06597, 2016
12016
Systems and methods for multimodal pretraining for three-dimensional understanding models
L Xue, N Yu, S Zhang, LI Junnan, C Xiong, S Savarese, JCN Duque, R Xu
US Patent App. 18/493,035, 2024
2024
Systems and methods for feedback based instructional visual editing
S Zhang, X Yang, Y Feng, R Xu, N Yu, CC Chen
US Patent App. 18/350,876, 2024
2024
Systems and methods for text-to-image generation using language models
N Yu, C Qin, C Xing, S Zhang, S Ermon, C Xiong, R Xu
US Patent App. 18/162,535, 2024
2024
Systems and methods for vision-language distribution alignment
S Zhang, LI Junnan, R Xu, C Xiong, C Ramaiah
US Patent App. 17/589,725, 2023
2023
Systems and methods for hierarchical multi-label contrastive learning
S Zhang, C Ramaiah, C Xiong, R Xu
US Patent App. 17/328,779, 2022
2022
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20