Volgen
Shu Zhang
Shu Zhang
Salesforce Inc.
Geverifieerd e-mailadres voor salesforce.com
Titel
Geciteerd door
Geciteerd door
Jaar
Heterogeneous memory enhanced multimodal attention model for video question answering
C Fan, X Zhang, S Zhang, W Wang, C Zhang, H Huang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
2492019
Context-aware surveillance video summarization
S Zhang, Y Zhu, AK Roy-Chowdhury
IEEE Transactions on Image Processing 25 (11), 5469-5478, 2016
912016
A camera network tracking (camnet) dataset and performance baseline
S Zhang, E Staudt, T Faltemier, AK Roy-Chowdhury
2015 IEEE Winter Conference on Applications of Computer Vision, 365-372, 2015
742015
Tracking multiple interacting targets in a camera network
S Zhang, Y Zhu, A Roy-Chowdhury
Computer Vision and Image Understanding 134, 64-73, 2015
472015
Use all the labels: A hierarchical multi-label contrastive learning framework
S Zhang, R Xu, C Xiong, C Ramaiah
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
292022
ULIP-2: Towards Scalable Multimodal Pre-training For 3D Understanding
L Xue, N Yu, S Zhang, J Li, R Martín-Martín, J Wu, C Xiong, R Xu, ...
arXiv preprint arXiv:2305.08275, 2023
102023
HIVE: Harnessing Human Feedback for Instructional Visual Editing
S Zhang, X Yang, Y Feng, C Qin, CC Chen, N Yu, Z Chen, H Wang, ...
arXiv preprint arXiv:2303.09618, 2023
102023
Video summarization through change detection in a non-overlapping camera network
S Zhang, AK Roy-Chowdhury
2015 IEEE International Conference on Image Processing (ICIP), 3832-3836, 2015
92015
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
C Qin, S Zhang, N Yu, Y Feng, X Yang, Y Zhou, H Wang, JC Niebles, ...
arXiv preprint arXiv:2305.11147, 2023
82023
Online social behavior modeling for multi-target tracking
S Zhang, A Das, C Ding, A Roy-Chowdhury
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2013
72013
Adaptive algorithm selection, with applications in pedestrian detection
S Zhang, Q Zhu, A Roy-Chowdhury
2016 IEEE International Conference on Image Processing (ICIP), 3768-3772, 2016
52016
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
C Qin, N Yu, C Xing, S Zhang, Z Chen, S Ermon, Y Fu, C Xiong, R Xu
arXiv preprint arXiv:2303.10056, 2023
42023
Adaptive algorithm and platform selection for visual detection and tracking
S Zhang, Q Zhu, A Roy-Chowdhury
arXiv preprint arXiv:1605.06597, 2016
22016
Systems and methods for vision-language distribution alignment
S Zhang, LI Junnan, R Xu, C Xiong, C RAMAIAH
US Patent App. 17/589,725, 2023
2023
Template-based key-value extraction for inferring OCR key values within form images
S Zhang, C Ramaiah, R Xu, C Xiong
US Patent 11,495,011, 2022
2022
The Plug and Play of Language Models for Text-to-image Generation
C Qin, N Yu, C Xing, S Zhang, S Ermon, Y Fu, C Xiong, R Xu
2022
Systems and methods for hierarchical multi-label contrastive learning
S Zhang, C Ramaiah, C Xiong, R Xu
US Patent App. 17/328,779, 2022
2022
Wide-Area Video Understanding: Tracking, Video Summarization and Algorithm-Platform Co-Design
S Zhang
University of California, Riverside, 2015
2015
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–18