Follow
(Jacob) Zhiyuan Fang
Title
Cited by
Cited by
Year
Range Loss for Deep Face Recognition with Long-tailed Training Data
X Zhang, Z Fang, Y Wen, Z Li, Y Qiao
Proceedings of the International Conference on Computer Vision (ICCV), 2017 …, 2017
4912017
SEED: Self-supervised Distillation For Visual Representation
Z Fang, J Wang, L Wang, L Zhang, Y Yang, Z Liu
International Conference on Learning Representations (ICLR), 2021, 2021
1792021
ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language
Z Wang, Z Fang, J Wang, Y Yang
Conference on European Conference On Computer Vision (ECCV), 2020, 2020
1442020
A Multi-Resolution Approach for Spinal Metastasis Detection using Deep Siamese Neural Networks
J Wang, Z Fang, N Lang, H Yuan, M Su, P Baldi
Computers in Biology and Medicine 84 (C), 137-146, 2017
1402017
Injecting Semantic Concepts into End-to-End Image Captioning
Z Fang, J Wang, X Hu, L Liang, Z Gan, L Wang, Y Yang, Z Liu
Conference on Computer Vision and Pattern Recognition (CVPR) 2022, 2022
952022
Compressing Visual-linguistic Model via Knowledge Distillation
Z Fang, J Wang, X Hu, L Wang, Y Yang, Z Liu
Proceedings of the International Conference on Computer Vision (ICCV), 2021, 2021
742021
Video2commonsense: Generating Commonsense Descriptions to Enrich Video Captioning
Z Fang, T Gokhale, P Banerjee, C Baral, Y Yang
Empirical Methods in Natural Language Processing (EMNLP), 2020, 2020
642020
Modularized Textual Grounding for Counterfactual Resilience
Z Fang, S Kong, C Fowlkes, Y Yang
Conference on Computer Vision and Pattern Recognition (CVPR) 2019, 6378-6388, 2019
332019
A Behavior Mining Based Hybrid Recommender System
Z Fang, L Zhang, K Chen
2016 IEEE International Conference on Big Data Analysis (ICBDA), 1-5, 2017
16*2017
Weak Supervision and Referring Attention for Temporal-textual Association Learning
Z Fang, S Kong, Z Wang, C Fowlkes, Y Yang
arXiv preprint arXiv:2006.11747, 2020
152020
Weakly Supervised Attention Learning for Textual Phrases Grounding
Z Fang, S Kong, Y Tianshu, Y Yang
2018 CVPRW on Vision and Language, 2018
102018
Blocksworld revisited: Learning and Reasoning to Generate Event-sequences From Image Pairs
T Gokhale, S Sampat, Z Fang, Y Yang, C Baral
arXiv preprint arXiv:1905.12042, 2019
72019
End-to-end Knowledge Retrieval with Multi-modal Queries
M Luo, Z Fang, T Gokhale, Y Yang, C Baral
(ACL'23) The 61st Annual Meeting of the Association for Computational …, 2023
62023
Knowledge Distillation Across Vision and Language
Z Fang, Y Yang
Advancements in Knowledge Distillation: Towards New Horizons of Intelligent …, 2023
52023
Text-to-image editing by image information removal
Z Zhang, J Zheng, JZ Fang, BA Plummer
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024, 2023
52023
Cooking with Blocks: A Recipe for Visual Reasoning on Image-pairs
T Gokhale, S Sampat, Z Fang, Y Yang, C Baral
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
52019
Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation
Y Chang, Y Zhang, Z Fang, Y Wu, Y Bisk, F Gao
arXiv preprint arXiv:2403.16394, 2024
12024
CAVAN: Commonsense Knowledge Anchored Video Captioning
H Shao, Z Fang, Y Yang
2022 26th International Conference on Pattern Recognition (ICPR), 4095-4102, 2022
12022
Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
A Chakravarthy, Z Fang, Y Yang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
12022
FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation
X He, J Zheng, JZ Fang, R Piramuthu, M Bansal, V Ordonez, ...
arXiv preprint arXiv:2405.04834, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20