Msr-vtt: A large video description dataset for bridging video and language J Xu, T Mei, T Yao, Y Rui Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 2204 | 2016 |
Learning spatio-temporal representation with pseudo-3d residual networks Z Qiu, T Yao, T Mei proceedings of the IEEE International Conference on Computer Vision, 5533-5541, 2017 | 2177 | 2017 |
Exploring visual relationship for image captioning T Yao, Y Pan, Y Li, T Mei Proceedings of the European conference on computer vision (ECCV), 684-699, 2018 | 1061 | 2018 |
Boosting Image Captioning with Attributes T Yao, Y Pan, Y Li, Z Qiu, T Mei ICCV, 2017 | 844 | 2017 |
Jointly modeling embedding and translation to bridge video and language Y Pan, T Mei, T Yao, H Li, Y Rui Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 709 | 2016 |
X-linear attention networks for image captioning Y Pan, T Yao, Y Li, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 672 | 2020 |
Contextual transformer networks for visual recognition Y Li, T Yao, Y Pan, T Mei IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (2), 1489-1500, 2022 | 552 | 2022 |
Transferrable prototypical networks for unsupervised domain adaptation Y Pan, T Yao, Y Li, Y Wang, CW Ngo, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 429 | 2019 |
Gaussian temporal awareness networks for action localization F Long, T Yao, Z Qiu, X Tian, J Luo, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 422 | 2019 |
Fully convolutional adaptation networks for semantic segmentation Y Zhang, Z Qiu, T Yao, D Liu, T Mei Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 418 | 2018 |
Video captioning with transferred semantic attributes Y Pan, T Yao, H Li, T Mei Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 417 | 2017 |
Memory matching networks for one-shot image recognition Q Cai, Y Pan, T Yao, C Yan, T Mei Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 374 | 2018 |
Exploring object relation in mean teacher for cross-domain detection Q Cai, Y Pan, CW Ngo, X Tian, L Duan, T Yao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 365 | 2019 |
Highlight detection with pairwise deep ranking for first-person video summarization T Yao, T Mei, Y Rui Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 337 | 2016 |
Semi-supervised domain adaptation with subspace learning for visual recognition T Yao, Y Pan, CW Ngo, H Li, T Mei Proceedings of the IEEE conference on Computer Vision and Pattern …, 2015 | 271 | 2015 |
Relation distillation networks for video object detection J Deng, Y Pan, T Yao, W Zhou, H Li, T Mei Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 265 | 2019 |
Learning spatio-temporal representation with local and global diffusion Z Qiu, T Yao, CW Ngo, X Tian, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 235 | 2019 |
Hierarchy parsing for image captioning T Yao, Y Pan, Y Li, T Mei Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 229 | 2019 |
Jointly localizing and describing events for dense video captioning Y Li, T Yao, Y Pan, H Chao, T Mei Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 215 | 2018 |
Multi-scale triplet cnn for person re-identification J Liu, ZJ Zha, QI Tian, D Liu, T Yao, Q Ling, T Mei Proceedings of the 24th ACM international conference on Multimedia, 192-196, 2016 | 214 | 2016 |