Towards Perceptual Image Dehazing by Physics-based Disentanglement and Adversarial Training
X Yang, Z Xu, J Luo
AAAI Conference on Artificial Intelligence (AAAI), 2018
Cross-x learning for fine-grained visual categorization
W Luo, X Yang, X Mo, Y Lu, LS Davis, J Li, J Yang, SN Lim
Proceedings of the IEEE/CVF international conference on computer vision …, 2019
STEP: Spatio-Temporal Progressive Learning for Video Action Detection
X Yang, X Yang, MY Liu, F Xiao, L Davis, J Kautz
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019
Deep multimodal representation learning from temporal data
X Yang, P Ramesh, R Chitta, S Madhvanath, EA Bernal, J Luo
Proceedings of the IEEE conference on computer vision and pattern …, 2017
Tracking Illicit Drug Dealing and Abuse on Instagram Using Multimodal Analysis
X Yang, J Luo
ACM Transactions on Intelligent Systems and Technology (TIST) 8 (4), 2017
Deep temporal multimodal fusion for medical procedure monitoring using wearable sensors
EA Bernal, X Yang, Q Li, J Kumar, S Madhvanath, P Ramesh, R Bala
IEEE Transactions on Multimedia 20 (1), 107-118, 2017
Pinterest board recommendation for twitter users
X Yang, Y Li, J Luo
Proceedings of the 23rd ACM international conference on Multimedia, 963-966, 2015
Temporal fusion of multimodal data from multiple data acquisition systems to automatically recognize and classify an action
X Yang, EA Bernal, S Madhvanath, R Bala, PS Ramesh, Q Li, J Kumar
US Patent 9,805,255, 2017
Understanding the variational lower bound
X Yang
variational lower bound, ELBO, hard attention 22, 1-4, 2017
Strong Baseline for Single Image Dehazing with Deep Features and Instance Normalization.
Z Xu, X Yang, X Li, X Sun, P Harbin
BMVC 2 (3), 5, 2018
Gta: Global temporal attention for video action understanding
B He, X Yang, Z Wu, H Chen, SN Lim, A Shrivastava
arXiv preprint arXiv:2012.08510, 2020
The effectiveness of instance normalization: a strong baseline for single image dehazing
Z Xu, X Yang, X Li, X Sun
arXiv preprint arXiv:1805.03305, 2018
Efficient video transformers with spatial-temporal token selection
J Wang, X Yang, H Li, L Liu, Z Wu, YG Jiang
Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022
Iterative spatio-temporal action detection in video
X Yang, X Yang, X Fanyi, MY Liu, J Kautz
US Patent 11,017,556, 2021
ASM-Loc: action-aware segment modeling for weakly-supervised temporal action localization
B He, X Yang, L Kang, Z Cheng, X Zhou, A Shrivastava
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
Hierarchical contrastive motion learning for video action recognition
X Yang, X Yang, S Liu, D Sun, L Davis, J Kautz
arXiv preprint arXiv:2007.10321, 2020
Semantic video entity linking based on visual content and metadata
Y Li, X Yang, J Luo
Proceedings of the IEEE International Conference on Computer Vision, 4615-4623, 2015
Semi-supervised vision transformers
Z Weng, X Yang, A Li, Z Wu, YG Jiang
Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022
Beyond short clips: End-to-end video-level learning with collaborative memories
X Yang, H Fan, L Torresani, LS Davis, H Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
Two stream self-supervised learning for action recognition
A Taha, M Meshry, X Yang, YT Chen, L Davis
arXiv preprint arXiv:1806.07383, 2018
