A survey on hallucination in large vision-language models H Liu, W Xue, Y Chen, D Chen, X Zhao, K Wang, L Hou, R Li, W Peng arXiv preprint arXiv:2402.00253, 2024 | 22 | 2024 |
ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition W Xue, D Chen, B Yu, Y Chen, S Zhou, W Peng arXiv preprint arXiv:2308.07743, 2023 | 2 | 2023 |
Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction H Li, Y Chen, Y Chen, W Yang, B Ding, Y Han, L Wang, R Yu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 1 | 2024 |
Video Action Recognition with Attentive Semantic Units Y Chen, D Chen, R Liu, H Li, W Peng Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 1 | 2023 |
Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition Y Chen, D Chen, R Liu, S Zhou, W Xue, W Peng Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | | 2024 |