Volgen
Sanjoy Chowdhury
Titel
Geciteerd door
Geciteerd door
Jaar
V-desirr: Very fast deep embedded single image reflection removal
BH Prasad, LR Boregowda, K Mitra, S Chowdhury
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
232021
APoLLo: Unified Adapter and Prompt Learning for Vision Language Models
S Chowdhury, S Nag, D Manocha
Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023
162023
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models
S Chowdhury, S Nag, KJ Joseph, D Srinivasan, BV, Manocha
CVPR 2024 (Highlight), 26826-26835, 2024
92024
Adverb: Visually guided audio dereverberation
S Chowdhury, S Ghosh, S Dasgupta, A Ratnarajah, U Tyagi, D Manocha
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
92023
Measured albedo in the wild: Filling the gap in intrinsics evaluation
J Wu, S Chowdhury, H Shanmugaraja, D Jacobs, S Sengupta
2023 IEEE International Conference on Computational Photography (ICCP), 1-12, 2023
82023
Meerkat: Audio-visual large language model for grounding in space and time
S Chowdhury, S Nag, S Dasgupta, J Chen, M Elhoseiny, R Gao, ...
European Conference on Computer Vision, 52-70, 2024
52024
Towards determining perceived audience intent for multimodal social media posts using the theory of reasoned action
T Mittal, S Chowdhury, P Guhan, S Chelluri, D Manocha
Scientific Reports 14 (1), 10606, 2024
42024
Can LLMs Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis
VS Dorbala, S Chowdhury, D Manocha
arXiv preprint arXiv:2403.11487, 2024
42024
Listen to the pixels
S Chowdhury, S Dasgupta, S Das, U Bhattacharya
2021 IEEE International Conference on Image Processing (ICIP), 2568-2572, 2021
42021
AudViSum: Self-Supervised Deep Reinforcement Learning for Diverse Audio-Visual Summary Generation
S Chowdhury, AP Patra, S Dasgupta, U Bhattacharya
British Machine Vision Conference - BMVC 2021, 2021
22021
AVTrustBench: Assessing and Enhancing Reliability and Robustness in Audio-Visual LLMs
S Chowdhury, S Nag, S Dasgupta, Y Wang, M Elhoseiny, R Gao, ...
arXiv preprint arXiv:2501.02135, 2025
2025
ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations
S Ghosh, CKR Evuru, S Kumar, U Tyagi, S Sakshi, S Chowdhury, ...
Findings of the Association for Computational Linguistics ACL 2024, 386-406, 2024
2024
Can LLMs Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis
V Sashank Dorbala, S Chowdhury, D Manocha
arXiv e-prints, arXiv: 2403.11487, 2024
2024
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models
C Sanjoy, S Nag, B Joseph, KJ, Srinivasan, D Manocha
CVPR, https://www.arxiv.org/pdf/2406.04673, 2024
2024
ASPIRE: Language-Guided Augmentation for Robust Image Classification
S Ghosh, CKR Evuru, S Kumar, U Tyagi, S Singh, S Chowdhury, ...
arXiv preprint arXiv:2308.10103, 2023
2023
Not Too Deep CNN for Face Detection in Real Life Scenario
S Chowdhury, P Mukherjee, U Bhattacharya
International Conference on Next Generation Computing Technologies 828, 870-886, 2017
2017
Classification of Citation in Scientific Articles
S Chowdhury, H Vardhan, P Mitra, D Bhandari
National Conference on Recent Advances in Science & Technology - 2016, 2016
2016
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–17