Data Augmentation Using GANs for Speech Emotion Recognition. A Chatziagapi, G Paraskevopoulos, D Sgouropoulos, G Pantazopoulos, ... Interspeech, 171-175, 2019 | 158 | 2019 |
Task formulation matters when learning continually: A case study in visual question answering M Nikandrou, L Yu, A Suglia, I Konstas, V Rieser arXiv preprint arXiv:2210.00044, 2022 | 7 | 2022 |
Multitask Multimodal Prompted Training for Interactive Embodied Task Completion G Pantazopoulos, M Nikandrou, A Parekh, B Hemanthage, A Eshghi, ... arXiv preprint arXiv:2311.04067, 2023 | 5 | 2023 |
ViCA: Combining visual, social, and task-oriented conversational AI in a healthcare setting G Pantazopoulos, J Bruyere, M Nikandrou, T Boissier, S Hemanthage, ... Proceedings of the 2021 International Conference on Multimodal Interaction …, 2021 | 5 | 2021 |
Quality-agnostic image captioning to safely assist people with vision impairment L Yu, M Nikandrou, J Jin, V Rieser arXiv preprint arXiv:2304.14623, 2023 | 4 | 2023 |
Going for GOAL: A resource for grounded football commentaries A Suglia, J Lopes, E Bastianelli, A Vanzo, S Agarwal, M Nikandrou, L Yu, ... arXiv preprint arXiv:2211.04534, 2022 | 4 | 2022 |
Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks G Pantazopoulos, A Parekh, M Nikandrou, A Suglia arXiv preprint arXiv:2405.04403, 2024 | 3 | 2024 |
Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling G Pantazopoulos, M Nikandrou, A Suglia, O Lemon, A Eshghi arXiv preprint arXiv:2409.05395, 2024 | 1 | 2024 |
Emma: A foundation model for embodied, interactive, multimodal task completion in 3d environments A Parekh, M Nikandrou, G Pantazopoulos, B Hemanthage, A Eshghi, ... Alexa Prize SimBot Challenge Proceedings, 2023 | 1 | 2023 |
Demonstrating EMMA: Embodied MultiModal Agent for Language-guided Action Execution in 3D Simulated Environments A Suglia, B Hemanthage, M Nikandrou, G Pantazopoulos, A Parekh, ... Proceedings of the 23rd Annual Meeting of the Special Interest Group on …, 2022 | 1 | 2022 |
CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts M Nikandrou, G Pantazopoulos, N Vitsakis, I Konstas, A Suglia arXiv preprint arXiv:2410.15453, 2024 | | 2024 |
Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation M Nikandrou, G Pantazopoulos, I Konstas, A Suglia arXiv preprint arXiv:2406.19297, 2024 | | 2024 |