Follow
Govind Thattai
Govind Thattai
Verified email at amazon.com
Title
Cited by
Cited by
Year
Embodied bert: A transformer model for embodied, language-guided visual task completion
A Suglia, Q Gao, J Thomason, G Thattai, G Sukhatme
arXiv preprint arXiv:2108.04927, 2021
642021
Transform-retrieve-generate: Natural language-centric outside-knowledge visual question answering
F Gao, Q Ping, G Thattai, A Reganti, YN Wu, P Natarajan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
552022
Dialfred: Dialogue-enabled agents for embodied instruction following
X Gao, Q Gao, R Gong, K Lin, G Thattai, GS Sukhatme
IEEE Robotics and Automation Letters 7 (4), 10049-10056, 2022
462022
Learning better visual dialog agents with pretrained visual-linguistic representation
T Tu, Q Ping, G Thattai, G Tur, P Natarajan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
202021
LRTA: A transparent neural-symbolic reasoning framework with modular supervision for visual question answering
W Liang, F Niu, A Reganti, G Thattai, G Tur
arXiv preprint arXiv:2011.10731, 2020
182020
Alexa arena: A user-centric interactive platform for embodied ai
Q Gao, G Thattai, S Shakiah, X Gao, S Pansare, V Sharma, G Sukhatme, ...
Advances in Neural Information Processing Systems 36, 2024
172024
A thousand words are worth more than a picture: Natural language-centric outside-knowledge visual question answering
F Gao, Q Ping, G Thattai, A Reganti, YN Wu, P Natarajan
arXiv preprint arXiv:2201.05299, 2022
162022
Learning to act with affordance-aware multimodal neural slam
Z Jia, K Lin, Y Zhao, Q Gao, G Thattai, GS Sukhatme
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022
142022
" polyaural" array processing for automatic speech recognition in degraded environments.
RM Stern, EB Gouvęa, G Thattai
Interspeech, 926-929, 2007
142007
Neural architecture search for parameter-efficient fine-tuning of large pre-trained language models
N Lawton, A Kumar, G Thattai, A Galstyan, GV Steeg
arXiv preprint arXiv:2305.16597, 2023
102023
Luminous: Indoor scene generation for embodied ai challenges
Y Zhao, K Lin, Z Jia, Q Gao, G Thattai, J Thomason, GS Sukhatme
arXiv preprint arXiv:2111.05527, 2021
102021
Givl: Improving geographical inclusivity of vision-language models with pre-training methods
D Yin, F Gao, G Thattai, M Johnston, KW Chang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
82023
Are we there yet? learning to localize in embodied instruction following
S Storks, Q Gao, G Thattai, G Tur
arXiv preprint arXiv:2101.03431, 2021
82021
Interactive teaching for conversational ai
Q Ping, F Niu, G Thattai, J Chengottusseriyil, Q Gao, A Reganti, ...
arXiv preprint arXiv:2012.00958, 2020
82020
Towards reasoning-aware explainable vqa
R Vaideeswaran, F Gao, A Mathur, G Thattai
arXiv preprint arXiv:2211.05190, 2022
62022
Lemma: Learning language-conditioned multi-robot manipulation
R Gong, X Gao, Q Gao, S Shakiah, G Thattai, GS Sukhatme
IEEE Robotics and Automation Letters, 2023
42023
Alexa, play with robot: Introducing the first alexa prize simbot challenge on embodied ai
H Shi, L Ball, G Thattai, D Zhang, L Hu, Q Gao, S Shakiah, X Gao, ...
arXiv preprint arXiv:2308.05221, 2023
32023
Opend: A benchmark for language-driven door and drawer opening
Y Zhao, Q Gao, L Qiu, G Thattai, GS Sukhatme
arXiv preprint arXiv:2212.05211, 2022
32022
Ch-marl: A multimodal benchmark for cooperative, heterogeneous multi-agent reinforcement learning
V Sharma, P Goyal, K Lin, G Thattai, Q Gao, GS Sukhatme
arXiv preprint arXiv:2208.13626, 2022
32022
Privacy preserving visual question answering
CP Bara, Q Ping, A Mathur, G Thattai, R MV, GS Sukhatme
arXiv preprint arXiv:2202.07712, 2022
32022
The system can't perform the operation now. Try again later.
Articles 1–20