Vishnu Sashank Dorbala

Email:

I am a final-year CS PhD student at the University of Maryland, College Park, advised by Prof. Dinesh Manocha. I work at the GAMMA Lab, where I focus on developing generalist robot agents that work consistently in any environment, from day one. My research explores combining pre-trained vision and language models with local scene context and interaction for scalable decision-making.

Prior to my PhD, I earned a master's degree in Robotics at UMD, where I focused on social robot navigation and collaborated with Aniket Bera.

Even before that, I was a Research Fellow for two years at the Center for Visual Information Technology, IIIT-Hyderabad, where I was mentored by Prof. C.V. Jawahar. and collaborated with Prof. A.H Abdul Hafez on a visual servoing project.

Outside of research, I enjoy traveling, writing, composing music, playing Topoi!

Publications

Improving Zero-Shot ObjectNav via Generative Communication Published at ICRA, 2025 [Paper Link ]
Right Place, Right Time! Towards ObjectNav for Non-Stationary Goals Under Review at CV Conference [Paper Link ]
S-EQA: Tackling Situational Queries in Embodied Question Answering Accepted at IROS 2025 [Paper Link ]
Can LLM’s Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis Published at NAACL Main Conference, 2024 [Paper Link ]
Can an Embodied Agent Find Your "Cat-shaped Mug"? LLM-Based Zero-Shot Object Navigation Published at RAL 2023 [Paper Link ]
CLIP-Nav: Using CLIP for Zero-Shot Vision-and-Language Navigation Published at CoRL LangRob Workshop, 2022. [Paper Link ]
Can a Robot Trust You? A DRL-Based Approach to Trust-Driven, Human-Guided Navigation Published at ICRA, 2021. [ Arxiv Link, Project Page ]
ProxEmo: Gait-based Emotion Learning and Multi-view Proxemic Fusion for Socially-Aware Robot Navigation Published at IROS, 2020. [ Arxiv Link, Project Page ]
A Deep Learning Approach for Autonomous Corridor Following Published at IROS, 2019. [ publication, pdf, video ]

Work Experience

(Summer 2024) Internship at Sony Corp., where I studied the spatial reasoning capabilities of VLMs for embodied exploration and reasoning. I was supervised by Akira Nakamura, and mentored by Marzieh Edraki and Selim Engin.
(Summer 2023) Internship at Amazon Alexa AI, where I worked on utilizing LLMs for embodied exploration and reasoning. I was supervised by Reza Ghanadhan, and mentored by Robinson Piramuthu and Prasoon Goyal.
(Summer 2022) Internship at Amazon Alexa AI, where I worked on solving an interesting Embodied AI problem called Vision-and-Language Navigation. I was supervised by Gaurav Sukhatme, and mentored by Robinson Piramuthu, Jesse Thomason and Gunnar Sigurdsson.
(Summer 2020) Internship at Nokia Bell Labs, where I worked on enabling Visual SLAM on an autonomous indoor Loomo robot.
(Aug. 2019 - Present) Graduate Student at University of Maryland, College Park
(Sept. 2017 - April 2019) Research Fellow at CVIT, IIIT-Hyderabad

Last updated October 2025