Graduate Student

Carnegie Mellon University


I am a second year Masters in Robotics Student at Carnegie Mellon University working with Prof. Katerina Fragkiadaki. I am also working as a Research Intern at Apple with Miguel A. Bautista and Navdeep Jaitley. Previously, I received B.E. in Computer Science from BITS Pilani, Pilani Campus where I spent one year working with Prof. Katerina Fragkiadaki as a research assistant in Machine Learning Department, Carnegie Mellon University.

I am interested in Computer Vision, Natural Language Processing and its application to embodied learning. I believe that both perception and language (plus other modalities like audio) are crucial pieces to truly intelligent and useful robots. I envision developing robots that continually learn by actively moving around while interacting with objects and humans. In my free time, I love listening to podcasts and lectures related to philosophy and ethics (My recent favourite). I also like reading fiction, watching (and playing) cricket and travelling.


  • Embodied AI
  • Computer Vision
  • Natural Language Processing


  • Masters in Robotics, 2021-Present

    Carnegie Mellon University

  • B.E. in Computer Science, 2017-21

    BITS PILANI, Pilani Campus


[Aug 2022] I’ll be attending Alexa Prize Simbot Bootcamp in Seattle!

[May 2022] I’ll be spending my summer as a research intern at Apple with Miguel and Navdeep!

[Dec 2021] Our team got selected for Alexa Prize Simbot Challenge. We will be working on advancing teachable embodied household agents.

[Aug 2021] I’ll be starting my Masters in Robotics at Carnegie Mellon University. I will also continue as a Research Assistant with Prof. Katerina Fragkiadaki.

[Aug 2020] I’ll be attending Google Research India’s AI Summer School (Computer Vision track) from August 20-22, 2020!

[May 2020] Joined Prof. Katerina Fragkiadaki’s Lab as Research Associate at Carnegie Mellon University.

Recent Publications

Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds

Most models tasked to ground referential utterances in 2D and 3D scenes learn to select the referred …
ECCV 2022.

Move to See Better: Towards Self-Improving Embodied Object Detection

Humans learn to better understand the world by moving around their environment to get more …
BMVC 2021.

AI-enabled Object Detection in Unmanned Aerial Vehicles for Edge Computing Applications

Unmanned Aerial Vehicles (UAVs) are emerging as a powerful tool for various industrial and smart …
IEEE Network. 2021.

VisDrone-DET2020: The Vision Meets Drone Object Detection in Image Challenge Results

The Vision Meets Drone Object Detection in Image Challenge (VisDrone-DET 2020) is the third annual …
ECCV 2020 Workshop.

NukeBERT: A Pre-trained language model for Low Resource Nuclear Domain

Significant advances have been made in recent years on Natural Language Processing with machines …



Research Intern


May 2022 – Present Cupertino, California, USA

Graduate Research Assistant

Carnegie Mellon University

Aug 2021 – Present Pittsburgh, Pennsylvania, USA
With Prof. Katerina Fragkiadaki, I am working on language grounding in static 2D and 3D scenes, robot manipultation following language instructions, and instruction following in indoor household environment.

Research Associate

Carnegie Mellon University

May 2020 – Jul 2021 Pittsburgh, Pennsylvania, USA
Under the supervision of Prof. Katerina Fragkiadaki, I worked on enabling an embodied agent to learn about objects without ground truth supervision in an unseen 3D environment just by moving around.

Research Assistant


Aug 2019 – Apr 2020 Rajasthan, India
As a research assistant, I worked under the guidance of Prof. Pratik Narang in computer vision. My team and I proposed a novel architectural design that improved the performance of previous methods by a substantial margin.

Academic Service and Teaching

I reviewed for BMVC 2020, 2021, CVPR 2022, ECCV 2022, TPAMI 2021, 2022

I have served as a teaching assistant for the following courses at BITS Pilani:

  • CS F407 Artificial Intelligence, Fall 2020
  • CS F464 Machine Learning, Spring 2020
  • CS F111 Computer Programming, Spring 2019 and Fall 2019