Manan Shah

Manan Shah

mt-shah@hotmail.com


I am a master's student in computer vision (MSCV) at Carnegie Mellon University. My research interests lie in vision-language models (VLMs), 3D vision, and generative models.
Previously, I worked at streamingo.ai on human activity recognition with VLMs, and as a research assistant at the Vision and AI Lab, IISc, under the mentorship of Prof. Venkatesh Babu, where I explored diffusion models and 3D Gaussian splatting. Before that, I was a software development engineer at Cashfree Payments.


I also maintain a blog where I jot down learnings, experiences, and ideas—writing helps me clarify concepts and share insights with others.
Feel free to reach out if you'd like to discuss interesting problems in computer vision!



News

  • [Aug 25] Started my master's in Computer Vision (MSCV) at CMU RI.
  • [Feb 25] Our work, MirrorVerse, on enhanced generation of mirror reflections got accepted to CVPR 2025.
  • [Nov 24] Our work, Reflecting Reality, on generating consistent mirror reflections got accepted to 3DV 2025.
  • [Mar 24] Joined Vision and AI Lab, IISc as a Research Assistant.
  • [Jul 23] Joined Cashfree Payments as a Software Development Engineer.
  • [Jul 23] Won the NCVPRIPG'23 challenge on writer verification.
  • [Jun 23] Graduated with a B.Tech in CSE from IIT Mandi.
  • [Jun 22] Interned with the Microsoft Search Technology Center, India (STCI) team.

Publications

2025


MirrorVerse: Pushing Diffusion Models to Realistically Reflect the World
Ankit Dhiman*, Manan Shah*, R Venkatesh Babu
CVPR 2025
[project page] [paper] [code]
Reflecting Reality: Enabling Diffusion Models to Produce Faithful Mirror Reflections
Ankit Dhiman*, Manan Shah*, Rishubh Parihar, Yash Bhalgat, Lokesh R Boregowda, R Venkatesh Babu
3DV 2025
[project page] [paper] [code]

Blog Posts