About me
Hello, I am Siddhant, I am currently a PreDoc at the Indian Institute of Science, Bangalore where I work on visual quality assement for AI generated images using MLLMs under Prof. Rajiv Soundararajan. Previously I worked on the AI side of the Oral Cancer Screening Project under the supervision of Prof. Rajesh Sundaresan and Prof. Chandra Sekhar Seelamantula.
In addition to my work at IISc, I am currently collaborating with Prof. Shruti Vyas at University of Central Florida, where I work on evaluating and enhancing MLLMs capabilities for Geolocalization. I am also a part of Cohere Community’s Maya, where I focus on spatial reasoning ability of MLLMs and VLMs.
I graduated with a major in Electrical and Electronics and a minor in Data Science at Manipal Institute of Technology, Manipal in 2024. My interest lies in deep learning, computer vision, and image processing. My current research primarily focuses on Visual Language Models (VLMs) and Multimodal Large Language Models (MLLMs).
Previously, I interned at Spectrum Lab in the Indian Institute of Science, Bangalore, where I worked on adapting the Segment Anything Model for the task of optic disc and optic cup segmentation in fundus image.
During my undergraduate, I conducted research on AI in Health Care, AI Security and the use of Deep Learning in battery health management. The bulk of the work was done under Prof. Harish Kumar J.R. (MIT, Manipal), and Prof. Munesh Chandra Trivedi (NIT Agartala).
When I am not programming or doing mathematics, you will find me reading, watching sitcoms, scribbling in my journal, and more often than not posting hot takes on X (formerly known as Twitter).
If you would like to collobarate or talk about one of my projects, feel free to drop a mail.