Siddhant Bharadwaj

Namaste, Hello, Hola

I am currently a PreDoc at the Indian Institute of Science, Bangalore where I work on visual quality assessment for AI generated images using MLLMs under Prof. Rajiv Soundararajan, and on the AI side of the Oral Cancer Screening Project under Prof. Rajesh Sundaresan and Prof. Chandra Sekhar Seelamantula.

In addition to my work at IISc, I am collaborating with Prof. Min Xu at Carnegie Mellon University on adapting 2D image-pretrained Transformers to 3D Cryo-ET subtomogram classification, and with Prof. Shruti Vyas at University of Central Florida on MLLM-based geolocalization. I also lead Computer Vision efforts at CORD.ai, a non-profit deep learning research community with 800+ members across 12 countries, and contribute to Cohere for AI's Maya project on spatial reasoning in VLMs.

I graduated with a major in Electrical and Electronics and a minor in Data Science at Manipal Institute of Technology, Manipal in 2024. My interest lies in deep learning, computer vision, and image processing. My current research primarily focuses on Visual Language Models (VLMs) and Multimodal Large Language Models (MLLMs).

Previously, I interned at Spectrum Lab in the Indian Institute of Science, Bangalore, where I worked on adapting the Segment Anything Model for the task of optic disc and optic cup segmentation in fundus images.

During my undergraduate, I conducted research on AI in Health Care, AI Security and the use of Deep Learning in battery health management. The bulk of the work was done under Prof. Harish Kumar J.R. (MIT, Manipal), and Prof. Munesh Chandra Trivedi (NIT Agartala).

When I am not programming or doing mathematics, you will find me reading, watching sitcoms, scribbling in my journal, and more often than not posting hot takes on X (formerly known as Twitter).

If you would like to collaborate or talk about one of my projects, feel free to drop a mail.

Email LinkedIn GitHub X (Twitter) Google Scholar

Publications

My research focuses on multimodal AI, vision-language models, medical imaging, and computer vision.

The Spatial Blindspot of Vision-Language Models
N. Alam, L. K. Murali, S. Bharadwaj, P. Liu, T. Chung, D. Sharma, Akshata A, K. Kiran, W. Tam, B. K. S. Vegesna
arXiv 2025
[Paper]

Multiscale Diagnostics of Visual Language Models
S. Bharadwaj, A. Vashist, M. Azfar, R. K. Salla, D. Verma, R. M. Amancherla
ICCV 2025 Workshop on CV4DC (Non-Archival Track)
[Paper]

InfUI: Harnessing Deep Learning to Bridge Eye Care Gap
S. Kotian, A. Batra, S. Bharadwaj, J. R. H. Kumar
BMC Medical Imaging, under review

Obscure to Observe: A Lesion-Aware MAE for Glaucoma Detection from Retinal Context
S. Bharadwaj, P. Seth, C. S. Seelamantula
Medical Imaging with Deep Learning (MIDL) 2025, Short Papers Track
[Paper]

Balanced CutMix: Enhancing Image Classification Through Curriculum Learning
M. Azfar*, S. Bharadwaj*
CV4DC Workshop at ACCV 2024
[Paper]

Improving Smooth GradCAM++ with Gradient Weighting Techniques
S. Bharadwaj*, M. Azfar*, A. Sasikumar
IEEE INDICON 2024 (Oral Presentation)
[Paper]

Adaptive Multi-Scale Document Binarisation Using Vision Mamba
M. Azfar*, S. Bharadwaj*, A. Sasikumar
ICVGIP 2024 - Tiny Papers Track
[Paper]

Vulnerability Analysis of Deep Learning Model for OCTA Image Classification
M. C. Trivedi, S. Bharadwaj
AICTC 2024
[Paper]

* Equal contribution

Achievements

Breakthrough Concept Award - Google Cloud Agentic AI Hackathon (Prize: INR 50,000)

Second Prize - Accel x Anthropic Developers Day

Department Rank 1 - Final year of undergraduate program

Achiever Scholarship - MIT Manipal (INR 86,000 merit-based award for top 5% in department)