About

I am a Computer Vision researcher and engineer driven by the need for explainable and reliable multimodal reasoning. I am currently a Master’s student at the University at Buffalo, advised by Prof. Junsong Yuan, where I investigate low-level perception bottlenecks in Vision-Language Models (VLMs) and explore the mechanistic interpretability of vision models, with a long-term interest in explainable and verifiable AI systems.

My work spans the entire AI lifecycle, from data curation to edge-device optimization. Most recently, I have focused on safety-critical applications, developing frameworks that align human intuition with model reasoning for tasks like high-density object counting. Previously, I spent three years at Citi developing innovative in-house tools to streamline and optimize inter team data workflows.

🔍 I am actively looking for Pre-Doc / Fellowship positions in Computer Vision and Machine Learning starting June 2026. Please reach out if you think I might be a good fit for your team!

News

[May 2026]: Won the CSE Faculty Choice Award at the University at Buffalo.
[May 2026]: Successfully defended my Master’s thesis: Structured Spatial Reasoning for Robust and Transparent Object Counting (Committee: Dr. J. Yuan, Dr. K. Ji, Dr. N. Xi).
[Nov 2025]: “Chain-of-Look Spatial Reasoning for Dense Surgical Instrument Counting” paper accepted at WACV 2026 [Paper] [Code] [Video]
[May 2025]: Summer’25 Internship at Mercedes-Benz R&D in San Jose as a Machine Learning Computing Intern on the Autonomous Driving - Middleware team. Focused on model optimization and deployment across various hardware targets.

Email: rbhyri [at] buffalo dot edu

Rishikesh Bhyri

News