Welcome! I'm Adheesh Juvekar

PhD Student at University of Illinois at Urbana-Champaign

I'm a Computer Science PhD student at PLAN Lab, UIUC, advised by Dr. Ismini Lourentzou. My research interests are in Multimodal Learning and Generative AI, with a focus on building models that can understand and create visual content in ways that are grounded, controllable, and interpretable.

I’m open to collaborations and always interested in discussing new research ideas, feel free to reach out via email. If you’re a self-motivated undergraduate (especially at UIUC) interested in working on Generative AI / multimodal projects feel free to reach out via email.

Research Areas

Multimodal Learning & Generative AI

I'm particularly interested in multimodal understanding, image/video generation. Currently, I am exploring hallucination mitigation in grounded vision-language models and controllable image/video generation with diffusion/flow-based models. I'm also interested in emerging direction at the intersection of generation and perception such as 3D-consistent generation, egocentric video understanding/editing as a step towards reliable and robust world models.

Recent Publications

One Editor, Many Edits: A Unified Training-free Framework for Diverse Video Edits

Adheesh Juvekar, Onkar Kishor Susladkar, Kiet A. Nguyen, Nabeel Bashir, Xiaona Zhou, Muntasir Wahed, Vedant Shah, Ismini Lourentzou

In preparation • 2026

In preparation

GraphVid: Interactive Graph Control Video Generation

Vedant Shah, Onkar Kishor Susladkar, Tushar Prakash, Kiet A. Nguyen, Tianjiao Yu, Adheesh Juvekar, Muntasir Wahed, Ismini Lourentzou

In preparation • 2026

In preparation

Best of Both Worlds: Multimodal Reasoning and Generation via Unified Discrete Flow Matching

Onkar Susladkar, Tushar Prakash, Gayatri Deshmukh, Kiet A Nguyen, Jiaxun Zhang, Adheesh Juvekar, Tianshu Bao, Lin Chai, Sparsh Mittal, Inderjit S Dhillon, Ismini Lourentzou

arXiv preprint arXiv:2602.12221 • 2026

PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation

Onkar Susladkar, Tushar Prakash, Adheesh Juvekar, Kiet A Nguyen, Dong-Hwan Jang, Inderjit S Dhillon, Ismini Lourentzou

arXiv preprint arXiv:2601.16210 • Accepted at CVPR 2026

In preparation for camera-ready

Counterfactual Segmentation Reasoning: Diagnosing and Mitigating Pixel-Grounding Hallucination

Xinzhuo Li*, Adheesh Juvekar*, Xingyou Liu, Muntasir Wahed, Kiet A Nguyen, Ismini Lourentzou

arXiv preprint arXiv:2506.21546 • Accepted at CVPR 2026

* Equal contribution • In preparation for camera-ready

RewardFlow: Generate Images by Optimizing What You Reward

Onkar Kishor Susladkar, Dong-Hwan Jang, Tushar Prakash, Adheesh Juvekar, Vedant Shah, Ayush Barik, Muntasir Wahed, Ritish Shrirao, Ismini Lourentzou

Accepted at CVPR 2026

In preparation for camera-ready

CALICO: Part-Focused Semantic Co-Segmentation with Large Vision-Language Models

Kiet A Nguyen, Adheesh Juvekar, Tianjiao Yu, Muntasir Wahed, Ismini Lourentzou

Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR) • 2025

Uncertainty in Action: Confidence Elicitation in Embodied Agents

Tianjiao Yu, Vedant Shah, Muntasir Wahed, Kiet A. Nguyen, Adheesh Juvekar, Tal August, Ismini Lourentzou

arXiv preprint arXiv:2503.10628 • 2025

Prima: Multi-image vision-language models for reasoning segmentation

Muntasir Wahed*, Kiet A Nguyen*, Adheesh Juvekar, Xinzhuo Li, Xiaona Zhou, Vedant Shah, Tianjiao Yu, Pinar Yanardag, Ismini Lourentzou

arXiv preprint arXiv:2412.15209 • 2024

* Equal contribution

MetaCompare 2.0: Differential ranking of ecological and human health resistome risks

Monjura Afrin Rumi, Min Oh, Benjamin C Davis, Connor L Brown, Adheesh Juvekar, Peter J Vikesland, Amy Pruden, Liqing Zhang

FEMS Microbiology Ecology • 2024