Ismini Lourentzou

RECONTEXT boosts long-context reasoning in LLMs by effectively reusing evidence from the input, leading to superior performance without the need for retraining.

Yanjun Zhao, Ruizhong Qiu, Tianxin Wei +6

Reasoning & Chain-of-Thought

May 28, 2026

Xiaona Zhou +6May 28, 2026

Tiny but Trusted: Efficient Vision-Language Reasoning for Time-Series Anomaly Detection

Forget brute-force VLMs: parameter-efficient fine-tuning with high-quality rationales unlocks surprisingly accurate and interpretable time-series anomaly detection.

Xiaona Zhou, Muntasir Wahed, Muntasir Wahed +4

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Apr 9, 2026

Onkar Susladkar +10Apr 9, 2026

RewardFlow: Generate Images by Optimizing What You Reward

Forget GAN inversions – now you can steer diffusion models with a dynamically weighted soup of differentiable rewards, including a VQA-based reward for language-vision reasoning, and get SOTA image edits.

Onkar Susladkar, Dong-Hwan Jang, Tushar Prakash +8

Computer Vision Multimodal Models

Ying Shen +4Apr 9, 2026

Phantom: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics

Training video generation models to explicitly infer latent physical properties yields more physically plausible videos than simply scaling data and model size.

Ying Shen, J. Xiong, Jerry Xiong +2

Computer Vision Multimodal Models World Models & Planning

M. Ogunleye +2Apr 9, 2026

3D-VCD: Hallucination Mitigation in 3D-LLM Embodied Agents through Visual Contrastive Decoding

Hallucinations in 3D embodied agents can be significantly reduced at inference time by contrasting predictions under original and geometrically/semantically perturbed 3D scene graphs.

M. Ogunleye, Eman Abdelrahman, Ismini Lourentzou

Multimodal Models Robotics & Embodied AI Tool Use & Agents

Mar 19, 2026

Tianjiao Yu +7Mar 19, 2026

DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising

Text-to-3D generation gets a semantic upgrade: DreamPartGen creates 3D objects with parts that not only look right but also understand their relationships and align with textual descriptions.

Tianjiao Yu, Xinzhuo Li, Muntasir Wahed +5

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Feb 12, 2026

Onkar Susladkar +10Feb 12, 2026

Best of Both Worlds: Multimodal Reasoning and Generation via Unified Discrete Flow Matching

Achieve SOTA multimodal performance across eight benchmarks and strong zero-shot generalization without task-specific training by decoupling understanding and generation via unified discrete flow matching.

Onkar Susladkar, Tushar Prakash, Gayatri S Deshmukh +8

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Reasoning & Chain-of-Thought

Search

Ismini Lourentzou

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (8)