Simon Jenni

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (3)Computer Vision (2)Recommendation & Information Retrieval (1)Tool Use & Agents (1)

Frequent co-authors

Jianglin Lu (2)Kushal Kafle (2)Jing Shi (2)Handong Zhao (2)

Papers (3)

Apr 6, 2026

Jianglin Lu +42w ago·also Northeastern

The Indra Representation Hypothesis for Multimodal Alignment

Unimodal models might already understand each other better than we thought: a shared relational structure, formalized via category theory, unlocks zero-shot cross-modal alignment.

Jianglin Lu, Hailing Wang, Kuo Yang +2

Multimodal Models

Feb 24, 2026

Seeing Through Words: Controlling Visual Retrieval Quality with Language Models

You can drastically improve text-to-image retrieval from short, ambiguous queries by using a language model to generate richer, quality-aware descriptions.

Jianglin Lu, Simon Jenni, Kushal Kafle +2

Computer Vision Multimodal Models Recommendation & Information Retrieval

Feb 19, 2026

Feb 19, 2026·also SJTU

RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward

Forget handcrafted metrics: RetouchIQ uses an RL-tuned MLLM to generate its own reward signals for instruction-based image editing, leading to more semantically consistent and perceptually pleasing results.

Qiucheng Wu, Jing Shi, Simon Jenni +3

Computer Vision Multimodal Models Tool Use & Agents

Search

Simon Jenni

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)