Hongyu Li

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Computer Vision (3)Multimodal Models (3)Architecture Design (Transformers, SSMs, MoE) (2)Eval Frameworks & Benchmarks (1)

Frequent co-authors

Manyuan Zhang (3)Dian Zheng (2)Xiang Chen (1)Hao Li (1)

Papers (5)

Apr 21, 2026

Xiang Chen +49Apr 21, 2026·also TU Munich, University of Science and Technology

LoViF 2026 Challenge on Real-World All-in-One Image Restoration: Methods and Results

Unified benchmarks reveal the state-of-the-art in simultaneously addressing multiple real-world image degradations like blur, low-light, and rain.

Xiang Chen, Hao Li, Jiangxin Dong +47

Computer Vision Eval Frameworks & Benchmarks

Mar 30, 2026

Kaituo Feng +8Mar 30, 2026

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Image generation takes a leap towards real-world knowledge by training an agent that actively searches for and integrates external information, substantially boosting performance on knowledge-intensive tasks.

Kaituo Feng, Manyuan Zhang, Yunlong Lin +6

Computer Vision Multimodal Models Tool Use & Agents

Mar 29, 2026

Meituan LongCat Team +89Mar 29, 2026·also Central South University, LongCat Team, Meituan

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

LongCat-Next shatters the language-centric paradigm by unifying text, vision, and audio into a single autoregressive model with minimal modality-specific design, finally reconciling understanding and generation in discrete vision modeling.

Meituan LongCat Team, Mei Xiao, Chao Wang +87

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Natural Language Processing

Mar 19, 2026

Yue Gong +10Mar 19, 2026

RPiAE: A Representation-Pivoted Autoencoder Enhancing Both Image Generation and Editing

Representation-Pivoted Autoencoders enable diffusion models to generate and edit images with higher fidelity by learning a compressed latent space that preserves the semantics of pre-trained visual representations.

Yue Gong, Hongyu Li, Shanyuan Liu +8

Architecture Design (Transformers, SSMs, MoE)Computer Vision

Feb 23, 2026

Feb 23, 2026·also CMU ML

NovaPlan: Zero-Shot Long-Horizon Manipulation via Closed-Loop Video Language Planning

Robots can now perform intricate assembly tasks and recover from errors in real-time, without any training, by fusing vision-language models with video-based kinematic priors for action planning.

Jiahui Fu, Junyu Nan, Lingfeng Sun +4

Multimodal Models Robotics & Embodied AI World Models & Planning