Ruize Han

Fudan University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Computer Vision (5)Multimodal Models (4)Training Efficiency & Optimization (2)Natural Language Processing (1)

Frequent co-authors

Liang Wan (4)Wei Feng (3)Yuzhong Feng (2)Yifeng Wu (1)

Papers (5)

Jun 18, 2026

Jun 18, 2026·also Shenzhen University of Advanced, SUSTech, Tencent AI

Timage: A Generative Text-in-Image Paradigm for Fine-Tuning Vision-Language Models

Timage transforms the way we align text and images, achieving superior multimodal reasoning with a modest model size that outperforms larger competitors.

Yifeng Wu, Huimin Huang, Ruiluo Wu +5

Computer Vision Multimodal Models

Jun 8, 2026

Jun 8, 2026·also Fudan, Monash, Nanchang University

ExDet: Open-Domain Open-Vocabulary Detection with Cross-modal Extrapolation and Rectification

ExDet achieves state-of-the-art performance in open-domain open-vocabulary detection while significantly reducing training costs through innovative cross-modal techniques.

Yuzhong Feng, Ruize Han, Zhiwei Chen +2

Computer Vision Multimodal Models Training Efficiency & Optimization

Jun 8, 2026·also Fudan, Monash, State Administration of Cultural

RT-SDGOD: Real-Time Single-Domain Generalized Object Detection

Real-time object detectors can achieve cross-domain generalization without any extra inference overhead by leveraging collaborative evidence modeling during training.

Fangzhuo Gao, Ruize Han, Wei Feng +1

Computer Vision

May 27, 2026

May 27, 2026·also Monash

LV-OSD: Language-Vision-Complementary Open-Set Object Detection

Object detection gets a flexible upgrade: now you can specify objects with text *and* images, opening the door to more intuitive and practical real-world applications.

Ruize Han, Wei Feng, Liang Wan

Computer Vision Multimodal Models Natural Language Processing

May 26, 2026

May 26, 2026·also TJU

COVD: Continual Open-Vocabulary Object Detection with Novel Concept Injection

Freezing your visual encoder and carefully nudging the text embeddings lets you continually teach an object detector new tricks without catastrophic forgetting.

Ruize Han, Yuzhong Feng, Zixin Ren +2