Zhuchenyang Liu

Aalto University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (3)Computer Vision (2)Recommendation & Information Retrieval (2)Eval Frameworks & Benchmarks (1)

Frequent co-authors

Yao Zhang (3)Zhu Liu (1)Yu Xiao (1)Yanlan He (1)

Papers (3)

Apr 1, 2026

3w ago

Benchmarking and Mechanistic Analysis of Vision-Language Models for Cross-Depiction Assembly Instruction Alignment

VLMs struggle to align assembly diagrams and videos because they occupy disjoint visual representation spaces, revealing a fundamental limitation in cross-modal understanding.

Zhuchenyang Liu, Zhu Liu, Yao Zhang +1

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Mar 13, 2026

Mar 13, 2026·also Northwestern

NanoVDR: Distilling a 2B Vision-Language Retriever into a 70M Text-Only Encoder for Visual Document Retrieval

Shrinking a 2B vision-language retriever to a 70M text-only model achieves 95% of the original quality and outperforms a 2B baseline, while slashing query latency by 50x.

Zhuchenyang Liu, Yao Zhang

Inference & Quantization Multimodal Models Recommendation & Information Retrieval

Mar 10, 2026

Mar 10, 2026·also Aalto

Fine-grained Motion Retrieval via Joint-Angle Motion Images and Token-Patch Late Interaction

Ditch global embeddings for text-motion retrieval: this method uses joint-angle motion images and token-patch late interaction to achieve state-of-the-art accuracy and interpretability.

Yao Zhang, Zhuchenyang Liu, Yanlan He +1

Computer Vision Multimodal Models Recommendation & Information Retrieval

Search

Zhuchenyang Liu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)