Yao Zhang

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (4)Recommendation & Information Retrieval (4)Natural Language Processing (2)Computer Vision (2)

Frequent co-authors

Zhuchenyang Liu (3)Zhu Liu (2)T. Ploetz (1)Yu Xiao (1)

Papers (7)

Apr 23, 2026

Yao Zhang +32d ago

Encoder-Free Human Motion Understanding via Structured Motion Descriptions

Forget finetuning encoders: representing human motion as structured text unlocks surprisingly strong performance on motion understanding tasks by directly leveraging LLMs' pretrained knowledge.

Yao Zhang, Zhu Liu, T. Ploetz +1

Multimodal Models Natural Language Processing Robotics & Embodied AI

Apr 6, 2026

2w ago

FAVE: Flow-based Average Velocity Establishment for Sequential Recommendation

Ditch the noise: FAVE achieves 10x faster sequential recommendations by learning a direct, one-step trajectory from user history to predicted item, bypassing the inefficient "noise-to-data" paradigm.

Ke Shi, Yao Zhang, Yaoyang Zhang +6

Architecture Design (Transformers, SSMs, MoE)Recommendation & Information Retrieval Training Efficiency & Optimization

Apr 1, 2026

3w ago

Benchmarking and Mechanistic Analysis of Vision-Language Models for Cross-Depiction Assembly Instruction Alignment

VLMs struggle to align assembly diagrams and videos because they occupy disjoint visual representation spaces, revealing a fundamental limitation in cross-modal understanding.

Zhu Liu, Zhuchenyang Liu, Yao Zhang +1

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Mar 13, 2026

Mar 13, 2026·also Northwestern

NanoVDR: Distilling a 2B Vision-Language Retriever into a 70M Text-Only Encoder for Visual Document Retrieval

Shrinking a 2B vision-language retriever to a 70M text-only model achieves 95% of the original quality and outperforms a 2B baseline, while slashing query latency by 50x.

Zhuchenyang Liu, Yao Zhang

Inference & Quantization Multimodal Models Recommendation & Information Retrieval

Mar 10, 2026

Mar 10, 2026·also Aalto

Fine-grained Motion Retrieval via Joint-Angle Motion Images and Token-Patch Late Interaction

Ditch global embeddings for text-motion retrieval: this method uses joint-angle motion images and token-patch late interaction to achieve state-of-the-art accuracy and interpretability.

Yao Zhang, Zhuchenyang Liu, Yanlan He +1

Computer Vision Multimodal Models Recommendation & Information Retrieval

Mar 9, 2026

Mar 9, 2026·also Northwestern

Detecting Fake Reviewer Groups in Dynamic Networks: An Adaptive Graph Learning Method

Spotting coordinated fake reviewers just got easier: a new graph learning method boosts detection accuracy by adaptively weighing network diversity and similarity.

Jing Zhang, Yao Zhang, Zhiwen Yu

Natural Language Processing Recommendation & Information Retrieval

Feb 24, 2026

OptiLeak: Efficient Prompt Reconstruction via Reinforcement Learning in Multi-tenant LLM Services

Prompt leakage attacks on multi-tenant LLMs are far more efficient than previously thought: a new RL-based method reconstructs prompts with over 12x fewer requests.

Longxiang Wang, Xiang Zheng, Xiang Zheng +3

Distributed Systems & Hardware Inference & Quantization Red-Teaming & Adversarial Robustness

Search

Yao Zhang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (7)