Hao Zhang

Papers on Lattice

Total citations

Topics

h-index

Research focus

Computer Vision (3)Multimodal Models (2)World Models & Planning (2)Reasoning & Chain-of-Thought (1)Eval Frameworks & Benchmarks (1)

Frequent co-authors

Jiachun Jin (1)Zetong Zhou (1)Xiao Yang (1)Pengfei Liu (1)

Papers (4)

Apr 2, 2026

Jiachun Jin +7Apr 2, 2026

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

Ditching pixel-space translation unlocks a unified model (LatentUM) that reasons across modalities with SOTA results, opening doors to more efficient and aligned visual AI.

Jiachun Jin, Zetong Zhou, Xiao Yang +5

Computer Vision Multimodal Models Reasoning & Chain-of-Thought

Apr 1, 2026

Apr 1, 2026·also RayNeo.AI

Fast and Accurate Probing of In-Training LLMs' Downstream Performances

Skip the costly generative evals: a simple probe trained on internal LLM representations can accurately predict downstream task performance during training, slashing evaluation time from an hour to just three minutes.

Zhichen Liu, Tianle Lun, Yulin Ou +4

Eval Frameworks & Benchmarks Scaling Laws & Emergent Abilities Training Efficiency & Optimization

Hao Zhang +3Apr 1, 2026·also SenseTime

ReinDriveGen: Reinforcement Post-Training for Out-of-Distribution Driving Scene Generation

Generate safety-critical driving scenarios with full trajectory control, even *beyond* your training data, using RL to fine-tune a video diffusion model.

Hao Zhang, Lue Fan, Zehuan Wu +1

Computer Vision Data Curation & Synthetic Data World Models & Planning

Apr 11, 2025

Apr 11, 2025·also ByteDance

In-2-4D: Inbetweening from Two Single-View Images to 4D Generation

Forget generating 4D from text or a single image – this work lets you create compelling 3D animations by simply specifying the start and end poses in two images.

Sauradip Nag, D. Cohen-Or, Hao Zhang +16

Computer Vision Multimodal Models World Models & Planning

Search

Hao Zhang

Research focus

Frequent co-authors

Papers (4)