Ziwei Liu

Nanyang Technological University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Computer Vision (6)Multimodal Models (5)World Models & Planning (2)Robotics & Embodied AI (2)

Frequent co-authors

Yuanhan Zhang (2)Bo Li (2)Runmao Yao (2)Fangzhou Hong (2)

Papers (7)

Jun 15, 2026

2d ago·also Stanford HAI, CUHK, NTU, Shanghai Innovation

PermaVid: Consistent Video Generation Across Edits via Disentangled Context Memory

PermaVid achieves unprecedented long-term consistency in video generation, even after significant edits, by disentangling appearance and geometry in its memory architecture.

Shuai Yang, Bingjie Gao, Ziwei Liu +3

Computer Vision Multimodal Models

Jun 8, 2026

1w ago·also HUST, NTU, S-Lab, Shopee Pte. Ltd.

Prisma-World: Camera-Controllable Multi-Agent Video World Model

Prisma-World achieves unprecedented cross-view consistency in multi-agent video generation by leveraging a joint geometry-aware denoising process.

Huiqiang Sun, Zhan Peng, Size Wu +8

Multimodal Models World Models & Planning

Jun 1, 2026

Xiang Xu +72w ago·also NJUPT, NTU, SKL-TI

Not All Points Are Equal: Uncertainty-Aware 4D LiDAR Scene Synthesis

U4D reveals that leveraging spatial uncertainty can drastically enhance the quality of LiDAR scene synthesis, achieving unprecedented fidelity and coherence.

Xiang Xu, Alan Liang, Youquan Liu +5

Computer Vision Robotics & Embodied AI

May 27, 2026

S-Lab3w ago·also DLUT Website, EvolvingLMMs-Lab/NEO, github.com, NTU +8

From Pixels to Words -- Towards Native One-Vision Models at Scale

Ditching modular architectures unlocks surprisingly competitive vision-language performance, proving that end-to-end pixel-to-word models can rival traditional approaches at scale.

Haiwen Diao, Jiahao Wang, Penghao Wu +16

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

May 26, 2026

3w ago·also Fudan, HUST, Northwestern, NTU +1

SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

Spatial foundation models aren't as "all-round" as we thought: SpatialBench reveals surprising generalization gaps and the critical importance of domain alignment over naive data scaling.

Haosong Peng, Hao Li, Jiaqi Chen +9

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

May 25, 2026

Xiang An +283w ago·also ERNIE Team, NTU, S-Lab, SenseTime +3

LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence

LLaVA-OV-2's codec-stream tokenization lets it crush existing video-language models, especially in tasks requiring fine-grained temporal understanding of high-frequency motion.

Xiang An, Yin Xie, Feilong Tang +26

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

May 20, 2026

May 20, 2026·also ACE Robotics PhysX-Omni

PhysX-Omni: Unified Simulation-Ready Physical 3D Generation for Rigid, Deformable, and Articulated Objects

A single framework now generates simulation-ready 3D assets for rigid, deformable, and articulated objects, unlocking new possibilities for embodied AI and physics-based simulation.

Ziang Cao, Yinghao Liu, Haitian Li +5

Computer Vision Robotics & Embodied AI World Models & Planning

Search

Ziwei Liu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (7)