Nanye Ma

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Computer Vision (2)Multimodal Models (2)Eval Frameworks & Benchmarks (1)Architecture Design (Transformers, SSMs, MoE) (1)

Frequent co-authors

Sihyun Yu (1)Pinzhi Huang (1)Hyunseok Lee (1)Shusheng Yang (1)

Papers (2)

Jun 2, 2026

2w ago·also KAIST

Benchmarking Visual State Tracking in Multimodal Video Understanding

MLLMs are failing to visually track events in videos, performing only modestly above baseline despite strong results on other benchmarks.

Sihyun Yu, Nanye Ma, Pinzhi Huang +8

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Feb 27, 2026

Stanford HAIFeb 27, 2026·also NVIDIA

Mode Seeking meets Mean Seeking for Fast Long Video Generation

Generate minute-long videos with compelling narrative structure and local realism, even with limited long-form training data, by cleverly combining supervised flow matching for global coherence with mode-seeking alignment to a short-video teacher for local fidelity.

Shengqu Cai, Weili Nie, Nanye Ma +3

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Search

Nanye Ma

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)