Guangming Yao

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (2)Computer Vision (1)Tool Use & Agents (1)Architecture Design (Transformers, SSMs, MoE) (1)

Frequent co-authors

Kaixiang Ji (2)Jingdong Chen (2)Cong Chen (1)Cong Chen (1)

Papers (2)

Jun 5, 2026

1w ago·also Ant Group, Central South University, HKU, HKUST +1

MemDreamer: Decoupling Perception and Reasoning for Long Video Understanding via Hierarchical Graph Memory and Agentic Retrieval Mechanism

MemDreamer narrows the performance gap with human experts in long video understanding to just 3.7 points while processing only 2% of the full context.

Cong Chen, Cong Chen, Guo Gan +16

Computer Vision Multimodal Models Tool Use & Agents

Jun 11, 2025

AI InclusionJun 11, 2025·also Didi International Business Group, Tencent AI, UNSW

Ming-Omni: A Unified Multimodal Model for Perception and Generation

GPT-4o now has open-source competition: Ming-Omni matches its modality support in a single, unified model capable of perception and generation across image, text, audio, and video.

A. Inclusion, Biao Gong, Cheng Zou +5529

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Speech & Audio

Search

Guangming Yao

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)