Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Changda Zhou | Lattice

Changda Zhou

Papers on Lattice

3

Total citations

0

Topics

6

h-index

4

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (3)Computer Vision (2)Architecture Design (Transformers, SSMs, MoE) (1)Data Curation & Synthetic Data (1)

Frequent co-authors

Yubo Zhang (2)Yue Zhang (2)Tingquan Gao (2)Zelun Zhang (2)

Papers (3)

Jun 11, 2026

Yubo Zhang +156d ago

PP-OCRv6: From 1.5M to 34.5M Parameters, Surpassing Billion-Scale VLMs on OCR Tasks

PP-OCRv6 outperforms billion-scale VLMs on OCR tasks with a fraction of the parameters, achieving state-of-the-art accuracy and speed.

Yubo Zhang, Xueqing Wang, Manhui Lin +13

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Jun 2, 2026

Zelun Zhang +82w ago

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

Targeted optimization in underperforming regions boosts document parsing accuracy to a record 96.33%, setting a new benchmark in the field.

Zelun Zhang, Yubo Zhang, Yiqing Xiang +6

Data Curation & Synthetic Data Multimodal Models Training Efficiency & Optimization

Mar 4, 2026

Mar 4, 2026

Crab$^{+}$: A Scalable and Unified Audio-Visual Scene Understanding Model with Explicit Cooperation

Multi-task AV-LLMs can actually *improve* performance over single-task models, if you carefully design the training data and explicitly model inter-task relationships to avoid negative transfer.

Dongnuan Cai, Dong Cai, Henghui Du +7

Computer Vision Multimodal Models Speech & Audio

Training Efficiency & Optimization (1)

Speech & Audio (1)

Xueqing Wang (1)