Lattice AI Research

Research focus

Computer Vision (2)Multimodal Models (2)Eval Frameworks & Benchmarks (1)Natural Language Processing (1)

Frequent co-authors

Pengfei Yue (1)Xingran Zhao (1)Juntao Chen (1)Wang Longchao (1)

Papers (2)

Mar 16, 2026

Pengfei Yue +5Mar 16, 2026

SEA-Vision: A Multilingual Benchmark for Comprehensive Document and Scene Text Understanding in Southeast Asia

Multimodal models stumble badly on low-resource Southeast Asian languages, as revealed by the new SEA-Vision benchmark for document and scene text understanding.

Pengfei Yue, Xingran Zhao, Juntao Chen +3

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Feb 25, 2026

CCCaption: Dual-Reward Reinforcement Learning for Complete and Correct Image Captioning

Ditch imperfect human annotations: this dual-reward RL approach trains image captioning models to be both more complete and more factually correct.

Zhijiang Tang, Linhua Wang, Jiaxin Qi +3

Computer Vision Multimodal Models Natural Language Processing

Search

Peng Hou

Research focus

Frequent co-authors

Papers (2)