Conghui He

Papers on Lattice

Total citations

910

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (3)Computer Vision (2)Data Curation & Synthetic Data (1)Scientific Discovery & Drug Design (1)

Frequent co-authors

Wenqi Shao (2)Xuanhe Zhou (2)Yinan He (2)Songze Li (2)

Papers (5)

Mar 29, 2026

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

The medical imaging AI community is being held back by a fragmented data landscape, but a new metadata-driven fusion paradigm offers a path to unlocking the power of foundation models.

Zhongying Deng, Cheng Tang, Ziyan Huang +121

Computer Vision Data Curation & Synthetic Data Scientific Discovery & Drug Design

Feb 26, 2026

Microsoft ResearchFeb 26, 2026·also Tsinghua AI, Beihang, CAS, Shanghai AI Lab +1

MoDora: Tree-Based Semi-Structured Document Analysis System

LLMs can now more accurately answer questions on complex documents thanks to a new system that understands layout and hierarchical relationships between document components.

Bangrui Xu, Qihang Yao, Qihang Yao +11

Computer Vision Natural Language Processing Recommendation & Information Retrieval

Feb 26, 2026·also CAS, HUST, Shanghai AI Lab, Westlake

The Trinity of Consistency as a Defining Principle for General World Models

A principled framework for General World Models reveals the limitations of current systems and the architectural requirements for future progress.

Jingxuan Wei, Jingxuan Wei, Siyuan Li +38

Multimodal Models Scaling Laws & Emergent Abilities World Models & Planning

Jun 12, 2025

Jun 12, 2025·also NUS, Tsinghua AI, CAS, Hebei University of Science and Technology +3

VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos

Current LLMs and VLMs struggle with multi-step reasoning in long videos, often failing to maintain temporal coherence and procedural validity, as revealed by a new benchmark of hour-long narratives.

Jiashuo Yu, Yue Wu, Meng Chu +149

Eval Frameworks & Benchmarks Multimodal Models Reasoning & Chain-of-Thought

Apr 14, 2025

Tsinghua AIApr 14, 2025·also NUS, CUHK, Deakin, Fudan +10

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Open-source multimodal models just leveled up: InternVL3 rivals closed-source titans like GPT-4o by pre-training vision and language together from the start.

Jinguo Zhu, Weiyun Wang, Zhe Chen +45901

Multimodal Models Open-Source Models & Weights Training Efficiency & Optimization

Search

Conghui He

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)