Jiacheng Liu

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (4)Computer Vision (2)Reasoning & Chain-of-Thought (2)Multimodal Models (1)

Frequent co-authors

Zichen Tang (4)Haihong E (4)Haocheng Gao (3)Rongjin Li (3)

Papers (5)

Apr 30, 2026

Tsinghua AI1d ago·also BUPT

Decoding Scientific Experimental Images: The SPUR Benchmark for Perception, Understanding, and Reasoning

Current vision-language models are surprisingly bad at interpreting scientific figures, failing to match expert-level reasoning on a new benchmark of experimental images.

Junpeng Ding, Zichen Tang, Haihong E +17

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

1d ago·also Hithink RoyalFlush Information Network Co., HKUST, IDEA

RoadMapper: A Multi-Agent System for Roadmap Generation of Solving Complex Research Problems

LLMs can now generate research roadmaps that are not only better but also far faster than those created by human experts, thanks to a novel multi-agent system.

Jiacheng Liu, Zichen Tang, Zhongjun Yang +8

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Tool Use & Agents

1d ago

NeocorRAG: Less Irrelevant Information, More Explicit Evidence, and More Effective Recall via Evidence Chains

Retrieval improvements in RAG don't always boost reasoning, but NeocorRAG's evidence chains can fix that, achieving SOTA with 80% fewer tokens.

Shiyao Peng, Qianhe Zheng, Zhuodi Hao +7

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Recommendation & Information Retrieval

1d ago

AEGIS: A Holistic Benchmark for Evaluating Forensic Analysis of AI-Generated Academic Images

GPT-5.1 can barely crack 50% accuracy when distinguishing real from AI-generated academic images, highlighting a stark gap between generative capabilities and forensic detection.

Bo Zhang, Tzu-Yen Ma, Zichen Tang +18

Computer Vision Data Curation & Synthetic Data Eval Frameworks & Benchmarks

Apr 20, 2026

Xingyu Fan +31w ago

PARM: Pipeline-Adapted Reward Model

Reward models optimized for single-step generation can fail spectacularly when integrated into multi-stage LLM pipelines, but pipeline-aware training can fix this.

Xingyu Fan, Jiacheng Liu, Linqi Song +1

Code Generation & Program Synthesis RLHF & Preference Learning

Search

Jiacheng Liu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)