Zhihao Yang

Papers on Lattice

Total citations

Topics

h-index

Research focus

Reasoning & Chain-of-Thought (2)Multimodal Models (1)Training Efficiency & Optimization (1)Code Generation & Program Synthesis (1)Tool Use & Agents (1)

Frequent co-authors

Yukun Chen (1)Ze Gong (1)Jingpeng Li (1)Hengyu Chang (1)

Papers (2)

Feb 25, 2026

Feb 25, 2026·also CAS, Sydney, UNSW

RuCL: Stratified Rubric-Based Curriculum Learning for Multimodal Large Language Model Reasoning

MLLMs can be significantly boosted by curriculum learning that focuses on reward design rather than data selection, dynamically weighting generalized rubrics based on the model's evolving competence.

Yukun Chen, Ze Gong, Jingpeng Li +5

Multimodal Models Reasoning & Chain-of-Thought Training Efficiency & Optimization

Feb 19, 2026

Siyu Wang +9Feb 19, 2026

AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation

Forget static agent communication graphs: AgentConductor uses RL to dynamically rewire agent interactions based on task difficulty, slashing token costs by up to 68% while boosting code generation accuracy.

Siyu Wang, R. Lu, Zhihao Yang +7

Code Generation & Program Synthesis Reasoning & Chain-of-Thought Tool Use & Agents

Search

Zhihao Yang

Research focus

Frequent co-authors

Papers (2)