Jiayu Wang

Current video world models may look good on the surface, but they fail to handle critical reasoning tasks, revealing a gap in trustworthiness that could jeopardize robotic manipulation safety.

Huiqiong Li, Jiayu Wang, Zhiting Mei +2

Eval Frameworks & Benchmarks Robotics & Embodied AI World Models & Planning

May 26, 2026

May 26, 2026·also CAS, HKUST, MBZUAI, The Hong Kong University of Technology +2

GeoFaith: A Spatio-Temporal Dual View of Faithful Chain-of-Thought

LLMs can be steered to generate more faithful reasoning chains without sacrificing accuracy using a novel geometric and entropy-based framework, outperforming even GPT-4 in faithfulness detection.

Weijiang Lv, Wentong Zhao, Jiayu Wang +2

Eval Frameworks & Benchmarks Interpretability & Mechanistic Interp Reasoning & Chain-of-Thought

Apr 21, 2026

Hao Li +31Apr 21, 2026·also CAS, HIT, OPPO, PolyU +2

LoViF 2026 Challenge on Real-World All-in-One Image Restoration: Methods and Results

Unified benchmarks reveal the state-of-the-art in simultaneously addressing multiple real-world image degradations like blur, low-light, and rain.

Hao Li, Naiwei Chen, Shengyuan Li +29

Computer Vision Eval Frameworks & Benchmarks

Apr 1, 2026

More Human, More Efficient: Aligning Annotations with Quantized SLMs

Forget giant models: A carefully trained, quantized SLM can beat proprietary LLMs at aligning with human annotators.

Jiayu Wang

Data Curation & Synthetic Data Inference & Quantization Open-Source Models & Weights

Mar 17, 2026

Mar 17, 2026·also Tsinghua AI, CAS, Hangzhou High-Tech Zone (Binjiang, Institute of Blockchain and Data

$D^3$-RSMDE: 40$\times$ Faster and High-Fidelity Remote Sensing Monocular Depth Estimation

Achieve diffusion-level perceptual quality in monocular depth estimation at 40x the speed, by replacing the slow initial diffusion steps with a fast ViT-based depth map and refining in a compact latent space.

Ruizhi Wang, Weihan Li, Zunlei Feng +2

Architecture Design (Transformers, SSMs, MoE)Computer Vision Inference & Quantization

Mar 5, 2026

Mar 5, 2026·also ZJU

Functionality-Oriented LLM Merging on the Fisher--Rao Manifold

By merging models on the Fisher-Rao manifold, this work achieves stable and accurate LLM merging even with many heterogeneous models, overcoming the representation collapse issues plaguing simpler weight averaging techniques.

Jiayu Wang, Zuojun Ye, Wenpeng Yin

Architecture Design (Transformers, SSMs, MoE)Open-Source Models & Weights Training Efficiency & Optimization

Feb 23, 2026

Feb 23, 2026·also Salesforce AI, ZJU

SkillOrchestra: Learning to Route Agents via Skill Transfer

SkillOrchestra slashes the learning costs of AI agent orchestration by up to 700x while improving performance by explicitly modeling agent skills and costs, offering a more scalable and interpretable alternative to RL-based methods.

Jiayu Wang, Yifei Ming, Zixuan Ke +3

RLHF & Preference Learning Tool Use & Agents

Search

Jiayu Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (8)