Lan-Zhe Guo

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (2)RLHF & Preference Learning (2)Tool Use & Agents (2)Multimodal Models (2)Reasoning & Chain-of-Thought (2)

Frequent co-authors

Jiaxuan Wang (1)Yulan Hu (1)Wenjin Yang (1)Wenjing Yang (1)

Papers (4)

Apr 9, 2026

Jiaxuan Wang +6Apr 9, 2026·also Group

Aligning Agents via Planning: A Benchmark for Trajectory-Level Reward Modeling

Current reward models struggle to distinguish good vs. bad agent behavior in complex tool-using scenarios, especially over long horizons, revealing a critical gap in alignment research.

Jiaxuan Wang, Yulan Hu, Wenjin Yang +4

Eval Frameworks & Benchmarks RLHF & Preference Learning Tool Use & Agents

Mar 18, 2026

Mar 18, 2026·also Didichuxing Co. Ltd

A Progressive Visual-Logic-Aligned Framework for Ride-Hailing Adjudication

An 8B parameter model, RideJudge, outperforms 32B baselines in ride-hailing dispute adjudication by aligning visual semantics with evidentiary protocols, achieving 88.41% accuracy.

Weiming Wu, Zi-Jian Cheng, Jie Meng +6

Computer Vision Multimodal Models Reasoning & Chain-of-Thought

Mar 17, 2026

NeSy-Route: A Neuro-Symbolic Benchmark for Constrained Route Planning in Remote Sensing

Current MLLMs struggle with even basic route planning in remote sensing, highlighting a critical gap in their ability to translate perception into action in complex, real-world scenarios.

Zhi Zhou, Shi-Yu Tian, Kun-Yang Yu +1

Eval Frameworks & Benchmarks Multimodal Models World Models & Planning

Mar 7, 2026

Huihan Tan +9Mar 7, 2026

Hindsight Credit Assignment for Long-Horizon LLM Agents

LLM agents can learn to solve complex, long-horizon tasks much more effectively by using themselves as post-hoc critics to refine Q-values through hindsight reasoning.

Huihan Tan, Xiao-Wen Yang, Hao Chen +7

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Search

Lan-Zhe Guo

Research focus

Frequent co-authors

Papers (4)