Zhouyang Jiang

Papers on Lattice

Total citations

Topics

Research focus

RLHF & Preference Learning (1)Robotics & Embodied AI (1)

Frequent co-authors

Yuanjun Li (1)Yuanjun Li (1)Bin Zhang (1)Bin Zhang (1)

Papers (1)

Feb 26, 2026

Feb 26, 2026·also State Key Laboratory of Nervous System Disorders, ZJU

QSIM: Mitigating Overestimation in Multi-Agent Reinforcement Learning via Action Similarity Weighted Q-Learning

By weighting Q-learning updates based on action similarity, QSIM tames overestimation in multi-agent RL, leading to more stable and effective learning.

Yuanjun Li, Yuanjun Li, Bin Zhang +8

RLHF & Preference Learning Robotics & Embodied AI

Search

Zhouyang Jiang

Research focus

Frequent co-authors

Papers (1)