Search papers, labs, and topics across Lattice.
Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China, Division of Life Science, The Hong Kong University of Science and Technology, Hong Kong SAR, China, State Key Laboratory of Nervous System Disorders, Hong Kong SAR, China, HKUST Shenzhen-Hong Kong Collaborative Innovation Research Institute, Futian, Shenzhen, China
1
0
2
2
By weighting Q-learning updates based on action similarity, QSIM tames overestimation in multi-agent RL, leading to more stable and effective learning.