Search papers, labs, and topics across Lattice.
Department of Computer Engineering, Dong-A University, Busan 49315, Republic of Korea
1
2
3
1
By dynamically tuning the discount factor based on policy entropy, RL agents can learn more stably and efficiently in complex robotic tasks, outperforming traditional fixed-discount approaches by 11%.