Huimu Wang

Papers on Lattice

Total citations

Topics

Research focus

RLHF & Preference Learning (1)Robotics & Embodied AI (1)Tool Use & Agents (1)

Frequent co-authors

Zhiqiang Pu (1)Xiaolin Ai (1)

Papers (1)

Mar 18, 2026

Zhiqiang Pu +2Mar 18, 2026

Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control

LLMs can act as effective action-level supervisors in reinforcement learning, dramatically boosting the sample efficiency of SAC without sacrificing convergence guarantees.

Zhiqiang Pu, Xiaolin Ai, Huimu Wang

RLHF & Preference Learning Robotics & Embodied AI Tool Use & Agents

Search

Huimu Wang

Research focus

Frequent co-authors

Papers (1)