Search papers, labs, and topics across Lattice.
3
0
5
Single-step action generation can outperform multi-step diffusion methods in offline reinforcement learning, achieving higher performance with lower computational costs.
LLMs can match or beat human reviewers on specific aspects of peer review like novelty verification and critique prioritization, but they still exhibit critical blind spots that aggregate metrics miss.
Unlock the potential of your offline RL data: a new framework achieves state-of-the-art performance on D4RL benchmarks by quantifying and leveraging data uncertainty with a computationally efficient Rank-One MIMO architecture.