Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Joe Watson | Lattice

Joe Watson

University of Oxford

Papers on Lattice

2

Total citations

0

Topics

4

Publication activitypapers/week, last 8 weeks

Research focus

Robotics & Embodied AI (2)RLHF & Preference Learning (1)Training Efficiency & Optimization (1)World Models & Planning (1)

Frequent co-authors

Ingmar Posner (2)Christian Scherer (1)Theo Gruner (1)Daniel Palenicek (1)

Papers (2)

Jun 1, 2026

3w ago·also DFKI, Hessian.AI, Oxford, Research Department SAIROL +1

Coherent Off-Policy Improvement of Large Behavior Models with Learned Rewards

Learning dense rewards from expert demonstrations allows for over 90% success in complex manipulation tasks, outperforming traditional RL methods.

Christian Scherer, Joe Watson, Theo Gruner +3

RLHF & Preference Learning Robotics & Embodied AI

Mar 9, 2026

Hamish Flynn +2Mar 9, 2026·also Oxford

Posterior Sampling Reinforcement Learning with Gaussian Processes for Continuous Control: Sublinear Regret Bounds for Unbounded State Spaces

GP-PSRL can achieve sublinear regret bounds in continuous control even with unbounded state spaces, resolving prior theoretical limitations and opening the door to more complex RL settings.

Hamish Flynn, Joe Watson, Ingmar Posner

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Hamish Flynn (1)