Francesco Quinzan

University of Ox- ford

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (1)World Models & Planning (1)Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)

Frequent co-authors

Mary Chriselda Antony Oliver (1)Lan Jiang (1)Aaron Bundi Anampiu (1)Elaf Almahmoud (1)

Papers (2)

May 26, 2026

3w ago·also African Institute for Mathematical Sciences, University of Ox- ford

Learning to Orchestrate Agents under Uncertainty

Coordinating AI agents gets a reliability boost: BOT-Orch uses bandit learning and Optimal Transport to intelligently delegate tasks, even when agents are unpredictable.

Mary Chriselda Antony Oliver, Lan Jiang, Aaron Bundi Anampiu +3

Tool Use & Agents World Models & Planning

University of Science and Tech- nology3w ago·also London School of Economics and Political, University of Ox- ford

BASIS: Batchwise Advantage Estimation from Single-Rollout Information Sharing for LLM Reasoning

Single-rollout RL can rival multi-rollout performance for LLM reasoning, thanks to a new batchwise advantage estimation technique that dramatically improves value function accuracy.

Shijin Gong, Erhan Xu, Kai Ye +3

Reasoning & Chain-of-Thought RLHF & Preference Learning Training Efficiency & Optimization

Search

Francesco Quinzan

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)