Branislav Kveton

Adobe Research

Papers on Lattice

Total citations

Topics

Research focus

Training Efficiency & Optimization (4)Computer Vision (3)RLHF & Preference Learning (2)Recommendation & Information Retrieval (2)Robotics & Embodied AI (1)

Frequent co-authors

B. Kveton (4)Subhojyoti Mukherjee (3)Michal Valko (3)Anup Rao (1)

Papers (8)

May 25, 2026

AdvantageFlow: Advantage-Weighted Least Squares for RL in Flow Models

Stabilizing advantage-weighted forward-process RL in flow models unlocks superior performance in image generation compared to reverse-process methods.

Branislav Kveton, Anup Rao, Subhojyoti Mukherjee +1

RLHF & Preference Learning Training Efficiency & Optimization

Apr 30, 2026

Apr 30, 2026·also INRIA, Intel Labs, Paris-Saclay, Pitt

Online semi-supervised perception: Real-time learning without explicit feedback

Forget reinforcement learning; this algorithm learns in real-time without any feedback at all.

B. Kveton, Branislav Kveton, Matthai Philipose +311

Computer Vision Robotics & Embodied AI

Apr 30, 2026·also INRIA, Paris-Saclay

Learning from a single labeled face and a stream of unlabeled data

Unlock face recognition with just one labeled example and a flood of unlabeled data, achieving state-of-the-art accuracy in a practical authentication scenario.

Branislav Kveton, B. Kveton, Michal Valko

Computer Vision Data Curation & Synthetic Data Training Efficiency & Optimization

Apr 27, 2026

BAIRApr 27, 2026·also Adobe Research, Cisco AI Research, Cisco Research, Dolby Laboratories +6

A Survey on LLM-based Conversational User Simulation

LLMs are revolutionizing conversational AI research, and this survey offers a structured guide to navigating the rapidly evolving landscape of LLM-powered user simulation.

B. Ni, Bo Ni, Yu Wang +34

Natural Language Processing Tool Use & Agents World Models & Planning

Apr 20, 2026

Apr 20, 2026·also Meta AI, Microsoft Research, Adobe Research, CERMICS École des Ponts ParisTech +5

Spectral bandits for smooth graph functions

Learning user preferences for thousands of items can be achieved with just a handful of evaluations, thanks to a novel approach that leverages effective dimension in graph-based bandit problems.

Michal Valko, Rémi Munos, Branislav Kveton +1

Recommendation & Information Retrieval

Mar 30, 2026

Yash Savani +3Mar 30, 2026·also Adobe Research

Stepwise Credit Assignment for GRPO on Flow-Matching Models

Correcting errors early in the diffusion process matters more than fixing them later: Stepwise-Flow-GRPO leverages this insight to dramatically improve RL-based flow model training.

Yash Savani, Branislav Kveton, Subhojyoti Mukherjee +1

Computer Vision RLHF & Preference Learning Training Efficiency & Optimization

Feb 18, 2026

Brno University of TechnologyFeb 18, 2026·also Adobe Research, KInIT, Slovak University of Technology in Bratislava

From Latent to Observable Position-Based Click Models in Carousel Interfaces

Eye-tracking data can boost click prediction in carousel interfaces, but surprisingly, better click prediction doesn't always mean a better model of user behavior.

Santiago de Leon-Martinez, Santiago de Leon-Martinez, Robert Moro +5

Recommendation & Information Retrieval

Feb 17, 2026

LLM-as-Judge on a Budget

Stop wasting compute on LLM evals: a variance-adaptive querying strategy slashes estimation error by focusing on the most uncertain prompt-response pairs.

Aadirupa Saha, Aniket Wagde, Branislav Kveton

Eval Frameworks & Benchmarks Training Efficiency & Optimization

Search

Branislav Kveton

Research focus

Frequent co-authors

Papers (8)