Search papers, labs, and topics across Lattice.
University of Science and Technology of China
3
0
7
0
Stop blasting your diffusion models with a single, static reward signal: fine-grained credit assignment across denoising steps and objectives unlocks better image and video generation.
Forget imitation: reward-aware trajectory shaping lets few-step generative models outperform their multi-step teachers.
Stop hard-coding reasoning strategies for your LLM agent: a learned router that dynamically picks the best paradigm for each task boosts performance by up to 5.5%, beating even the best fixed strategy.