Search papers, labs, and topics across Lattice.
5
0
8
0
Spatial awareness is the secret ingredient to unlocking better visual in-context learning, boosting performance across diverse vision tasks.
Achieve state-of-the-art long-horizon video understanding by compressing multimodal memories into high-level semantic schemas, enabling efficient reasoning without losing crucial details.
Context-augmented RL lets smaller MLLMs punch *way* above their weight, rivaling much larger models on reasoning tasks while dodging reward hacking.
Task-oriented dialogue agents can now learn to balance user satisfaction and operational costs, thanks to a new RL framework that optimizes for both.
LLMs learn to recommend better by looking inside themselves, using intermediate layer activations to generate harder negatives on the fly.