Search papers, labs, and topics across Lattice.
Kuaishou Technology
5
0
7
12
MLLMs can't grasp metaphors in videos, revealing a surprising gap in their high-order cognitive abilities compared to humans.
LLMs exhibit a "Utopian bias" when simulating human behavior, converging towards an unrealistic "positive average person" and failing to capture individual differences and long-tail behaviors.
Context-augmented RL lets smaller MLLMs punch *way* above their weight, rivaling much larger models on reasoning tasks while dodging reward hacking.
Unleashing diffusion models' spatial reasoning potential is now possible without expensive joint training, thanks to a clever plug-and-play framework that leverages MLLMs for layout planning.