Search papers, labs, and topics across Lattice.
4
0
5
4
LVLMs are better at spotting their own mistakes than generating correct answers in the first place, and this self-awareness can be exploited to reduce hallucinations.
Forget external teachers – the best way to boost your RL model's performance is to learn from its future self.
EasyVideoR1 achieves a 1.47 times throughput improvement in video understanding tasks by eliminating redundant video decoding and leveraging a comprehensive task-aware reward system.
Fake news in short videos often betrays itself through subtle inconsistencies between text, visuals, and audio, a weakness MAGIC3 exploits to achieve VLM-level accuracy at a fraction of the cost.