Search papers, labs, and topics across Lattice.
3
0
6
Forget handcrafted losses: this paper uses human feedback and reinforcement learning to create infrared and visible image fusion that actually looks good to people.
Achieve state-of-the-art universal audio representation by unifying diverse audio tasks into a single next-token prediction framework, outperforming Whisper by a large margin.
Forget ImageNet: Xray-Visual sets a new SOTA for multimodal vision models by scaling to billions of social media data points with a novel three-stage training pipeline.