Search papers, labs, and topics across Lattice.
University of Maryland College Park, Indian Institute of Technology Delhi, Indraprastha Institute of Information Technology Delhi, Jaypee Institute of Information Technology Noida
3
0
7
5
Open-source diffusion models can now achieve state-of-the-art illumination control rivaling closed-source alternatives, thanks to a novel training pipeline and dataset.
Audio-language models can now reason about 30-minute-long audio clips with timestamp-grounded intermediate steps, unlocking a new level of fine-grained understanding.
LALMs can be easily tricked into "hearing" things that aren't there, with success rates as high as 95% on targeted attacks.