Search papers, labs, and topics across Lattice.
Indian Institute of Technology Kharagpur
2
0
3
VLMs hallucinate less when you force them to "think twice" by contrasting language-driven and vision-driven token probabilities at each decoding step.
Pose-guided GANs and diffusion models can faithfully generate complex cultural dance postures, opening new avenues for digital preservation and education.