Search papers, labs, and topics across Lattice.
2
0
6
VLMs can be transformed into pixel-precise structural document parsing experts, achieving state-of-the-art OCR performance by enforcing syntactic validity and structural integrity through reinforcement learning.
Unlock better Automatic Chord Recognition by distilling knowledge from readily available pre-trained models using pseudo-labels on unlabeled data, surpassing the performance of both supervised baselines and the original teacher model.