Search papers, labs, and topics across Lattice.
School of Informatics, University of Edinburgh
1
0
3
9
You can slash the text encoder size in vision-language segmentation models by 88% without sacrificing performance, thanks to surprisingly high redundancy in how these models process prompts.