Search papers, labs, and topics across Lattice.
2
0
3
17
Forget relying solely on final-layer features: intermediate layers in Vision Transformers hold untapped potential for boosting face image quality assessment.
Turns out, your pre-trained face recognition ViT already knows which faces are high quality, just by looking at the attention maps.