Search papers, labs, and topics across Lattice.
1
0
3
Injecting LLM-generated textual descriptions of facial action units into a vision model substantially boosts AU detection performance, suggesting a powerful way to leverage language priors in computer vision.