Search papers, labs, and topics across Lattice.
Corresponding author: Cuntai Guan, Guo-Sen Xie.Yong Li is with the School of Computer Science and Engineering, and the Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications, Southeast University, Nanjing 210096, China. E-mail: mysee1989@gmail.com.Tianyi Zhang is with Key Laboratory of Child Development and Learning Science (Ministry of Education), School of Biological Sciences and Medical Engineering, Southeast University, Nanjing, China. E-mail: t.zhang@seu.edu.cn.Yi Ren, Yizhe Zhang, Wenhua Zhang and Guo-Sen Xie are with the School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China. E-mail: yi.ren@njust.edu.cn, zhangyizhe@njust.edu.cn, whzhang@njust.edu.cn, gsxiehm@gmail.com.Cuntai Guan and Muyun Jiang are with the School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore, 639798. E-mail: (ctguan, james.jiang)@ntu.edu.sg.Manuscript received April 19, 2021; revised August 16, 2021
1
0
3
0
Injecting LLM-generated textual descriptions of facial action units into a vision model substantially boosts AU detection performance, suggesting a powerful way to leverage language priors in computer vision.