Pengfei Cai

Papers on Lattice

Total citations

Topics

h-index

Research focus

Speech & Audio (2)Multimodal Models (1)RLHF & Preference Learning (1)Computer Vision (1)

Frequent co-authors

Qing Gu (2)Yanfeng Shi (1)Nan Jiang (1)Li-Rong Dai (1)

Papers (2)

Apr 15, 2026

Yanfeng Shi +5Apr 15, 2026·also USTC

Towards Fine-grained Temporal Perception: Post-Training Large Audio-Language Models with Audio-Side Time Prompt

LALMs can gain a far more precise sense of time by simply interleaving learned time embeddings into their audio feature sequences and then being fine-tuned with RL.

Yanfeng Shi, Pengfei Cai, Qing Gu +3

Multimodal Models RLHF & Preference Learning Speech & Audio

Mar 16, 2026

Singapore Institute of TechnologyMar 16, 2026·also Meta AI, Austrian Institute of Technology, Duke

Spectrogram Features for Audio and Speech Analysis

The optimal spectrogram configuration for audio and speech analysis hinges on a nuanced interplay between front-end feature representation and back-end classifier architecture, varying significantly across tasks.

Ian McLoughlin, L. Pham, Yan Song +8

Computer Vision Speech & Audio

Search

Pengfei Cai

Research focus

Frequent co-authors

Papers (2)