Xuefeng Bai

MLLMs aren't just improving video translation quality; they're fundamentally changing how we approach it by jointly optimizing for semantic accuracy, timing, speaker identity, and emotional nuance.

Bingzheng QU, Xuefeng Bai

Multimodal Models Natural Language Processing Speech & Audio

Apr 9, 2026

Weiyang Huang +73w ago·also HIT

SAT: Balancing Reasoning Accuracy and Efficiency with Stepwise Adaptive Thinking

LRMs can slash up to 40% of reasoning tokens without sacrificing accuracy by dynamically adjusting their "thinking speed" at each step.

Weiyang Huang, Xuefeng Bai, Kehai Chen +5

Eval Frameworks & Benchmarks Inference & Quantization Reasoning & Chain-of-Thought+1

Apr 1, 2026

Apr 1, 2026·also TikTok Inc

Agentic Tool Use in Large Language Models

The chaos of LLM tool use research gets tamed: a new framework reveals the hidden evolutionary relationships between prompting, supervised learning, and RL-based approaches.

Jinchao Hu, Meizhi Zhong, Xuefeng Bai

Reasoning & Chain-of-Thought Tool Use & Agents

Feb 16, 2026

Beyond Token-Level Policy Gradients for Complex Reasoning with Large Language Models

Token-level policy gradients fall short in complex reasoning tasks, but treating sequences of tokens as unified actions can significantly boost performance in mathematical and coding benchmarks.

Mufan Xu, Xuefeng Bai, Zhengyu Niu +2

Reasoning & Chain-of-Thought RLHF & Preference Learning

Search

Xuefeng Bai

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)