Tong Jia

Papers on Lattice

Total citations

Topics

h-index

Research focus

Distributed Systems & Hardware (1)RLHF & Preference Learning (1)Training Efficiency & Optimization (1)

Frequent co-authors

Lingzhe Zhang (1)Yunpeng Zhai (1)Liancheng Fang (1)Kening Zheng (1)

Papers (1)

May 6, 2026

Lingzhe Zhang +6May 6, 2026

Towards Robust LLM Post-Training: Automatic Failure Management for Reinforcement Fine-Tuning

RFT's Achilles heel? This benchmark reveals how fragile reinforcement fine-tuning is, and introduces an automated system to catch and fix training failures before they tank your LLM.

Lingzhe Zhang, Tong Jia, Yunpeng Zhai +4

Distributed Systems & Hardware RLHF & Preference Learning Training Efficiency & Optimization

Search

Tong Jia

Research focus

Frequent co-authors

Papers (1)