Lattice AI Research

Research focus

RLHF & Preference Learning (2)Training Efficiency & Optimization (2)Constitutional AI & AI Ethics (1)Interpretability & Mechanistic Interp (1)Tool Use & Agents (1)

Frequent co-authors

Zhenwei Tang (3)Ashton Anderson (2)Yilun Liu (1)Ye Yuan (1)

Papers (3)

Apr 20, 2026

Difan Jiao +6Apr 20, 2026·also W1H” paradigm

LLM Safety From Within: Detecting Harmful Content with Internal Representations

Harnessing the internal states of LLMs, SIREN outperforms traditional guard models while using a fraction of the parameters, revolutionizing harmful content detection.

Difan Jiao, Yilun Liu, Ye Yuan +4

Constitutional AI & AI Ethics Interpretability & Mechanistic Interp

Apr 8, 2026

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

Rollout design in LLM reinforcement learning is more than just sampling trajectories – it's a modular pipeline you can optimize for reliability, coverage, and cost.

Rohan Surana, Gagan Mundada, Xunyi Jiang +19

RLHF & Preference Learning Tool Use & Agents Training Efficiency & Optimization

Apr 2, 2026

Apr 2, 2026·also Coolwei AI Lab

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

Jointly training LLMs to reason and refine their answers unlocks significant performance gains, outperforming standard policy optimization by up to 11.5 points on AIME.

Difan Jiao, Qianfeng Wen, Blair Yang +2

Reasoning & Chain-of-Thought RLHF & Preference Learning Training Efficiency & Optimization

Search

Difan Jiao

Research focus

Frequent co-authors

Papers (3)