Daiting Shi

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

RLHF & Preference Learning (2)Interpretability & Mechanistic Interp (1)Natural Language Processing (1)Architecture Design (Transformers, SSMs, MoE) (1)

Frequent co-authors

Kai Qin (1)Liangxin Liu (1)Yu Liang (1)Longzheng Wang (1)

Papers (3)

Apr 8, 2026

Tsinghua AI3w ago·also Baidu

ReflectRM: Boosting Generative Reward Models via Self-Reflection within a Unified Judgment Framework

By reflecting on its own reasoning, ReflectRM achieves a +10.2 improvement in mitigating positional bias compared to leading generative reward models, making it a far more stable evaluator.

Kai Qin, Liangxin Liu, Yu Liang +6

Interpretability & Mechanistic Interp RLHF & Preference Learning

Apr 7, 2026

3w ago·also Baidu

UniCreative: Unifying Long-form Logic and Short-form Sparkle via Reference-Free Reinforcement Learning

Models can learn to self-differentiate between tasks requiring rigorous planning versus direct generation in creative writing, unlocking a new level of meta-cognitive ability.

Xiaolong Wei, Zerun Zhu, Simin Niu +9

Natural Language Processing RLHF & Preference Learning

Mar 2, 2026

Mar 2, 2026·also CAS, HKU, State Key Laboratory of AI Safety

Reconstructing Content via Collaborative Attention to Improve Multimodal Embedding Quality

Multimodal embeddings get a serious upgrade with CoCoA, a new pre-training method that forces models to compress all input information into a single token for reconstruction, leading to substantial quality gains.

Jiahan Chen, Jiahan Chen, Da Li +9

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Recommendation & Information Retrieval

Search

Daiting Shi

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)