Ran He

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Reasoning & Chain-of-Thought (5)RLHF & Preference Learning (4)Multimodal Models (3)Architecture Design (Transformers, SSMs, MoE) (2)

Frequent co-authors

Yuanda Xu (3)Hejian Sang (3)Zhengze Zhou (2)Zhipeng Wang (2)

Papers (12)

Jul 6, 2026

Yuanda Xu +132w ago·also LinkedIn Corporation

TREK: Distill to Explore, Reinforce to Refine

TREK transforms the way models tackle challenging prompts by expanding their exploration support, leading to substantial performance gains even in the hardest task scenarios.

Yuanda Xu, Zhengze Zhou, Kayhan Behdin +11

Reasoning & Chain-of-Thought RLHF & Preference Learning

Jun 29, 2026

3w ago·also CAS, NJU

On the Vulnerability of Parameter-Level Defenses to Model Merging

Parameter-level defenses against model merging are fundamentally flawed, allowing attackers to exploit their weaknesses with a new Anchor-Guided Attack.

Kuangpu Guo, Qingyan Zheng, Yongcan Yu +3

Architecture Design (Transformers, SSMs, MoE)Red-Teaming & Adversarial Robustness

Jun 12, 2026

OmniVideo-100K: A Dataset for Audio-Visual Reasoning through Structured Scripts and Evidence Chains

Fine-tuning on the new OmniVideo-100K dataset boosts model performance by over 20% in audio-visual reasoning tasks, revealing the power of structured scripts in enhancing multimodal understanding.

Xinyue Cai, Chaoyou Fu, Yifan Zhang +2

Multimodal Models Reasoning & Chain-of-Thought

May 22, 2026

Pin Wang +2May 22, 2026

Coloring the Noise: Adversarial Sobolev Alignment for Faithful Image Super Resolution

Hallucinated details in super-resolution are not just random noise; they reveal a fundamental spectral mismatch that can be corrected by shaping the generative process itself.

Pin Wang, Chao Zhou, Ran He

Computer Vision Red-Teaming & Adversarial Robustness

May 1, 2026

Zihan Lin +8May 1, 2026

ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning

LLMs can reason better and generate more diverse outputs by projecting negative samples onto a positive subspace during reinforcement learning.

Zihan Lin, Xiaohan Wang, Jie Cao +6

Reasoning & Chain-of-Thought RLHF & Preference Learning

Apr 23, 2026

Apr 23, 2026·also Meituan

Understanding and Mitigating Spurious Signal Amplification in Test-Time Reinforcement Learning for Math Reasoning

Test-time RL's vulnerability to noisy pseudo-labels is amplified by group-relative advantage estimation, but can be mitigated with a surprisingly simple debiasing and denoising approach.

Yongcan Yu, Lingxiao He, Jian Liang +5

Reasoning & Chain-of-Thought RLHF & Preference Learning

Apr 22, 2026

SpeechParaling-Bench: A Comprehensive Benchmark for Paralinguistic-Aware Speech Generation

Current audio-language models are surprisingly bad at controlling and interpreting subtle vocal cues, failing in nearly half of situational dialogue scenarios.

Ruohan Liu, Shukang Yin, Weiji Zhuang +4

Eval Frameworks & Benchmarks Speech & Audio

Apr 20, 2026

Qihang Fan +3Apr 20, 2026·also Department of Cardiology

Advancing Vision Transformer with Enhanced Spatial Priors

EVT achieves 86.6% top-1 accuracy on ImageNet-1k without extra training data, redefining the potential of Vision Transformers in computer vision.

Qihang Fan, Mingrui Chen, Hongmin Liu +1

Architecture Design (Transformers, SSMs, MoE)Computer Vision

Apr 15, 2026

Yuanda Xu +4Apr 15, 2026·also LinkedIn Corporation

TIP: Token Importance in On-Policy Distillation

Overconfident tokens, often missed by entropy-based methods, carry surprisingly dense corrective signals in on-policy distillation, allowing for near-baseline performance with <10% of tokens.

Yuanda Xu, Hejian Sang, Ran He +2

Inference & Quantization Training Efficiency & Optimization

Apr 12, 2026

Apr 12, 2026·also BIT, BUPT, CAS, PKU

OmniUMI: Towards Physically Grounded Robot Learning via Human-Aligned Multimodal Interaction

Robots can now learn contact-rich manipulation skills like humans by feeling the forces involved, thanks to a new multimodal interface that captures synchronized visual, tactile, and force data.

Yuanyuan Li, Chaoran Xu, Jiachen Zhang +2

Multimodal Models Robotics & Embodied AI

Feb 26, 2026

Feb 26, 2026·also Beihang, CAS, HUST, NJU +3

The Trinity of Consistency as a Defining Principle for General World Models

A principled framework for General World Models reveals the limitations of current systems and the architectural requirements for future progress.

Jingxuan Wei, Siyuan Li, Yuhang Xu +33

Multimodal Models Scaling Laws & Emergent Abilities World Models & Planning

Feb 24, 2026

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

Overconfident errors in RLVR monopolize probability mass and suppress exploration, but a confidence-aware penalty fixes this and boosts mathematical reasoning performance.

Yuanda Xu, Hejian Sang, Zhengze Zhou +2

Reasoning & Chain-of-Thought RLHF & Preference Learning

Search

Ran He

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (12)