Jie Yu

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Distributed Systems & Hardware (1)Training Efficiency & Optimization (1)Natural Language Processing (1)

Frequent co-authors

Shasha Li (2)Dongfang Li (1)Xiaodong Luo (1)Ruoyu Sun (1)

Papers (3)

Jul 22, 2026

4d ago·also Changsha University of Science and Technology

SLAI T-Rex: Full-Parameter Post-training of the DeepSeek-V4 Family on Ascend SuperPOD

Achieving a 71.81% zero-shot Pass@1 score, the DeepSeek-V4-Flash model outperforms leading competitors by leveraging a novel optimization framework on Ascend SuperPOD.

Dongfang Li, Xiaodong Luo, Ruoyu Sun +85

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Training Efficiency & Optimization

Jul 21, 2026

Jinying Xiao +85d ago

SkillSight: Seeing Through Shared Descriptions for Accurate Skill Retrieval

Shared descriptive patterns in skill libraries can obscure task-relevant signals, but SkillSight reveals and calibrates these biases to boost retrieval accuracy by over 20%.

Jinying Xiao, Bing Ji, Shasha Li +6

Natural Language Processing Recommendation & Information Retrieval

Jan 13, 2026

Shezheng Song +3Jan 13, 2026

Where Does Vision Meet Language? Understanding and Refining Visual Fusion in MLLMs via Contrastive Attention

MLLMs don't fuse vision and language uniformly: targeted interventions guided by layer-wise attention analysis can significantly boost multimodal reasoning without retraining.

Shezheng Song, Shezheng Song, Shasha Li +1

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp Multimodal Models

Search

Jie Yu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)