Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Xinyang Chen | Lattice

Xinyang Chen

Harbin Institute of Technology (Shenzhen)

Papers on Lattice

2

Total citations

0

Topics

6

Research focus

Inference & Quantization (2)Eval Frameworks & Benchmarks (1)Reasoning & Chain-of-Thought (1)Training Efficiency & Optimization (1)Architecture Design (Transformers, SSMs, MoE) (1)

Frequent co-authors

Weiyang Huang (1)Xuefeng Bai (1)Kehai Chen (1)Xinyan Chen (1)

Papers (2)

Apr 9, 2026

Weiyang Huang +7Apr 9, 2026·also HIT

SAT: Balancing Reasoning Accuracy and Efficiency with Stepwise Adaptive Thinking

LRMs can slash up to 40% of reasoning tokens without sacrificing accuracy by dynamically adjusting their "thinking speed" at each step.

Weiyang Huang, Xuefeng Bai, Kehai Chen +5

Eval Frameworks & Benchmarks Inference & Quantization Reasoning & Chain-of-Thought+1

Feb 18, 2026

Tsinghua AIFeb 18, 2026·also HIT, USTB

FlowPrefill: Decoupling Preemption from Prefill Scheduling Granularity to Mitigate Head-of-Line Blocking in LLM Serving

LLM serving can achieve 5.6x higher throughput without sacrificing latency by decoupling preemption granularity from scheduling frequency.

Chia-chi Hsieh, Chia-chi Hsieh, Zan Zong +9

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Distributed Systems & Hardware (1)

Yibin Chen (1)

Chia-chi Hsieh (1)