Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Min Zhang | Lattice

Min Zhang

Papers on Lattice

3

Total citations

0

Topics

6

h-index

1

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (2)Inference & Quantization (2)Reasoning & Chain-of-Thought (2)Training Efficiency & Optimization (2)

Frequent co-authors

Weiyang Huang (1)Xuefeng Bai (1)Kehai Chen (1)Xinyang Chen (1)

Papers (3)

Apr 9, 2026

Weiyang Huang +71w ago·also HIT

SAT: Balancing Reasoning Accuracy and Efficiency with Stepwise Adaptive Thinking

LRMs can slash up to 40% of reasoning tokens without sacrificing accuracy by dynamically adjusting their "thinking speed" at each step.

Weiyang Huang, Xuefeng Bai, Kehai Chen +5

Eval Frameworks & Benchmarks Inference & Quantization Reasoning & Chain-of-Thought+1

1w ago·also Huawei

When to Trust Tools? Adaptive Tool Trust Calibration For Tool-Integrated Math Reasoning

Tool-integrated reasoning models often stubbornly stick to their own (wrong) answers, even when a tool provides the correct solution.

Ruotao Xu, Yixin Ji, Yu Luo +4

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Tool Use & Agents

Apr 8, 2026

Quantong Qiu +71w ago

Flux Attention: Context-Aware Hybrid Attention for Efficient LLMs Inference

Forget static attention allocation – Flux Attention dynamically routes layers between full and sparse attention based on context, delivering significant speedups without sacrificing performance in long-context LLMs.

Quantong Qiu, Zhiyi Hong, Yi Yang +5

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Training Efficiency & Optimization

Tool Use & Agents (1)

Architecture Design (Transformers, SSMs, MoE) (1)

Xinyan Chen (1)