Lattice AI Research

Research focus

Eval Frameworks & Benchmarks (1)Tool Use & Agents (1)Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

K. Shi (1)Ziao Zhang (1)Shiting Huang (1)Avery Nie (1)

Papers (2)

May 27, 2026

May 27, 2026·also UofT

AsyncTool: Evaluating the Asynchronous Function Calling Capability under Multi-Task Scenarios

LLM agents struggle to juggle multiple tasks when tool use involves realistic delays, revealing critical weaknesses in temporal reasoning and coordination.

K. Shi, Ziao Zhang, Shiting Huang +10

Eval Frameworks & Benchmarks Tool Use & Agents

Mar 18, 2026

Multi-stage Flow Scheduling for LLM Serving

LLM serving systems can boost Time-To-First-Token (TTFT) attainment by up to 2.4x simply by prioritizing network flows based on a novel approximation of Least-Laxity-First scheduling.

Yijun Sun, Xudong Liao, Songrun Xie +10

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

T. China

Research focus

Frequent co-authors

Papers (2)