Xiang Liu

The Hong Kong University of Science and Technology (Guangzhou), A_{1}\succ A_{2}) as shown in Figure 2 (left). We do not strictly formalize the measurement function MM, considering that its real-world definitions are various and could range from log-probabilities to other complex scoring mechanisms. This prioritization can be modeled as a directed graph where nodes are instructions or values and edges represent priority relationships (Zhang et al., 2025; Wallace et al., 2025). However, unlike Asimov’s simple linear hierarchy, these graphs can contain directed cycles (e.g., A1≻A2≻A3≻A

Papers on Lattice

Total citations

Topics

h-index

Research focus

Distributed Systems & Hardware (1)Inference & Quantization (1)Reasoning & Chain-of-Thought (1)

Frequent co-authors

Qi Li (1)Junpan Wu (1)Yuxin Wang (1)Zeyu Li (1)

Papers (1)

Oct 21, 2025

Oct 21, 2025·also Tsinghua AI, HKUST

Reasoning Language Model Inference Serving Unveiled: An Empirical Study

Naive application of LLM inference optimizations can *hurt* the performance of smaller reasoning models, highlighting the need for RLLM-specific serving strategies.

Qi Li, Junpan Wu, Xiang Liu +6

Distributed Systems & Hardware Inference & Quantization Reasoning & Chain-of-Thought

Search

Xiang Liu

Research focus

Frequent co-authors

Papers (1)