Search

Search papers, labs, and topics across Lattice.

Yafei Xiang

Papers on Lattice

Total citations

Topics

Research focus

Inference & Quantization (1)Recommendation & Information Retrieval (1)Tool Use & Agents (1)

Frequent co-authors

Yichao Wu (1)Mengwei Yuan (1)Weiran Yan (1)

Papers (1)

Mar 1, 2026

Mar 1, 2026·also UVA

Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models

Slash RAG latency by an order of magnitude using a tiny, LoRA-adapted SLM that routes queries, achieving GPT-4o-mini level accuracy at a fraction of the cost.

Yichao Wu, Yafei Xiang, Mengwei Yuan +1

Inference & Quantization Recommendation & Information Retrieval Tool Use & Agents