Lattice AI Research

Research focus

Recommendation & Information Retrieval (3)Reasoning & Chain-of-Thought (2)Tool Use & Agents (2)Architecture Design (Transformers, SSMs, MoE) (1)Natural Language Processing (1)

Frequent co-authors

Weiran Yan (3)Yichao Wu (3)Jianan Liu (1)Jing Yang (1)

Papers (3)

Apr 20, 2026

Jianan Liu +6Apr 20, 2026·also SJTU

Architecture Matters More Than Scale: A Comparative Study of Retrieval and Memory Augmentation for Financial QA Under SME Compute Constraints

Forget scaling up; this study shows that the right reasoning architecture—structured memory for deterministic tasks, RAG for conversational ones—can dramatically outperform a baseline LLM even when compute is severely limited.

Jianan Liu, Jing Yang, Xianyou Li +4

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Reasoning & Chain-of-Thought+1

Mar 10, 2026

Mar 10, 2026·also SJTU

TA-Mem: Tool-Augmented Autonomous Memory Retrieval for LLM in Long-Term Conversational QA

LLMs can now autonomously retrieve relevant memories from a database using specialized tools, significantly improving performance on long-term conversational question answering.

Mengwei Yuan, Weiran Yan, Yichao Wu

Reasoning & Chain-of-Thought Recommendation & Information Retrieval Tool Use & Agents

Mar 1, 2026

Mar 1, 2026·also UVA

Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models

Slash RAG latency by an order of magnitude using a tiny, LoRA-adapted SLM that routes queries, achieving GPT-4o-mini level accuracy at a fraction of the cost.

Yichao Wu, Yafei Xiang, Mengwei Yuan +1

Inference & Quantization Recommendation & Information Retrieval Tool Use & Agents

Search

Mengwei Yuan

Research focus

Frequent co-authors

Papers (3)