May 11 – May 18, 2026

Distributed Systems & Hardware - Weekly Roundup

2 papers published across 1 lab.

2700% acceleration

Selected Labs publishing this week

NUS1

Top Papers

May 16, 2026

NUS1w ago

MemForest: An Efficient Agent Memory System with Hierarchical Temporal Indexing

LLM agents can now maintain long-term memories with 6x higher throughput thanks to a novel hierarchical temporal indexing approach that avoids costly full-state rewrites.

Distributed Systems & Hardware Inference & Quantization Tool Use & Agents

May 13, 2026

1w ago·also D Pareto candidate set

KVServe: Service-Aware KV Cache Compression for Communication-Efficient Disaggregated LLM Serving

Forget static KV cache compression – KVServe dynamically adapts compression strategies to your service context, slashing latency by up to 32.8x in disaggregated LLM serving.

Zedong Liu, Xinyang Ma, Dejun Luo +9

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Distributed Systems & Hardware - Weekly Roundup

Selected Labs publishing this week

Top Papers