Search papers, labs, and topics across Lattice.
Peking University
2
0
3
8
Open-source ATLAS unlocks rapid, accurate co-design of 3D-DRAM memory systems and LLM accelerators, previously hindered by closed-source tools and customized designs.
A hybrid-bonding-based LLM serving accelerator, Helios, tackles the dynamic nature of KV cache management in LLM serving, achieving significant speedup and energy efficiency gains over existing GPU/NMP designs.