Search papers, labs, and topics across Lattice.
State Key Lab of Processors, Institute of Computing Technology, Chinese Academy of Sciences
1
0
3
4
LLM serving gets a boost from PAM, a hierarchical memory architecture that intelligently distributes and processes key-value pairs across heterogeneous PIM devices, slashing memory bottlenecks.