Search papers, labs, and topics across Lattice.
1
0
3
6
Inference time can reveal the GPU models behind black-box LLM APIs, offering a way to estimate their hidden energy costs.