Search papers, labs, and topics across Lattice.
1
0
3
Adaptive layer selection in LLMs can significantly enhance inference efficiency without sacrificing accuracy, achieving better performance than static pruning methods.