Search papers, labs, and topics across Lattice.
Shenzhen University
1
0
3
Achieve 50% parameter reduction in LLaMA-2-7B with minimal performance loss and no fine-tuning, thanks to a new global gating-based structured pruning method.