Search papers, labs, and topics across Lattice.
1
0
3
MLLMs can slash 68% of their FLOPs with minimal accuracy loss by pruning visual tokens at the "Entropy Collapse Layer"—where information content plummets—using a new matrix-entropy-guided method.