Search papers, labs, and topics across Lattice.
Oak Ridge National Laboratory
1
0
3
Training MoE models just got a whole lot faster: Piper achieves up to 3.5x higher MFU by intelligently scheduling pipeline parallelism and optimizing communication.