Search papers, labs, and topics across Lattice.
University of Chinese Academy of Sciences, Beijing, China
1
0
2
5
Cut your debugging time: CCL-D slashes the diagnosis time for slow/hang anomalies in large-scale distributed training from days to just 6 minutes.