Search papers, labs, and topics across Lattice.
Shanghai Jiao Tong University
1
0
2
9
GF-DiT achieves up to 6.01脳 throughput improvement and 95% latency reduction by dynamically adapting GPU parallelism in response to workload demands.