Search papers, labs, and topics across Lattice.
George Washington University,Department of Engineering Management and Systems Engineering,USA
1
3
3
2
Ditch the reactive autoscaling: a new RL-powered Kubernetes autoscaler learns to anticipate traffic spikes and optimize GPU inference deployments entirely in simulation.