Search papers, labs, and topics across Lattice.
This paper tackles the Flexible Job Shop Scheduling Problem (FJSP) with random job arrivals using Deep Reinforcement Learning (DRL). They employ Proximal Policy Optimization (PPO) with MLPs, training an agent to minimize total job completion time by selecting from a set of dispatching rules based on a directly accessible state representation. Results demonstrate that the DRL approach outperforms individual dispatching rules, particularly on heterogeneous datasets, and achieves competitive performance against MILP solvers.
DRL can beat hand-crafted dispatching rules at flexible job shop scheduling, even with unpredictable job arrivals.
The Flexible Job Shop Scheduling Problem (FJSP) is the optimal allocation of a set of jobs to machines. Two primary challenges persist in FJSP: the unpredictable arrival of future jobs and the combinatorial complexity of the problem, rendering it intractable for conventional mixed-integer linear programming solvers. This paper proposes an event-based \gls{DRL} approach to solve FJSP with random job arrivals. Specifically, we employ the Proximal Policy Optimization algorithm and use lightweight Multi-Layer Perceptrons to train the \gls{DRL} agent for minimizing the total completion time of all jobs. We design the state representation to be directly accessible from the environment, and limit the learning agent to selecting from among a set of well-established dispatching rules. Simulations show that our \gls{DRL} approach outperforms any of the individual dispatching rules on datasets with varying heterogeneity and job arrival rates. We benchmark our \gls{DRL} against an arrival-triggered mixed-integer linear programming solution and show that our method achieves good performance especially when the datasets are heterogeneous.