Search papers, labs, and topics across Lattice.
This paper introduces Iterative Learning Control-Informed Reinforcement Learning (IL-CIRL), a framework that integrates ILC with DRL for batch process control. IL-CIRL uses a dual-layer architecture with batch-to-batch and within-batch control, leveraging Kalman filter-based state estimation to guide DRL agents. The method aims to improve the safety and stability of DRL controllers in industrial settings by incorporating ILC's iterative refinement and disturbance compensation capabilities.
Marrying iterative learning control with reinforcement learning yields safer and more stable DRL controllers for batch process control, addressing a key adoption barrier in industrial settings.
A significant limitation of Deep Reinforcement Learning (DRL) is the stochastic uncertainty in actions generated during exploration-exploitation, which poses substantial safety risks during both training and deployment. In industrial process control, the lack of formal stability and convergence guarantees further inhibits adoption of DRL methods by practitioners. Conversely, Iterative Learning Control (ILC) represents a well-established autonomous control methodology for repetitive systems, particularly in batch process optimization. ILC achieves desired control performance through iterative refinement of control laws, either between consecutive batches or within individual batches, to compensate for both repetitive and non-repetitive disturbances. This study introduces an Iterative Learning Control-Informed Reinforcement Learning (IL-CIRL) framework for training DRL controllers in dual-layer batch-to-batch and within-batch control architectures for batch processes. The proposed method incorporates Kalman filter-based state estimation within the iterative learning structure to guide DRL agents toward control policies that satisfy operational constraints and ensure stability guarantees. This approach enables the systematic design of DRL controllers for batch processes operating under multiple disturbance conditions.