Search papers, labs, and topics across Lattice.
This paper investigates whether a single transformer policy can learn to approximate optimal state-feedback control laws for a family of heterogeneous MIMO LTI systems. The approach involves training a transformer on LQR-generated trajectories from systems with varying state and input dimensions, utilizing a shared representation with standardization, padding, dimension encoding, and masked loss. The resulting policy demonstrates empirically small sub-optimality compared to LQR, maintains stability under perturbations, and benefits from fine-tuning on unseen systems, suggesting transformers can effectively approximate near-optimal feedback laws.
Forget hand-tuning controllers for each new linear system: a single transformer can learn near-optimal control policies across diverse MIMO LTI systems.
We study whether optimal state-feedback laws for a family of heterogeneous Multiple-Input, Multiple-Output (MIMO) Linear Time-Invariant (LTI) systems can be captured by a single learned controller. We train one transformer policy on LQR-generated trajectories from systems with different state and input dimensions, using a shared representation with standardization, padding, dimension encoding, and masked loss. The policy maps recent state history to control actions without requiring plant matrices at inference time. Across a broad set of systems, it achieves empirically small sub-optimality relative to Linear Quadratic Regulator (LQR), remains stabilizing under moderate parameter perturbations, and benefits from lightweight fine-tuning on unseen systems. These results support transformer policies as practical approximators of near-optimal feedback laws over structured linear-system families.