Search papers, labs, and topics across Lattice.
This paper introduces a two-level scenario approach framework for data-driven design, distinguishing between baseline appropriateness for design and post-design appropriateness for a posteriori evaluation of user-specified properties. The approach allows for certifying the reliability of a design with respect to these post-design properties using the same dataset used for design, avoiding the need for a separate test set. The paper provides distribution-free upper and, under additional assumptions, lower bounds on the risk of failing to meet the post-design appropriateness criteria, and demonstrates the methodology with H2 and pole-placement problems.
Now you can certify the reliability of your data-driven design for properties you didn't even consider during the design phase, all without needing extra test data.
The scenario approach is an established data-driven design framework that comes equipped with a powerful theory linking design complexity to generalization properties. In this approach, data are simultaneously used both for design and for certifying the design's reliability, without resorting to a separate test dataset. This paper takes a step further by guaranteeing additional properties, useful in post-design usage but not considered during the design phase. To this end, we introduce a two-level framework of appropriateness: baseline appropriateness, which guides the design process, and post-design appropriateness, which serves as a criterion for a posteriori evaluation. We provide distribution-free upper bounds on the risk of failing to meet the post-design appropriateness; these bounds are computable without using any additional test data. Under additional assumptions, lower bounds are also derived. As part of an effort to demonstrate the usefulness of the proposed methodology, the paper presents two practical examples in H2 and pole-placement problems. Moreover, a method is provided to infer comprehensive distributional knowledge of relevant performance indexes from the available dataset.