Search papers, labs, and topics across Lattice.
This paper tackles insurance pricing optimization by framing it as a decision-making problem solvable via off-policy evaluation and stochastic control. They introduce a kernelized inverse propensity score estimator that leverages local action-space structure to reduce variance compared to standard inverse propensity scoring. Empirically, neural network-based policy optimization outperforms existing techniques in a synthetic travel insurance environment.
Kernel methods can substantially improve off-policy evaluation for insurance pricing, enabling neural networks to discover better pricing strategies.
Traditional insurance pricing relies on risk-based principles that ensure actuarial fairness and solvency but do not explicitly account for policyholders' price sensitivity. We formulate insurance pricing as a decision-making problem and study it using tools from off-policy evaluation and stochastic control. We propose a kernelized inverse propensity score estimator that exploits local structure in the action space and yields variance reduction compared to the classical inverse propensity score estimator. Building on these value estimates, we investigate policy optimization and present two practical approaches for computing optimal pricing rules: an interpretable data-shared Lasso formulation and a flexible policy parameterization based on neural networks. Using a controlled synthetic travel insurance environment, we empirically confirm the theoretical results and show that neural networks outperform existing techniques for policy optimization.