Search papers, labs, and topics across Lattice.
This paper introduces a tabular foundation model-based approach to generate synthetic landslide datasets, addressing the challenges of data sparsity and imbalance in landslide modeling. The method leverages the model's ability to learn from limited observations to preserve multivariate dependencies and statistical characteristics of landslide occurrences. Experiments on 20 landslide inventories show the generated datasets closely match observed distributions and maintain realistic feature dependencies across diverse environmental contexts.
Generating realistic landslide datasets from sparse, imbalanced real-world data is now possible, thanks to a tabular foundation model that captures complex feature dependencies.
Landslide investigation relies on sufficient and well-balanced observational data influenced by geological, hydrological, and anthropogenic factors. Available landslide inventories are often sparse and imbalanced, which limits understanding of triggering conditions and failure mechanisms. Data generation provides an effective approach to help capture feature dependencies from limited landslide observations. However, existing generation approaches for landslides often struggle to capture complex relationships among features and lack robustness across multiple scenarios and interacting factors. Here, we propose an accurate and robust approach for generating multi-feature landslide datasets by utilizing a tabular foundation model. By leveraging the capacity to learn from limited observations, the proposed approach effectively preserves the multivariate dependencies and statistical characteristics inherent in landslide occurrences. Comparative experiments on 20 landslide inventories demonstrate that the generated datasets closely align with observed distributions, maintain realistic feature dependencies, and exhibit robustness across different environmental contexts. This work provides an effective approach to overcome data sparsity and imbalance and strengthens landslide susceptibility modeling and risk assessment under limited observations.