Search papers, labs, and topics across Lattice.
The authors introduce ForesightSafety Bench, a comprehensive AI safety evaluation framework encompassing 94 risk dimensions across fundamental, embodied, AI4Science, social, environmental, catastrophic, existential, and industrial safety domains. They systematically evaluated over twenty mainstream large AI models using this benchmark, revealing widespread safety vulnerabilities, especially in areas like risky agentic autonomy and AI4Science safety. The benchmark includes tens of thousands of structured risk data points and assessment results, providing a hierarchical and dynamically evolving system for AI safety evaluation.
Frontier AI models exhibit widespread safety vulnerabilities across multiple pillars, including risky agentic autonomy and catastrophic risks, according to a new comprehensive benchmark.
Rapidly evolving AI exhibits increasingly strong autonomy and goal-directed capabilities, accompanied by derivative systemic risks that are more unpredictable, difficult to control, and potentially irreversible. However, current AI safety evaluation systems suffer from critical limitations such as restricted risk dimensions and failed frontier risk detection. The lagging safety benchmarks and alignment technologies can hardly address the complex challenges posed by cutting-edge AI models. To bridge this gap, we propose the "ForesightSafety Bench" AI Safety Evaluation Framework, beginning with 7 major Fundamental Safety pillars and progressively extends to advanced Embodied AI Safety, AI4Science Safety, Social and Environmental AI risks, Catastrophic and Existential Risks, as well as 8 critical industrial safety domains, forming a total of 94 refined risk dimensions. To date, the benchmark has accumulated tens of thousands of structured risk data points and assessment results, establishing a widely encompassing, hierarchically clear, and dynamically evolving AI safety evaluation framework. Based on this benchmark, we conduct systematic evaluation and in-depth analysis of over twenty mainstream advanced large models, identifying key risk patterns and their capability boundaries. The safety capability evaluation results reveals the widespread safety vulnerabilities of frontier AI across multiple pillars, particularly focusing on Risky Agentic Autonomy, AI4Science Safety, Embodied AI Safety, Social AI Safety and Catastrophic and Existential Risks. Our benchmark is released at https://github.com/Beijing-AISI/ForesightSafety-Bench. The project website is available at https://foresightsafety-bench.beijing-aisi.ac.cn/.