Search papers, labs, and topics across Lattice.
PhysForge is introduced, a two-stage framework for generating physics-grounded 3D assets, addressing the limitations of existing methods that primarily focus on static geometry. The framework uses a VLM to plan a "Hierarchical Physical Blueprint" defining physical constraints, followed by a physics-grounded diffusion model that synthesizes geometry and kinematic parameters using a novel KineVoxel Injection (KVI) mechanism. Experiments show PhysForge generates functionally plausible, simulation-ready assets, supported by a large-scale dataset of 150,000 assets with physical annotations.
Interactive 3D asset generation can now be driven by functional logic and hierarchical physics, thanks to a new framework that synthesizes simulation-ready assets.
Synthesizing physics-grounded 3D assets is a critical bottleneck for interactive virtual worlds and embodied AI. Existing methods predominantly focus on static geometry, overlooking the functional properties essential for interaction. We propose that interactive asset generation must be rooted in functional logic and hierarchical physics. To bridge this gap, we introduce PhysForge, a decoupled two-stage framework supported by PhysDB, a large-scale dataset of 150,000 assets with four-tier physical annotations. First, a VLM acts as a "physical architect" to plan a "Hierarchical Physical Blueprint" defining material, functional, and kinematic constraints. Second, a physics-grounded diffusion model realizes this blueprint by synthesizing high-fidelity geometry alongside precise kinematic parameters via a novel KineVoxel Injection (KVI) mechanism. Experiments demonstrate that PhysForge produces functionally plausible, simulation-ready assets, providing a robust data engine for interactive 3D content and embodied agents.