BrownSFUMay 29, 2026arXiv:2605.30819

Function2Scene: 3D Indoor Scene Layout from Functional Specifications

Ruiqi Wang, Qimin Chen, Daniel Ritchie, Angel X. Chang, M. Savva, Kai Wang, Hao Zhang

AI Summary

Function2Scene generates 3D indoor layouts from natural language functional specifications by parsing occupant personas and activities to derive customized design constraints. It uses a tool-augmented check-and-repair loop, combining geometric measurements, LLM-based contextual reasoning, and VLM-based visual assessment, to iteratively evaluate and refine the layout. Experiments demonstrate that Function2Scene outperforms LLM-based baselines in satisfying functional requirements, achieving a 94.3% preference rate in pairwise comparisons.

Key Contribution

Forget object-centric prompts: Function2Scene designs 3D indoor scenes directly from natural language descriptions of *how* the space will be used, not just *what* furniture to put there.

Abstract

Most text-driven 3D indoor scene synthesis methods generate rooms from object-centric prompts, asking what furniture should be placed rather than how the space is used. Yet in real interior design, a layout is judged by how well it supports its occupants, e.g., their activities and physical needs. We introduce Function2Scene, a framework for generating 3D indoor layouts from functional specifications, i.e., natural-language design briefs describing who will use a room and what they need to do there. Given such a specification, our system parses occupant personas and activities, derives a customized set of functional design constraints from a taxonomy of 17 criteria spanning spatial, ergonomic, activity, and environmental considerations, and uses these constraints to guide layout generation. Rather than relying on an LLM to directly produce a final scene, Function2Scene performs iterative evaluation and refinement through a tool-augmented check-and-repair loop, combining geometric measurements, LLM-based contextual reasoning, and VLM-based visual assessment. Experiments on 30 professionally written interior-design cases show that Function2Scene produces layouts that better satisfy functional requirements than recent LLM-based scene synthesis baselines, with our results preferred in 94.3% of pairwise comparisons. Our work reframes text-driven indoor scene synthesis from placing plausible objects to designing spaces that support human use.

Computer Vision Natural Language Processing Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References130

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Function2Scene: 3D Indoor Scene Layout from Functional Specifications

Related Papers