Search papers, labs, and topics across Lattice.
The paper introduces SimpleProc, a fully procedural synthetic data generator for multi-view stereo (MVS) training, driven by a small set of rules using NURBS, displacement, and texture patterns. They show that training on synthetically generated data from SimpleProc outperforms training on manually curated datasets of similar size. Scaling the synthetic dataset further yields performance on par with or exceeding models trained on much larger manually curated datasets across several benchmarks.
Forget painstakingly curating MVS datasets - simple procedural generation can match or beat real-world data, even at half the scale.
In this paper, we explore the design space of procedural rules for multi-view stereo (MVS). We demonstrate that we can generate effective training data using SimpleProc: a new, fully procedural generator driven by a very small set of rules using Non-Uniform Rational Basis Splines (NURBS), as well as basic displacement and texture patterns. At a modest scale of 8,000 images, our approach achieves superior results compared to manually curated images (at the same scale) sourced from games and real-world objects. When scaled to 352,000 images, our method yields performance comparable to--and in several benchmarks, exceeding--models trained on over 692,000 manually curated images. The source code and the data are available at https://github.com/princeton-vl/SimpleProc.