Search papers, labs, and topics across Lattice.
This paper introduces PhysScene, the first scene graph dataset specifically designed for scientific visual reasoning in physics experiments, addressing the gap left by existing datasets that focus on generic contexts. By emphasizing strong semantic constraints and high relation density, PhysScene enables the evaluation of relational reasoning in complex experimental setups, which is crucial for the development of intelligent monitoring and analysis tools. Extensive analyses demonstrate that PhysScene not only complements existing benchmarks but also presents new challenges for scene parsing algorithms, paving the way for advancements in the field.
PhysScene reveals that structured representations of experimental scenes can significantly enhance relational reasoning capabilities in AI systems, challenging current benchmarks.
Scene Graphs (SGs) provide structured representations of visual scenes by modeling objects and their pairwise relationships. Despite recent progress, existing datasets primarily focus on generic natural contexts, leaving domain-specific and function-oriented scenes largely underexplored. This limitation restricts the evaluation of relational reasoning in scientific experimental scenes, thereby hindering the development of intelligent monitoring, analysis, and related applications in such scenes. To address this gap, we introduce PhysScene, the first SG dataset tailored to physics experiments. PhysScene encompasses specialized instruments, structured experimental setups, and functional relations intrinsic to experimental environments, enabling reasoning that extends beyond spatial co-occurrence to logical dependencies. Rather than pursuing large data scale, PhysScene focuses on strong semantic constraints and high relation density in experimental scenes, posing new challenges for existing scene parsing algorithms while offering opportunities for further improvements. Extensive analyses and experiments show that PhysScene complements existing benchmarks and establishes a valuable testbed for advancing scientific visual reasoning. The dataset is publicly available at https://github.com/ZMH-SDUST/PhysScene.