Search papers, labs, and topics across Lattice.
The paper introduces DataCube, a platform designed to streamline video dataset creation by enabling automatic video processing, multi-dimensional profiling, and query-driven retrieval. It leverages structured semantic representations of video clips and combines neural re-ranking with deep semantic matching for hybrid retrieval. The platform allows users to efficiently construct customized video subsets from large repositories, facilitating training, analysis, and evaluation.
Stop manually sifting through video data: DataCube offers an intelligent platform to automatically process, profile, and retrieve video clips based on natural language queries.
Large-scale video repositories are increasingly available for modern video understanding and generation tasks. However, transforming raw videos into high-quality, task-specific datasets remains costly and inefficient. We present DataCube, an intelligent platform for automatic video processing, multi-dimensional profiling, and query-driven retrieval. DataCube constructs structured semantic representations of video clips and supports hybrid retrieval with neural re-ranking and deep semantic matching. Through an interactive web interface, users can efficiently construct customized video subsets from massive repositories for training, analysis, and evaluation, and build searchable systems over their own private video collections. The system is publicly accessible at https://datacube.baai.ac.cn/. Demo Video: https://baai-data-cube.ks3-cn-beijing.ksyuncs.com/custom/Adobe%20Express%20-%202%E6%9C%8818%E6%97%A5%20%281%29%281%29%20%281%29.mp4