Search papers, labs, and topics across Lattice.
The ICPR 2026 TVRID competition introduced a new RGB-Depth dataset for privacy-aware top-view person re-identification, comprising 86 identities captured by synchronized overhead cameras across varied viewpoints. The competition evaluated RGB Re-ID, Depth Re-ID, and RGB$\leftrightarrow$Depth cross-modal retrieval using mAP and CMC-1 metrics. Results revealed that RGB-based Re-ID was the easiest, followed by Depth-based, with cross-modal retrieval being the most challenging, emphasizing the difficulty of modality-constrained retrieval.
Top-view RGB-D person re-identification is surprisingly feasible, even across modalities, despite the inherent challenges of viewpoint and modality variations.
This companion paper reports the ICPR 2026 TVRID competition on privacy-aware top-view person re-identification. We present the competition setting, the released RGB-Depth dataset, and a summary of final results with descriptions of the top entries. TVRID contains 86 identities captured by four synchronized overhead Intel RealSense D455 cameras, with paired RGB/Depth streams and structured geometric variation across flat, ascent, descent, and oblique viewpoints. The evaluation protocol includes three tracks: RGB Re-ID, Depth Re-ID, and RGB$\leftrightarrow$Depth cross-modal retrieval. Submissions are ranked using mAP and CMC-1 under a unified server-side evaluation. The final results show a clear difficulty ordering (RGB $>$ Depth $>$ Cross-Modal), highlighting both the challenge of modality-constrained retrieval and the feasibility of strong performance with modality-invariant learning. By releasing the dataset at https://zenodo.org/records/17909410, the evaluation scripts at https://github.com/RaphaelDel/ICPR-TVRID, and the accompanying documentation, TVRID establishes a reproducible benchmark for top-view, depth-based, and cross-modal person re-id.