Search papers, labs, and topics across Lattice.
The paper introduces LenghuSky-8, a new eight-year all-sky imaging dataset with high night-time coverage and per-pixel alt-az calibration designed for cloud segmentation and nowcasting. They train a DINOv3-based linear probe for robust cloud segmentation achieving 93.3% accuracy, and establish a nowcasting benchmark using persistence, optical flow, ConvLSTM, and VideoGPT models, finding limited gains over persistence. The dataset, calibrations, and toolkit are released to facilitate research in autonomous observatory operations.
This 8-year all-sky dataset with star-aware masks and alt-az calibration could unlock more reliable cloud prediction for ground-based telescopes.
Ground-based time-domain observatories require minute-by-minute, site-scale awareness of cloud cover, yet existing all-sky datasets are short, daylight-biased, or lack astrometric calibration. We present LenghuSky-8, an eight-year (2018-2025) all-sky imaging dataset from a premier astronomical site, comprising 429,620 $512 \times 512$ frames with 81.2% night-time coverage, star-aware cloud masks, background masks, and per-pixel altitude-azimuth (Alt-Az) calibration. For robust cloud segmentation across day, night, and lunar phases, we train a linear probe on DINOv3 local features and obtain 93.3% $\pm$ 1.1% overall accuracy on a balanced, manually labeled set of 1,111 images. Using stellar astrometry, we map each pixel to local alt-az coordinates and measure calibration uncertainties of approximately 0.37 deg at zenith and approximately 1.34 deg at 30 deg altitude, sufficient for integration with telescope schedulers. Beyond segmentation, we introduce a short-horizon nowcasting benchmark over per-pixel three-class logits (sky/cloud/contamination) with four baselines: persistence (copying the last frame), optical flow, ConvLSTM, and VideoGPT. ConvLSTM performs best but yields only limited gains over persistence, underscoring the difficulty of near-term cloud evolution. We release the dataset, calibrations, and an open-source toolkit for loading, evaluation, and scheduler-ready alt-az maps to boost research in segmentation, nowcasting, and autonomous observatory operations.