Search papers, labs, and topics across Lattice.
The paper introduces CAFE, a channel-autoregressive factorized encoding scheme for spatial super-resolution of biosignals from low-density recordings. CAFE reconstructs the full montage in geometry-aligned stages, starting from local channels and progressively expanding to distal regions to mitigate artifact propagation. By using step-wise supervision and teacher forcing with scheduled sampling during training, and an autoregressive rollout during inference, CAFE achieves state-of-the-art reconstruction performance across multiple modalities and datasets.
Achieve state-of-the-art biosignal spatial super-resolution by reconstructing high-density montages from sparse recordings with a plug-and-play module that exploits local structure before introducing non-local interactions.
High-density biosignal recordings are critical for neural decoding and clinical monitoring, yet real-world deployments often rely on low-density (LD) montages due to hardware and operational constraints. This motivates spatial super-resolution from LD observations, but heterogeneous dependencies under sparse and noisy measurements often lead to artifact propagation and false non-local correlations. To address this, we propose CAFE, a plug-and-play rollout generation scheme that reconstructs the full montage in geometry-aligned stages. Starting from the LD channels, CAFE first recovers nearby channels and then progressively expands to more distal regions, exploiting reliable local structure before introducing non-local interactions. During training, step-wise supervision is applied over channel groups and teacher forcing with epoch-level scheduled sampling along the group dimension is utilized to reduce exposure bias, enabling parallel computation across steps. At test time, CAFE performs an autoregressive rollout across groups, while remaining plug-and-play by reusing any temporal backbone as the shared predictor. Evaluated on $4$ modalities and $6$ datasets, CAFE demonstrates plug-and-play generality across $3$ backbones (MLP, Conv, Transformer) and achieves consistently better reconstruction than $5$ representative baselines.