Search papers, labs, and topics across Lattice.
The paper introduces SRNeRV, a scale-wise recursive framework for implicit neural video representation that addresses parameter redundancy in existing multi-scale INR generators. SRNeRV employs a hybrid sharing scheme, decoupling processing blocks into scale-specific spatial mixing and scale-invariant channel mixing modules, recursively applying the shared channel mixing module across scales. Experiments demonstrate SRNeRV achieves improved rate-distortion performance, particularly in INR-friendly scenarios, while significantly reducing model size.
Achieve state-of-the-art video compression with a surprisingly compact architecture by recursively applying a shared channel mixing module across scales in implicit neural representations.
Implicit Neural Representations (INRs) have emerged as a promising paradigm for video representation and compression. However, existing multi-scale INR generators often suffer from significant parameter redundancy by stacking independent processing blocks for each scale. Inspired by the principle of scale self-similarity in the generation process, we propose SRNeRV, a novel scale-wise recursive framework that replaces this stacked design with a parameter-efficient shared architecture. The core of our approach is a hybrid sharing scheme derived from decoupling the processing block into a scale-specific spatial mixing module and a scale-invariant channel mixing module. We recursively apply the same shared channel mixing module, which contains the majority of the parameters, across all scales, significantly reducing the model size while preserving the crucial capacity to learn scale-specific spatial patterns. Extensive experiments demonstrate that SRNeRV achieves a significant rate-distortion performance boost, especially in INR-friendly scenarios, validating that our sharing scheme successfully amplifies the core strengths of the INR paradigm.