HKUSTSCUTUTokyoMay 25, 2026arXiv:2605.25751

SplitAvatar: One-shot Head Avatar with Autoregressive Gaussian Splitting

Hongzhe Liao, Chuhua Xian, Hongmin Cai, Haiyang Liu, Fa-Ting Hong

AI Summary

This paper introduces SplitAvatar, a novel method for reconstructing animatable head avatars from a single image using 3D Gaussian Splatting. The core innovation is an autoregressive Gaussian splitting network that progressively generates Gaussians from coarse to fine, guided by a mesh topology extension method to maintain graph consistency. A density control mechanism with a gating network prevents over-densification, resulting in improved expression representation and reconstruction quality.

Key Contribution

Achieve higher-fidelity, one-shot head avatars by autoregressively splitting 3D Gaussians, guided by a GNN, to capture fine-grained facial details.

Abstract

3D Gaussian Splatting (3DGS) provides an efficient method for high-quality scene reconstruction using anisotropic Gaussians. Recently, 3DGS-based methods have significantly improved the rendering quality of human avatars while enabling real-time performance. However, existing methods suffer from a magnitude mismatch in the number of Gaussians generated by image-based and 3DMM-based approaches. This discrepancy results in reconstructed expressions that lack fine-grained detail. In this paper, we introduce a novel method for reconstructing an animatable head avatar from a single image. We propose a Graph splitting network to progressively generate Gaussians from coarse to fine using an autoregressive architecture. To address the graph inconsistency caused by split Gaussians, we employ a mesh topology extension method to align the GNN's connectivity with the increased Gaussian count. Furthermore, we introduce a novel density control method that includes a gating mechanism that generates soft masks for Gaussians, preventing over-densification after the splitting operation. This allows for dynamic control over Gaussian density across different facial regions. For smooth and rapid training, we employ a delayed filtering strategy to avoid re-computing the graph topology during training. Experimental results demonstrate that our autoregressive structure effectively improves expression representation ability by progressively splitting Gaussians. This process, enabled by the GNN-guided splitting, synthesizes more precise facial details and achieves higher reconstruction quality.

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

SplitAvatar: One-shot Head Avatar with Autoregressive Gaussian Splitting

Related Papers