Peter Wonka

Papers on Lattice

Total citations

Topics

h-index

Research focus

Computer Vision (5)Multimodal Models (3)Data Curation & Synthetic Data (2)Architecture Design (Transformers, SSMs, MoE) (2)

Frequent co-authors

Xiangjun Tang (2)Aleksandar Cvejic (1)Rameen Abdal (1)A. Eldesokey (1)

Papers (5)

Apr 2, 2026

Aleksandar Cvejic +5Apr 2, 2026·also Served in an advisory role

NearID: Identity Representation Learning via Near-identity Distractors

Pre-trained vision encoders are shockingly bad at distinguishing identity when background context is controlled, but a simple contrastive learning scheme can fix it.

Aleksandar Cvejic, Rameen Abdal, A. Eldesokey +3

Computer Vision Data Curation & Synthetic Data

Mar 18, 2026

Mar 18, 2026·also D geometry and aspect ratio of the subject., D map that represents a distortion-free, Imperial, KAUST

AHOY! Animatable Humans under Occlusion from YouTube Videos with Gaussian Splatting and Video Diffusion Priors

Reconstructing complete, animatable 3D avatars from heavily occluded YouTube videos is now possible, thanks to a hallucination-as-supervision pipeline using diffusion models.

Aymen Mir, Riza Alp Guler, Xiangjun Tang +2

Computer Vision Multimodal Models

Mar 4, 2026

Mar 4, 2026·also KAUST

EgoPoseFormer v2: Accurate Egocentric Human Motion Estimation for AR/VR

Achieve real-time egocentric motion capture with 19% better accuracy and half the jitter of prior art, thanks to a transformer architecture and self-supervised pretraining on millions of unlabeled frames.

Sai Kumar Dwivedi, Filip Maric, Carlos Chacon +9

Architecture Design (Transformers, SSMs, MoE)Computer Vision Data Curation & Synthetic Data

Mar 3, 2026

Any Resolution Any Geometry: From Multi-View To Multi-Patch

Key contribution not extracted.

Wenqing Cui, Mykola Lavreniuk, Ramzi Idoughi +2

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Feb 12, 2026

Feb 12, 2026·also CAS, KAUST, Luxembourg

OMEGA-Avatar: One-shot Modeling of 360{\deg} Gaussian Avatars

Finally, you can generate a fully animatable, 360-degree 3D head avatar from a single image, without per-instance optimization.

Yiqun Wang, Jun Xiao, Peter Wonka

Computer Vision Multimodal Models

Search

Peter Wonka

Research focus

Frequent co-authors

Papers (5)