G. Dubbelman

Papers on Lattice

Total citations

Topics

h-index

Research focus

Computer Vision (5)Multimodal Models (3)Architecture Design (Transformers, SSMs, MoE) (2)Inference & Quantization (1)Robotics & Embodied AI (1)

Frequent co-authors

Niccolò Cavagnero (3)Gijs Dubbelman (3)Daan de Geus (3)Jing Gu (2)

Papers (5)

Apr 9, 2026

Jing Gu +3Apr 9, 2026·also TU Eindhoven

Orion-Lite: Distilling LLM Reasoning into Efficient Vision-Only Driving Models

A vision-only driving model, distilled from a massive VLA teacher, not only matches but *exceeds* its teacher's performance, proving that there's still headroom in vision-centric architectures for autonomous driving.

Jing Gu, Niccolò Cavagnero, G. Dubbelman +1

Computer Vision Inference & Quantization Multimodal Models

Apr 9, 2026

Revisiting Radar Perception With Spectral Point Clouds

Radar point clouds, when enriched with spectral information, can outperform traditional dense range-Doppler spectra, suggesting a path toward more robust and generalizable radar perception models.

Hamza Alsharif, Jing Gu, P. Jancura +4

Computer Vision Robotics & Embodied AI

Apr 6, 2026

Amazon ScienceApr 6, 2026·also Purdue, TU Eindhoven

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

Ditch the computational bloat: DeltaWorld slashes parameters by 35x and FLOPs by 2000x while generating more realistic video futures.

Tommie Kerssies, G. Berton, Gabriele Berton +7

Computer Vision Training Efficiency & Optimization World Models & Planning

Mar 26, 2026

PMT: Plain Mask Transformer for Image and Video Segmentation with Frozen Vision Encoders

Get 3x faster image segmentation and comparable video segmentation performance to fine-tuned models, all while keeping your vision encoder frozen.

Niccolò Cavagnero, Narges Norouzi, G. Dubbelman +1

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Feb 19, 2026

VidEoMT: Your ViT is Secretly Also a Video Segmentation Model

Ditch the complex trackers: a plain ViT encoder, augmented with a clever query propagation trick, delivers state-of-the-art video segmentation at 10x the speed.

Narges Norouzi, Idil Esen Zulfikar, Niccolò Cavagnero +4

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Search

G. Dubbelman

Research focus

Frequent co-authors

Papers (5)