Marco Lettiero

University of Naples Parthenope,Dept. of Science and Technology,Naples,Italy

Papers on Lattice

Total citations

Topics

h-index

Research focus

Computer Vision (1)Distributed Systems & Hardware (1)Inference & Quantization (1)Training Efficiency & Optimization (1)

Frequent co-authors

Pasquale De Luca (1)

Papers (1)

Jun 30, 2025

Parallel Optimization of Quantized CNNs for Efficient GPU Inference

FP8 quantization slashes VGG16's inference time by 40% and memory footprint by 32% on an RTX 4090, making it a sweet spot for efficient GPU deployment compared to INT8 and FP32.

Marco Lettiero, Pasquale De Luca

Computer Vision Distributed Systems & Hardware Inference & Quantization+1

Search

Marco Lettiero

Research focus

Frequent co-authors

Papers (1)