Daniel Ezer

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Distributed Systems & Hardware (1)Inference & Quantization (1)Recommendation & Information Retrieval (1)

Frequent co-authors

Roi Pony (1)Adi Raz Goldfarb (1)Idan Friedman (1)U. Barzelay (1)

Papers (1)

May 28, 2026

Roi Pony +43w ago

FLASH-MAXSIM: IO-Aware Fused Kernels for Late-Interaction Scoring

Late-interaction retrieval just got a whole lot faster and cheaper: Flash-MaxSim slashes memory usage by 16x and speeds up inference by 4.7x on an H100 by ditching the massive similarity tensor.

Roi Pony, Adi Raz Goldfarb, Idan Friedman +2

Distributed Systems & Hardware Inference & Quantization Recommendation & Information Retrieval

Search

Daniel Ezer

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)