Sandeep P. Chinchali

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (2)Eval Frameworks & Benchmarks (1)Inference & Quantization (1)Natural Language Processing (1)

Frequent co-authors

Po-han Li (3)Shenghui Chen (2)U. Topcu (2)Shenghui Chen (1)

Papers (3)

Jul 9, 2026

2w ago

VEGAS: Human-Aligned Video Caption Evaluation via Gaze

Captions selected with VEGAS align significantly better with human attention, boosting retrieval performance and challenging the status quo of video captioning metrics.

Shenghui Chen, Shenghui Chen, Po-han Li +13

Eval Frameworks & Benchmarks Multimodal Models

Mar 4, 2026

Mohammad Omama +8Mar 4, 2026·also UT Austin

SSR: A Generic Framework for Text-Aided Map Compression for Localization

Text, combined with learned image embeddings, can compress maps by 2x while preserving localization accuracy, offering a practical solution to the growing memory demands of robotic mapping.

Mohammad Omama, Po-han Li, Harsh Goel +6

Inference & Quantization Natural Language Processing Robotics & Embodied AI

Jan 14, 2026

ViSIL: Unified Evaluation of Information Loss in Multimodal Video Captioning

Ditch BLEU and ROUGE: ViSIL offers a unified metric for multimodal video captioning that actually correlates with VQA performance and human judgment by measuring information loss via VLM inference.

Po-han Li, Shenghui Chen, U. Topcu +1

Multimodal Models Reasoning & Chain-of-Thought

Search

Sandeep P. Chinchali

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)