Xin Li

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Computer Vision (5)Eval Frameworks & Benchmarks (3)Multimodal Models (3)Inference & Quantization (2)

Frequent co-authors

Zihao Ye (1)Yung Hsiang Lu (1)Xiao Hu (1)Shuai Zhang (1)

Papers (6)

Apr 21, 2026

5d ago·also Ajou University, Loyola University Chicago, UMN

Evaluation of Winning Solutions of 2025 Low Power Computer Vision Challenge

The LPCVC 2025 winning solutions showcase surprisingly effective strategies for balancing accuracy and efficiency in edge-based computer vision, pushing the boundaries of what's possible on resource-constrained devices.

Zihao Ye, Yung Hsiang Lu, Xiao Hu +14

Computer Vision Eval Frameworks & Benchmarks Inference & Quantization

Xiang Chen +555d ago·also TU Munich, WHU

LoViF 2026 Challenge on Real-World All-in-One Image Restoration: Methods and Results

Unified benchmarks reveal the state-of-the-art in simultaneously addressing multiple real-world image degradations like blur, low-light, and rain.

Xiang Chen, Hao Li, Jiangxin Dong +53

Computer Vision Eval Frameworks & Benchmarks

Apr 20, 2026

Pengcheng Laboratory6d ago·also VA and then adds the proposed components

APRVOS: 1st Place Winner of 5th PVUW MeViS-Audio Track

By explicitly verifying the visual existence of spoken references before segmentation, APRVOS substantially improves robustness in noisy audio-conditioned Ref-VOS, outperforming standard pipelines.

Deshui Miao, Yameng Gu, Chao Yang +2

Computer Vision Multimodal Models Speech & Audio

Apr 14, 2026

AI21w ago·also NVIDIA, Communication University of China, LARK Lab, UT Austin +1

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Nemotron 3 Super proves you can achieve comparable accuracy to existing 120B models, but with significantly higher inference throughput, by combining Mamba, Attention, and Mixture-of-Experts.

Aakshita Chandiramani, Aaron Blakeman, Abdullahi Olaoye +529

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Tool Use & Agents

Yingying Zhao +91w ago

Challenging Vision-Language Models with Physically Deployable Multimodal Semantic Lighting Attacks

VLMs can be easily fooled in the real world by strategically manipulating lighting, causing them to misinterpret scenes and hallucinate nonsensical captions.

Yingying Zhao, Chengyin Hu, Qike Zhang +7

Computer Vision Multimodal Models Red-Teaming & Adversarial Robustness

Apr 13, 2026

Xin Li +271w ago

LoViF 2026 Challenge on Human-oriented Semantic Image Quality Assessment: Methods and Results

A new dataset, SeIQA, offers a benchmark to evaluate how humans perceive semantic loss in degraded images, pushing beyond traditional quality metrics.

Xin Li, Daoli Xu, Wei Luo +25

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Search

Xin Li

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (6)