Xuelong Li

Single-pixel imaging gets a deep learning boost: SISTA-Net leverages learned sparsity and hybrid CNN-VSSM architectures to achieve state-of-the-art reconstruction quality, even in noisy underwater environments.

Jijun Lu, Yifan Chen, Libang Chen +5

Architecture Design (Transformers, SSMs, MoE)Computer Vision Training Efficiency & Optimization

Mar 18, 2026

Huan Song +7Mar 18, 2026

Ruyi2.5 Technical Report

Ruyi2.5 achieves comparable performance to Qwen3-VL on general multimodal benchmarks while significantly outperforming it in privacy-constrained surveillance, demonstrating the effectiveness of its edge-cloud architecture.

Huan Song, Shuyu Tian, Qingfei Zhao +5

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Open-Source Models & Weights

Mar 17, 2026

Yiqiang Zhou +5Mar 17, 2026

Advancing Visual Reliability: Color-Accurate Underwater Image Enhancement for Real-Time Underwater Missions

Achieve real-time (409 FPS) underwater image enhancement with a tiny (3,880 parameter) model that significantly improves color accuracy, enabling deployment on resource-constrained underwater platforms.

Yiqiang Zhou, Yifan Chen, Zhe Sun +3

Computer Vision Robotics & Embodied AI

Mar 10, 2026

Xiamen UniversityMar 10, 2026·also Shanghai Innovation, TeleAI, USTC

Efficiently Aligning Draft Models via Parameter- and Data-Efficient Adaptation

Don't fully retrain your draft model after fine-tuning your LLM: EDA restores speculative decoding performance with significantly less compute by adapting only a small, private component and regenerating training data.

Luxi Lin, Zhihang Lin, Zhanpeng Zeng +4

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Training Efficiency & Optimization

Haoran Yang +6Mar 10, 2026

ZeroWBC: Learning Natural Visuomotor Humanoid Control Directly from Human Egocentric Video

Skip the costly robot teleoperation data: ZeroWBC learns surprisingly natural humanoid control policies directly from human egocentric videos.

Haoran Yang, Jiacheng Bao, Yucheng Xin +4

Computer Vision Robotics & Embodied AI World Models & Planning

Jan 22, 2025

Chenjia Bai +5Jan 22, 2025

Online Preference Alignment for Language Models via Count-based Exploration

LLMs can learn better from human feedback by exploring more creatively, thanks to a simple coin-flip counting method that encourages them to try new things.

Chenjia Bai, Yang Zhang, Shuang Qiu +320

Natural Language Processing RLHF & Preference Learning

Search

Xuelong Li

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (8)