Mingli Song

College of Computer Science and Technology, Zhejiang University, State Key Laboratory of Blockchain and Security, Zhejiang University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Inference & Quantization (2)Computer Vision (2)Multimodal Models (2)

Frequent co-authors

Jie Song (4)Anda Cao (1)Zhuo Gou (1)Kaixuan Chen (1)

Papers (4)

Apr 20, 2026

1w ago

Evolutionary Negative Module Pruning for Better LoRA Merging

Pruning detrimental LoRA modules can lead to substantial performance gains in multi-task models, challenging the assumption that all components contribute positively.

Anda Cao, Zhuo Gou, Kaixuan Chen +3

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Training Efficiency & Optimization

Mar 17, 2026

Mar 17, 2026·also Tsinghua AI, Huawei

DriveFix: Spatio-Temporally Coherent Driving Scene Restoration

DriveFix tackles the "shaky camera" problem in 4D driving scene reconstruction, producing significantly more stable and coherent novel views by explicitly modeling spatio-temporal dependencies.

Heyu Si, Brandon James Denis, Muyang Sun +7

Computer Vision Multimodal Models Robotics & Embodied AI

Mar 17, 2026·also Tsinghua AI, Hangzhou High-Tech Zone (Binjiang, Institute of Blockchain and Data

$D^3$-RSMDE: 40$\times$ Faster and High-Fidelity Remote Sensing Monocular Depth Estimation

Achieve diffusion-level perceptual quality in monocular depth estimation at 40x the speed, by replacing the slow initial diffusion steps with a fast ViT-based depth map and refining in a compact latent space.

Ruizhi Wang, Weihan Li, Zunlei Feng +4

Architecture Design (Transformers, SSMs, MoE)Computer Vision Inference & Quantization

Feb 24, 2026

Tsinghua AIFeb 24, 2026·also ZJU

SpatiaLQA: A Benchmark for Evaluating Spatial Logical Reasoning in Vision-Language Models

VLMs still can't reason about spatial logic in real-world scenes, but a new benchmark and scene graph method shows how to make progress.

Yuechen Xie, Xiaoyan Zhang, Yicheng Shan +4

Eval Frameworks & Benchmarks Multimodal Models Reasoning & Chain-of-Thought

Search

Mingli Song

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)