Wei Wang

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (11)Computer Vision (10)Multimodal Models (10)Inference & Quantization (8)

Frequent co-authors

Xu Zhang (3)Wei Jiang (2)Mingyu Yang (2)Zengzhen Su (2)

Papers (33)

Apr 23, 2026

Wei Jiang +12d ago

Sub-Token Routing in LoRA for Adaptation and Query-Aware KV Compression

Forget compressing entire tokens – selectively routing *parts* of tokens based on query relevance unlocks better compression-quality tradeoffs in LoRA-adapted transformers.

Wei Jiang, Wei Wang

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Training Efficiency & Optimization

Apr 22, 2026

3d ago·also BIGAI

Fast-then-Fine: A Two-Stage Framework with Multi-Granular Representation for Cross-Modal Retrieval in Remote Sensing

Achieve state-of-the-art remote sensing image-text retrieval without the computational burden of large-scale vision-language model pre-training, thanks to a novel two-stage approach.

Xi Chen, Xiangyang Jia, Xu Zhang +2

Computer Vision Multimodal Models Recommendation & Information Retrieval

Apr 16, 2026

1w ago·also CMU ML, BIT, PKU

Well Begun is Half Done: Training-Free and Model-Agnostic Semantically Guaranteed User Representation Initialization for Multimodal Recommendation

Dramatically improve multimodal recommendation accuracy without any training by initializing user embeddings with item modality features and user cluster information.

Jinfeng Xu, Zheyu Chen, Shuo Yang +6

Multimodal Models Recommendation & Information Retrieval Training Efficiency & Optimization

School of Artificial Intelligence1w ago·also SJTU

PRL-Bench: A Comprehensive Benchmark Evaluating LLMs'Capabilities in Frontier Physics Research

LLMs are still far from being autonomous scientists, failing to master even simplified, end-to-end physics research workflows.

Tingjia Miao, Wenkai Jin, Muhua Zhang +19

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Scientific Discovery & Drug Design

Apr 15, 2026

Stanford HAI1w ago·also Nankai University, Santa Clara University

From Pixels to Nucleotides: End-to-End Token-Based Video Compression for DNA Storage

Neural video codecs can be designed for biological substrates from the ground up, unlocking a new paradigm for DNA storage.

Cihan Ruan, Lebin Zhou, Bingqing Zhao +8

Computer Vision Inference & Quantization Scientific Discovery & Drug Design

Qiyang Lyu +41w ago

RoSLAC: Robust Simultaneous Localization and Calibration of Multiple Magnetometers

AMRs can now navigate reliably indoors without GPS or external infrastructure, thanks to a new method that simultaneously calibrates magnetometers and estimates robot pose.

Qiyang Lyu, Zhenyu Wu, Wei Wang +2

Robotics & Embodied AI

Apr 14, 2026

Chang-Chieh Cheng +71w ago·also SJTU

On the Distillation Loss Functions of Speech VAE for Unified Reconstruction, Understanding, and Generation

Aligning speech VAEs with SSL features isn't a one-size-fits-all game: joint-marginal alignment with adaptive weighting unlocks superior performance across reconstruction, understanding, and generation.

Chang-Chieh Cheng, Changhao Cheng, Wei Wang +5

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Speech & Audio

1w ago

IAD-Unify: A Region-Grounded Unified Model for Industrial Anomaly Segmentation, Understanding, and Generation

Achieve industrial anomaly detection that not only locates defects, but explains them and generates controlled edits, all in one model.

Haoyu Zheng, Tianwei Lin, Wei Wang +3

Computer Vision Multimodal Models Natural Language Processing

Apr 9, 2026

Lingyun Yang +122w ago

LegoDiffusion: Micro-Serving Text-to-Image Diffusion Workflows

Stop treating diffusion workflows as monolithic black boxes: LegoDiffusion unlocks 3x higher throughput by decomposing them into independently scalable microservices.

Lingyun Yang, Suyi Li, Tianyu Feng +10

Computer Vision Distributed Systems & Hardware Inference & Quantization

Baihui Liu +52w ago

Alloc-MoE: Budget-Aware Expert Activation Allocation for Efficient Mixture-of-Experts Inference

Squeeze 34% more decode speed out of your MoE model without sacrificing accuracy by intelligently budgeting expert activations.

Baihui Liu, Kaiyuan Tian, Wei Wang +3

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Apr 8, 2026

2w ago·also University of Leicester

CubeGraph: Efficient Retrieval-Augmented Generation for Spatial and Temporal Data

CubeGraph achieves superior RAG performance by unifying vector and spatial search, eliminating the overhead of fragmented sub-index invocations common in existing systems.

Mingyu Yang, Wentao Li, Wei Wang

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Natural Language Processing+1

Apr 6, 2026

Jiancheng Wang +52w ago

Multimodal Backdoor Attack on VLMs for Autonomous Driving via Graffiti and Cross-Lingual Triggers

VLMs in self-driving cars are shockingly vulnerable: a subtle combination of graffiti and foreign-language commands can hijack their behavior without degrading performance on normal tasks.

Jiancheng Wang, Lidan Liang, Zengzhen Su +3

Computer Vision Multimodal Models Red-Teaming & Adversarial Robustness

Tianmeng Fang +42w ago

A Patch-based Cross-view Regularized Framework for Backdoor Defense in Multimodal Large Language Models

Multimodal LLMs are surprisingly vulnerable to backdoor attacks, but a simple patch-based augmentation and cross-view regularization can drastically improve robustness without sacrificing performance.

Tianmeng Fang, Zetai Kong, Zengzhen Su +2

Multimodal Models Red-Teaming & Adversarial Robustness

2w ago·also Beijing Tokfinity Technology Co.

Beyond Fixed Tests: Repository-Level Issue Resolution as Coevolution of Code and Behavioral Constraints

Stop treating tests as immutable oracles: letting repair agents revise behavioral constraints during search dramatically improves issue resolution.

Kefan Li, Yuan Yuan, Mengfei Wang +5

Code Generation & Program Synthesis Eval Frameworks & Benchmarks

Apr 5, 2026

2w ago·also PKU

RUQuant: Towards Refining Uniform Quantization for Large Language Models

Achieve near-lossless 4-bit quantization for LLMs in under a minute, without full fine-tuning, by correcting for non-uniform activation distributions.

Han Liu, Haotian Gao, Changya Li +3

Inference & Quantization Training Efficiency & Optimization

Apr 2, 2026

Zhaoyi Li +83w ago

On the Role of Reasoning Patterns in the Generalization Discrepancy of Long Chain-of-Thought Supervised Fine-Tuning

LLMs can learn to reason *worse* from seemingly better training data: models trained on CoT data with lower loss can generalize poorly due to inheriting inefficient, divergent reasoning patterns.

Zhaoyi Li, Xiangyu Xi, Zhengyu Chen +6

Eval Frameworks & Benchmarks Open-Source Models & Weights Reasoning & Chain-of-Thought

Mar 30, 2026

3w ago·also CAS

Rethinking Structure Preservation in Text-Guided Image Editing with Visual Autoregressive Models

Achieve significantly better structure preservation in text-guided image editing by injecting structure-related features into visual autoregressive models, guided by reinforcement learning.

Tao Xia, Yukun Zhang, Ting Liu +1

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Mar 29, 2026

Mei Xiao +913w ago·also Central South University, LongCat Team, Meituan

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

LongCat-Next shatters the language-centric paradigm by unifying text, vision, and audio into a single autoregressive model with minimal modality-specific design, finally reconciling understanding and generation in discrete vision modeling.

Mei Xiao, Meituan LongCat Team, Chao Wang +89

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Natural Language Processing

Mar 19, 2026

Wei Wang +1Mar 19, 2026

Color image restoration based on nonlocal saturation-value similarity

Color image restoration gets a boost: exploiting saturation-value similarity in nonlocal methods yields significantly better results than relying on individual RGB channels.

Wei Wang, Yakun Li

Computer Vision

Mar 18, 2026

Mar 18, 2026·also HIT

ZipServ: Fast and Memory-Efficient LLM Inference with Hardware-Aware Lossless Compression

Lossless compression can actually *speed up* LLM inference on GPUs, not just shrink model size, thanks to ZipServ's hardware-aware design.

Ruibo Fan, Xiangrui Yu, Xinglin Pan +6

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Mar 10, 2026

Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports

Sports expose surprising limitations in VLMs' spatial reasoning, as current models struggle to generalize from existing benchmarks despite fine-tuning gains on a new, large-scale dataset.

Yuchen Yang, Yuqing Shao, Duxiu Huang +14

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Mar 5, 2026

Kunrui Ze +6Mar 5, 2026

Integrated cooperative localization of heterogeneous measurement swarm: A unified data-driven method

Unleashing heterogeneous robot swarms: a new data-driven method achieves cooperative localization even with sparse, unidirectional measurements, sidestepping restrictive geometric constraints.

Kunrui Ze, Wei Wang, Guibin Sun +4

Robotics & Embodied AI

Feb 27, 2026

Shibo Hong +6Feb 27, 2026

DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model

Instruction-based image editing models still struggle to edit small objects, with a new benchmark revealing significant performance gaps despite progress on existing benchmarks.

Shibo Hong, Boxian Ai, Wei Wang +4

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Feb 25, 2026

Feb 25, 2026·also UCLA

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

ARLArena reveals the hidden instability of agentic RL, offering a path to more reliable LLM-based agents via a novel stable policy optimization method (SAMPO).

Xiaoxuan Wang, Xiaoxuan Wang, Haixin Wang +16

Robotics & Embodied AI Tool Use & Agents Training Efficiency & Optimization

Zhihao Li +2Feb 25, 2026

From Basis to Basis: Gaussian Particle Representation for Interpretable PDE Operators

Ditch the quadratic cost and black-box nature of neural operators: Gaussian Particle Operators offer interpretable, near-linear PDE learning by representing fields as learned Gaussian atoms.

Zhihao Li, Zhilu Lai, Wei Wang

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp Scientific Discovery & Drug Design

Feb 23, 2026

A Green Learning Approach to LDCT Image Restoration

Achieve state-of-the-art LDCT image restoration with a Green Learning approach that's mathematically transparent, computationally efficient, and memory-friendly.

Wei Wang, Yixing Wu, C. -C. Jay Kuo

Computer Vision Scientific Discovery & Drug Design Training Efficiency & Optimization

Feb 19, 2026

Liuchang Jing +3Feb 19, 2026·also Shenzhen University, UMich

Multiple Index Merge for Approximate Nearest Neighbor Search

Achieve up to 5.48x speedup in merging proximity graph indexes for AKNN search by intelligently exploiting structural information, outperforming naive reconstruction by nearly 10x.

Liuchang Jing, Mingyu Yang, Jianbin Qin +1

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Recommendation & Information Retrieval

Feb 18, 2026

Feb 18, 2026·also Harvard, PKU, RPI

SEMixer: Semantics Enhanced MLP-Mixer for Multiscale Mixing and Long-term Time Series Forecasting

By randomly attending to different time patches and progressively mixing scales, SEMixer achieves state-of-the-art long-term time series forecasting with a lightweight architecture.

Xu Zhang, Wei Wang

Architecture Design (Transformers, SSMs, MoE)Training Efficiency & Optimization

Feb 18, 2026·also PKU, UBC

Amortized Predictability-aware Training Framework for Time Series Forecasting and Classification

Stop letting noisy, low-predictability data ruin your time series models: APTF dynamically identifies and penalizes these samples during training, leading to improved forecasting and classification accuracy.

Xu Zhang, Wei Wang

Architecture Design (Transformers, SSMs, MoE)Training Efficiency & Optimization

Haoxiang Sun +3Feb 18, 2026

DeepVision-103K: A Visually Diverse, Broad-Coverage, and Verifiable Mathematical Dataset for Multimodal Reasoning

Forget small, curated datasets: DeepVision-103K unlocks stronger multimodal reasoning in LMMs via diverse, verifiable visual math problems.

Haoxiang Sun, Lizhen Xu, Wotao Yin +1

Data Curation & Synthetic Data Multimodal Models Reasoning & Chain-of-Thought

Feb 13, 2026

Chinese Medical AssociationFeb 13, 2026·also Affiliated Hospital of Dalian Medical University, Air Force Medical University, Army Medical University, Chongqing +12

Chinese expert consensus on the surgical treatment of femoral neck fracture by direct anterior approach hip arthroplasty for elderly patients (2025 edition)

This consensus provides expert recommendations for DAA-HJA in elderly patients with FNF, addressing key clinical dilemmas and promoting standardized surgical techniques.

X. Man, Zhaxi Mima, Zhonghua Xu +57

Feb 12, 2026

Feb 12, 2026·also UCSD

CellMaster: Collaborative Cell Type Annotation in Single-Cell Analysis

Key contribution not extracted.

Zhen Wang, Enze Ma, Jefferson Chen +8

Natural Language Processing Scientific Discovery & Drug Design Tool Use & Agents

Jan 12, 2026

Treatment of Periprosthetic Joint Infection After Tumor Megaprosthetic Reconstruction: A Narrative Review

This review highlights the unique challenges in managing PJI after tumor megaprosthetic reconstruction, emphasizing the need for tailored diagnostic and treatment strategies due to the elevated risk compared to standard arthroplasty.

Wei Wang, Haoran Qiao, Zhiqing Zhao +1