Fengxiang Wang

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (7)Computer Vision (3)Speech & Audio (3)Inference & Quantization (2)

Frequent co-authors

Yueying Li (2)Mingshuo Chen (2)Long Lan (2)Zehua Han (2)

Papers (8)

Jul 14, 2026

Mingzhen Xu +101w ago·also Corresponding author: Weisi Lin, Xidian

MBTI: A Multi-Branch Efficient Fine-Tuning Framework for Hyperspectral Image Classification with Foundation Models

Preserving full-band spectral information in hyperspectral image classification can significantly boost model performance while keeping parameter tuning minimal.

Mingzhen Xu, Haonan Guo, Di Wang +8

Computer Vision Multimodal Models

Jun 9, 2026

Microsoft ResearchJun 9, 2026·also Tsinghua AI, NUDT, PKU

LC-QAT: Data-Efficient 2-Bit QAT for LLMs via Linear-Constrained Vector Quantization

LC-QAT achieves superior performance in 2-bit quantization with just a fraction of the training data, setting a new standard for data-efficient model optimization.

Haoyu Wang, Xingyu Yu, Haiyan Zhao +1

Inference & Quantization Training Efficiency & Optimization

May 25, 2026

Tsinghua AIMay 25, 2026·also HKUST, Kling Team, NJU, NTU +2

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Current audio-visual generation models struggle to maintain coherence and alignment when scaling to minute-long content, a problem exposed by the new LongAV-Compass benchmark.

Jiafu Tang, Qixun Wang, Fengxiang Wang +5

Eval Frameworks & Benchmarks Multimodal Models Speech & Audio

Apr 13, 2026

Apr 13, 2026·also BUPT

Semantic-Geometric Dual Compression: Training-Free Visual Token Reduction for Ultra-High-Resolution Remote Sensing Understanding

Achieve faster and more accurate remote sensing interpretation by intelligently pruning visual tokens based on task-specific semantic and geometric importance, without any training.

Yueying Li, Fengxiang Wang, Mingshuo Chen +2

Computer Vision Inference & Quantization Multimodal Models

Apr 5, 2026

VA-FastNavi-MARL: Real-Time Robot Control with Multimedia-Driven Meta-Reinforcement Learning

Robots can now nimbly respond to new audio-visual commands in real-time, thanks to a meta-RL approach that bypasses the sensory processing bottleneck.

Shengxi Jing, Fengxiang Wang, Yuan Feng

Multimodal Models Robotics & Embodied AI Speech & Audio

Mar 30, 2026

Zehua Han +14Mar 30, 2026·also Tsinghua AI, College of Computer Science and Software Engineering

PReD: An LLM-based Foundation Multimodal Model for Electromagnetic Perception, Recognition, and Decision

PReD leaps ahead by creating the first foundation model to close the loop on perception, recognition, and decision-making for electromagnetic signals.

Zehua Han, Jing Xiao, Yiqi Duan +12

Data Curation & Synthetic Data Multimodal Models Scientific Discovery & Drug Design

Mar 9, 2026

Tsinghua AIMar 9, 2026·also Artificial Intelligence Institute of China, Beihang, Beijing Information Science and Technology, BUPT +4

MERLIN: Building Low-SNR Robust Multimodal LLMs for Electromagnetic Signals

MLLMs can now reliably interpret electromagnetic signals even in noisy environments, thanks to a new training framework and benchmark designed specifically for this challenging domain.

Junyu Shen, Zhendong She, Chenghanyu Zhang +11

Data Curation & Synthetic Data Multimodal Models Speech & Audio

Feb 15, 2026

Feb 15, 2026·also BUPT, CAS, Northwestern

GeoEyes: On-Demand Visual Focusing for Evidence-Grounded Understanding of Ultra-High-Resolution Remote Sensing Imagery

MLLMs struggle to effectively zoom into relevant details in ultra-high-resolution remote sensing imagery, but a new staged training framework can teach them when and where to focus for substantial accuracy gains.

Fengxiang Wang, Mingshuo Chen, Yueying Li +3

Computer Vision Multimodal Models Tool Use & Agents

Search

Fengxiang Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (8)