Hao Wang

Papers on Lattice

Total citations

Topics

h-index

Research focus

Inference & Quantization (3)Training Efficiency & Optimization (3)Eval Frameworks & Benchmarks (3)Architecture Design (Transformers, SSMs, MoE) (2)RLHF & Preference Learning (2)

Frequent co-authors

Binxing Xu (2)Hao Gu (2)Lujun Li (2)Beisong Liu (2)

Papers (10)

Apr 9, 2026

Binxing Xu +10Apr 9, 2026

Bit-by-Bit: Progressive QAT Strategy with Outlier Channel Splitting for Stable Low-Bit LLMs

Achieve near-lossless 2-bit LLMs with a novel quantization-aware training scheme that progressively reduces precision and intelligently handles outlier channels.

Binxing Xu, Hao Gu, Lujun Li +8

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Training Efficiency & Optimization

Hao Gu +11Apr 9, 2026·also Wenxuan Zhang2 Fumin Shen1

QaRL: Rollout-Aligned Quantization-Aware RL for Fast and Stable Training under Training--Inference Mismatch

Quantizing rollouts in LLM RL pipelines introduces a training-inference gap that QaRL closes, leading to +5.5 performance on math problems.

Hao Gu, Hao Wang, Jiacheng Liu +9

Inference & Quantization RLHF & Preference Learning Training Efficiency & Optimization

Apr 8, 2026

Apr 8, 2026·also Key Laboratory of Cyberspace Security, NUDT

How Independent are Large Language Models? A Statistical Framework for Auditing Behavioral Entanglement and Reweighting Verifier Ensembles

LLMs are far more alike than you think: shared biases and failure modes mean that ensembling them is less effective than you'd hope.

Chenchen Kuai, Jiwan Jiang, Zihao Zhu +7

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Apr 2, 2026

Hao Wang +10Apr 2, 2026·also HIT

COMPASS: Complete Multimodal Fusion via Proxy Tokens and Shared Spaces for Ubiquitous Sensing

Achieve robust multimodal fusion even with missing modalities by ensuring the fusion head always receives a complete, fixed-size input via learned proxy tokens.

Hao Wang, Hao Wang, Yanyu Qian +8

Architecture Design (Transformers, SSMs, MoE)Multimodal Models

Apr 1, 2026

Haoyu Zheng +9Apr 1, 2026·also WHU

Scheduling LLM Inference with Uncertainty-Aware Output Length Predictions

Stop guessing how long LLM outputs will be – modeling the *distribution* of possible lengths slashes latency by 2x and boosts throughput by 40%.

Haoyu Zheng, Yongqiang Zhang, Fangcheng Fu +7

Distributed Systems & Hardware Inference & Quantization

Mar 31, 2026

Mar 31, 2026·also DUT, HKUST

Conditional Polarization Guidance for Camouflaged Object Detection

Polarization cues, often overlooked, can significantly boost camouflaged object detection by explicitly guiding RGB feature learning, leading to state-of-the-art performance.

Qifan Zhang, Hao Wang, Xiangrong Qin

Computer Vision

Fengjian Xue +5Mar 31, 2026·also Corresponding author, Xi'an Jiaotong Uni- versity, Zhejiang Lab

FED-Bench: A Cross-Granular Benchmark for Disentangled Evaluation of Facial Expression Editing

Current facial expression editing models can't simultaneously preserve identity and accurately manipulate expressions, revealing a critical need for better fine-grained instruction following.

Fengjian Xue, Heli Sun, Yunyun Shi +3

Computer Vision Eval Frameworks & Benchmarks

Mar 30, 2026

Yu Sun +16Mar 30, 2026·also Tsinghua AI

ManipArena: Comprehensive Real-world Evaluation of Reasoning-Oriented Generalist Robot Manipulation

Current robot manipulation benchmarks fail to capture the messy reality of real-world deployment, so this work introduces a new benchmark, ManipArena, to close the sim2real gap.

Yu Sun, Meng Cao, Ping Yang +14

Eval Frameworks & Benchmarks Robotics & Embodied AI World Models & Planning

Mar 29, 2026

Fengxian Li +50Mar 29, 2026

KAT-Coder-V2 Technical Report

Agentic coding models can achieve near-SOTA performance by specializing in distinct coding domains before unifying them via on-policy distillation.

Fengxian Li, Fengxiang Li, Haoyang Huang +48

Code Generation & Program Synthesis RLHF & Preference Learning Tool Use & Agents+1

Mar 3, 2026

Liang Lu +5Mar 3, 2026

Distributed Task Planning Method for Synchronous Execution of Heterogeneous Tasks in Uncertain Environments

Robot swarms can now synchronize and execute diverse tasks in uncertain environments without deadlocks, thanks to a new distributed planning method that dynamically adapts to risk and task dependencies.

Liang Lu, Xiangquan Gao, Hao Wang +3

Distributed Systems & Hardware Robotics & Embodied AI World Models & Planning

Search

Hao Wang

Research focus

Frequent co-authors

Papers (10)