Rui Wang

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (2)Computer Vision (1)Data Curation & Synthetic Data (1)Robotics & Embodied AI (1)

Frequent co-authors

Gang Yu (2)Bo Zhao (2)Jinghong Lan (1)Wei Cheng (1)

Papers (6)

Jun 18, 2026

3d ago·also HKU, Westlake

FreeStyle: Free Control of Style-Content Dual-Reference Generation from Community LoRA Mining

FreeStyle achieves a remarkable balance between style alignment and content preservation while effectively suppressing semantic leakage in dual-reference image generation.

Jinghong Lan, Wei Cheng, Yunuo Chen +10

Computer Vision Data Curation & Synthetic Data Multimodal Models

Jun 12, 2026

1w ago·also Tsinghua AI

Hy-Embodied-0.5-VLA: From Vision-Language-Action Models to a Real-World Robot Learning Stack

A fully integrated robot learning stack that bridges the gap from simulation to real-world deployment, enhancing the efficacy of vision-language-action models.

He Zhang, Lingzhu Xiang, Haitao Lin +23

Multimodal Models Robotics & Embodied AI

Jun 8, 2026

1w ago·also AI Lab, Tencent AI

PBSD: Privileged Bayesian Self-Distillation for Long-Horizon Credit Assignment

Sparse rewards can be transformed into actionable turn-level feedback, enabling agents to learn from both successful and misleading actions in long-horizon tasks.

Yang Tian, Rui Wang, Xumeng Wen +5

Reasoning & Chain-of-Thought Tool Use & Agents

Jun 4, 2026

Rui Wang +22w ago

LLMCodec: Adapting Video Codecs for Efficient Weight Compression of Large Language Models

Video codecs can slash LLM perplexity by over 1.5x while boosting task accuracy by 21%, revolutionizing model compression strategies.

Rui Wang, Yan Zhao, Zhengxue Cheng

Inference & Quantization Scaling Laws & Emergent Abilities

2w ago·also School of Artificial Intelligence and Computer, SYSU

A Sliced-Wasserstein Framework on Correlation Matrices for EEG Decoding

A new framework for EEG decoding boosts generalization across datasets while maintaining low training overhead and no extra inference costs.

Chen Hu, Rui Wang, Jiale Zhou +4

Scientific Discovery & Drug Design

May 22, 2026

Open-Sora Plan TeamMay 22, 2026·also Annenberg School of Communication and Journalism, Department of Foundation Model, Griffith, PKU +4

StepAudio 2.5 Technical Report

Forget specialized architectures: StepAudio 2.5 proves a single audio-language foundation, shaped by RLHF, can dominate ASR, TTS, and real-time dialogue simultaneously.

Bin Lin, Bo Zhao, Boyong Wu +89

Architecture Design (Transformers, SSMs, MoE)Open-Source Models & Weights Speech & Audio

Search

Rui Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (6)