Yang Wang

MOSS-Audio achieves state-of-the-art performance in audio understanding tasks by effectively integrating temporal cues and deep acoustic features, setting a new benchmark for audio-language models.

Chen Yang, Chufan Yu, Hanfu Chen +22

Multimodal Models Speech & Audio

May 26, 2026

DISCOVER Robotics † Advising2w ago·also HFUT, Mitsubishi Electric Research Laboratories (MERL), SJTU, University of California +2

Cesarean Scar Defect Segmentation in Transvaginal Ultrasound Images: a Dataset and Benchmark

A new dataset of 1,111 transvaginal ultrasound images with detailed annotations finally enables AI-powered diagnosis of Cesarean Scar Defects, a condition frequently missed by sonographers.

Yue Li, Wei Xia, Tianyu Xu +7

Computer Vision Data Curation & Synthetic Data Eval Frameworks & Benchmarks

MiniMax +1952w ago·also Columbia, Eastern Institute of Technology, HFUT, HIT +12

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

MiniMax-M2 proves that massive parameter counts don't always translate to better agentic performance; strategic activation of a smaller subset can unlock frontier-level intelligence.

MiniMax, Aili Chen, Aonian Li +193

Architecture Design (Transformers, SSMs, MoE)Open-Source Models & Weights Tool Use & Agents

May 25, 2026

Shipeng Cao +42w ago·also Helmholtz, HFUT

AI-T2I: Aggregating-and-Isolating Cross-Attention to Diffusion Models for Text-to-Image Synthesis

Sharper text-to-image alignment is now possible in diffusion models by explicitly aggregating related attention and isolating unrelated attention.

Shipeng Cao, Biao Qian, Haipeng Liu +2

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

May 5, 2026

OpenAIMay 5, 2026·also HFUT

Resilient AI Supercomputer Networking using MRC and SRv6

AI training jobs can now shrug off network failures that used to halt progress, thanks to a new resilient networking stack deployed at OpenAI and Microsoft.

Joao Araujo, Alex Chow, Mark Handley +149

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Training Efficiency & Optimization

May 4, 2026

May 4, 2026·also Central South University, Fullive Innovation (Beijing) AI Technology Co., HFUT, HKU +4

AcademiClaw: When Students Set Challenges for AI Agents

Today's best AI agents can only solve 55% of real-world academic tasks that university students find challenging, revealing a significant gap between current AI capabilities and the demands of academic workflows.

Junjie Yu, Pengrui Lu, Weiye Si +72

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Search

Yang Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (8)