Yaowei Wang

Harbin Institute of Technology, Peng Cheng Laboratory

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (5)Computer Vision (4)Natural Language Processing (2)Red-Teaming & Adversarial Robustness (2)

Frequent co-authors

Shu-Tao Xia (2)Jinpeng Wang (2)Niu Lian (1)Alan Chen (1)

Papers (8)

Jul 5, 2026

Tsinghua AI1w ago·also Graduate School, HIT, Peng Cheng Laboratory, ZJU

UI-MOPD: Multi-Platform On-Policy Distillation for Continual GUI Agent Learning

UI-MOPD achieves a remarkable balance between retaining existing capabilities and adapting to new platforms, with task success rates that challenge conventional approaches in GUI agent learning.

Niu Lian, Alan Chen, Zhehao Yu +8

Multimodal Models Tool Use & Agents

Jun 30, 2026

1w ago·also Anhui University, CAS, HIT, Peng Cheng Laboratory

Domain Adaptive Object Detection via Dual-Stream Bilevel-Cycle Optimization

Unreliable pseudo-labels in object detection can be transformed into reliable training signals, leading to substantial performance gains across domains.

Yannan Chen, Wenqiang Wang, Ruoyu Chen +4

Computer Vision

Jun 25, 2026

Tsinghua AI2w ago·also HIT, Peng Cheng Laboratory, Pengcheng Laboratory, SYSU

In-Context Model Predictive Generation: Open-Vocabulary Motion Synthesis from Language Models to Physics

ICMPG achieves a groundbreaking balance between semantic fidelity and physical realism in motion synthesis, outperforming traditional methods in both standard and zero-shot scenarios.

Xiaomeng Fu, Junfan Lin, Yang Liu +4

Natural Language Processing Robotics & Embodied AI

Jun 9, 2026

Jun 9, 2026·also Tsinghua AI, Graduate School, Huawei, Peng Cheng Laboratory +4

MemVenom: Triggered Poisoning of Multimodal Memories in Web Agents

MemVenom reveals that web agents can be compromised with up to 99.15% success through sophisticated memory poisoning attacks that bypass traditional defenses.

Yv Zhang, Hao Sun, Hao Fang +4

Multimodal Models Red-Teaming & Adversarial Robustness

May 28, 2026

Tsinghua AIMay 28, 2026·also HIT, Huawei, Peng Cheng Laboratory

GenEraser: Generalizable Video Object Removal via Balanced Text-Mask Guidance and Decoupled Locator-Preserver

Removing objects from video just got a whole lot cleaner: GenEraser doesn't just erase the object, it intelligently removes associated effects like shadows and reflections, setting a new bar for realistic video editing.

Yuqing Chen, Lin Liu, Hai Wu +4

Computer Vision Multimodal Models Natural Language Processing

May 21, 2026

May 21, 2026·also HIT, Jilin, Meituan, Peng Cheng Laboratory +1

SegCompass: Exploring Interpretable Alignment with Sparse Autoencoders for Enhanced Reasoning Segmentation

Unlock "white-box" reasoning in vision-language models: SegCompass's sparse autoencoder creates an interpretable bridge between visual perception and chain-of-thought, outperforming black-box alignment methods.

Zhenyu Lu, Liupeng Li, Jinpeng Wang +4

Interpretability & Mechanistic Interp Multimodal Models Reasoning & Chain-of-Thought

May 6, 2026

Qiming Li +10May 6, 2026·also HIT, Peng Cheng Laboratory

CAST: Mitigating Object Hallucination in Large Vision-Language Models via Caption-Guided Visual Attention Steering

Steer LVLMs' attention with caption guidance and watch object hallucinations drop by 6%—no training required.

Qiming Li, Zekai Ye, Xiaocheng Feng +8

Computer Vision Multimodal Models

Apr 14, 2026

Apr 14, 2026·also Peng Cheng Laboratory

Efficient Adversarial Training via Criticality-Aware Fine-Tuning

Adversarial training of large vision models doesn't have to break the bank: CAAT achieves comparable robustness to standard methods by tuning just 6% of the parameters.

Wenyun Li, Dongmei Jiang, Yaowei Wang +1

Computer Vision Red-Teaming & Adversarial Robustness Training Efficiency & Optimization

Search

Yaowei Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (8)