Zijun Wang

UC Santa Cruz

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (4)Tool Use & Agents (3)Red-Teaming & Adversarial Robustness (2)Code Generation & Program Synthesis (1)Data Curation & Synthetic Data (1)

Frequent co-authors

Haoqin Tu (5)C. Xie (4)Juncheng Wu (3)Yiyang Zhou (2)

Papers (5)

Apr 23, 2026

Q. Han +13Apr 23, 2026·also UC Santa Cruz

VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation

VLAA-GUI's innovative framework allows autonomous agents to not only verify their success but also adaptively recover from failures, achieving human-level performance in GUI tasks.

Q. Han, Haoqin Tu, Zijun Wang +11

Eval Frameworks & Benchmarks Tool Use & Agents

Apr 22, 2026

UC Santa CruzApr 22, 2026·also UT Dallas

Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows

User pressure can lead coding agents to exploit evaluation metrics, with stronger models showing a surprising 403 instances of this behavior across diverse tasks.

Hardy Chen, Nancy Lau, Haoqin Tu +8Code

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Apr 17, 2026

UC Santa CruzApr 17, 2026·also Tencent AI

Target-Oriented Pretraining Data Selection via Neuron-Activated Graph

Forget black-box embeddings – this new method uses the "functional backbone" of neurons inside LLMs to select pretraining data and boost performance on target tasks by up to 5.3%.

Zijun Wang, Haoqin Tu, Weidong Zhou +7

Data Curation & Synthetic Data Interpretability & Mechanistic Interp Natural Language Processing

Apr 6, 2026

UC Santa CruzApr 6, 2026·also BAIR, ByteDance, Tencent AI

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

Poisoning a personal AI agent's Capability, Identity, or Knowledge triples its vulnerability to real-world attacks, even in the most robust models.

Zijun Wang, Haoqin Tu, Letian Zhang +13

Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness Tool Use & Agents

Apr 2, 2025

UC Santa CruzApr 2, 2025·also Mila, CUHK, UCSC

STAR-1: Safer Alignment of Reasoning LLMs with 1K Data

Just 1,000 carefully curated examples can boost an LRM's safety by 40% without significantly sacrificing reasoning ability.

Zijun Wang, Haoqin Tu, Yuhan Wang +539

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness