S. Cheung

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (3)Eval Frameworks & Benchmarks (3)Code Generation & Program Synthesis (2)Red-Teaming & Adversarial Robustness (1)

Frequent co-authors

Zimo Ji (2)Congying Xu (2)Zongjie Li (2)Yudong Gao (2)

Papers (4)

Jul 2, 2026

Zimo Ji +53w ago

Cloak and Detonate: Scanner Evasion and Dynamic Detection of Agent Skill Malware

Static scanners fail against adaptive evasions, but a new behavior-centric auditor can detect 97% of malicious skills with minimal false positives.

Zimo Ji, Congying Xu, Zongjie Li +3

Red-Teaming & Adversarial Robustness Tool Use & Agents

Zimo Ji +43w ago

Coding Agents Are Guessing: Measuring Action-Boundary Violations in Underspecified DevOps Instructions

Coding agents guess their way through underspecified instructions, leading to alarming action-boundary violations that challenge the notion of safe autonomy.

Zimo Ji, Congying Xu, Zongjie Li +2

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Tool Use & Agents

Apr 6, 2026

Independent ResearcherApr 6, 2026·also SJTU

Scaling Coding Agents via Atomic Skills

Forget task-specific overfitting: training coding agents on atomic skills unlocks surprisingly broad generalization to complex software engineering tasks.

Yingwei Ma, Yanhao Li, Kelin Fu +7

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Apr 2, 2026

Zhiyong Chen +3Apr 2, 2026·also HKUST

Can Large Language Models Model Programs Formally?

LLMs struggle to translate code into formal specifications, as evidenced by their poor performance on the new Model-Bench benchmark, revealing a critical gap in their ability to support formal verification.

Zhiyong Chen, Jialun Cao, Jiarong Wu +1

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought

Search

S. Cheung

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)