Lattice AI Research

Research focus

Red-Teaming & Adversarial Robustness (3)Tool Use & Agents (1)Natural Language Processing (1)RLHF & Preference Learning (1)

Frequent co-authors

Charith Peris (2)A.G. Galstyan (2)Kai-Wei Chang (2)Yuting Ning (1)

Papers (3)

Jun 1, 2026

2w ago·also Stanford HAI, HKU

SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction

Current agents are alarmingly susceptible to skill-based attacks, with success rates reaching over 86%, exposing a critical vulnerability in AI safety.

Yuting Ning, Zhehao Zhang, Yash Kumar Lal +8

Red-Teaming & Adversarial Robustness Tool Use & Agents

May 5, 2026

Amazon ScienceMay 5, 2026·also MIT CSAIL

SWAN: Semantic Watermarking with Abstract Meaning Representation

Semantic watermarks, embedded via AMR, survive paraphrasing attacks that obliterate token-level watermarks.

Ziping Ye, Gourab Dey, Christos Christodoulopoulos +7

Natural Language Processing Red-Teaming & Adversarial Robustness

Apr 20, 2026

Amazon ScienceApr 20, 2026·also MIT CSAIL

ARES: Adaptive Red-Teaming and End-to-End Repair of Policy-Reward System

Current red-teaming efforts miss the forest for the trees: ARES reveals that safety failures often stem from a systemic breakdown between the LLM *and* the reward model, not just the LLM itself.

Jiacheng Liang, Yao Ma, Tharindu Kumarage +8

Red-Teaming & Adversarial Robustness RLHF & Preference Learning

Rahul Gupta

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)

Search

Rahul Gupta

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)