Jialiang Fan

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Red-Teaming & Adversarial Robustness (1)RLHF & Preference Learning (1)Robotics & Embodied AI (1)

Frequent co-authors

Shixiong Jiang (1)Mengyu Liu (1)Fanxin Kong (1)

Papers (1)

Feb 18, 2026

3w ago·also Notre Dame

Vulnerability Analysis of Safe Reinforcement Learning via Inverse Constrained Reinforcement Learning

Safe RL policies, designed to avoid unsafe actions, can be effectively attacked using a novel framework that learns safety constraints from demonstrations and then crafts adversarial perturbations, even without access to the target policy's gradients.

Jialiang Fan, Shixiong Jiang, Mengyu Liu +1

Red-Teaming & Adversarial Robustness RLHF & Preference Learning Robotics & Embodied AI

Search

Jialiang Fan

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)