Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Mengxuan Hu | Lattice

Mengxuan Hu

Papers on Lattice

1

Total citations

0

Topics

3

Research focus

Reasoning & Chain-of-Thought (1)Red-Teaming & Adversarial Robustness (1)RLHF & Preference Learning (1)

Frequent co-authors

Vivek V. Datla (1)Anoop Kumar (1)Zihan Guan (1)Sheng Li (1)

Papers (1)

Feb 24, 2026

Mengxuan Hu +6Feb 24, 2026

Alignment-Weighted DPO: A principled reasoning approach to improve safety alignment

LLMs can be made significantly more robust to jailbreaks by weighting the reasoning steps in DPO training, leading to more principled refusals.

Mengxuan Hu, Vivek V. Datla, Anoop Kumar +4

Reasoning & Chain-of-Thought Red-Teaming & Adversarial Robustness RLHF & Preference Learning

Alfy Samuel (1)