Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Gayane Ghazaryan | Lattice

Gayane Ghazaryan

Institute for Natural Language Processing, University of Stuttgart

Papers on Lattice

1

Total citations

0

Topics

3

Publication activitypapers/week, last 8 weeks

Research focus

Constitutional AI & AI Ethics (1)Eval Frameworks & Benchmarks (1)RLHF & Preference Learning (1)

Frequent co-authors

Esra Dönmez (1)

Papers (1)

May 6, 2026

2w ago

Misaligned by Reward: Socially Undesirable Preferences in LLMs

Current reward models often *prefer* socially undesirable responses, revealing a critical gap in LLM alignment beyond instruction following.

Gayane Ghazaryan, Esra Dönmez

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks RLHF & Preference Learning