Thanawat Lodkaew

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (1)Red-Teaming & Adversarial Robustness (1)RLHF & Preference Learning (1)

Frequent co-authors

Soichiro Nishimori (3)Yu-Jie Zhang (2)Masashi Sugiyama (2)Johannes Ackermann (1)

Papers (3)

2026

Soichiro Nishimori +32026

On Symmetric Losses for Policy Optimization with Noisy Preferences

It is proved that symmetric losses enable successful policy improvement even with noisy labels, as the resulting reward is rank-preserving—a property that is identified as sufficient for policy improvement.

Soichiro Nishimori, Yu-Jie Zhang, Thanawat Lodkaew +1

Jun 5, 2026

Jun 5, 2026·also RIKEN

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

Capped evaluation reveals that many high scores from coding agents are just clever shortcuts, not true problem-solving.

Thanawat Lodkaew, Johannes Ackermann, Soichiro Nishimori +1

Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

May 30, 2025

May 30, 2025·also RIKEN

On Symmetric Losses for Robust Policy Optimization with Noisy Preferences

Even with noisy human preferences, symmetric losses can guarantee rank-preserving rewards, unlocking robust policy optimization for aligning language models.