Lattice AI Research

Research focus

Constitutional AI & AI Ethics (2)Eval Frameworks & Benchmarks (2)Natural Language Processing (1)Red-Teaming & Adversarial Robustness (1)

Frequent co-authors

Jiho Jin (2)Junho Myung (2)Junyeong Park (1)Rifki Afina Putri (1)

Papers (2)

May 26, 2026

3w ago·also Google Research, Universitas Gadjah Mada

JuICE: A Benchmark for Evaluating LLM-Judge in Identifying Cultural Errors

Even the best LLM judges miss cultural faux pas that are obvious to locals, achieving only 52% F1 score on a new benchmark.

Jiho Jin, Junho Myung, Juhyun Oh +5

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Natural Language Processing

Mar 4, 2026

FINEST: Improving LLM Responses to Sensitive Topics Through Fine-Grained Evaluation

LLMs can be made significantly more helpful and less cautious on sensitive topics by using fine-grained feedback that pinpoints specific errors in content, logic, and appropriateness.

Juhyun Oh, Chani Jung, Jiho Jin +2

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Search

Juhyun Oh

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)