Lattice AI Research

Research focus

Eval Frameworks & Benchmarks (2)Natural Language Processing (1)Constitutional AI & AI Ethics (1)Red-Teaming & Adversarial Robustness (1)

Frequent co-authors

Yuxiao Chen (2)Sidi Chang (1)Peiying Zhu (1)Rongdong Chai (1)

Papers (2)

Apr 30, 2026

Sidi Chang +4Apr 30, 2026·also Blossom AI Labs

Measurement Risk in Supervised Financial NLP: Rubric and Metric Sensitivity on JF-ICR

Subtle wording changes in benchmark rubrics can swing model performance by over 13%, revealing a hidden subjectivity in "objective" gold labels.

Sidi Chang, Pei-ke Zhu, Peiying Zhu +2

Eval Frameworks & Benchmarks Natural Language Processing

Apr 28, 2026

Pei-ke Zhu +1Apr 28, 2026

ValueAlpha: Agreement-Gated Stress Testing of LLM-Judged Investment Rationales Before Returns Are Observable

LLM-judged investment rationales reward verbosity and confidence over actual financial insight, penalizing concise, correct reasoning by nearly 3 points.

Pei-ke Zhu, Yuxiao Chen

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Search

Pei-ke Zhu

Research focus

Frequent co-authors

Papers (2)