Lattice AI Research

Research focus

Eval Frameworks & Benchmarks (3)Natural Language Processing (1)Constitutional AI & AI Ethics (1)Red-Teaming & Adversarial Robustness (1)Robotics & Embodied AI (1)

Frequent co-authors

Pei-ke Zhu (2)Sidi Chang (1)Peiying Zhu (1)Rongdong Chai (1)

Papers (3)

Apr 30, 2026

Sidi Chang +4Apr 30, 2026·also Blossom AI Labs

Measurement Risk in Supervised Financial NLP: Rubric and Metric Sensitivity on JF-ICR

Subtle wording changes in benchmark rubrics can swing model performance by over 13%, revealing a hidden subjectivity in "objective" gold labels.

Sidi Chang, Pei-ke Zhu, Peiying Zhu +2

Eval Frameworks & Benchmarks Natural Language Processing

Apr 28, 2026

Pei-ke Zhu +1Apr 28, 2026

ValueAlpha: Agreement-Gated Stress Testing of LLM-Judged Investment Rationales Before Returns Are Observable

LLM-judged investment rationales reward verbosity and confidence over actual financial insight, penalizing concise, correct reasoning by nearly 3 points.

Pei-ke Zhu, Yuxiao Chen

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Mar 18, 2026

Stanford HAIMar 18, 2026·also Georgia Tech

ReSteer: Quantifying and Refining the Steerability of Multitask Robot Policies

Robots often ignore your commands mid-task, but ReSteer offers a way to fix this by pinpointing and patching the "blind spots" in their training data.

Zhenyang Chen, Alan Tian, Alan Tian +11

Eval Frameworks & Benchmarks Robotics & Embodied AI Tool Use & Agents

Search

Yuxiao Chen

Research focus

Frequent co-authors

Papers (3)