Forrest McKee

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (2)Red-Teaming & Adversarial Robustness (2)Code Generation & Program Synthesis (1)Reasoning & Chain-of-Thought (1)Constitutional AI & AI Ethics (1)

Frequent co-authors

David A. Noever (3)

Papers (3)

May 7, 2025

David A. Noever +1May 7, 2025

Alpha Excel Benchmark

Forget abstract reasoning benchmarks – this new Excel-based challenge reveals how LLMs actually perform on the kinds of financial modeling tasks used by 1.5 billion people daily.

David A. Noever, Forrest McKee

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought

Feb 8, 2025

David A. Noever +1Feb 8, 2025

Forbidden Science: Dual-Use AI Challenge Benchmark and Scientific Refusal Tests

LLMs exhibit wildly different safety profiles when probed about dual-use science, with refusal rates ranging from 0% to 73% depending on the model.

David A. Noever, Forrest McKee

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Jan 9, 2025

David A. Noever +1Jan 9, 2025

Infecting Generative AI With Viruses

LLMs can be tricked into executing malicious code hidden inside images, exposing a critical security vulnerability in their file handling capabilities.

David A. Noever, Forrest McKee

Computer Vision Multimodal Models Red-Teaming & Adversarial Robustness

Search

Forrest McKee

Research focus

Frequent co-authors

Papers (3)