Jon-Paul Cacioli

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Natural Language Processing (4)Architecture Design (Transformers, SSMs, MoE) (4)Eval Frameworks & Benchmarks (2)Open-Source Models & Weights (2)

Papers (6)

Apr 30, 2026

Jon-Paul Cacioli3w ago

Beyond the Mean: Within-Model Reliable Change Detection for LLM Evaluation

LLM upgrades are a chaotic mix of progress and decay: despite overall gains, up to 47% of questions get *worse* after an update, and single-shot evals miss almost half of these critical regressions.

Jon-Paul Cacioli

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

Apr 28, 2026

Jon-Paul Cacioli3w ago

Below-Chance Blindness: Prompted Underperformance in Small LLMs Produces Positional Bias Rather than Answer Avoidance

Forget sophisticated deception – small LLMs "sandbagging" on tests just pick option 'E' or 'F' regardless of the question, revealing a surprising positional bias instead of true answer-aware avoidance.

Jon-Paul Cacioli

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Apr 23, 2026

Jon-Paul CacioliApr 23, 2026

Cross-Entropy Is Load-Bearing: A Pre-Registered Scope Test of the K-Way Energy Probe on Bidirectional Predictive Coding

Cross-entropy loss isn't just a detail – it's the unsung hero behind how well energy probes work in predictive coding networks, accounting for up to 66% of the probe-softmax gap.

Jon-Paul Cacioli

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp

Apr 6, 2026

Jon-Paul CacioliApr 6, 2026

Same Geometry, Opposite Noise: Transformer Magnitude Representations Lack Scalar Variability

Transformers get the magnitude geometry right, but completely botch the noise: unlike brains, their representations become *less* variable for larger numbers.

Jon-Paul Cacioli

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Open-Source Models & Weights

Jon-Paul CacioliApr 6, 2026

Exemplar Retrieval Without Overhypothesis Induction: Limits of Distributional Sequence Learning in Early Word Learning

Even with perfect memorization of examples, autoregressive transformers fail to learn higher-order generalizations about word categories, suggesting a fundamental gap in how these models learn compared to children.

Jon-Paul Cacioli

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Scaling Laws & Emergent Abilities

Mar 30, 2026

Jon-Paul CacioliMar 30, 2026

Categorical Perception in Large Language Model Hidden States: Structural Warping at Digit-Count Boundaries

LLMs exhibit categorical perception-like warping in their hidden state representations at digit-count boundaries, even without explicit semantic category knowledge, revealing a surprising sensitivity to structural input discontinuities.

Jon-Paul Cacioli

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp Natural Language Processing

Search

Jon-Paul Cacioli

Publication activitypapers/week, last 8 weeks

Research focus

Papers (6)