Helena Casademunt

Papers on Lattice

Total citations

Topics

h-index

Papers (2)

Jun 3, 2026

Tsinghua AIJun 3, 2026·also BAIR, Department of Computer Science, Georgia Tech, KU Leuven +7

The hardest AI tasks remain largely unsolved, with current models achieving only a 2.6% success rate on economically valuable workflows.

Mar 5, 2026

Mar 5, 2026·also Warsaw University of Technology IDEAS

Censored LLMs offer a surprisingly natural and effective environment for stress-testing methods that aim to elicit truthfulness and detect deception.