Martin Vechev

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (3)Red-Teaming & Adversarial Robustness (1)Data Curation & Synthetic Data (1)Natural Language Processing (1)

Frequent co-authors

Martin T. Vechev (2)Fabian Kaczmarczyck (1)Ivan Petrov (1)Ilia Shumailov (1)

Papers (3)

May 28, 2026

Google Research2w ago·also DeepMind, ETH, AI Sequrity Company

Honeyval: A Comprehensive Evaluation Framework for LLM-powered HTTP Honeypots

LLM-powered honeypots can trick even frontier models into longer interactions than rule-based systems, all while costing less to run.

Fabian Kaczmarczyck, Ivan Petrov, Ilia Shumailov +5

Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Feb 25, 2026

ETHFeb 25, 2026·also Sofia University "St. Kliment Ohridski"

Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets

LLM benchmark translations can be dramatically improved by test-time compute scaling, revealing a surprisingly cheap way to get more reliable multilingual evaluations.

Hanna Yukhymenko, Hanna Yukhymenko, Anton Alexandrov +3

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Natural Language Processing

Feb 12, 2026

ETHFeb 12, 2026

Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?

Context files like AGENTS.md, intended to guide coding agents, often *hurt* performance and increase costs, challenging the common practice of using them.

Thibaud Gloaguen, Thibaud Gloaguen, Niels Mundler +7

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Search

Martin Vechev

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)