Tsinghua AIRUCApr 8, 2026arXiv:2604.06734

TEC: A Collection of Human Trial-and-error Trajectories for Problem Solving

Xinkai Zhang, Jingtao Zhan, Jingtao Zhan, Qingyao Ai

AI Summary

The paper introduces the Trial-and-Error Collection (TEC), a new dataset of 5,370 human trial-and-error trajectories across 58 tasks, gathered via a custom data annotation platform that also captures user reflections on error feedback. The dataset aims to address the lack of real-world human trial-and-error data for training AI systems. Experiments using TEC reveal that humans significantly outperform LLMs in trial-and-error problem solving, highlighting a gap in current AI capabilities.

Key Contribution

Humans are still way better than LLMs at trial-and-error problem solving, and this new dataset of human problem-solving trajectories shows us why.

Abstract

Trial-and-error is a fundamental strategy for humans to solve complex problems and a necessary capability for Artificial Intelligence (AI) systems operating in real-world environments. Although several trial-and-error AI techniques have recently been proposed, most of them rely on simple heuristics designed by researchers and achieve limited performance gains. The core issue is the absence of appropriate data: current models cannot learn from detailed records of how humans actually conduct trial-and-error in practice. To address this gap, we introduce a data annotation platform and a corresponding dataset, termed Trial-and-Error Collection (TEC). The platform records users'complete trajectories across multiple trials and collects their reflections after receiving error feedback. Using this platform, we record the problem-solving processes of 46 participants on 58 tasks, resulting in 5,370 trial trajectories along with error reflections across 41,229 webpages. With this dataset, we observe that humans achieve substantially higher accuracy compared to LLMs, which demonstrates that humans are more effective in trial-and-error than LLMs. We believe that the TEC platform and dataset provide a valuable foundation for understanding human trial-and-error behavior and for developing more capable AI systems. Platform and dataset are publicly available.

Data Curation & Synthetic Data Reasoning & Chain-of-Thought Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References37

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

TEC: A Collection of Human Trial-and-error Trajectories for Problem Solving

Related Papers