Mar 17, 2026arXiv:2603.16651

What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline

AI Summary

This thesis introduces \pino, a novel end-to-end pipeline for developing norm-compliant and context-aware reinforcement learning agents, inspired by the Pinocchio story. \pino combines reinforcement learning with argumentation-based normative advisors, building upon the AJAR, Jiminy, and NGRL architectures. A key contribution is a new algorithm for automatically extracting arguments and relationships to inform the advisors' decisions, along with an investigation and mitigation strategy for norm avoidance in RL agents.

Key Contribution

Reinforcement learning agents can now learn to be "good" (i.e., norm-compliant) via a novel pipeline that leverages argumentation-based normative advisors and automatically extracts the reasoning behind those norms.

Abstract

In the past decade, artificial intelligence (AI) has developed quickly. With this rapid progression came the need for systems capable of complying with the rules and norms of our society so that they can be successfully and safely integrated into our daily lives. Inspired by the story of Pinocchio in ``Le avventure di Pinocchio - Storia di un burattino'', this thesis proposes a pipeline that addresses the problem of developing norm compliant and context-aware agents. Building on the AJAR, Jiminy, and NGRL architectures, the work introduces \pino, a hybrid model in which reinforcement learning agents are supervised by argumentation-based normative advisors. In order to make this pipeline operational, this thesis also presents a novel algorithm for automatically extracting the arguments and relationships that underlie the advisors' decisions. Finally, this thesis investigates the phenomenon of \textit{norm avoidance}, providing a definition and a mitigation strategy within the context of reinforcement learning agents. Each component of the pipeline is empirically evaluated. The thesis concludes with a discussion of related work, current limitations, and directions for future research.

Constitutional AI & AI Ethics RLHF & Preference Learning Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline

Related Papers