Google Research

×Tool Use & Agents

16 papers from Google Research on Tool Use & Agents

May 5, 2026

2w ago·also Google Research, Harvard, Northeastern, Notre Dame +2

Deco: Extending Personal Physical Objects into Pervasive AI Companion through a Dual-Embodiment Framework

Instead of creating new AI companions from scratch, Deco shows how to breathe new life into cherished physical objects by giving them a digital voice and personality powered by LLMs.

Zhihan Jiang, Meng Wu, Ruishi Zou +14

Natural Language Processing Robotics & Embodied AI Tool Use & Agents

Apr 27, 2026

Google Research3w ago·also LinkedIn Corporation

Co-Director: Agentic Generative Video Storytelling

Forget handcrafted prompts: a hierarchical multi-agent framework turns diffusion models into coherent storytelling engines by globally optimizing for semantic coherence.

Yale Song, Yale Song, Yiwen Song +29

Computer Vision Multimodal Models Tool Use & Agents

Apr 21, 2026

Apr 21, 2026·also Google Research

An AI Agent Execution Environment to Safeguard User Data

GAAP offers a deterministic, trust-minimized approach to AI agent security, safeguarding user data even when models are compromised or prompts are injected.

Robert Stanley, Avirishu Verma, Avi Verma +4

Constitutional AI & AI Ethics Red-Teaming & Adversarial Robustness Tool Use & Agents

Apr 15, 2026

Google ResearchApr 15, 2026·also UMD

CANVAS: Continuity-Aware Narratives via Visual Agentic Storyboarding

Generating consistent visual narratives is now possible: CANVAS outperforms existing methods by explicitly planning character, background, and scene continuity across multiple shots.

I. Mondal, Ishani Mondal, Mihir Parmar +5

Computer Vision Multimodal Models Tool Use & Agents

Apr 13, 2026

Ludwig-Maximilians-Universität MünchenApr 13, 2026·also DeepMind, Google Research, Stanford HAI, Munich Center for Machine Learning +1

Epistemic Trust as a Mechanism for Ethics Integration: Failure Modes and Design Principles from 70 Moral Imagination Workshops

Ethics interventions in AI development often fail because practitioners don't trust them – here's a breakdown of why, and how to fix it.

Benjamin Lange, Geoff Keeling, Kyle Pedersen +4

Constitutional AI & AI Ethics Natural Language Processing Tool Use & Agents

Google ResearchApr 13, 2026

LLM-Based Automated Diagnosis Of Integration Test Failures At Google

Google developers are spending less time debugging integration tests thanks to an LLM that diagnoses failures with 90% accuracy.

Celal Ziftci, Ray Liu, Spencer Greene +1

Code Generation & Program Synthesis Natural Language Processing Tool Use & Agents

Apr 8, 2026

Google ResearchApr 8, 2026·also Gonzaga University, UIUC

Logical Robots: Declarative Multi-Agent Programming in Logica

Ditch imperative robot programming and embrace the elegance of logic: control swarms with declarative code.

E. Skvortsov, Evgeny Skvortsov, Yilin Xia +6

Code Generation & Program Synthesis Robotics & Embodied AI Tool Use & Agents+1

Apr 5, 2026

Google ResearchApr 5, 2026

Beyond Fluency: Toward Reliable Trajectories in Agentic IR

Fluent language from an agentic IR system can be dangerously deceptive, masking critical errors in planning, retrieval, reasoning, and execution that accumulate over time.

Anushree Sinha, Srivaths Ranganathan, Debanshu Das +1

Reasoning & Chain-of-Thought Recommendation & Information Retrieval Tool Use & Agents

Apr 2, 2026

Google ResearchApr 2, 2026

Multi-Agent Video Recommenders: Evolution, Patterns, and Open Challenges

LLM-powered multi-agent architectures are poised to revolutionize video recommendation by enabling precise, explainable, and adaptive recommendations that surpass the limitations of static, single-model systems.

Srivaths Ranganathan, Srivaths Ranganathan, Abhishek Dharmaratnakar +5

Recommendation & Information Retrieval Tool Use & Agents

Mar 9, 2026

Google ResearchMar 9, 2026·also DeepMind, Babylon Health, Beth Israel Deaconess Medical Center, Beth Israel Lahey Health +3

A prospective clinical feasibility study of a conversational diagnostic AI in an ambulatory primary care clinic

LLM-powered diagnostic AI is ready for prime time: a real-world clinical trial shows it's safe, patients love it, and doctors find it useful.

P. Brodeur, Peter Brodeur, Jacob M. Koshy +58

Eval Frameworks & Benchmarks Natural Language Processing Tool Use & Agents

Mar 5, 2026

Google ResearchMar 5, 2026·also CMU ML, Harvard

Solving an Open Problem in Theoretical Physics using AI-Assisted Discovery

An AI agent cracked an open problem in theoretical physics, deriving exact analytical solutions for gravitational radiation from cosmic strings, proving AI can do more than just pattern recognition.

Michael P. Brenner, Vincent Cohen-Addad, David Woodruff +1

Reasoning & Chain-of-Thought Scientific Discovery & Drug Design Tool Use & Agents

Mar 4, 2026

Google ResearchMar 4, 2026·also BAIR, DeepMind

Dual-Modality Multi-Stage Adversarial Safety Training: Robustifying Multimodal Web Agents Against Cross-Modal Attacks

Multimodal web agents are surprisingly vulnerable to cross-modal attacks, but a novel adversarial training approach can double task completion efficiency while mitigating these risks.

Haoyu Liu, Dingcheng Li, Lukas Rutishauser +1

Multimodal Models Red-Teaming & Adversarial Robustness Tool Use & Agents

Mar 3, 2026

DeepMindMar 3, 2026·also Google Research

Architecting Trust in Artificial Epistemic Agents

LLMs are becoming "epistemic agents" that shape our knowledge environment, so we need a new framework for evaluating and governing them based on trustworthiness, not just performance.

Nahema Marchal, Stephanie Chan, Matija Franklin +4

Constitutional AI & AI Ethics Scalable Oversight & Alignment Theory Tool Use & Agents

Feb 24, 2026

DeepMindFeb 24, 2026·also CMU ML, Google Research

Aletheia tackles FirstProof autonomously

Gemini 3 Deep Think can now autonomously solve a majority of problems in a challenging math competition, signaling a leap in AI's mathematical reasoning capabilities.

Tony Feng, Tony Feng, Junehyuk Jung +26

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Tool Use & Agents

Feb 18, 2026

Google ResearchFeb 18, 2026·also DeepMind, SFI

Multi-agent cooperation through in-context co-player inference

Sequence models can learn to cooperate in multi-agent settings simply by training against diverse partners, no explicit meta-learning required.

Marissa A. Weis, Marissa A. Weis, Maciej Wołczyk +11

RLHF & Preference Learning Tool Use & Agents World Models & Planning

Mar 10, 2025

Microsoft ResearchMar 10, 2025·also Google Research, UCLA

Magnet: Multi-turn Tool-use Data Synthesis and Distillation via Graph Translation

Forget hand-annotated data: Magnet distills multi-turn tool-use skills into LLMs by automatically generating training trajectories that outperform even Gemini 1.5 Pro.

Fan Yin, Zifeng Wang, I-Hung Hsu +920

Data Curation & Synthetic Data Inference & Quantization Tool Use & Agents

Search

Google Research