Allen Institute for AI (AI2)

×Tool Use & Agents

7 papers from Allen Institute for AI (AI2) on Tool Use & Agents

Apr 14, 2026

AI2Apr 14, 2026·also NVIDIA, UT Austin, Waterloo, ZJU

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Nemotron 3 Super proves you can achieve comparable accuracy to existing 120B models, but with significantly higher inference throughput, by combining Mamba, Attention, and Mixture-of-Experts.

Aakshita Chandiramani, Aaron Blakeman, Abdullahi Olaoye +463

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Tool Use & Agents

Apr 10, 2026

Apr 10, 2026·also AI2

ScheMatiQ: From Research Question to Structured Data through Interactive Schema Discovery

Skip the annotation bottleneck: ScheMatiQ lets you turn research questions and text corpora into structured databases with LLMs, guided by a simple web interface.

Shahar Levy, Reshef Mintz, Barak Raveh +1

Data Curation & Synthetic Data Natural Language Processing Tool Use & Agents

Apr 9, 2026

AI2Apr 9, 2026·also Paul G. Allen School of Computer Science

MolmoWeb: Open Visual Web Agent and Open Data for the Open Web

Open-source web agents can now outperform GPT-4o on key web navigation tasks, thanks to a new dataset and model family that levels the playing field.

Tanmay Gupta, Piper Wolters, Zixian Ma +17

Data Curation & Synthetic Data Open-Source Models & Weights Tool Use & Agents

Apr 7, 2026

AI2Apr 7, 2026·also BUPT

RAGEN-2: Reasoning Collapse in Agentic RL

LLM agents can appear to reason well (high entropy) while completely ignoring the input, and mutual information is a far better metric for catching this failure.

Chi Gui, Chi Gui, Xing Jin +13

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Mar 29, 2026

UWMar 29, 2026·also AI2, Microsoft Research, Stanford HAI, Bake AI +5

Emergent Social Intelligence Risks in Generative Multi-Agent Systems

Generative multi-agent systems spontaneously exhibit collusion and conformity, mirroring societal pathologies, even without explicit programming and bypassing individual agent safeguards.

Wenjie Wang, Yuchen Ma, Zichen Chen +4

Constitutional AI & AI Ethics Red-Teaming & Adversarial Robustness Tool Use & Agents

Mar 16, 2026

Mar 16, 2026·also AI2, Bell Labs

Are We Automating the Joy Out of Work? Designing AI to Augment Work, Not Meaning

AI is poised to automate the most joyful and agentic parts of our jobs, while developers are building AI with the wrong traits.

Jaspreet Ranjit, Swabha Swayamdipta, Daniele Quercia +1

Constitutional AI & AI Ethics Natural Language Processing Tool Use & Agents

Feb 26, 2026

AI2Feb 26, 2026·also Allen Institute of AI, Alongside.care, Bar-Ilan, Northeastern

Understanding Usage and Engagement in AI-Powered Scientific Research Tools: The Asta Interaction Dataset

Forget simple keyword searches – scientists are using AI research tools as collaborative partners, delegating complex tasks and engaging with results in surprisingly persistent and non-linear ways.

Dany Haddad, Dany Haddad, Daniel Bareket +39

Recommendation & Information Retrieval Scientific Discovery & Drug Design Tool Use & Agents

Search

Allen Institute for AI (AI2)