Pablo Samuel Castro

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Interpretability & Mechanistic Interp (1)Reasoning & Chain-of-Thought (1)Distributed Systems & Hardware (1)Training Efficiency & Optimization (1)

Frequent co-authors

Hugh Blayney (1)Álvaro Arroyo (1)Johan Obando-Ceron (1)Aaron Courville (1)

Papers (2)

Apr 13, 2026

MilaApr 13, 2026

A Mechanistic Analysis of Looped Reasoning Language Models

Looped LLMs don't just perform better reasoning, they also internally mirror the distinct inference stages of standard feedforward models, repeating them cyclically.

Hugh Blayney, Álvaro Arroyo, Johan Obando-Ceron +4

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp Reasoning & Chain-of-Thought

Mar 2, 2026

Homayoun Honari +10Mar 2, 2026

Align and Filter: Improving Performance in Asynchronous On-Policy RL

Overcome policy lag in distributed RL with TV-ACPO, a method that aligns advantage functions and constrains policy updates, leading to more robust and scalable on-policy learning.

Homayoun Honari, Homayoun Honari, Roger Creus Castanyer +8

Distributed Systems & Hardware Training Efficiency & Optimization

Search

Pablo Samuel Castro

Research focus

Frequent co-authors

Papers (2)