Apple ML Research

×Reasoning & Chain-of-Thought

3 papers from Apple ML Research on Reasoning & Chain-of-Thought

Feb 16, 2026

Apple MLFeb 16, 2026

The Potential of CoT for Reasoning: A Closer Look at Trace Dynamics

Just 20% of a strong model's chain-of-thought can unlock a weaker model's reasoning abilities, revealing the surprising transferability of CoT mechanics.

Gregor Bachmann, Seyed Mohsen Moosavi Dezfooli, Moin Nabi

Eval Frameworks & Benchmarks Interpretability & Mechanistic Interp Reasoning & Chain-of-Thought

Apple MLFeb 16, 2026·also EPFL

Goldilocks RL: Tuning Task Difficulty to Escape Sparse Rewards for Reasoning

Key contribution not extracted.

Ilia Mahrooghi, Aryo Lotfi, Emmanuel Abbe

Reasoning & Chain-of-Thought RLHF & Preference Learning Training Efficiency & Optimization

Feb 13, 2026

Apple MLFeb 13, 2026

On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs

RL fine-tuning can make vision-language models *less* reliable reasoners, as gains in benchmark accuracy come at the cost of faithfulness to the underlying visual grounding and chain-of-thought.

Anshul Shah, Xiaoyu Zhu, Xinke Deng +3

Multimodal Models Reasoning & Chain-of-Thought Red-Teaming & Adversarial Robustness

Search

Apple ML Research