Search papers, labs, and topics across Lattice.
The paper trains transformers on letter-string analogy tasks using Meta-Learning for Compositionality (MLC), finding that performance hinges on the model's ability to attend to informative problem elements. Copying tasks are introduced during training to guide attention, significantly improving the model's ability to learn and generalize to new alphabets, especially with heterogeneous datasets. Interpretability analysis reveals an algorithm approximating the model's computations, allowing for precise steering.
Copying, often seen as a rote task, unlocks analogical reasoning in transformers by forcing them to attend to the most relevant features.
Analogical reasoning is a hallmark of human intelligence, enabling us to solve new problems by transferring knowledge from one situation to another. Yet, developing artificial intelligence systems capable of robust human-like analogical reasoning has proven difficult. In this work, we train transformers using Meta-Learning for Compositionality (MLC) on an analogical reasoning task (letter-string analogies) and assess their generalization capabilities. We find that letter-string analogies become learnable when guiding the models to attend to the most informative problem elements induced by including copying tasks in the training data. Furthermore, generalization to new alphabets becomes better when models are trained with more heterogeneous datasets, where our 3-layer encoder-decoder model outperforms most frontier models. The MLC approach also enables some generalization to compositions of trained transformations, but not to completely novel transformations. To understand how the model operates, we identify an algorithm that approximates the model's computations. We verify this using interpretability analyses and show that the model can be steered precisely according to expectations derived from the algorithm. Finally, we discuss implications of our findings for generalization capabilities of larger models and parallels to human analogical reasoning.