Search papers, labs, and topics across Lattice.
This study investigates the generalization capabilities of fine-tuned small language models for graph structural inference, focusing on graph size and family distribution. Through a controlled experimental setup involving three instruction-tuned models and two graph serialization formats, the authors assess performance on larger graphs and diverse graph families. The results reveal that these models maintain strong ordinal consistency and effectively rank graphs by structural properties, even when faced with inputs significantly larger than those encountered during training, highlighting architecture-specific degradation profiles.
Fine-tuned small language models can reliably generalize to larger and structurally distinct graphs, maintaining strong performance in graph property estimation.
Small language models fine-tuned for graph property estimation have demonstrated strong in-distribution performance, yet their generalization capabilities beyond training conditions remain poorly understood. In this work, we systematically investigate the boundaries of structural inference in fine-tuned small language models along two generalization axes - graph size and graph family distribution - and assess domain-learning capability on real-world graph benchmarks. Using a controlled experimental setup with three instruction-tuned models in the 3-4B parameter class and two graph serialization formats, we evaluate performance on graphs substantially larger than the training range and across held-out random graph families. Our results show that fine-tuned models maintain strong ordinal consistency across structurally distinct graph families and continue to rank graphs by structural properties on inputs substantially larger than those seen during training, with distinct architecture-specific degradation profiles. These findings delineate where fine-tuned small language models generalize reliably, providing empirical grounding for their use in graph-based reasoning tasks.