Search papers, labs, and topics across Lattice.
George Mason University
2
0
6
Model internals, not just outputs, hold the key to predicting generalization: circuit-based metrics beat standard proxies by up to 34% in assessing ViT performance under distribution shift.
LLM web agents struggle more with perceptual grounding and low-level execution than high-level reasoning, challenging the assumption that better reasoning alone will solve web navigation.