Search papers, labs, and topics across Lattice.
3
0
7
2
Reasoning across languages doesn't have to break the bank: a new framework slashes token costs by over 50% while maintaining accuracy, especially boosting performance in low-resource languages.
Even the best large vision-language models struggle with multi-image reasoning, scoring only 50% on a new benchmark designed to challenge their capabilities.
Social intelligence may be a qualitatively different beast than analytical reasoning: a 7B model trained with SAVOIR beats GPT-4o and Claude-3.5-Sonnet on social interaction, while large reasoning models lag behind.