Search papers, labs, and topics across Lattice.
Hefei University of Technology
8
0
12
STEDiff enhances text-to-image alignment without the need for costly fine-tuning, achieving remarkable semantic consistency even in complex prompts.
Models trained on the new VideoKR dataset achieve superior performance in knowledge-intensive video reasoning, setting a new benchmark for the field.
MOSS-Audio achieves state-of-the-art performance in audio understanding tasks by effectively integrating temporal cues and deep acoustic features, setting a new benchmark for audio-language models.
A new dataset of 1,111 transvaginal ultrasound images with detailed annotations finally enables AI-powered diagnosis of Cesarean Scar Defects, a condition frequently missed by sonographers.
MiniMax-M2 proves that massive parameter counts don't always translate to better agentic performance; strategic activation of a smaller subset can unlock frontier-level intelligence.
Sharper text-to-image alignment is now possible in diffusion models by explicitly aggregating related attention and isolating unrelated attention.
AI training jobs can now shrug off network failures that used to halt progress, thanks to a new resilient networking stack deployed at OpenAI and Microsoft.
Today's best AI agents can only solve 55% of real-world academic tasks that university students find challenging, revealing a significant gap between current AI capabilities and the demands of academic workflows.