Search papers, labs, and topics across Lattice.
Shanghai Artificial Intelligence Laboratory, The research work was conducted in the JC STEM Lab of Machine Learning and Computer Vision funded by The Hong Kong Jockey Club Charities Trust. This research received partially support from the Global STEM Professorship Scheme from the Hong Kong Special Administrative Region
7
9
11
15
Imagine fixing your robot's mistakes *before* it even makes them: RoboPocket lets you train robots twice as efficiently using just your smartphone and AR.
Automatically generating personas from VR app store reviews can efficiently foster empathy and uncover hidden accessibility needs in VR development.
Current video LLMs falter when faced with the demands of real-time interaction, a gap RIVER Bench directly addresses by providing a challenging new evaluation framework.
Achieve state-of-the-art small object detection by explicitly preserving fine-grained structural details and modeling global relations, even in complex backgrounds.
InterFormer tackles the "interaction illusion" in egocentric hand-object parsing, achieving state-of-the-art results by explicitly modeling hand-object co-occurrence and spatial dynamics.
Escape the bottleneck of translating product intent into ranking system hypotheses: GEARS offers an agentic framework that autonomously discovers and validates superior ranking policies.
Current LLMs and VLMs struggle with multi-step reasoning in long videos, often failing to maintain temporal coherence and procedural validity, as revealed by a new benchmark of hour-long narratives.