Yinyi Guo

You can now build a real-time, privacy-preserving conversational assistant for procedural tasks using *only* audio and IMU data, thanks to a new finetuning method that makes the assistant less chatty and more helpful.

Rehana Mahfuz, Yinyi Guo, Erik Visser +1

Robotics & Embodied AI Speech & Audio Tool Use & Agents

Feb 16, 2026

Naveen Vakada +4Feb 16, 2026

LongAudio-RAG: Event-Grounded Question Answering over Multi-Hour Long Audio

By grounding LLMs in timestamped acoustic events instead of raw audio, LongAudio-RAG enables accurate question answering over multi-hour audio, outperforming standard RAG and text-to-SQL baselines.

Naveen Vakada, Kartik Hegde, Arvind Krishna Sridhar +2

Natural Language Processing Recommendation & Information Retrieval Speech & Audio

Search

Yinyi Guo

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)