Xiaoyu Shen

FPGAs can beat GPUs at dynamically allocating computation for LLM inference, thanks to a new architecture that fuses operations, uses mixed precision, and caches KV values on-chip.

Zicheng He, Anhao Zhao, Xiaoyu Shen +1

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Mar 4, 2026

Mar 4, 2026·also Eastern Institute of Technology, Ningbo Institute of Digital Twin

From Static Inference to Dynamic Interaction: Navigating the Landscape of Streaming Large Language Models

Untangling the mess of "streaming LLMs," this paper delivers a clear taxonomy that distinguishes between streaming generation, streaming inputs, and interactive architectures.

Zilong Wang, YuJie Ren, Peiran Yin +2

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Natural Language Processing

Mar 3, 2026

Think-as-You-See: Streaming Chain-of-Thought Reasoning for Large Vision-Language Models

LVLMs can reason about video streams *much* faster and better by thinking concurrently with the incoming data, not in batches.

Jialiang Zhang, Yirong Sun, Yunpu Ma +1

Computer Vision Multimodal Models Reasoning & Chain-of-Thought

Mar 1, 2026

Mar 1, 2026·also Eastern Institute of Technology

Beyond Global Similarity: Towards Fine-Grained, Multi-Condition Multimodal Retrieval

Forget simple image search: MCMR reveals how current multimodal models struggle with the complex, interdependent constraints of real-world product search.

Kangle Li, Haohang Huang, Xiaoyu Shen

Eval Frameworks & Benchmarks Multimodal Models Recommendation & Information Retrieval

Feb 16, 2026

Rethinking the Role of LLMs in Time Series Forecasting

LLMs actually *do* improve time series forecasting, especially for cross-domain generalization, overturning prior doubts with a massive 8-billion observation study.

Xin Qiu, Yirong Sun, Yunpu Ma +1

Eval Frameworks & Benchmarks Natural Language Processing

Search

Xiaoyu Shen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (6)