Hiroya Takamura

National Institute of Advanced Industrial Science and Technology, Japan

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (3)Speech & Audio (1)Eval Frameworks & Benchmarks (1)Reasoning & Chain-of-Thought (1)

Frequent co-authors

Tatsuya Ishigaki (3)Anum Afzal (2)Yuki Saito (2)Shinnosuke Takamichi (2)

Papers (3)

Jun 11, 2026

Jun 11, 2026·also CMU ML, AIST, Keio, NAIST +1

Low-Latency Real-Time Audio Game Commentary System via LLM-Based Parallel Text Generation

Reducing inter-utterance silence from 9.6 seconds to 0.3 seconds transforms the quality of real-time game commentary, making it feel more natural and engaging.

Ryota Kawamatsu, Anum Afzal, Yuki Saito +5

Multimodal Models Speech & Audio

Mar 19, 2026

Multimodal Task Interference: A Benchmark and Analysis of History-Target Mismatch in Multimodal LLMs

Multimodal LLMs suffer a major performance hit when asked to switch from text-based to image-based tasks mid-conversation, revealing a surprising asymmetry in their ability to handle task interference.

Masayuki Kawarada, Masayuki Kawarada, Tatsuya Ishigaki +1

Eval Frameworks & Benchmarks Multimodal Models Reasoning & Chain-of-Thought

Mar 3, 2026

Mar 3, 2026·also CMU ML, Keio, LMU, NAIST +2

Real-Time Generation of Game Video Commentary with Multimodal LLMs: Pause-Aware Decoding Approaches

Forget fine-tuning: Prompting MLLMs with a dynamic interval-based decoding strategy lets them generate surprisingly human-like, pause-aware real-time game commentary.

Anum Afzal, Yuki Saito, Hiroya Takamura +5

Computer Vision Multimodal Models Natural Language Processing

Search

Hiroya Takamura

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)