Search papers, labs, and topics across Lattice.
The 2025 Automatic Music Transcription (AMT) Challenge benchmarked multi-instrument transcription models, revealing that two out of eight submitted models surpassed the MT3 baseline. Analysis of the submissions highlights improvements in transcription accuracy, but also persistent challenges in handling polyphony and timbre variation across instruments. The challenge results point to the need for future research to focus on broader genre coverage and improved instrument detection capabilities in AMT systems.
Despite progress, accurately transcribing music with multiple instruments, complex polyphony, and diverse timbres remains a significant hurdle for AI.
This paper presents the results of the 2025 Automatic Music Transcription (AMT) Challenge, an online competition to benchmark progress in multi-instrument transcription. Eight teams submitted valid solutions; two outperformed the baseline MT3 model. The results highlight both advances in transcription accuracy and the remaining difficulties in handling polyphony and timbre variation. We conclude with directions for future challenges: broader genre coverage and stronger emphasis on instrument detection.