Search papers, labs, and topics across Lattice.
This paper introduces the Semantic Timbre Dataset, a collection of electric guitar sounds labeled with 19 semantic timbre descriptors and magnitudes derived from guitar effect units. The dataset aims to bridge the gap between perceptual timbre dimensions and machine learning representations for improved timbre control. Validation using a VAE demonstrates the dataset's ability to capture timbral structure and enable smooth interpolation across descriptors.
Unlock timbre-aware generative AI with a new dataset linking semantic descriptors to electric guitar sounds, enabling nuanced control over audio synthesis.
Understanding and manipulating timbre is central to audio synthesis, yet this remains under-explored in machine learning due to a lack of annotated datasets linking perceptual timbre dimensions to semantic descriptors. We present the Semantic Timbre Dataset, a curated collection of monophonic electric guitar sounds, each labeled with one of 19 semantic timbre descriptors and corresponding magnitudes. These descriptors were derived from a qualitative analysis of physical and virtual guitar effect units and applied systematically to clean guitar tones. The dataset bridges perceptual timbre and machine learning representations, supporting learning for timbre control and semantic audio generation. We validate the dataset by training a variational autoencoder (VAE) on its latent space and evaluating it using human perceptual judgments and descriptor classifiers. Results show that the VAE captures timbral structure and enables smooth interpolation across descriptors. We release the dataset, code, and evaluation protocols to support timbre-aware generative AI research.