MusicNet-16k + EM for YourMT3

Sungkyun Chang; Dixon, Simon; Benetos, Emmanouil

doi:10.5281/zenodo.8082232

Published April 2, 2023 | Version v22

Dataset Open

MusicNet-16k + EM for YourMT3

1. C4DM, Queen Mary University of London

< UNDER CONSTRUCTION >

About this version:

This particular variant of the MusicNet dataset has been resampled to a 16 kHz-mono-16-bit-wav format, which makes it more suitable for certain audio processing tasks, particularly those that require lower sampling rates. We redistribute this data as a part of YourMT3 project. The license for redistribution is attached.

Moreover, this version of the dataset includes various split options derived from previous works on automatic music transcription as python dictionary (see README.md). Below is a brief description of available split options:

MUSICNET_SPLIT_INFO = {
    'train_mt3': [], # the first 300 songs are synth dataset, while the remaining 300 songs are acoustic dataset. 
    'train_mt3_synth' : [], # Note: this is not the synthetic dataset of EM (MIDI Pop 80K) nor pitch-augmented. Just recording of MusicNet MIDI, split by MT3 author's split. But not sure if they used this (maybe not).
    'train_mt3_acoustic': [],
    'validation_mt3': [1733, 1765, 1790, 1818, 2160, 2198, 2289, 2300, 2308, 2315, 2336, 2466, 2477, 2504, 2611],
    'validation_mt3_synth': [1733, 1765, 1790, 1818, 2160, 2198, 2289, 2300, 2308, 2315, 2336, 2466, 2477, 2504, 2611],
    'validation_mt3_acoustic': [1733, 1765, 1790, 1818, 2160, 2198, 2289, 2300, 2308, 2315, 2336, 2466, 2477, 2504, 2611],
    'test_mt3_acoustic': [1729, 1776, 1813, 1893, 2118, 2186, 2296, 2431, 2432, 2487, 2497, 2501, 2507, 2537, 2621],
    'train_thickstun': [], # the first 327 songs are synth dataset, while the remaining 327 songs are acoustic dataset.  
    'test_thickstun': [1819, 2303, 2382],
    'train_mt3_em': [], # 293 tracks. MT3 train set - 7 missing tracks[2194, 2211, 2227, 2230, 2292, 2305, 2310], ours
    'validation_mt3_em': [1733, 1765, 1790, 1818, 2160, 2198, 2289, 2300, 2308, 2315, 2336, 2466, 2477, 2504, 2611], # ours
    'test_mt3_em': [1729, 1776, 1813, 1893, 2118, 2186, 2296, 2431, 2432, 2487, 2497, 2501, 2507, 2537, 2621], # ours
    'train_em_table2' : [], # 317 tracks. Whole set - 7 missing tracks[2194, 2211, 2227, 2230, 2292, 2305, 2310] - 6 test_em
    'test_em_table2' : [2191, 2628, 2106, 2298, 1819, 2416], # strings and winds from Cheuk's split, using EM annotations
    'test_cheuk_table2' : [2191, 2628, 2106, 2298, 1819, 2416], # strings and winds from Cheuk's split, using Thickstun's annotations
}

About MusicNet:

The MusicNet dataset, originally released in 2016 by Thickstun et al., "Learning Features of Music from Scratch". It is a collection of music recordings annotated with labels for various tasks, such as automatic music transcription, instrument recognition, and genre classification. The original dataset contains over 330 hours of audio, sourced from various public domain recordings of classical music, and is labeled with instrument activations and note-wise annotations.

About MusicNet EM:

MusicNetEM are refined labels for the MusicNet dataset, in the form of MIDI files. They are aligned with the recordings, with onset timing within 32ms. They were created using an EM process, similar to the one described in the Ben Maman and Amit H. Bermano, "Unaligned Supervision for Automatic Music Transcription in The Wild". Their split (Table 2 of this paper) derived from another paper, Kin Wai Cheuk et al., "ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data".

License:

CC-BY-4.0

Files

LICENSE_musicnet_em.txt

Files (42.5 GB)

Name	Size	Download all
enstdrums_yourmt3_16k.tar.gz md5:7e28c2a923e4f4162b3d83877cedb5eb	1.6 GB	Download
guitarset_yourmt3_16k.tar.gz md5:e3cfe0cc9394d91d9c290ce888821360	1.1 GB	Download
idmtbass_yourmt3_16k.tar.gz md5:46c0ecf15931419e8b444ea83dac84b8	809.8 MB	Download
LICENSE_musicnet_em.txt md5:3ed98e7f7fb167436e6745b61cfa5fe3	1.2 kB	Preview Download
maestro_yourmt3_16k.tar.gz md5:c17c6a188d936e5ff3870ef27144d397	19.2 GB	Download
maps_yourmt3_16k.tar.gz md5:6b070d162c931cd5e69c16ef2398a649	1.6 GB	Download
mir1k_yourmt3_16k.tar.gz md5:4cbac56a4e971432ca807efd5cb76d67	669.4 MB	Download
mir_st500_yourmt3_16k.tar.gz md5:db1f8f1fe5c50da17741bee0a43d6873	9.0 GB	Download
musicnet_yourmt3_16k.tar.gz md5:a2da7c169e26d452a4e8b9bef498b3d7	6.7 GB	Download
rwc_classical_yourmt3_16k.tar.gz md5:030995ec30e25dd66a9242af939a8375	531.6 MB	Download
rwc_pop_yourmt3_16k.tar.gz md5:55cfce257e6e2d527017797e0aea3416	753.1 MB	Download
urmp_yourmt3_16k.tar.gz md5:4f539c71678a77ba34f6dfca41072102	521.4 MB	Download

Additional details

Is derived from: Conference paper: 10.5281/zenodo.5120004 (DOI)

	All versions	This version
Views	2,112	53
Downloads	1,495	72
Data volume	7.1 TB	337.9 GB

MusicNet-16k + EM for YourMT3

Creators

Description

Files

LICENSE_musicnet_em.txt

Files (42.5 GB)

Additional details

Related works