PBSCR: The Piano Bootleg Score Composer Recognition Dataset

Jain, Arhan; Bunn, Alec; Pham, Austin; Tsai, TJ

Computer Science > Sound

arXiv:2401.16803 (cs)

[Submitted on 30 Jan 2024 (v1), last revised 5 Aug 2024 (this version, v3)]

Title:PBSCR: The Piano Bootleg Score Composer Recognition Dataset

Authors:Arhan Jain, Alec Bunn, Austin Pham, TJ Tsai

View PDF HTML (experimental)

Abstract:This article motivates, describes, and presents the PBSCR dataset for studying composer recognition of classical piano music. Our goal was to design a dataset that facilitates large-scale research on composer recognition that is suitable for modern architectures and training practices. To achieve this goal, we utilize the abundance of sheet music images and rich metadata on IMSLP, use a previously proposed feature representation called a bootleg score to encode the location of noteheads relative to staff lines, and present the data in an extremely simple format (2D binary images) to encourage rapid exploration and iteration. The dataset itself contains 40,000 62x64 bootleg score images for a 9-class recognition task, 100,000 62x64 bootleg score images for a 100-class recognition task, and 29,310 unlabeled variable-length bootleg score images for pretraining. The labeled data is presented in a form that mirrors MNIST images, in order to make it extremely easy to visualize, manipulate, and train models in an efficient manner. We include relevant information to connect each bootleg score image with its underlying raw sheet music image, and we scrape, organize, and compile metadata from IMSLP on all piano works to facilitate multimodal research and allow for convenient linking to other datasets. We release baseline results in a supervised and low-shot setting for future works to compare against, and we discuss open research questions that the PBSCR data is especially well suited to facilitate research on.

Comments:	19 pages, 6 figures, to be published in Transactions of the International Society for Music Information Retrieval
Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2401.16803 [cs.SD]
	(or arXiv:2401.16803v3 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2401.16803

Submission history

From: T.J. Tsai [view email]
[v1] Tue, 30 Jan 2024 07:50:32 UTC (1,399 KB)
[v2] Wed, 7 Feb 2024 06:48:12 UTC (1,399 KB)
[v3] Mon, 5 Aug 2024 21:55:11 UTC (1,320 KB)

Computer Science > Sound

Title:PBSCR: The Piano Bootleg Score Composer Recognition Dataset

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:PBSCR: The Piano Bootleg Score Composer Recognition Dataset

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators