Abstract

Data is crucial in various computer-related fields, including Music Information Retrieval (MIR), an interdisciplinary area bridging computer science and music. This paper introduces CCMusic, an open and diverse database comprising numerous datasets for different MIR tasks, all of which are publicly available. The majority of the datasets in this database are designed for tasks related to Chinese music, highlighting our focus on this culturally rich domain. The term 'general' is reserved for datasets that encompass a broader spectrum of music, usually Western music in nature. Our database integrated both published (evaluated) and unpublished (unevaluated) datasets; for the former, we contribute by integrating them into our database, while for the latter, we provide comprehensive evaluations to ensure data reliability. The raw materials used for processing the data in this database are sourced from the ccmusic-database platform.

Cite

@dataset{zhaorui_liu_2021_5676893,
  author       = {Monan Zhou, Shenyang Xu, Zhaorui Liu, Zhaowen Wang, Feng Yu, Wei Li and Baoqiang Han},
  title        = {CCMusic: an Open and Diverse Database for Chinese and General Music Information Retrieval Research},
  month        = {mar},
  year         = {2024},
  publisher    = {HuggingFace},
  version      = {1.2},
  url          = {https://huggingface.co/ccmusic-database}
}