TY - JOUR
T1 - Large vocabulary automatic chord estimation using bidirectional long short-term memory recurrent neural network with even chance training
AU - Deng, Junqi
AU - Kwok, Yu Kwong
N1 - Publisher Copyright:
© 2017 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2018/1/1
Y1 - 2018/1/1
N2 - This paper presents an argument for the necessity of a large vocabulary in automatic chord recognition systems, on the grounds of the requirements of machine musicianship. It proposes a system framework with a skewed class-sensitive training scheme that leads to a preliminary solution to large vocabulary automatic chord estimation. This framework applies a bidirectional long short-term memory recurrent neural network architecture, which employs an ‘even chance’ training scheme to make up for the lack of uncommon chords’ exposure. The main drawback of this approach is the low segmentation quality, which inevitably lowers the upper bound of chord estimation accuracy. Under a large vocabulary evaluation, the proposed system can significantly outperform the baseline system in terms of the overall weighted chord symbol recall, and there is no significant difference between them in terms of average chord quality accuracy. The results demonstrate preliminary success in our approach, and also prove the even chance training scheme to be effective in boosting uncommon chord symbol recalls as well as the average chord quality accuracy.
AB - This paper presents an argument for the necessity of a large vocabulary in automatic chord recognition systems, on the grounds of the requirements of machine musicianship. It proposes a system framework with a skewed class-sensitive training scheme that leads to a preliminary solution to large vocabulary automatic chord estimation. This framework applies a bidirectional long short-term memory recurrent neural network architecture, which employs an ‘even chance’ training scheme to make up for the lack of uncommon chords’ exposure. The main drawback of this approach is the low segmentation quality, which inevitably lowers the upper bound of chord estimation accuracy. Under a large vocabulary evaluation, the proposed system can significantly outperform the baseline system in terms of the overall weighted chord symbol recall, and there is no significant difference between them in terms of average chord quality accuracy. The results demonstrate preliminary success in our approach, and also prove the even chance training scheme to be effective in boosting uncommon chord symbol recalls as well as the average chord quality accuracy.
KW - Music information retrieval
KW - automatic chord estimation
KW - deep learning
KW - large vocabulary
KW - recurrent neural network
UR - https://www.scopus.com/pages/publications/85032688233
U2 - 10.1080/09298215.2017.1367820
DO - 10.1080/09298215.2017.1367820
M3 - Article
AN - SCOPUS:85032688233
SN - 0929-8215
VL - 47
SP - 53
EP - 67
JO - Journal of New Music Research
JF - Journal of New Music Research
IS - 1
ER -