International Journal of Computational Intelligence Systems

Volume 12, Issue 2, 2019, Pages 1622 - 1634

Music Emotion Recognition by Using Chroma Spectrogram and Deep Visual Features

Authors
Mehmet Bilal Er*, Ibrahim Berkan Aydilek
Department of Computer Engineering, Faculty of Engineering, Harran University, 63050 Sanlıurfa, Turkey
*Corresponding author. Email: bilal.er@harran.edu.tr
Corresponding Author
Mehmet Bilal Er
Received 21 May 2019, Accepted 3 December 2019, Available Online 20 December 2019.
DOI
10.2991/ijcis.d.191216.001How to use a DOI?
Keywords
Music emotion recognition; Deep learning; Deep features; Chroma spectrogram; AlexNet; VGG-16
Abstract

Music has a great role and importance in human life since it has the ability to trigger or convey feelings. As recognizing music emotions is the subject of many studies conducted in many disciplines like science, psychology, musicology and art, it has attracted the attention of researchers as an up-to-date research topic in recent years. Many researchers extract acoustic features from music and investigate relations between emotional tags corresponding to these features. In recent studies, on the other hand, music types are classified emotionally by using deep learning through music spectrograms that involved both time and frequency domain information. In the present study, a new method is presented for music emotion recognition by employing pre-trained deep learning model with chroma spectrograms extracted from music recordings. The AlexNet architecture is used as the pre-trained network model. The conv5, Fc6, Fc7 and Fc8 layers of the AlexNet model are chosen as the feature extracting layer, and deep visual features are extracted from these layers. The extracted deep features are used to train and test the Support Vector Machines (SVM) and the Softmax classifiers. Besides, deep visual features are extracted from conv5_3, Fc6, Fc7 and Fc8 layers of the VGG-16 deep network model and the same experimental applications are made in order to find out the effective power of pre-trained deep networks in music emotion recognition. Several experiments are conducted on two datasets, and better results are obtained with the proposed method. The best result is obtained from the VGG-16 in the Fc7 layer as 89.2% on our dataset. According to the obtained results, it is observed that the presented method performs better.

Copyright
© 2019 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)
View full text (HTML)

Journal
International Journal of Computational Intelligence Systems
Volume-Issue
12 - 2
Pages
1622 - 1634
Publication Date
2019/12/20
ISSN (Online)
1875-6883
ISSN (Print)
1875-6891
DOI
10.2991/ijcis.d.191216.001How to use a DOI?
Copyright
© 2019 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - JOUR
AU  - Mehmet Bilal Er
AU  - Ibrahim Berkan Aydilek
PY  - 2019
DA  - 2019/12/20
TI  - Music Emotion Recognition by Using Chroma Spectrogram and Deep Visual Features
JO  - International Journal of Computational Intelligence Systems
SP  - 1622
EP  - 1634
VL  - 12
IS  - 2
SN  - 1875-6883
UR  - https://doi.org/10.2991/ijcis.d.191216.001
DO  - 10.2991/ijcis.d.191216.001
ID  - BilalEr2019
ER  -