01whole.pdf (1.35 MB)
Download file

The Mel-frequency cepstrum coefficient for music emotion recognition in machine learning

Download (1.35 MB)
thesis
posted on 28.03.2022, 17:29 by Ai-Phee Chris Yong
The Mel-frequency Cepstrum Coefficient (MFCC), a technique designed initially for speech analysis, has in recent years become very popular in music emotion recognition projects. MFCC uses the Mel scaling method to simulate human auditory properties, logarithmic noise reduction techniques, and the Discrete Cosine Transformation (DCT) to generalise all salient features, without losing critical information. These techniques, while applicable to speech analysis, may not always be suitable for music analysis. We suggest, in Music Emotion Recognition (MER) analysis, spectral and temporal (which have a deep historical foundation) should be the more relevant features to use. We propose extracting three feature types, MFCC, Spectral, and Temporal, from the clips of songs in the '1000 songs' dataset to train a simple Artificial neural network (ANN). The trained ANN model will subsequently be able to predict the emotion value of songs. The prediction error is calculated based on the predicted value and actual annotated value. The feature that produces the lowest prediction error is judged as the most suitable feature for MER. Our results show that spectral features produced the lowest error, whereas MFCC produced the highest prediction error; this suggests that MFCC may not be a suitable feature for MER.

History

Table of Contents

1. Introduction -- 2. Music features -- 3. Emotion models -- 4. Mel-frequency cepstrum coefficient -- 5. Temporal and spectral properties -- 6. Existing work in music emotion recognition -- 7. The experiment -- 8. Results and discussion -- 9. References.

Notes

Bibliography: pages 45-49 Empirical thesis.

Awarding Institution

Macquarie University

Degree Type

Thesis MRes

Degree

MRes, Macquarie University, Faculty of Science and Engineering, Department of Computing

Department, Centre or School

Department of Computing

Year of Award

2019

Principal Supervisor

Malcolm Ryan

Rights

Copyright Ai-Phee Chris Yong 2019. Copyright disclaimer: http://mq.edu.au/library/copyright

Language

English

Extent

1 online resource (x, 47 pages) diagrams, graphs, tables

Former Identifiers

mq:71131 http://hdl.handle.net/1959.14/1271177