Image-based music genre classification using convolutional neural networks

  • Juan Sebastian Gallego Villamarin (Student)

Student thesis: Master's Thesis

Abstract

This dissertation presents a full investigation into the application of machine learning, specifically Convolutional Neural Networks (CNNs), in the music genre classification using vinyl album covers. Leveraging the latest advancements in deep learning and computer vision, the study introduces two models: one employing batch normalization and another utilizing Concept Whitening (CW) techniques to enhance model interpretability. The main objectives are to evaluate these models' classification accuracy and interpretability. Using robust evaluation parameters, both models exhibit good accuracy rates in classifying music genres based on vinyl album covers. Notably, the concept whitening model adds another layer of interpretability, unraveling the black-box features found in standard neural networks. Empirical findings indicate that concept whitening enhances model interpretability and competes effectively in predictive performance. This project serves in the pursuit of reliable and transparent image-based music genre classification systems. By comparing the two models on both accuracy and interpretability fronts, the study shines a light on the viability of incorporating concept whitening into standard CNN architectures for more explainable AI applications.
Date of Award18 Oct 2023
Original languageEnglish
Awarding Institution
  • Universidade Católica Portuguesa
SupervisorNicolò Bertani (Supervisor)

Keywords

  • Convolutional neural networks
  • Music genre classification
  • Image-based classification
  • Deep learning
  • Interpretability
  • Explainability

Designation

  • Mestrado em Análise de Dados para Gestão

Cite this

'