Please use this identifier to cite or link to this item:
Title: Piano Music Generation with Deep Learning Transformer Models
Authors: Στάης, Άγγελος
Στάμου Γιώργος
Keywords: Artificial Neural Networks
Deep Neural Networks
Deep Learning
Symbolic Music
Transformer Models
Music Composition
Music Generation
Composition Assistance
Music Representation
Music Transformer
Perceiver AR
Issue Date: 13-Mar-2023
Abstract: William Wordsworth famously wrote in "The Solitary Reaper": "The music in my heart I bore, long after it was heard no more". The emotional impact of music, particularly piano music, has transcended cultural barriers and touched people from all walks of life for centuries. In recent years, the composition of music with the assistance of artificial intelligence has become a notable area of interest that has garnered significant attention. Among multiple deep learning models proposed, the Transformer has been a prominent approach for generating longer piano performances. This thesis delves into the capabilities of symbolic piano music generation with two noteworthy Transformer models, Music Transformer and Perceiver-AR. We explore training these models in six different datasets of various sizes and musical genres, generate large-scale number of outputs for each trained model and evaluate them objectively and subjectively. We investigate the impact of training datasets, model types and trained models on the music generation process. We also examine the correlations of objective and subjective evaluation metrics and propose a set of primary subjective quality indicators for music generated by artificial intelligence models. Finally, we suggest possible improvements and areas for future research.
Appears in Collections:Διπλωματικές Εργασίες - Theses

Files in This Item:
File Description SizeFormat 
final_ntua-ece-library.pdf5.19 MBAdobe PDFView/Open

Items in Artemis are protected by copyright, with all rights reserved, unless otherwise indicated.