Doctor of Philosophy with a Major in Music Technology

Series

Doctor of Philosophy with a Major in Music Technology

Permanent Link

https://hdl.handle.net/1853/73273

Series Type

Degree Series

Associated Organization(s)

Organizational Unit

School of Music

Full item page

Publication Search Results

Now showing 1 - 1 of 1

Learning to manipulate latent representations of deep generative models

(Georgia Institute of Technology, 2021-01-14) Pati, Kumar Ashis

Deep generative models have emerged as a tool of choice for the design of automatic music composition systems. While these models are capable of learning complex representations from data, a limitation of many of these models is that they allow little to no control over the generated music. Latent representation-based models, such as Variational Auto-Encoders, have the potential to alleviate this limitation as they are able to encode hidden attributes of the data in a low-dimensional latent space. However, the encoded attributes are often not interpretable and cannot be explicitly controlled. The work presented in this thesis seeks to address these challenges by learning to manipulate and design latent spaces in a way that allows control over musically meaningful attributes that are understandable by humans. This in turn can allow explicit control of such attributes during the generation process and help users realize their compositional goals. Specifically, three different approaches are proposed to investigate this problem. The first approach shows that we can learn to traverse latent spaces of generative models to perform complex interactive music composition tasks. The second approach uses a novel latent space regularization technique which can encode individual musical attributes along specific dimensions of the latent space. The third approach attempts to use attribute-informed non-linear transformations over an existing latent space such that the transformed latent space allows controllable generation of data. In addition, the problem of disentanglement learning in the context of symbolic music is investigated systematically by proposing a tailor-made dataset for the task and evaluating the performance of several different methods for unsupervised and supervised disentanglement learning. Together, the proposed methods will help address critical shortcomings of deep music generative models and pave the path towards intuitive interfaces which can be used by humans in real compositional settings.

Series

Doctor of Philosophy with a Major in Music Technology

Permanent Link

Series Type

Description

Associated Organization(s)

Associated Organization(s)

Filters

Author

Advisor

Date

Organization

Series

Resource Type

Resource Subtype

Has files

Record Type

Settings

Sort By

Results per page

Publication Search Results

Georgia Tech Library

Series Doctor of Philosophy with a Major in Music Technology

Permanent Link

Series Type

Description

Associated Organization(s)

Associated Organization(s)

Filters

Author

Advisor

Date

Organization

Series

Resource Type

Resource Subtype

Has files

Record Type

Settings

Sort By

Results per page

Publication Search Results

Series

Doctor of Philosophy with a Major in Music Technology