Title:
Addressing the data challenge in automatic drum transcription with labeled and unlabeled data

dc.contributor.advisor Lerch, Alexander
dc.contributor.author Wu, Chih-Wei
dc.contributor.committeeMember Clements, Mark
dc.contributor.committeeMember Davidson, Grant
dc.contributor.committeeMember Freeman, Jason
dc.contributor.committeeMember Hsu, Timothy
dc.contributor.committeeMember Weinberg, Gil
dc.contributor.department Music
dc.date.accessioned 2019-01-16T17:21:30Z
dc.date.available 2019-01-16T17:21:30Z
dc.date.created 2018-12
dc.date.issued 2018-07-23
dc.date.submitted December 2018
dc.date.updated 2019-01-16T17:21:30Z
dc.description.abstract Automatic Drum Transcription (ADT) is a sub-task of automatic music transcription that involves the conversion of drum-related audio events into musical notations. While noticeable progress has been made in the past by combining pattern recognition methods with audio signal processing techniques, many systems are still impeded by the lack of a meaningful amount of labeled data to support the data-driven algorithms. To address this data challenge in ADT, this work presents three approaches. First, a dataset for ADT tasks is created using a semi-automatic process that minimizes the workload of human annotators. Second, an ADT system that requires minimum training data is designed to account for the presence of other instruments (e.g., non-percussive or pitched instruments). Third, the possibility of improving generic ADT systems with a large amount of unlabeled data from online resources is explored. The main contributions of this work include the introduction of a new ADT dataset, the methods for realizing ADT systems under the constraint of data insufficiency, and a scheme for data-driven methods to benefit from the abundant online resources and might have impact on other audio and music related tasks traditionally impeded by small amounts of labeled data.
dc.description.degree Ph.D.
dc.format.mimetype application/pdf
dc.identifier.uri http://hdl.handle.net/1853/60721
dc.language.iso en_US
dc.publisher Georgia Institute of Technology
dc.subject Automatic drum transcription
dc.subject Music information retrieval
dc.subject Pattern recognition
dc.subject Audio signal processing
dc.title Addressing the data challenge in automatic drum transcription with labeled and unlabeled data
dc.type Text
dc.type.genre Dissertation
dspace.entity.type Publication
local.contributor.corporatename College of Design
local.contributor.corporatename School of Music
local.relation.ispartofseries Doctor of Philosophy with a Major in Music Technology
relation.isOrgUnitOfPublication c997b6a0-7e87-4a6f-b6fc-932d776ba8d0
relation.isOrgUnitOfPublication 92d2daaa-80f2-4d99-b464-ab7c1125fc55
relation.isSeriesOfPublication d1fd9079-2e93-4803-98c8-8365a28e0761
thesis.degree.level Doctoral
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
WU-DISSERTATION-2018.pdf
Size:
3.9 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LICENSE.txt
Size:
3.86 KB
Format:
Plain Text
Description: