Title:
Modifying sparse coding to model imbalanced datasets

dc.contributor.advisor Anderson, David V.
dc.contributor.author Whitaker, Bradley M.
dc.contributor.committeeMember Rozell, Christopher J.
dc.contributor.committeeMember Romberg, Justin K.
dc.contributor.committeeMember Li, Wing
dc.contributor.committeeMember Clifford, Gari D.
dc.contributor.department Electrical and Computer Engineering
dc.date.accessioned 2018-05-31T18:16:13Z
dc.date.available 2018-05-31T18:16:13Z
dc.date.created 2018-05
dc.date.issued 2018-04-06
dc.date.submitted May 2018
dc.date.updated 2018-05-31T18:16:13Z
dc.description.abstract The objective of this research is to explore the use of sparse coding as a tool for unsupervised feature learning to more effectively model imbalanced datasets. Traditional sparse coding dictionaries are learned by minimizing the average approximation error between a vector and its sparse decomposition. As such, these dictionaries may overlook important features that occur infrequently in the data. Without these features, it may be difficult to accurately classify between classes if one or more classes are not well-represented in the training data. To overcome this problem, this work explores novel modifications to the sparse coding dictionary learning framework that encourage dictionaries to learn anomalous features. Sparse coding also inherently assumes that a vector can be represented as a sparse linear combination of a feature set. This work addresses the ability of sparse coding to learn a representative dictionary when the underlying data has a nonlinear sparse structure. Finally, this work illustrates one benefit of improved signal modeling by utilizing sparse coding in three imbalanced classification tasks.
dc.description.degree Ph.D.
dc.format.mimetype application/pdf
dc.identifier.uri http://hdl.handle.net/1853/59919
dc.language.iso en_US
dc.publisher Georgia Institute of Technology
dc.subject Sparse coding
dc.subject Imbalanced data
dc.subject Machine learning
dc.title Modifying sparse coding to model imbalanced datasets
dc.type Text
dc.type.genre Dissertation
dspace.entity.type Publication
local.contributor.advisor Anderson, David V.
local.contributor.corporatename School of Electrical and Computer Engineering
local.contributor.corporatename College of Engineering
relation.isAdvisorOfPublication eefeec08-2c7a-4e05-9f4b-7d25059e20a0
relation.isOrgUnitOfPublication 5b7adef2-447c-4270-b9fc-846bd76f80f2
relation.isOrgUnitOfPublication 7c022d60-21d5-497c-b552-95e489a06569
thesis.degree.level Doctoral
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
WHITAKER-DISSERTATION-2018.pdf
Size:
955.55 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LICENSE.txt
Size:
3.87 KB
Format:
Plain Text
Description: