Title:
The evaluation of the stability of acoustic features in affective conveyance across multiple emotional databases

dc.contributor.author Sun, Rui
dc.contributor.committeeMember Moore, Elliot, II
dc.contributor.committeeMember Clements, Mark A.
dc.contributor.committeeMember Wu, Hongwei
dc.contributor.committeeMember Walker, Bruce N.
dc.contributor.committeeMember Tannenbaum, Allen R.
dc.contributor.department Electrical and Computer Engineering
dc.date.accessioned 2013-09-20T13:25:23Z
dc.date.available 2013-09-20T13:25:23Z
dc.date.created 2013-08
dc.date.issued 2013-05-20
dc.date.submitted August 2013
dc.date.updated 2013-09-20T13:25:23Z
dc.description.abstract The objective of the research presented in this thesis was to systematically investigate the computational structure for cross-database emotion recognition. The research consisted of evaluating the stability of acoustic features, particularly the glottal and Teager Energy based features, and investigating three normalization methods and two data fusion techniques. One of the challenges of cross-database training and testing is accounting for the potential variation in the types of emotions expressed as well as the recording conditions. In an attempt to alleviate the impact of these types of variations, three normalization methods on the acoustic data were studied. Motivated by the lack of large and diverse enough emotional database to train the classifier, using multiple databases to train posed another challenge: data fusion. This thesis proposed two data fusion techniques, pre-classification SDS and post-classification ROVER to study the issue. Using the glottal, TEO and TECC features, of which the stability of emotion distinguishing ability has been highlighted on multiple databases, the systematic computational structure proposed in this thesis could improve the performance of cross-database binary-emotion recognition by up to 23% for neutral vs. emotional and 10% for positive vs. negative.
dc.description.degree Ph.D.
dc.format.mimetype application/pdf
dc.identifier.uri http://hdl.handle.net/1853/49041
dc.language.iso en_US
dc.publisher Georgia Institute of Technology
dc.subject Emotion recognition in speech
dc.subject Cross-database evaluation
dc.subject.lcsh Language and emotions
dc.subject.lcsh Emotions
dc.subject.lcsh Automatic speech recognition
dc.subject.lcsh Speech processing systems
dc.title The evaluation of the stability of acoustic features in affective conveyance across multiple emotional databases
dc.type Text
dc.type.genre Dissertation
dspace.entity.type Publication
local.contributor.corporatename School of Electrical and Computer Engineering
local.contributor.corporatename College of Engineering
relation.isOrgUnitOfPublication 5b7adef2-447c-4270-b9fc-846bd76f80f2
relation.isOrgUnitOfPublication 7c022d60-21d5-497c-b552-95e489a06569
thesis.degree.level Doctoral
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
SUN-DISSERTATION-2013.pdf
Size:
440.66 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LICENSE.txt
Size:
3.86 KB
Format:
Plain Text
Description: