Title:
Topics in sequence analysis

dc.contributor.advisor Houdré, Christian
dc.contributor.author Ma, Jinyong en_US
dc.contributor.committeeMember Bakhtin, Yuri
dc.contributor.committeeMember Foley, Robert
dc.contributor.committeeMember Koltchinskii, Vladimir
dc.contributor.committeeMember Matzinger, Heinrich
dc.contributor.department Mathematics en_US
dc.date.accessioned 2013-01-17T21:59:31Z
dc.date.available 2013-01-17T21:59:31Z
dc.date.issued 2012-11-12 en_US
dc.description.abstract This thesis studies two topics in sequence analysis. In the first part, we investigate the large deviations of the shape of the random RSK Young diagrams, associated with a random word of size n whose letters are independently drawn from an alphabet of size m=m(n). When the letters are drawn uniformly and when both n and m converge together to infinity, m not growing too fast with respect to n, the large deviations of the shape of the Young diagrams are shown to be the same as that of the spectrum of the traceless GUE. Since the length of the top row of the Young diagrams is the length of the longest (weakly) increasing subsequence of the random word, the corresponding large deviations follow. When the letters are drawn with non-uniform probability, a control of both highest probabilities will ensure that the length of the top row of the diagrams satisfies a large deviation principle. In either case, both speeds and rate functions are identified. To complete our study, non-asymptotic concentration bounds for the length of the top row of the diagrams, are obtained for both models. In the second part, we investigate the order of the r-th, 1<= r < +∞, central moment of the length of the longest common subsequence of two independent random words of size n whose letters are identically distributed and independently drawn from a finite alphabet. When all but one of the letters are drawn with small probabilities, which depend on the size of the alphabet, the r-th central moment is shown to be of order n^{r/2}. In particular, when r=2, we get the order of the variance of the longest common subsequence. en_US
dc.description.degree PhD en_US
dc.identifier.uri http://hdl.handle.net/1853/45908
dc.publisher Georgia Institute of Technology en_US
dc.subject Longest common subsequence en_US
dc.subject Young diagrams en_US
dc.subject Large deviations en_US
dc.subject.lcsh Sequential analysis
dc.subject.lcsh Representations of groups
dc.title Topics in sequence analysis en_US
dc.type Text
dc.type.genre Dissertation
dspace.entity.type Publication
local.contributor.advisor Houdré, Christian
local.contributor.corporatename College of Sciences
local.contributor.corporatename School of Mathematics
relation.isAdvisorOfPublication 1fcd2323-5c4e-4e86-92a2-574f8decf21e
relation.isOrgUnitOfPublication 85042be6-2d68-4e07-b384-e1f908fae48a
relation.isOrgUnitOfPublication 84e5d930-8c17-4e24-96cc-63f5ab63da69
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
ma_jinyong_201212_phd.pdf
Size:
358.69 KB
Format:
Adobe Portable Document Format
Description: