Title:
Characterizing the redundancy of universal source coding for finite-length sequences

dc.contributor.advisor Fekri, Faramarz
dc.contributor.author Beirami, Ahmad en_US
dc.contributor.committeeMember Barry, John
dc.contributor.committeeMember Sivakumar, Raghupathy
dc.contributor.committeeMember Steven W McLaughlin
dc.contributor.department Electrical and Computer Engineering en_US
dc.date.accessioned 2013-01-17T20:47:37Z
dc.date.available 2013-01-17T20:47:37Z
dc.date.issued 2011-12-16 en_US
dc.description.abstract In this thesis, we first study what is the average redundancy resulting from the universal compression of a single finite-length sequence from an unknown source. In the universal compression of a source with d unknown parameters, Rissanen demonstrated that the expected redundancy for regular codes is asymptotically d/2 log n + o(log n) for almost all sources, where n is the sequence length. Clarke and Barron also derived the asymptotic average minimax redundancy for memoryless sources. The average minimax redundancy is concerned with the redundancy of the worst parameter vector for the best code. Thus, it does not provide much information about the effect of the different source parameter values. Our treatment in this thesis is probabilistic. In particular, we derive a lower bound on the probability measure of the event that a sequence of length n from an FSMX source chosen using Jeffreys' prior is compressed with a redundancy larger than a certain fraction of d/2 log n. Further, our results show that the average minimax redundancy provides good estimate for the average redundancy of most sources for large enough n and d. On the other hand, when the source parameter d is small the average minimax redundancy overestimates the average redundancy for small to moderate length sequences. Additionally, we precisely characterize the average minimax redundancy of universal coding when the coding scheme is restricted to be from the family of two--stage codes, where we show that the two--stage assumption incurs a negligible redundancy for small and moderate length n unless the number of source parameters is small. %We show that redundancy is significant in the compression of small sequences. Our results, collectively, help to characterize the non-negligible redundancy resulting from the compression of small and moderate length sequences. Next, we apply these results to the compression of a small to moderate length sequence provided that the context present in a sequence of length M from the same source is memorized. We quantify the achievable performance improvement in the universal compression of the small to moderate length sequence using context memorization. en_US
dc.description.degree MS en_US
dc.identifier.uri http://hdl.handle.net/1853/45750
dc.publisher Georgia Institute of Technology en_US
dc.subject Information theory en_US
dc.subject Minimum en_US
dc.subject Parameter estimation en_US
dc.subject.lcsh Source code (Computer science)
dc.subject.lcsh Computer programs
dc.subject.lcsh Communication
dc.title Characterizing the redundancy of universal source coding for finite-length sequences en_US
dc.type Text
dc.type.genre Thesis
dspace.entity.type Publication
local.contributor.advisor Fekri, Faramarz
local.contributor.corporatename School of Electrical and Computer Engineering
local.contributor.corporatename College of Engineering
relation.isAdvisorOfPublication f46b53e9-5ee4-4646-b71c-8bd223cd31c8
relation.isOrgUnitOfPublication 5b7adef2-447c-4270-b9fc-846bd76f80f2
relation.isOrgUnitOfPublication 7c022d60-21d5-497c-b552-95e489a06569
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
Beirami_Ahmad_201108_MS.pdf.pdf
Size:
391.67 KB
Format:
Adobe Portable Document Format
Description: