Organizational Unit:
School of Biological Sciences

Research Organization Registry ID
Description
Previous Names
Parent Organization
Parent Organization
Organizational Unit
Includes Organization(s)

Publication Search Results

Now showing 1 - 5 of 5
Thumbnail Image
Item

Origin and evolution of eukaryotic gene sequences derived from transposable elements

2008-06-09 , Piriyapongsa, Jittima

My dissertation encompasses five different studies that are linked by a common theme the investigation of transposable element (TE) contributions to eukaryotic gene sequences. A detailed analysis of exonization events of LTR elements in the human genome shows the preference towards the fixation of LTR elements in gene untranslated regions, which supports the existing concept of a major role of LTR elements as a natural source of regulatory sequences. The ability of different classes of sequence similarity search methods to detect TE-derived sequences was evaluated. In general, the different search methods are found to be complementary, and combined search approaches are needed to systematically check any data set for all potential TE-associated coding sequences. On average, TE-derived exon sequences have low protein coding potential. In particular, non-coding TEs, are frequently exonized but unlikely to encode protein sequences. Many of these non-coding exonized TEs may be actually involved in gene regulation via the formation of double stranded RNA complexes with complementary TE-derived exons. The investigation of the relationship between human miRNAs and TEs shows that 55 experimentally verified human miRNA genes (~12%) originated from TEs. Overall, TE-derived miRNA genes are less conserved than non TE-derived miRNAs. The potential regulatory and functional significance of TE-derived miRNAs was explored. An ab initio prediction algorithm I developed was used to discover putative cases of novel TE-derived miRNA genes. A miRNA gene family, hsa-mir-548, was found to be derived from Made1 family of MITEs. The palindromic structure of the Made1 elements, and MITEs in general, points to a specific mechanism by which these sequences can be recognized and processed by the miRNA biogenesis pathway. MITEs may also represent an evolutionary link between siRNAs and miRNAs. An original model for a siRNA-to-miRNA evolutionary transition mediated by DNA-type TEs is proposed. This model is supported by the presence of evolutionary intermediate TE sequences that encode both siRNAs and miRNAs in the Arabidopsis and rice genomes. The siRNA-to-miRNA evolutionary transition is representative of a number of other regulatory mechanisms that evolved to silence TEs and were later co-opted to serve as regulators of host gene expression.

Thumbnail Image
Item

Exonization of the LTR transposable elements in human genome

2007-08-28 , Piriyapongsa, Jittima , Polavarapu, Nalini , Borodovsky, Mark , McDonald, John F.

Background: Retrotransposons have been shown to contribute to evolution of both structure and regulation of protein coding genes. It has been postulated that the primary mechanism by which retrotransposons contribute to structural gene evolution is through insertion into an intron or a gene flanking region, and subsequent incorporation into an exon. Results: We found that Long Terminal Repeat (LTR) retrotransposons are associated with 1,057 human genes (5.8%). In 256 cases LTR retrotransposons were observed in protein-coding regions, while 50 distinct protein coding exons in 45 genes were comprised exclusively of LTR RetroTransposon Sequence (LRTS). We go on to reconstruct the evolutionary history of an alternatively spliced exon of the Interleukin 22 receptor, alpha 2 gene (IL22RA2) derived from a sequence of retrotransposon of the Mammalian apparent LTR retrotransposons (MaLR) family. Sequencing and analysis of the homologous regions of genomes of several primates indicate that the LTR retrotransposon was inserted into the IL22RA2 gene at least prior to the divergence of Apes and Old World monkeys from a common ancestor (~25 MYA). We hypothesize that the recruitment of the part of LTR as a novel exon in great ape species occurred prior to the divergence of orangutans and humans from a common ancestor (~14 MYA) as a result of a single mutation in the proto-splice site. Conclusion: Our analysis of LRTS exonization events has shown that the patterns of LRTS distribution in human exons support the hypothesis that LRTS played a significant role in human gene evolution by providing cis-regulatory sequences; direct incorporation of LTR sequences into protein coding regions was observed less frequently. Combination of computational and experimental approaches used for tracing the history of the LTR exonization process of IL22RA2 gene presents a promising strategy that could facilitate further studies of transposon initiated gene evolution.

Thumbnail Image
Item

Dual coding of siRNAs and miRNAs by Plant Transposable Elements

2008-03 , Piriyapongsa, Jittima , Jordan, I. King

We recently proposed a specific model whereby miRNAs encoded from short nonautonomous DNA-type TEs known as MITEs evolved from corresponding ancestral full-length (autonomous) elements that originally encoded short interfering (siRNAs). Our miRNA-origins model predicts that evolutionary intermediates may exist as TEs that encode both siRNAs and miRNAs, and we analyzed Arabidopsis thaliana and Oryza sativa (rice) genomic sequence and expression data to test this prediction. We found a number of examples of individual plant TE insertions that encode both siRNAs and miRNAs. We show evidence that these dual coding TEs can be expressed as readthrough transcripts from the intronic regions of spliced RNA messages. These TE transcripts can fold to form the hairpin (stem–loop) structures characteristic of miRNA genes along with longer doublestranded RNA regions that typically are processed as siRNAs. Taken together with a recent study showing Drosha independent processing of miRNAs from Drosophila introns, our results indicate that ancestral miRNAs could have evolved from TEs prior to the full elaboration of the miRNA biogenesis pathway. Later, as the specific miRNA biogenesis pathway evolved, and numerous other expressed inverted repeat regions came to be recognized by the miRNA processing endonucleases, the host gene-related regulatory functions of miRNAs emerged. In this way, host genomes were afforded an additional level of regulatory complexity as a by-product of TE defense mechanisms. The siRNA-to-miRNA evolutionary transition is representative of a number of other regulatory mechanisms that evolved to silence TEs and were later co-opted to serve as regulators of host gene expression.

Thumbnail Image
Item

A family of human microRNA genes from miniature-inverted repeat transposable elements

2007-02-14 , Piriyapongsa, Jittima , Jordan, I. King

While hundreds of novel microRNA (miRNA) genes have been discovered in the last few years alone, the origin and evolution of these non-coding regulatory sequences remain largely obscure. In this report, we demonstrate that members of a recently discovered family of human miRNA genes, hsa-mir-548, are derived from Made1 transposable elements. Made1 elements are short miniature inverted-repeat transposable elements (MITEs), which consist of two 37 base pair (bp) terminal inverted repeats that flank 6 bp of internal sequence. Thus, Made1 elements are nearly perfect palindromes, and when expressed as RNA they form highly stable hairpin loops. Apparently, these Made1-related structures are recognized by the RNA interference enzymatic machinery and processed to form 22 bp mature miRNA sequences. Consistent with their origin from MITEs, hsa-mir- 548 genes are primate-specific and have many potential paralogs in the human genome. There are more than 3,500 putative hsa-mir-548 target genes; analysis of their expression profiles and functional affinities suggests cancer-related regulatory roles for hsa-mir-548. Taken together, the characteristics of Made1 elements, and MITEs in general, point to a specific mechanism for the generation of numerous small regulatory RNAs and target sites throughout the genome. The evolutionary lineage-specific nature of MITEs could also provide for the generation of novel regulatory phenotypes related to species diversification. Finally, we propose that MITEs may represent an evolutionary link between siRNAs and miRNAs.

Thumbnail Image
Item

Evaluating the protein coding potential of exonized transposable element sequences

2007-11-26 , Piriyapongsa, Jittima , Rutledge, Mark T. , Patel, Sanil , Borodovsky, Mark , Jordan, I. King

Background: Transposable element (TE) sequences, once thought to be merely selfish or parasitic members of the genomic community, have been shown to contribute a wide variety of functional sequences to their host genomes. Analysis of complete genome sequences have turned up numerous cases where TE sequences have been incorporated as exons into mRNAs, and it is widely assumed that such 'exonized' TEs encode protein sequences. However, the extent to which TE-derived sequences actually encode proteins is unknown and a matter of some controversy. We have tried to address this outstanding issue from two perspectives: i-by evaluating ascertainment biases related to the search methods used to uncover TE-derived protein coding sequences (CDS) and ii-through a probabilistic codon-frequency based analysis of the protein coding potential of TE-derived exons. Results: We compared the ability of three classes of sequence similarity search methods to detect TE-derived sequences among data sets of experimentally characterized proteins: 1-a profile-based hidden Markov model (HMM) approach, 2-BLAST methods and 3-RepeatMasker. Profile based methods are more sensitive and more selective than the other methods evaluated. However, the application of profile-based search methods to the detection of TE-derived sequences among well-curated experimentally characterized protein data sets did not turn up many more cases than had been previously detected and nowhere near as many cases as recent genome-wide searches have. We observed that the different search methods used were complementary in the sense that they yielded largely non-overlapping sets of hits and differed in their ability to recover known cases of TE-derived CDS. The probabilistic analysis of TE-derived exon sequences indicates that these sequences have low protein coding potential on average. In particular, non-autonomous TEs that do not encode protein sequences, such as Alu elements, are frequently exonized but unlikely to encode protein sequences. Conclusion: The exaptation of the numerous TE sequences found in exons as bona fide protein coding sequences may prove to be far less common than has been suggested by the analysis of complete genomes. We hypothesize that many exonized TE sequences actually function as post-transcriptional regulators of gene expression, rather than coding sequences, which may act through a variety of double stranded RNA related regulatory pathways. Indeed, their relatively high copy numbers and similarity to sequences dispersed throughout the genome suggests that exonized TE sequences could serve as master regulators with a wide scope of regulatory influence.