Organizational Unit:
School of Biological Sciences

Research Organization Registry ID
Description
Previous Names
Parent Organization
Parent Organization
Organizational Unit
Includes Organization(s)

Publication Search Results

Now showing 1 - 5 of 5
Thumbnail Image
Item

Unsupervised and semi-supervised training methods for eukaryotic gene prediction

2008-11-17 , Ter-Hovhannisyan, Vardges

This thesis describes new gene finding methods for eukaryotic gene prediction. The current methods for deriving model parameters for gene prediction algorithms are based on curated or experimentally validated set of genes or gene elements. These training sets often require time and additional expert efforts especially for the species that are in the initial stages of genome sequencing. Unsupervised training allows determination of model parameters from anonymous genomic sequence with. The importance and the practical applicability of the unsupervised training is critical for ever growing rate of eukaryotic genome sequencing. Three distinct training procedures are developed for diverse group of eukaryotic species. GeneMark-ES is developed for species with strong donor and acceptor site signals such as Arabidopsis thaliana, Caenorhabditis elegans and Drosophila melanogaster. The second version of the algorithm, GeneMark-ES-2, introduces enhanced intron model to better describe the gene structure of fungal species with posses with relatively weak donor and acceptor splice sites and well conserved branch point signal. GeneMark-LE, semi-supervised training approach is designed for eukaryotic species with small number of introns. The results indicate that the developed unsupervised training methods perform well as compared to other training methods and as estimated from the set of genes supported by EST-to-genome alignments. Analysis of novel genomes reveals interesting biological findings and show that several candidates of under-annotated and over-annotated fungal species are present in the current set of annotated of fungal genomes.

Thumbnail Image
Item

Heuristic and self-training methods for improving gene prediction in prokaryotes

2002-12 , Besemer, John David

Thumbnail Image
Item

Improving gene annotation of complete viral genomes

2003-08 , Mills, Ryan E.

Thumbnail Image
Item

Making sense of cDNA : automated annotation, storing in an interactive database, mapping to genomic DNA

2002-08 , Shmeleva, Nataliya V.

Thumbnail Image
Item

Developing new computational methods for characterization ORFS with unknown function

2003-05 , Michino, Mayako