Comparative gene prediction based on gene structure conservation

Shu Ju Hsieh*, Chun Yuan Lin, Ning Han Liu, Chuan Yi Tang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Identifying protein coding genes is one of most important task in newly sequenced genomes. With increasing numbers of gene annotations verified by experiments, it is feasible to identify genes in newly sequenced genomes by comparing with genes annotated on phylogenetically close organisms. Here, we propose a program, GeneAlign, which predicts the genes on one sequence by measuring the similarity between the predicted sequence and related genes annotated on another genome. The program applies CORAL, a heuristic linear time alignment tool, to determine whether the regions flanked by candidate signals are similar with the annotated exons or not. The approach, which employs the conservation of gene structures and sequence homologies between protein coding regions, increases the prediction accuracy. GeneAlign was tested on Projector data set of 449 human-mouse homologous sequence pairs. At the gene level, the sensitivity and specificity of GeneAlign are 80%, and larger than 96% at the exon level.

Original languageEnglish
Title of host publicationPattern Recognition in Bioinformatics - International Workshop, PRIB 2006, Proceedings
PublisherSpringer Verlag
Pages32-41
Number of pages10
ISBN (Print)3540374469, 9783540374466
DOIs
StatePublished - 2006
Externally publishedYes
EventInternational Workshop on Pattern Recognition in Bioinformatics, PRIB 2006 - Hong Kong, China
Duration: 20 08 200620 08 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4146 LNBI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Workshop on Pattern Recognition in Bioinformatics, PRIB 2006
Country/TerritoryChina
CityHong Kong
Period20/08/0620/08/06

Fingerprint

Dive into the research topics of 'Comparative gene prediction based on gene structure conservation'. Together they form a unique fingerprint.

Cite this