UNION: An efficient mapping tool using UniMark with non-overlapping interval indexing strategy

  • Che Lun Hung*
  • , Chun Yuan Lin
  • , Yu Chen Hu
  • *Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    NGS has become a popular research field in biologists because it was able to produce inexpensive and accuracy short biology sequences very fast. NGS technique has been improved to produce long length sequences, more than 100bp, recently with the same quality, accuracy and speed. Thus, tools for short sequences may be not suitable for long length sequences. We propose a new tool called UNION for re-sequencing applications by mapping long length sequences to a reference genome. UNION uses the UniMarker with a non-overlapping interval indexing strategy and a tool, CORAL, to do sequence alignments. For the experiments we randomly cut ten thousands sequences with a length of 512bp from the genome of Trichomonas and also produce mutations/sequence errors for these sequences to simulate different similarities. UNION has been compared with GMAP in terms of speed and accuracy and achieves better performance than that of GMAP.

    Original languageEnglish
    Title of host publicationDatabase Theory and Application, Bio-Science and Bio-Technology -Int. Conf. DTA and BSBT 2011,Held as Part of the Future Generation Inf. Tech. Conf. FGIT 2011, in Conjunction. with GDC 2011,Proc.
    Pages187-196
    Number of pages10
    DOIs
    StatePublished - 2011
    Event2011 Database Theory and Application,DTA 2011 and Bio-Science and Bio-Technology,BSBT 2011, Held as Part of the Future Generation Information Tech. Conf.FGIT 2011, in Conjunction with GDC 2011 - Jeju Island, Korea, Republic of
    Duration: 08 12 201110 12 2011

    Publication series

    NameCommunications in Computer and Information Science
    Volume258 CCIS
    ISSN (Print)1865-0929

    Conference

    Conference2011 Database Theory and Application,DTA 2011 and Bio-Science and Bio-Technology,BSBT 2011, Held as Part of the Future Generation Information Tech. Conf.FGIT 2011, in Conjunction with GDC 2011
    Country/TerritoryKorea, Republic of
    CityJeju Island
    Period08/12/1110/12/11

    Keywords

    • Genome mapping
    • NGS
    • UniMaker
    • re-seqencing

    Fingerprint

    Dive into the research topics of 'UNION: An efficient mapping tool using UniMark with non-overlapping interval indexing strategy'. Together they form a unique fingerprint.

    Cite this