跳至主導覽 跳至搜尋 跳過主要內容

HaplotypeCN: Copy number haplotype inference with hidden markov model and localized haplotype clustering

  • Yen Jen Lin
  • , Yu Tin Chen
  • , Shu Ni Hsu
  • , Chien Hua Peng
  • , Chuan Yi Tang
  • , Tzu Chen Yen
  • , Wen Ping Hsieh
  • National Tsing Hua University
  • Chang Gung Memorial Hospital
  • Providence University Taiwan

研究成果: 期刊稿件文章同行評審

2 引文 斯高帕斯(Scopus)

摘要

Copy number variation (CNV) has been reported to be associated with disease and various cancers. Hence, identifying the accurate position and the type of CNV is currently a critical issue. There are many tools targeting on detecting CNV regions, constructing haplotype phases on CNV regions, or estimating the numerical copy numbers. However, none of them can do all of the three tasks at the same time. This paper presents a method based on Hidden Markov Model to detect parent specific copy number change on both chromosomes with signals from SNP arrays. A haplotype tree is constructed with dynamic branch merging to model the transition of the copy number status of the two alleles assessed at each SNP locus. The emission models are constructed for the genotypes formed with the two haplotypes. The proposed method can provide the segmentation points of the CNV regions as well as the haplotype phasing for the allelic status on each chromosome. The estimated copy numbers are provided as fractional numbers, which can accommodate the somatic mutation in cancer specimens that usually consist of heterogeneous cell populations. The algorithm is evaluated on simulated data and the previously published regions of CNV of the 270 HapMap individuals. The results were compared with five popular methods: PennCNV, genoCN, COKGEN, QuantiSNP and cnvHap. The application on oral cancer samples demonstrates how the proposed method can facilitate clinical association studies. The proposed algorithm exhibits comparable sensitivity of the CNV regions to the best algorithm in our genome-wide study and demonstrates the highest detection rate in SNP dense regions. In addition, we provide better haplotype phasing accuracy than similar approaches. The clinical association carried out with our fractional estimate of copy numbers in the cancer samples provides better detection power than that with integer copy number states.

原文英語
文章編號e96841
期刊PLoS ONE
9
發行號5
DOIs
出版狀態已出版 - 21 05 2014
對外發佈

UN SDG

此研究成果有助於以下永續發展目標

  1. SDG3 健康與福祉
    SDG3 健康與福祉

指紋

深入研究「HaplotypeCN: Copy number haplotype inference with hidden markov model and localized haplotype clustering」主題。共同形成了獨特的指紋。

引用此