TY - JOUR
T1 - Systematic phylogenetic analysis of influenza A virus reveals many novel mosaic genome segments
AU - Lam, Tommy Tsan Yuk
AU - Chong, Yee Ling
AU - Shi, Mang
AU - Hon, Chung Chau
AU - Li, Jun
AU - Martin, Darren P.
AU - Tang, Julian Wei Tze
AU - Mok, Chee Keng
AU - Shih, Shin Ru
AU - Yip, Chi Wai
AU - Jiang, Jingwei
AU - Hui, Raymond Kin Hei
AU - Pybus, Oliver G.
AU - Holmes, Edward C.
AU - Leung, Frederick Chi Ching
PY - 2013/8
Y1 - 2013/8
N2 - Recombination plays an important role in shaping the genetic diversity of a number of DNA and RNA viruses. Although some recent studies have reported bioinformatic evidence of mosaic sequences in a variety of influenza A viruses, it remains controversial as to whether these represent bona fide natural recombination events or laboratory artifacts. Importantly, mosaic genome structures can create significant topological incongruence during phylogenetic analyses, which can mislead additional phylogeny-based molecular evolutionary analyses such as molecular clock dating, the detection of selection pressures and phylogeographic inference. As a result, there is a strong need for systematic screenings for mosaic structures within the influenza virus genome database. We used a combination of sequence-based and phylogeny-based methods to identify 388 mosaic influenza genomic segments, of which 332 are previously unreported and are significantly supported by phylogenetic methods. It is impossible, however, to ascertain whether these represent natural recombinants. To facilitate the future identification of recombinants, reference sets of non-recombinant sequences were selected for use in an automatic screening protocol for detecting mosaic sequences. Tests using real and simulated mosaic sequences indicate that our screening protocol is both sensitive (average >90%) and accurate (average >77%) enough to identify a range of different mosaic patterns. The relatively high prevalence of mosaic influenza virus sequences implies that efficient systematic screens, such as that proposed here, should be performed routinely to detect natural recombinant strains, potential laboratory artifacts, and sequencing contaminants either prior to sequences being deposited in GenBank or before they are used for phylogenetic analyses.
AB - Recombination plays an important role in shaping the genetic diversity of a number of DNA and RNA viruses. Although some recent studies have reported bioinformatic evidence of mosaic sequences in a variety of influenza A viruses, it remains controversial as to whether these represent bona fide natural recombination events or laboratory artifacts. Importantly, mosaic genome structures can create significant topological incongruence during phylogenetic analyses, which can mislead additional phylogeny-based molecular evolutionary analyses such as molecular clock dating, the detection of selection pressures and phylogeographic inference. As a result, there is a strong need for systematic screenings for mosaic structures within the influenza virus genome database. We used a combination of sequence-based and phylogeny-based methods to identify 388 mosaic influenza genomic segments, of which 332 are previously unreported and are significantly supported by phylogenetic methods. It is impossible, however, to ascertain whether these represent natural recombinants. To facilitate the future identification of recombinants, reference sets of non-recombinant sequences were selected for use in an automatic screening protocol for detecting mosaic sequences. Tests using real and simulated mosaic sequences indicate that our screening protocol is both sensitive (average >90%) and accurate (average >77%) enough to identify a range of different mosaic patterns. The relatively high prevalence of mosaic influenza virus sequences implies that efficient systematic screens, such as that proposed here, should be performed routinely to detect natural recombinant strains, potential laboratory artifacts, and sequencing contaminants either prior to sequences being deposited in GenBank or before they are used for phylogenetic analyses.
KW - Influenza A virus
KW - Mosaic pattern
KW - Mosaic screening protocol
KW - Recombination
UR - https://www.scopus.com/pages/publications/84880816395
U2 - 10.1016/j.meegid.2013.03.015
DO - 10.1016/j.meegid.2013.03.015
M3 - 文章
C2 - 23548803
AN - SCOPUS:84880816395
SN - 1567-1348
VL - 18
SP - 367
EP - 378
JO - Infection, Genetics and Evolution
JF - Infection, Genetics and Evolution
ER -