ARGNet: using deep neural networks for robust identification and classification of antibiotic resistance genes from sequences

Yao Pei, Marcus Ho Hin Shum, Yunshi Liao, Vivian W. Leung, Yu Nong Gong, David K. Smith, Xiaole Yin, Yi Guan, Ruibang Luo, Tong Zhang, Tommy Tsan Yuk Lam*

*此作品的通信作者

研究成果: 期刊稿件文章同行評審

摘要

BACKGROUND: Emergence of antibiotic resistance in bacteria is an important threat to global health. Antibiotic resistance genes (ARGs) are some of the key components to define bacterial resistance and their spread in different environments. Identification of ARGs, particularly from high-throughput sequencing data of the specimens, is the state-of-the-art method for comprehensively monitoring their spread and evolution. Current computational methods to identify ARGs mainly rely on alignment-based sequence similarities with known ARGs. Such approaches are limited by choice of reference databases and may potentially miss novel ARGs. The similarity thresholds are usually simple and could not accommodate variations across different gene families and regions. It is also difficult to scale up when sequence data are increasing.

RESULTS: In this study, we developed ARGNet, a deep neural network that incorporates an unsupervised learning autoencoder model to identify ARGs and a multiclass classification convolutional neural network to classify ARGs that do not depend on sequence alignment. This approach enables a more efficient discovery of both known and novel ARGs. ARGNet accepts both amino acid and nucleotide sequences of variable lengths, from partial (30-50 aa; 100-150 nt) sequences to full-length protein or genes, allowing its application in both target sequencing and metagenomic sequencing. Our performance evaluation showed that ARGNet outperformed other deep learning models including DeepARG and HMD-ARG in most of the application scenarios especially quasi-negative test and the analysis of prediction consistency with phylogenetic tree. ARGNet has a reduced inference runtime by up to 57% relative to DeepARG.

CONCLUSIONS: ARGNet is flexible, efficient, and accurate at predicting a broad range of ARGs from the sequencing data. ARGNet is freely available at https://github.com/id-bioinfo/ARGNet , with an online service provided at https://ARGNet.hku.hk . Video Abstract.

原文英語
文章編號84
頁(從 - 到)84
期刊Microbiome
12
發行號1
DOIs
出版狀態已出版 - 09 05 2024

文獻附註

© 2024. The Author(s).

指紋

深入研究「ARGNet: using deep neural networks for robust identification and classification of antibiotic resistance genes from sequences」主題。共同形成了獨特的指紋。

引用此