TSSUNet-MB – ab initio identification of σ70 promoter transcription start sites in Escherichia coli using deep multitask learning

  • Chung En Ni
  • , Duy Phuong Doan
  • , Yen Jung Chiu
  • , Yen Hua Huang*
  • *Corresponding author for this work

Research output: Contribution to journalJournal Article peer-review

Abstract

Motivation: Computational promoter prediction (CPP) tools designed to classify prokaryotic promoter regions usually assume that a transcription start site (TSS) is located at a predefined position within each promoter region. Such CPP tools are sensitive to any positional shifting of the TSS in a windowed region, and they are unsuitable for determining the boundaries of prokaryotic promoters. Results: TSSUNet-MB is a deep learning model developed to identify the TSSs of σ70 promoters. Mononucleotide and bendability were used to encode input sequences. TSSUNet-MB outperforms other CPP tools when assessed using the sequences obtained from the neighborhood of real promoters. TSSUNet-MB achieved a sensitivity of 0.839 and specificity of 0.768 on sliding sequences, while other CPP tool cannot maintain both sensitivities and specificities in a compatible range. Furthermore, TSSUNet-MB can precisely predict the TSS position of σ70 promoter-containing regions with a 10-base accuracy of 77.6%. By leveraging the sliding window scanning approach, we further computed the confidence score of each predicted TSS, which allows for more accurately identifying TSS locations. Our results suggest that TSSUNet-MB is a robust tool for finding σ70 promoters and identifying TSSs.

Original languageEnglish
Article number107904
JournalComputational Biology and Chemistry
Volume105
DOIs
StatePublished - 08 2023
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2023 The Authors

Keywords

  • Prokaryotic transcription start site
  • Promoter
  • σ

Fingerprint

Dive into the research topics of 'TSSUNet-MB – ab initio identification of σ70 promoter transcription start sites in Escherichia coli using deep multitask learning'. Together they form a unique fingerprint.

Cite this