Exploiting deep neural network and long short-term memory method-ologies in bioacoustic classification of LPC-based features

Cihun Siyong Alex Gong*, Chih Hui Simon Su, Kuo Wei Chao, Yi Chu Chao, Chin Kai Su, Wei Hang Chiu

*Corresponding author for this work

Research output: Contribution to journalJournal Article peer-review

7 Scopus citations

Abstract

The research describes the recognition and classification of the acoustic characteristics of amphibians using deep learning of deep neural network (DNN) and long short-term memory (LSTM) for biological applications. First, original data is collected from 32 species of frogs and 3 species of toads commonly found in Taiwan. Secondly, two digital filtering algorithms, linear predictive coding (LPC) and Mel-frequency cepstral coefficient (MFCC), are respectively used to collect amphibian bioacoustic features and construct the datasets. In addition, principal component analysis (PCA) algorithm is applied to achieve dimensional reduction of the training model datasets. Next, the classification of amphibian bioacoustic features is accomplished through the use of DNN and LSTM. The Pytorch platform with a GPU processor (NVIDIA GeForce GTX 1050 Ti) realizes the calculation and recognition of the acoustic feature classification results. Based on above-mentioned two algorithms, the sound feature datasets are classified and effectively summarized in several classification result tables and graphs for presentation. The results of the classification experiment of the different features of bioacoustics are verified and discussed in detail. This research seeks to extract the optimal combination of the best recognition and classification algorithms in all experimental processes.

Original languageEnglish
Article numbere0259140
JournalPLoS ONE
Volume16
Issue number12 December
DOIs
StatePublished - 12 2021

Bibliographical note

Publisher Copyright:
© 2021 Gong et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Fingerprint

Dive into the research topics of 'Exploiting deep neural network and long short-term memory method-ologies in bioacoustic classification of LPC-based features'. Together they form a unique fingerprint.

Cite this