Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Bird sounds classification by combining PNCC and robust Mel-log filter bank features

Full metadata record
DC Field Value Language
dc.contributor.authorBadi, Alzahra-
dc.contributor.authorKo, Kyungdeuk-
dc.contributor.authorKo, Hanseok-
dc.date.accessioned2021-09-01T21:53:02Z-
dc.date.available2021-09-01T21:53:02Z-
dc.date.created2021-06-19-
dc.date.issued2019-01-
dc.identifier.issn1225-4428-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/68433-
dc.description.abstractIn this paper, combining features is proposed as a way to enhance the classification accuracy of sounds under noisy environments using the CNN (Convolutional Neural Network) structure. A robust log Mel-filter bank using Wiener filter and PNCCs (Power Normalized Cepstral Coefficients) are extracted to form a 2-dimensional feature that is used as input to the CNN structure. An ebird database is used to classify 43 types of bird species in their natural environment. To evaluate the performance of the combined features under noisy environments, the database is augmented with 3 types of noise under 4 different SNRs (Signal to Noise Ratios) (20 dB, 10 dB, 5 dB, 0 dB). The combined feature is compared to the log Mel-filter bank with and without incorporating the Wiener filter and the PNCCs. The combined feature is shown to outperform the other mentioned features under clean environments with a 1.34 % increase in overall average accuracy. Additionally, the accuracy under noisy environments at the 4 SNR levels is increased by 1.06 % and 0.65 % for shop and schoolyard noise backgrounds, respectively.-
dc.languageEnglish-
dc.language.isoen-
dc.publisherACOUSTICAL SOC KOREA-
dc.titleBird sounds classification by combining PNCC and robust Mel-log filter bank features-
dc.typeArticle-
dc.contributor.affiliatedAuthorKo, Hanseok-
dc.identifier.doi10.7776/ASK.2019.38.1.039-
dc.identifier.scopusid2-s2.0-85079185086-
dc.identifier.wosid000457557300005-
dc.identifier.bibliographicCitationJOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, v.38, no.1, pp.39 - 46-
dc.relation.isPartOfJOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA-
dc.citation.titleJOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA-
dc.citation.volume38-
dc.citation.number1-
dc.citation.startPage39-
dc.citation.endPage46-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.identifier.kciidART002434508-
dc.description.journalClass1-
dc.description.journalRegisteredClassscopus-
dc.description.journalRegisteredClasskci-
dc.relation.journalResearchAreaAcoustics-
dc.relation.journalWebOfScienceCategoryAcoustics-
dc.subject.keywordAuthorAcoustic event recognition-
dc.subject.keywordAuthorEnvironmental sound classification-
dc.subject.keywordAuthorCNN (Convolutional Neural Network)-
dc.subject.keywordAuthorWeiner filter-
dc.subject.keywordAuthorPNCCs (Power Normalized Cepstral Coefficients)-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Engineering > School of Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Ko, Han seok photo

Ko, Han seok
공과대학 (전기전자공학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE