Multi-band CNN architecture using adaptive frequency filter for acoustic event classification

Kim, Donghyeon; Park, Sangwook; Han, David K.; Ko, Hanseok

doi:10.1016/j.apacoust.2020.107579

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Multi-band CNN architecture using adaptive frequency filter for acoustic event classification

Authors: Kim, Donghyeon; Park, Sangwook; Han, David K.; Ko, Hanseok

Issue Date: 15-Jan-2021

Publisher: ELSEVIER SCI LTD

Keywords: Filter parameter training; Sub-band; Convolutional neural network; High energy frequency; Low energy feature vanishing

Citation: APPLIED ACOUSTICS, v.172

Indexed: SCIE
SCOPUS

Journal Title: APPLIED ACOUSTICS

Volume: 172

URI: https://scholar.korea.ac.kr/handle/2021.sw.korea/50108

DOI: 10.1016/j.apacoust.2020.107579

ISSN: 0003-682X

Abstract: Although Convolutional Neural Networks (CNNs) architecture based learning systems have shown impressive results in the performance of numerous classification tasks, their effectiveness has been limited in certain cases of acoustic based classification. This vulnerability is particularly evident in the acoustic event classification tasks using spectral features. For example, spectral based features may suffer from a typical normalization process when it is fed to a neural network for training since the magnitudes in high-frequency band are inadvertently attenuated even though they may yet contain useful discriminant features. Although some research efforts try to mitigate this problem by introducing a multi-band approach for attaining salient and stable features, it requires empirically preset frequency bands to separate the spectral features. Being heuristic, however, this process is difficult to ensure the consistency required for high correlation between manually separated features and good classification performance. In this paper, we propose a novel filter parameter modeling framework performing optimized frequency sub-band separation via CNN based end-to-end training for achieving high acoustic event classification performance. In particular, the filter response characteristics, namely, cut-off frequencies and damping ratio for roll off are considered as added learning parameters to the CNN architecture for the proposed end-to-end learning framework so that the filter's frequency response is optimized for producing salient features. The proposed training process is shown to not only automatically select the filter parameters for multi-band frequency separation but also guarantee high correlation between the resulting sub-band features and accurate classification performance. (C) 2020 Elsevier Ltd. All rights reserved.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Engineering > School of Electrical Engineering > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Ko, Han seok photo

Ko, Han seok: College of Engineering (School of Electrical Engineering)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :6,997,638; Today View :9,591

RSS_1.0 RSS_2.0 ATOM_1.0

145 Anam-ro, Seongbuk-gu, Seoul, 02841, Korea+82-2-3290-2963

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE