Temporal attention based animal sound classification
- Authors
- Kim, Jungmin; Lee, Younglo; Kim, Donghyeon; Ko, Hanseok
- Issue Date
- 2020
- Publisher
- ACOUSTICAL SOC KOREA
- Keywords
- Audio event classification; Convolution Neural Network (CNN); Self-attention; Gated Linear Unit (GLU)
- Citation
- JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, v.39, no.5, pp.406 - 413
- Indexed
- SCOPUS
KCI
- Journal Title
- JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA
- Volume
- 39
- Number
- 5
- Start Page
- 406
- End Page
- 413
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/59024
- DOI
- 10.7776/ASK.2020.39.5.406
- ISSN
- 1225-4428
- Abstract
- In this paper, to improve the classification accuracy of bird and amphibian acoustic sound, we utilize GLU (Gated Linear Unit) and Self-attention that encourages the network to extract important features from data and discriminate relevant important frames from all the input sequences for further performance improvement. To utilize acoustic data, we convert 1-D acoustic data to a log-Mel spectrogram. Subsequently, undesirable component such as background noise in the log-Mel spectrogram is reduced by GLU. Then, we employ the proposed temporal self-attention to improve classification accuracy. The data consist of 6-species of birds, 8-species of amphibians including endangered species in the natural environment. As a result, our proposed method is shown to achieve an accuracy of 91 % with bird data and 93 % with amphibian data. Overall, an improvement of about 6 % similar to 7 % accuracy in performance is achieved compared to the existing algorithms.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Engineering > School of Electrical Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.