Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Dilated convolution and gated linear unit based sound event detection and tagging algorithm using weak label

Full metadata record
DC Field Value Language
dc.contributor.authorPark, Chungho-
dc.contributor.authorKim, Donghyun-
dc.contributor.authorKo, Hanseok-
dc.date.accessioned2021-08-31T16:10:02Z-
dc.date.available2021-08-31T16:10:02Z-
dc.date.created2021-06-18-
dc.date.issued2020-
dc.identifier.issn1225-4428-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/59028-
dc.description.abstractIn this paper, we propose a Dilated Convolution Gate Linear Unit (DCGLU) to mitigate the lack of sparsity and small receptive field problems caused by the segmentation map extraction process in sound event detection with weak labels. In the advent of deep learning framework, segmentation map extraction approaches have shown improved performance in noisy environments. However, these methods are forced to maintain the size of the feature map to extract the segmentation map as the model would be constructed without a pooling operation. As a result, the performance of these methods is deteriorated with a lack of sparsity and a small receptive field. To mitigate these problems, we utilize GLU to control the flow of information and Dilated Convolutional Neural Networks (DCNNs) to increase the receptive field without additional learning parameters. For the performance evaluation, we employ a URBAN-SED and self-organized bird sound dataset. The relevant experiments show that our proposed DCGLU model outperforms over other baselines. In particular, our method is shown to exhibit robustness against nature sound noises with three Signal to Noise Ratio (SNR) levels (20 dB, 10 dB and 0 dB).-
dc.languageKorean-
dc.language.isoko-
dc.publisherACOUSTICAL SOC KOREA-
dc.subjectCLASSIFICATION-
dc.titleDilated convolution and gated linear unit based sound event detection and tagging algorithm using weak label-
dc.typeArticle-
dc.contributor.affiliatedAuthorKo, Hanseok-
dc.identifier.doi10.7776/ASK.2020.39.5.414-
dc.identifier.wosid000594710300005-
dc.identifier.bibliographicCitationJOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, v.39, no.5, pp.414 - 423-
dc.relation.isPartOfJOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA-
dc.citation.titleJOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA-
dc.citation.volume39-
dc.citation.number5-
dc.citation.startPage414-
dc.citation.endPage423-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.identifier.kciidART002628514-
dc.description.journalClass1-
dc.description.journalRegisteredClassscopus-
dc.description.journalRegisteredClasskci-
dc.relation.journalResearchAreaAcoustics-
dc.relation.journalWebOfScienceCategoryAcoustics-
dc.subject.keywordPlusCLASSIFICATION-
dc.subject.keywordAuthorAudio tagging-
dc.subject.keywordAuthorSound event detection-
dc.subject.keywordAuthorDilated convolution-
dc.subject.keywordAuthorGated linear unit-
dc.subject.keywordAuthorT-f segmentation map-
dc.subject.keywordAuthorWeak label-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Engineering > School of Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Ko, Han seok photo

Ko, Han seok
공과대학 (전기전자공학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE