음악검색을 위한 가변임계치 기반의 음성 질의 변환 기법A Threshold Adaptation based Voice Query Transcription Scheme for Music Retrieval
- Other Titles
- A Threshold Adaptation based Voice Query Transcription Scheme for Music Retrieval
- Authors
- 한병준; 노승민; 황인준
- Issue Date
- 2010
- Publisher
- 대한전기학회
- Keywords
- Query-by-humming; Audio signal analysis; Music transcription; Note onset detection
- Citation
- 전기학회논문지ABCD, v.59, no.2, pp.445 - 451
- Journal Title
- 전기학회논문지ABCD
- Volume
- 59
- Number
- 2
- Start Page
- 445
- End Page
- 451
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/117412
- ISSN
- 1229-2443
- Abstract
- This paper presents a threshold adaptation based voice query transcription scheme for music information retrieval. The proposed scheme analyzes monophonic voice signal and generates its transcription for diverse music retrieval applications. For accurate transcription, we propose several advanced features including (i) Energetic Feature eXtractor (EFX) for onset, peak, and transient area detection; (ii) Modified Windowed Average Energy (MWAE) for defining multiple small but coherent windows with local threshold values as offset detector; and finally (iii) Circular Average Magnitude Difference Function (CAMDF) for accurate acquisition of fundamental frequency (F0) of each frame.
In order to evaluate the performance of our proposed scheme, we implemented a prototype music transcription system called AMT2 (Automatic Music Transcriber version 2) and carried out various experiments. In the experiment, we used QBSH corpus [1], adapted in MIREX 2006 contest data set. Experimental result shows that our proposed scheme can improve the transcription performance.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Engineering > School of Electrical Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.