Decoding Imagined Speech Based on Deep Metric Learning for Intuitive BCI Communication

Lee, Dong-Yeon; Lee, Minji; Lee, Seong-Whan

doi:10.1109/TNSRE.2021.3096874

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Decoding Imagined Speech Based on Deep Metric Learning for Intuitive BCI Communication

Authors: Lee, Dong-Yeon; Lee, Minji; Lee, Seong-Whan

Issue Date: 2021

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords: Electroencephalography; Entropy; Feature extraction; Imagined speech; Neural networks; Scalability; Speech recognition; Training; brain-computer interface; deep metric learning; instantaneous frequency; spectral entropy

Citation: IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, v.29, pp.1363 - 1374

Indexed: SCIE
SCOPUS

Journal Title: IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING

Volume: 29

Start Page: 1363

End Page: 1374

URI: https://scholar.korea.ac.kr/handle/2021.sw.korea/138494

DOI: 10.1109/TNSRE.2021.3096874

ISSN: 1534-4320

Abstract: Imagined speech is a highly promising paradigm due to its intuitive application and multiclass scalability in the field of brain-computer interfaces. However, optimal feature extraction and classifiers have not yet been established. Furthermore, retraining still requires a large number of trials when new classes are added. The aim of this study is (i) to increase the classification performance for imagined speech and (ii) to apply a new class using a pretrained classifier with a small number of trials. We propose a novel framework based on deep metric learning that learns the distance by comparing the similarity between samples. We also applied the instantaneous frequency and spectral entropy used for speech signals to electroencephalography signals during imagined speech. The method was evaluated on two public datasets (6-class Coretto DB and 5-class BCI Competition DB). We achieved a 6-class accuracy of 45.00 +/- 3.13% and a 5-class accuracy of 48.10 +/- 3.68% using the proposed method, which significantly outperformed state-of-the-art methods. Additionally, we verified that the new class could be detected through incremental learning with a small number of trials. As a result, the average accuracy is 44.50 +/- 0.26% for Coretto DB and 47.12 +/- 0.27% for BCI Competition DB, which shows similar accuracy to baseline accuracy without incremental learning. Our results have shown that the accuracy can be greatly improved even with a small number of trials by selecting appropriate features from imagined speech. The proposed framework could be directly used to help construct an extensible intuitive communication system based on brain-computer interfaces.

Files in This Item: There are no files associated with this item.

Appears in Collections: Graduate School > Department of Artificial Intelligence > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Lee, Seong Whan photo

Lee, Seong Whan: 인공지능학과

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :9,537,565; Today View :1,979

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE