혐오와 대항: 혐오표현 탐지 모델 평가를 위한 대항표현 데이터셋 구축
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 박하율 | - |
dc.contributor.author | 박현아 | - |
dc.contributor.author | 송상헌 | - |
dc.date.accessioned | 2022-06-11T16:40:38Z | - |
dc.date.available | 2022-06-11T16:40:38Z | - |
dc.date.created | 2022-06-10 | - |
dc.date.issued | 2022 | - |
dc.identifier.issn | 1226-5691 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/142042 | - |
dc.description.abstract | This study argues for the necessity of a Korean counter-speech dataset for ethical and effective hate speech detection research. Counter-speech is a response to online hate in order to stop the spread of hate speech and is considered an alternative approach to deleting and blocking. However, since counter-speech often employs offensive language or linguistic structures similar to hate speech, even the state-of-the-art hate speech detection models usually classify it as hate speech. This false positive bias risks silencing the language of minorities and their allies. However, the evaluation of Korean hate speech detection models remains untouched due to the absence of a Korean counter-speech dataset. Thus, we introduce the first Korean counter-speech dataset with annotations about target groups. We then tested a Korean hate speech detection model with our dataset, revealing a significant drop in the model’s accuracy from 97.9% to 42.7%. | - |
dc.language | Korean | - |
dc.language.iso | ko | - |
dc.publisher | 담화·인지언어학회 | - |
dc.title | 혐오와 대항: 혐오표현 탐지 모델 평가를 위한 대항표현 데이터셋 구축 | - |
dc.title.alternative | Countering the hatred: The counter-speech dataset in Korean for evaluating hate speech detection models | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | 송상헌 | - |
dc.identifier.bibliographicCitation | 담화와 인지, v.29, no.2, pp.1 - 23 | - |
dc.relation.isPartOf | 담화와 인지 | - |
dc.citation.title | 담화와 인지 | - |
dc.citation.volume | 29 | - |
dc.citation.number | 2 | - |
dc.citation.startPage | 1 | - |
dc.citation.endPage | 23 | - |
dc.type.rims | ART | - |
dc.identifier.kciid | ART002841065 | - |
dc.description.journalClass | 2 | - |
dc.description.journalRegisteredClass | kci | - |
dc.subject.keywordAuthor | hate speech detection | - |
dc.subject.keywordAuthor | counter-speech | - |
dc.subject.keywordAuthor | language model | - |
dc.subject.keywordAuthor | ethics in NLP | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(02841) 서울특별시 성북구 안암로 14502-3290-1114
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.