혐오와 대항: 혐오표현 탐지 모델 평가를 위한 대항표현 데이터셋 구축

박하율; 박현아; 송상헌

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

혐오와 대항: 혐오표현 탐지 모델 평가를 위한 대항표현 데이터셋 구축

Full metadata record

DC Field	Value	Language
dc.contributor.author	박하율	-
dc.contributor.author	박현아	-
dc.contributor.author	송상헌	-
dc.date.accessioned	2022-06-11T16:40:38Z	-
dc.date.available	2022-06-11T16:40:38Z	-
dc.date.created	2022-06-10	-
dc.date.issued	2022	-
dc.identifier.issn	1226-5691	-
dc.identifier.uri	https://scholar.korea.ac.kr/handle/2021.sw.korea/142042	-
dc.description.abstract	This study argues for the necessity of a Korean counter-speech dataset for ethical and effective hate speech detection research. Counter-speech is a response to online hate in order to stop the spread of hate speech and is considered an alternative approach to deleting and blocking. However, since counter-speech often employs offensive language or linguistic structures similar to hate speech, even the state-of-the-art hate speech detection models usually classify it as hate speech. This false positive bias risks silencing the language of minorities and their allies. However, the evaluation of Korean hate speech detection models remains untouched due to the absence of a Korean counter-speech dataset. Thus, we introduce the first Korean counter-speech dataset with annotations about target groups. We then tested a Korean hate speech detection model with our dataset, revealing a significant drop in the model’s accuracy from 97.9% to 42.7%.	-
dc.language	Korean	-
dc.language.iso	ko	-
dc.publisher	담화·인지언어학회	-
dc.title	혐오와 대항: 혐오표현 탐지 모델 평가를 위한 대항표현 데이터셋 구축	-
dc.title.alternative	Countering the hatred: The counter-speech dataset in Korean for evaluating hate speech detection models	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	송상헌	-
dc.identifier.bibliographicCitation	담화와 인지, v.29, no.2, pp.1 - 23	-
dc.relation.isPartOf	담화와 인지	-
dc.citation.title	담화와 인지	-
dc.citation.volume	29	-
dc.citation.number	2	-
dc.citation.startPage	1	-
dc.citation.endPage	23	-
dc.type.rims	ART	-
dc.identifier.kciid	ART002841065	-
dc.description.journalClass	2	-
dc.description.journalRegisteredClass	kci	-
dc.subject.keywordAuthor	hate speech detection	-
dc.subject.keywordAuthor	counter-speech	-
dc.subject.keywordAuthor	language model	-
dc.subject.keywordAuthor	ethics in NLP	-

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Liberal Arts > Department of Linguistics > 1. Journal Articles

Show simple item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,469,166; Today View :45,580

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE