Phraseological Analysis of Learner Corpus Based on Language Model
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 송상헌 | - |
dc.date.accessioned | 2022-04-09T13:41:08Z | - |
dc.date.available | 2022-04-09T13:41:08Z | - |
dc.date.created | 2022-04-08 | - |
dc.date.issued | 2018-02 | - |
dc.identifier.issn | 12267430 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/139786 | - |
dc.description.abstract | The present study addresses how Englishexpressions produced by Korean native speakers are close to common expressions usedby English native speakers. To this end, this article provides a quantitative study of theYonsei English Learner Corpus using a skill set derived from computational linguistics. The focus of the current work is on a language model of English texts written by Koreanuniversity students. A language model refers to a collection of logarithmic N-gramsdescribed in the ARPA format, and this model serves to discriminate native-likesentences from awkward sentences. The present study compares a language modelacquired from an L2 corpus to the other language models acquired from two L1 corporain English: namely, English Gigaword and Europarl. The present study utilizes GeniaSentence Splitter to separate the sentences and SRILM to create the language models ina computationally tractable way. On the one hand, a deep analysis of N-grams ispresented. This analysis consists of two subtasks. First, the N-grams are tallied andevaluated using common metrics of computational linguistics. Second, as an evaluation ofthe language model, the perplexity of each language model is measured and comparedto a reference point drawn from five test data sources. On the other hand, an analysis | - |
dc.language | English | - |
dc.language.iso | en | - |
dc.publisher | 한국언어정보학회 | - |
dc.title | Phraseological Analysis of Learner Corpus Based on Language Model | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | 송상헌 | - |
dc.identifier.bibliographicCitation | 언어와 정보, v.22, no.1, pp.123 - 152 | - |
dc.relation.isPartOf | 언어와 정보 | - |
dc.citation.title | 언어와 정보 | - |
dc.citation.volume | 22 | - |
dc.citation.number | 1 | - |
dc.citation.startPage | 123 | - |
dc.citation.endPage | 152 | - |
dc.type.rims | ART | - |
dc.type.docType | Article | - |
dc.description.journalClass | 2 | - |
dc.description.journalRegisteredClass | kci | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(02841) 서울특별시 성북구 안암로 14502-3290-1114
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.