A robust proposal generation method for text lines in natural scene images
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Fan, Kun | - |
dc.contributor.author | Baek, Seung Jun | - |
dc.date.accessioned | 2021-09-02T07:37:26Z | - |
dc.date.available | 2021-09-02T07:37:26Z | - |
dc.date.created | 2021-06-16 | - |
dc.date.issued | 2018-08-23 | - |
dc.identifier.issn | 0925-2312 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/73734 | - |
dc.description.abstract | Motivated by the success of object proposal generation methods for object detection, we propose a novel method for generating text line proposals from natural scene images. Our strategy is to detect text regions which we define as part of text lines containing a whole character or transitions between two adjacent characters. We observe that, if we scale text regions to a small and fixed size, their image gradients exhibit certain patterns irrespective of text shapes and language types. Based on this observation, we propose simple features which consist of means and standard deviations of image gradients to train a Random Forest so as to detect text regions over multiple image scales and color channels. Text regions are then merged into text line candidates which are ranked based on the Random Forest responses combined with the shapes of the candidates, e.g., horizontally elongated candidates are given higher scores, because they are more likely to contain texts. Even though our method is trained on English, our experiments demonstrate that it achieves high recall with a few thousand good quality proposals on four standard benchmarks, including multi-language datasets. Following the One-to-One and Many-to-One detection criteria, our method achieves 91.6%, 87.4%, 92.1% and 97.9% recall on the ICDAR 2013 Robust Reading Dataset, Street View Text Dataset, Pan's multilingual Dataset and Sampled KAIST Scene Text Dataset respectively, with an average of less than 1250 proposals. (c) 2018 Elsevier B.V. All rights reserved. | - |
dc.language | English | - |
dc.language.iso | en | - |
dc.publisher | ELSEVIER | - |
dc.subject | RECOGNITION | - |
dc.subject | EXTRACTION | - |
dc.subject | GRADIENTS | - |
dc.title | A robust proposal generation method for text lines in natural scene images | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Baek, Seung Jun | - |
dc.identifier.doi | 10.1016/j.neucom.2018.03.041 | - |
dc.identifier.scopusid | 2-s2.0-85046823519 | - |
dc.identifier.wosid | 000432492800004 | - |
dc.identifier.bibliographicCitation | NEUROCOMPUTING, v.304, pp.47 - 63 | - |
dc.relation.isPartOf | NEUROCOMPUTING | - |
dc.citation.title | NEUROCOMPUTING | - |
dc.citation.volume | 304 | - |
dc.citation.startPage | 47 | - |
dc.citation.endPage | 63 | - |
dc.type.rims | ART | - |
dc.type.docType | Article | - |
dc.description.journalClass | 1 | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
dc.subject.keywordPlus | RECOGNITION | - |
dc.subject.keywordPlus | EXTRACTION | - |
dc.subject.keywordPlus | GRADIENTS | - |
dc.subject.keywordAuthor | Scene text detection | - |
dc.subject.keywordAuthor | Feature extraction | - |
dc.subject.keywordAuthor | Text line proposals | - |
dc.subject.keywordAuthor | Random Forest | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
145 Anam-ro, Seongbuk-gu, Seoul, 02841, Korea+82-2-3290-2963
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.