Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Parameter-free geometric document layout analysis

Full metadata record
DC Field Value Language
dc.contributor.authorLee, SW-
dc.contributor.authorRyu, DS-
dc.date.accessioned2021-09-09T08:48:56Z-
dc.date.available2021-09-09T08:48:56Z-
dc.date.created2021-06-19-
dc.date.issued2001-11-
dc.identifier.issn0162-8828-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/123629-
dc.description.abstractAutomatic transformation of paper documents into electronic documents requires geometric document layout analysis at the first stage. However, variations in character font sizes, text line spacing, and document layout structures have made it difficult to design a general-purpose document layout analysis algorithm for many years. The use of some parameters has therefore been unavoidable in previous methods. In this paper, we propose a parameter-free method for segmenting the document images into maximal homogeneous regions and identifying them as texts, images, tables, and ruling lines. A pyramidal quadtree structure is constructed for multiscale analysis and a periodicity measure is suggested to find a periodical attribute of text regions for page segmentation. To obtain robust page segmentation results, a confirmation procedure using texture analysis is applied to only ambiguous regions. Based on the proposed periodicity measure, multiscale analysis, and confirmation procedure, we could develop a robust method for geometric document layout analysis independent of character font sizes, text line spacing, and document layout structures. The proposed method was experimented with the document database from the University of Washington and the MediaTeam Document Database. The results of these tests have shown that the proposed method provides more accurate results than the previous ones.-
dc.languageEnglish-
dc.language.isoen-
dc.publisherIEEE COMPUTER SOC-
dc.subjectSEGMENTATION-
dc.titleParameter-free geometric document layout analysis-
dc.typeArticle-
dc.contributor.affiliatedAuthorLee, SW-
dc.identifier.scopusid2-s2.0-0035510433-
dc.identifier.wosid000172108300003-
dc.identifier.bibliographicCitationIEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, v.23, no.11, pp.1240 - 1256-
dc.relation.isPartOfIEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE-
dc.citation.titleIEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE-
dc.citation.volume23-
dc.citation.number11-
dc.citation.startPage1240-
dc.citation.endPage1256-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordPlusSEGMENTATION-
dc.subject.keywordAuthorgeometric document layout analysis-
dc.subject.keywordAuthorparameter-free method-
dc.subject.keywordAuthorperiodicity estimation-
dc.subject.keywordAuthormultiscale analysis-
dc.subject.keywordAuthorpage segmentation-
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Seong Whan photo

Lee, Seong Whan
인공지능학과
Read more

Altmetrics

Total Views & Downloads

BROWSE