Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Automatic generation of structured hyperdocuments from document images

Authors
Lee, JYPark, JSByun, HMoon, JLee, SW
Issue Date
2월-2002
Publisher
PERGAMON-ELSEVIER SCIENCE LTD
Keywords
structured hyperdocument; multi-column document; document conversion; document image understanding; logical structure analysis
Citation
PATTERN RECOGNITION, v.35, no.2, pp.485 - 503
Indexed
SCIE
SCOPUS
Journal Title
PATTERN RECOGNITION
Volume
35
Number
2
Start Page
485
End Page
503
URI
https://scholar.korea.ac.kr/handle/2021.sw.korea/124388
DOI
10.1016/S0031-3203(01)00026-7
ISSN
0031-3203
Abstract
As sharing documents through the World Wide Web has been recently and constantly increasing, the need for creating hyperdocuments to make them accessible and retrievable via the internet, in formats such as HTML and SGML/XML, has also been rapidly rising. Nevertheless, only a few works have been done on the conversion of paper documents into hyperdocuments. Moreover, most of these studies have concentrated on the direct conversion of single-column document images that include only text and image objects. In this paper, we propose two methods for converting complex multi-column document images into HTML documents, and a method for generating a structured table of contents page based on the logical structure analysis of the document image. Experiments with various kinds of multi-column document images show that, by using the proposed methods, their corresponding HTML documents can be generated in the same visual layout as that of the document images, and their structured table of contents page can be also produced with the hierarchically ordered section titles hyperlinked to the contents. (C) 2001 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Seong Whan photo

Lee, Seong Whan
인공지능학과
Read more

Altmetrics

Total Views & Downloads

BROWSE