Hyperdocument generation using OCR and icon detection

In this contribution we consider the construction of hyperdocuments; converting scanned paper documents into electronic hypertext. Hyperlink creation is automated by analyzing the structure and content of the scanned document. The focus is on hyperlinks between the text and labels in a picture. A number of tools for such hyperlink detection are described. Practical results are presented.

[1]  Atsuhiro Takasu,et al.  A collaborative supporting method between document processing and hypertext construction , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[2]  Andreas Myka,et al.  Using electronic facsimiles of documents for automatic reconstruction of underlying hypertext structures , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[3]  Rainer Hoch,et al.  From paper to office document standard representation , 1992, Computer.

[4]  Mahesh Viswanathan,et al.  A prototype document image analysis system for technical journals , 1992, Computer.

[5]  Marcel Worring,et al.  An ODA/Dexter Hyperdocument System with Automated Link Definition , 1997, VDB.

[6]  Haruo Asada,et al.  Major components of a complete text reading system , 1992 .

[7]  Ian R. Campbell-Grant Introducing ODA , 1991 .