Towards a canonical and structured representation of PDF documents through reverse engineering
暂无分享,去创建一个
[1] David F. Brailsford,et al. Document analysis of PDF files: methods, results and implications , 1995 .
[2] Anjo Anjewierden,et al. Automatic indexing of documents with ontologies , 2001 .
[3] David F. Brailsford,et al. Creating reusable well-structured PDF as a sequence of component object graphic (COG) elements , 2003, DocEng '03.
[4] Maurizio Rigamonti,et al. Xed: a new tool for extracting hidden structures from electronic documents , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..
[5] Anjo Anjewierden. AIDAS: incremental logical structure discovery in PDF documents , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.
[6] Jian Fan,et al. Layout and Content Extraction for PDF Documents , 2004, Document Analysis Systems.