Sample-based Collection and Adjustment Algorithm of Rules for Metadata Extraction on Business Documents

[1]  Kentaro Torisawa,et al.  Finding Specification Pages from the Web , 2006 .

[2]  Yasuto Ishitani Logical structure analysis of document images based on emergent computation , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[3]  Michelangelo Ceci,et al.  Machine learning methods for automatically processing historical documents: from paper acquisition to XML transformation , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[4]  Janusz Wnek Machine Learning of Generalized Document Templates for Data Extraction , 2002, Document Analysis Systems.

[5]  Kristen Maria Summers Near-wordless document structure classification , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[6]  John C. Handley,et al.  Document understanding system using stochastic context-free grammars , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[7]  Thomas M. Breuel,et al.  Bibliographic Meta-Data Extraction Using Probabilistic Finite State Transducers , 2007 .

[8]  Abdel Belaïd,et al.  Logical structure recognition of scientific bibliographic references , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[9]  Suzanne Liebowitz Taylor,et al.  Extraction of data from preprinted forms , 2007, Machine Vision and Applications.

[10]  Sung-Bae Cho,et al.  Geometric Structure Analysis of Document Images: A Knowledge-Based Approach , 2000, IEEE Trans. Pattern Anal. Mach. Intell..