Decision-Based Specification and Comparison of Table Recognition Algorithms

We wished to compare the Handley [2] and Hu et al. [4]. table structure recognition algorithms and the complete set of table cell hypotheses they each generated, including any rejected in the final result. Rebuilding the systems using procedural code that transformed data structures for interpretations in-place would not have achieved this goal. Initially we translated the strategies to a formal model-based (specifically grammarbased) framework. A well designed model-driven system (such as DMOS by Couasnon [1]) makes it easier to observe and record decision making, and can be programmed succinctly by a model specification. However, we found mapping the sequence of operations in the strategies to a model based description was difficult, and the formal system required frequent and substantial reconfiguration in order to incorporate unanticipated requirements.

[1]  Richard Zanibbi,et al.  Recognition Tasks Are Imitation Games , 2005, ICAPR.

[2]  John K. Ousterhout,et al.  Tcl and the Tk Toolkit , 1994 .

[3]  Daniel P. Lopresti Exploiting WWW Resources in Experimental Document Analysis Research , 2002, Document Analysis Systems.

[4]  Richard Zanibbi,et al.  The Recognition Strategy Language , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[5]  John Langford,et al.  Search-based structured prediction , 2009, Machine Learning.

[6]  Masayuki Mukunoki,et al.  Table form document analysis based on the document structure grammar , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[7]  Richard Zanibbi,et al.  Historical recall and precision: summarizing generated hypotheses , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[8]  Andrew D. Bagdanov Style characterization of machine printed texts , 2004 .

[9]  Matthew Hurst Layout and language: an efficient algorithm for detecting text blocks based on spatial and linguistic evidence , 2000, IS&T/SPIE Electronic Imaging.

[10]  David W. Binkley,et al.  Program slicing , 2008, 2008 Frontiers of Software Maintenance.

[11]  Thomas W. Reps,et al.  The use of program dependence graphs in software engineering , 1992, International Conference on Software Engineering.

[12]  RecognitionBruce A. DraperDept Learning Control Strategies for Object Recognition , 1996 .

[13]  James R. Cordy,et al.  The TXL source transformation language , 2006, Sci. Comput. Program..

[14]  Bruce A. Draper,et al.  ADORE: Adaptive Object Recognition , 1999, ICVS.

[15]  Kevin A. Schneider,et al.  Source transformation in software engineering using the TXL transformation system , 2002, Inf. Softw. Technol..

[16]  Atsuhiro Takasu,et al.  A document understanding method for database construction of an electronic library , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[17]  Daniel Marcu,et al.  Learning as search optimization: approximate large margin methods for structured prediction , 2005, ICML.

[18]  H. Chertkow,et al.  Semantic memory , 2002, Current neurology and neuroscience reports.

[19]  Robert M. Haralick,et al.  Document image understanding: geometric and logical layout , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Edward R. Dougherty,et al.  Electronic Imaging Technology , 1999 .

[21]  Katsushi Ikeuchi,et al.  Symbolic visual learning , 1997 .

[22]  Piero Mussio,et al.  Metareasoning in the determination of image interpretation strategies , 1994, Pattern Recognit. Lett..

[23]  Edward H. Shortliffe,et al.  EMYCIN: A Knowledge Engineer’s Tool for Constructing Rule-Based Expert Systems , 2005 .

[24]  Robert M. Haralick,et al.  CD-ROM document database standard , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[25]  J. Cordy,et al.  A Survey of Table Recognition : Models , Observations , Transformations , and Inferences , 2003 .

[26]  A. Colman Game Theory and its Applications: In the Social and Biological Sciences , 1995 .

[27]  Richard Zanibbi,et al.  Recognizing Mathematical Expressions Using Tree Transformation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Daniel P. Lopresti,et al.  A Tabular Survey of Automated Table Processing , 1999, GREC.

[29]  Richard Zanibbi,et al.  A language for specifying and comparing table recognition strategies , 2005 .

[30]  Jonathan J. Hull,et al.  Document Recognition IV , 1997 .

[31]  Daniel P. Lopresti,et al.  Table structure recognition and its evaluation , 2000, IS&T/SPIE Electronic Imaging.

[32]  Daniel P. Lopresti,et al.  Why table ground-truthing is hard , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[33]  Xinxin Wang,et al.  Tabular Abstraction, Editing, and Formatting , 1996 .

[34]  Daniel P. Lopresti,et al.  Evaluating the performance of table processing algorithms , 2002, International Journal on Document Analysis and Recognition.

[35]  David W. Embley,et al.  Table-processing paradigms: a research survey , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[36]  Yalin Wang,et al.  Detecting Tables in HTML Documents , 2002, Document Analysis Systems.

[37]  John C. Handley Table analysis for multi-line cell identifica-tion , 2001 .

[38]  Thomas Kieninger,et al.  Applying the T-Recs table recognition system to the business letter domain , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[39]  George Nagy,et al.  Twenty Years of Document Image Analysis in PAMI , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Luís Torgo,et al.  Design of an end-to-end method to extract information from tables , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[41]  Robert M. Haralick,et al.  Document structure analysis and performance evaluation , 1999 .

[42]  Matthew Hurst Towards a theory of tables , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[43]  Daniel P. Lopresti,et al.  Evaluating document analysis results via graph probing , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[44]  Bertrand Coüasnon DMOS: a generic document recognition method, application to an automatic generator of musical scores, mathematical formulae and table structures recognition systems , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[45]  Bertrand Coüasnon DMOS, a generic document recognition method: application to table structure analysis in a general and in a specific way , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[46]  John C. Handley,et al.  Table analysis for multiline cell identification , 2000, IS&T/SPIE Electronic Imaging.

[47]  Matthew Hurst,et al.  Layout and Language: Challenges for Table Understanding on the Web , 2001 .

[48]  Francesca Cesarini,et al.  Trainable table location in document images , 2002, Object recognition supported by user interaction for service robots.