Identifying table boundaries in digital documents via sparse line detection
暂无分享,去创建一个
[1] W. Bruce Croft,et al. Table extraction using conditional random fields , 2003, DG.O.
[2] Andrew McCallum,et al. Efficiently Inducing Features of Conditional Random Fields , 2002, UAI.
[3] Kun Bai,et al. TableSeer: automatic table metadata extraction and searching in digital libraries , 2007, JCDL '07.
[4] David A. Landgrebe,et al. A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..
[5] Yalin Wang,et al. Detecting Tables in HTML Documents , 2002, Document Analysis Systems.
[6] Thomas Kieninger,et al. Applying the T-Recs table recognition system to the business letter domain , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.
[7] Jun'ichi Tsujii,et al. A method to integrate tables of the World Wide Web , 2001 .
[8] Fernando Pereira,et al. Shallow Parsing with Conditional Random Fields , 2003, NAACL.
[9] Zijian Zheng,et al. Naive Bayesian Classifier Committees , 1998, ECML.
[10] Wei Li,et al. Early results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons , 2003, CoNLL.
[11] Yalin Wang,et al. A machine learning based approach for table detection on the web , 2002, WWW '02.
[12] Hwee Tou Ng,et al. Learning to Recognize Tables in Free Text , 1999, ACL.
[13] Thomas G Kieninger,et al. Table structure recognition based on robust block segmentation , 1998, Electronic Imaging.
[14] H.S. Baird,et al. A retargetable table reader , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.
[15] Jiwon Shin,et al. Table Recognition and Evaluation , 2005 .
[16] Hsin-Hsi Chen,et al. Mining Tables from Large Scale HTML Texts , 2000, COLING.
[17] Matthew Hurst,et al. Layout and Language: Challenges for Table Understanding on the Web , 2001 .
[18] Jian Fan,et al. Layout and Content Extraction for PDF Documents , 2004, Document Analysis Systems.
[19] Robert M. Haralick,et al. Recursive X-Y cut using bounding boxes of connected components , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.
[20] Wolfgang Gatterbauer,et al. Using visual cues for extraction of tabular data from arbitrary HTML documents , 2005, WWW '05.
[21] J. Cordy,et al. A Survey of Table Recognition : Models , Observations , Transformations , and Inferences , 2003 .
[22] Kun Bai,et al. Improving the Table Boundary Detection in PDFs by Fixing the Sequence Error of the Sparse Lines , 2009, 2009 10th International Conference on Document Analysis and Recognition.
[23] Jianying Hu,et al. Flexible Web document analysis for delivery to narrow-bandwidth devices , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.
[24] Katharina Kaiser,et al. pdf2table: A Method to Extract Table Information from PDF Files , 2005, IICAI.
[25] Yalin Wang,et al. Automatic table ground truth generation and a background-analysis-based table structure extraction method , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.
[26] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.