Identification et structuration hié rarchique des titres dans les documents HTML Structuration hié rarchique des titres
暂无分享,去创建一个
[1] Andreas Stolcke,et al. Hidden Markov Model} Induction by Bayesian Model Merging , 1992, NIPS.
[2] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.
[3] I. V. Ramakrishnan,et al. Automatic discovery of semantic structures in HTML documents , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..
[4] Jie Zou,et al. Structure and content analysis for html medical articles: a hidden markov model approach , 2007, DocEng '07.
[6] Daniel Marcu,et al. From Local to Global Coherence: A Bottom-Up Approach to Text Planning , 1997, AAAI/IAAI.
[7] Shuming Shi,et al. Title extraction from bodies of HTML documents and its application to web page retrieval , 2005, SIGIR '05.
[8] Keiichiro Hoashi,et al. Robust web page segmentation for mobile terminal using content-distances and page layout information , 2007, WWW '07.
[9] Wei-Ying Ma,et al. Learning block importance models for web pages , 2004, WWW '04.
[10] Shuming Shi,et al. Web page title extraction and its application , 2007, Inf. Process. Manag..
[11] H. P. Edmundson,et al. New Methods in Automatic Extracting , 1969, JACM.
[12] HongJiang Zhang,et al. HTML page analysis based on visual cues , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.