A Decision Support Method for Finding Appropriate Information on the Web Documents

Today, the Web has been expanded dramatically and hence, looking up desired information in a vast ocean of available data is a difficult task for users. So, we need methods which using a targeted search, help users in making decisions for choosing the appropriate documents according the desired content. In presented information retrieval technique, web documents are introduced to the user as search results. To resolve this problem can be used semantic extraction. That conclusion is valid for extraction if related subject pages identified initially. Semantic extraction ontology is one of these methods. This paper puts to evaluation the extent of relationship between a Semi structured HTML and ontology using some statistical techniques. Then with calculate the density of the document and compared with the expected density ontology in an acceptable limitation, documents related with ontology predicted. Then with calculate the density of the document and compared with the expected density ontology in an acceptable limitation, documents related with ontology predicted. If calculations for the two cases of expected value with density and view value are within the required range, then ontology would be related. According to experimental Results within a 99% reliable range, shows that the recommended method's ability to achieve value recall 100% and precision 83% is able.

[1]  J. Wolfowitz,et al.  An Introduction to the Theory of Statistics , 1951, Nature.

[2]  L. Hogben Introduction to the Theory of Statistics , 1951 .

[3]  E. Ziegel Regression: A Second Course in Statistics , 1988 .

[4]  Alberto O. Mendelzon,et al.  WebOQL: restructuring documents, databases and Webs , 1998, Proceedings 14th International Conference on Data Engineering.

[5]  David W. Embley,et al.  Ontology-based extraction and structuring of information from data-rich unstructured documents , 1998, CIKM '98.

[6]  David W. Embley,et al.  Record-boundary discovery in Web documents , 1999, SIGMOD '99.

[7]  David W. Embley,et al.  Ontology Suitability for Uncertain Extraction of Information from Multi-Record Web Documents , 1999, Datenbank Rundbr..

[8]  David W. Embley,et al.  Recognizing Ontology-Applicable Multiple-Record Web Documents , 2001, ER.

[9]  Hong Chen,et al.  Odaies: ontology-driven adaptive Web information extraction system , 2003, IEEE/WIC International Conference on Intelligent Agent Technology, 2003. IAT 2003..

[10]  Yihong Ding SEMIAUTOMATIC GENERATION OF RESILIENT DATA-EXTRACTION ONTOLOGIES , 2003 .

[11]  David W. Embley,et al.  Query Rewriting for Extracting Data Behind HTML Forms , 2004, ER.

[12]  D. Embley,et al.  Querying Disjunctive Databases in Polynomial Time ∗ , 2004 .

[13]  Automating the Extraction of Domain Specific Information from the Web — a Case Study for the Genealogical Domain , 2004 .

[14]  David W. Embley,et al.  Grouping search-engine returned citations for person-name queries , 2004, WIDM '04.

[15]  Alan E. Wessman A Framework for Extraction Plans and Heuristics in an Ontology-Based Data-Extraction System , 2005 .

[16]  Yuanqiu Zhou Generating Data-Extraction Ontologies By Example , 2005 .

[17]  Yihong Ding Study of Design Issues on an Automated Semantic Annotation System , 2005 .

[18]  Mark S. Vickers,et al.  Ontology-Based Free-Form Query Processing for the Semantic Web , 2006 .

[19]  Cui Tao,et al.  Automatic Creation of Web Services from Extraction Ontologies , 2006, ER.

[20]  David W. Embley,et al.  Ontology aware software service agents: meeting ordinary user needs on the semantic web , 2007 .

[21]  D. Embley,et al.  Automatic Generation of Ontologies from Canonicalized Web Tables , 2008 .

[22]  Stephen Lynn Automating Mini-Ontology Generation from Canonical Tables , 2008 .

[23]  Tsvi Kuflik,et al.  Manually vs semiautomatic domain specific ontology building , 2008 .

[24]  Martin Hepp,et al.  Reusing ontologies and language components for ontology generation , 2010, Data Knowl. Eng..