Text Classification Techniques in Oil Industry Applications

The development of automatic methods to produce usable structured information from unstructured text sources is extremely valuable to the oil and gas industry. A structured resource would allow researches and industry professionals to write relatively simple queries to retrieve all the information regards transcriptions of any accident. Instead of the thousands of abstracts provided by querying the unstructured corpus, the queries on structured corpus would result in a few hundred well-formed results.

[1]  Shih-Hung Wu,et al.  Text Categorization Using Automatically Acquired Domain Ontology , 2003 .

[2]  Stephan Bloehdorn,et al.  Text classification by boosting weak learners based on terms and concepts , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[3]  Amit P. Sheth,et al.  Semantic Content Management for Enterprises and the Web , 2001 .

[4]  Alan F. Smeaton,et al.  Ontology-Based MEDLINE Document Classification , 2007, BIRD.

[5]  Amit P. Sheth,et al.  Semantic Enhancement Engine: A Modular Document Enhancement Platform for Semantic Applications over Heterogeneous Content , 2002 .

[6]  Masoud Nikravesh,et al.  Enhancing the Power of the Internet , 2004 .

[7]  Jun Fang,et al.  Ontology-Based Automatic Classification and Ranking for Web Documents , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[8]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[9]  Evgeniy Gabrilovich,et al.  Overcoming the Brittleness Bottleneck using Wikipedia: Enhancing Text Categorization with Encyclopedic Knowledge , 2006, AAAI.

[10]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[11]  David D. Lewis,et al.  Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval , 1998, ECML.

[12]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[13]  Vipul Kashyap,et al.  Relationships at the Heart of Semantic Web: Modeling, Discovering, and Exploiting Complex Semantic Relationships , 2004 .

[14]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[15]  Céline Rouveirol,et al.  Machine Learning: ECML-98 , 1998, Lecture Notes in Computer Science.

[16]  Amit P. Sheth,et al.  Altering document term vectors for classification: ontologies as expectations of co-occurrence , 2007, WWW '07.