Uncertain XML documents classification using Extreme Learning Machine

Driven by the emerging network data exchange and storage, XML documents classification has become increasingly important. Most existing representation model and conventional learning algorithm are defined on certain XML documents. However, in many real-world applications, XML datasets contain inherent uncertainty, which brings greater challenges to classification problem. In this paper, we propose a novel solution to classify uncertain XML documents, including uncertain XML documents representation and two uncertain learning algorithms based on Extreme Learning Machine. Experimental results show that our approaches exhibit prominent performance for uncertain XML documents classification problem.

[1]  Guang-Bin Huang,et al.  Extreme learning machine: a new learning scheme of feedforward neural networks , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[2]  Lei Zhao,et al.  Neural modeling of vapor compression refrigeration cycle with extreme learning machine , 2014, Neurocomputing.

[3]  Yi Zhao,et al.  A protein secondary structure prediction framework based on the Extreme Learning Machine , 2008, Neurocomputing.

[4]  Yehoshua Sagiv,et al.  Matching Twigs in Probabilistic XML , 2007, VLDB.

[5]  Jianxin Li,et al.  Matching Top-k Answers of Twig Patterns in Probabilistic XML , 2010, DASFAA.

[6]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[7]  Xin Bi,et al.  XML document classification based on ELM , 2011, Neurocomputing.

[8]  Ye Yuan,et al.  Extreme learning machine for classification over uncertain data , 2014, Neurocomputing.

[9]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[10]  Hongming Zhou,et al.  Optimization method based extreme learning machine for classification , 2010, Neurocomputing.

[11]  Jeffrey Xu Yu,et al.  Query ranking in probabilistic XML data , 2009, EDBT '09.

[12]  H. V. Jagadish,et al.  ProTDB: Probabilistic Data in XML , 2002, VLDB.

[13]  Jeffrey Xu Yu,et al.  Efficient processing of top-k twig queries over probabilistic XML data , 2011, World Wide Web.

[14]  Ge Yu,et al.  Breast tumor detection in digital mammography based on extreme learning machine , 2014, Neurocomputing.

[15]  Siqi Liu,et al.  Boosting Twig Joins in Probabilistic XML , 2011, DEXA.

[16]  Xin Bi,et al.  Probability based voting extreme learning machine for multiclass XML documents classification , 2013, World Wide Web.

[17]  Tianyou Chai,et al.  Burning state recognition of rotary kiln using ELMs with heterogeneous features , 2013, Neurocomputing.

[18]  Serge Abiteboul,et al.  On the expressiveness of probabilistic XML models , 2009, The VLDB Journal.

[19]  Yuan Lan,et al.  An extreme learning machine approach for speaker recognition , 2012, Neural Computing and Applications.

[20]  Dong Han,et al.  Semantic concept detection for video based on extreme learning machine , 2013, Neurocomputing.

[21]  Serge Abiteboul,et al.  On the complexity of managing probabilistic XML data , 2007, PODS '07.

[22]  Yawen Li,et al.  Holistically Twig Matching in Probabilistic XML , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[23]  Biao Wang,et al.  Update strategy based on region classification using ELM for mobile object index , 2012, Soft Comput..

[24]  Jian Liu,et al.  Querying and ranking incomplete twigs in probabilistic XML , 2013, World Wide Web.

[25]  Jianxin Li,et al.  Top-k keyword search over probabilistic XML data , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[26]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[27]  Yehoshua Sagiv,et al.  Query efficiency in probabilistic XML models , 2008, SIGMOD Conference.

[28]  Jianxin Li,et al.  ELCA evaluation for keyword search on probabilistic XML data , 2012, World Wide Web.

[29]  Peter Buneman,et al.  Semistructured data , 1997, PODS.

[30]  Yehoshua Sagiv,et al.  Query evaluation over probabilistic XML , 2009, The VLDB Journal.

[31]  Jianxin Li,et al.  Quasi-SLCA Based Keyword Query Processing over Probabilistic XML Data , 2013, IEEE Transactions on Knowledge and Data Engineering.

[32]  Hongming Zhou,et al.  Credit risk evaluation with extreme learning machine , 2012, 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[33]  Chen Xiaoou,et al.  A semi-structured document model for text mining , 2002 .