Description Logic Based Extended Predictive Model Markup Language EPMML

Predictive Model Markup Language PMML is currently used as standardized description language for data mining model by more and more DMG members.However,different experience of data mining products providers,constant development of data mining techniques,and PMML containing lots of language elements inevitably lead to inconsistency problems in PMML based data mining metadata.Considering this problem,a description logic SOIN is designed in this research.Its syntax and semantics are analyzed.An extended predictive model markup language EPMML is then proposed based on the SOIN infrastructure,the language elements are designed in detail.EPMML based data mining metadata can be transformed into knowledge base of SOIN,and then potential semantic inconsistency problems in the metadata can be automatically discovered by knowledge reasoning upon the knowledge base.Illustrations in the reasoning engineer Racer validate the well-formedness,well expressibility and reasoning efficiency of EPMML.