JNCC2: The Java Implementation Of Naive Credal Classifier 2

JNCC2 implements the naive credal classifier 2 (NCC2). This is an extension of naive Bayes to imprecise probabilities that aims at delivering robust classifications also when dealing with small or incomplete data sets. Robustness is achieved by delivering set-valued classifications (that is, returning multiple classes) on the instances for which (i) the learning set is not informative enough to smooth the effect of choice of the prior density or (ii) the uncertainty arising from missing data prevents the reliable indication of a single class. JNCC2 is released under the GNU GPL license.

[1]  Marco Zaffalon,et al.  Statistical inference of the naive credal classifier , 2001, ISIPTA.

[2]  Ron Kohavi,et al.  Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[3]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[4]  Daniel F. Heitjan,et al.  Ignorability, Sufficiency and Ancillarity , 1997 .

[5]  M. Zaffalon Credal Classification for Mining Environmental Data , 2002 .

[6]  Marco Zaffalon,et al.  Naive Credal Classifier 2: a robust approach to classification for small and incomplete data sets , 2007 .

[7]  P. Walley Statistical Reasoning with Imprecise Probabilities , 1990 .

[8]  C. Manski Partial Identification of Probability Distributions , 2003 .

[9]  Marco Zaffalon,et al.  Conservative Rules for Predictive Inference with Incomplete Data , 2005, ISIPTA.

[10]  Marco Zaffalon,et al.  Learning Reliable Classifiers From Small or Incomplete Data Sets: The Naive Credal Classifier 2 , 2008, J. Mach. Learn. Res..

[11]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[12]  Marco Zaffalon,et al.  Reliable diagnoses of dementia by the naive credal classifier inferred from incomplete cognitive data , 2003, Artif. Intell. Medicine.

[13]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[14]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.