Predicting carcinoid heart disease with the noisy-threshold classifier

OBJECTIVE To predict the development of carcinoid heart disease (CHD), which is a life-threatening complication of certain neuroendocrine tumors. To this end, a novel type of Bayesian classifier, known as the noisy-threshold classifier, is applied. MATERIALS AND METHODS Fifty-four cases of patients that suffered from a low-grade midgut carcinoid tumor, of which 22 patients developed CHD, were obtained from the Netherlands Cancer Institute (NKI). Eleven attributes that are known at admission have been used to classify whether the patient develops CHD. Classification accuracy and area under the receiver operating characteristics (ROC) curve of the noisy-threshold classifier are compared with those of the naive-Bayes classifier, logistic regression, the decision-tree learning algorithm C4.5, and a decision rule, as formulated by an expert physician. RESULTS The noisy-threshold classifier showed the best classification accuracy of 72% correctly classified cases, although differences were significant only for logistic regression and C4.5. An area under the ROC curve of 0.66 was attained for the noisy-threshold classifier, and equaled that of the physician's decision-rule. CONCLUSIONS The noisy-threshold classifier performed favorably to other state-of-the-art classification algorithms, and equally well as a decision-rule that was formulated by the physician. Furthermore, the semantics of the noisy-threshold classifier make it a useful machine learning technique in domains where multiple causes influence a common effect.

[1]  Nevin Lianwen Zhang,et al.  Exploiting Causal Independence in Bayesian Network Inference , 1996, J. Artif. Intell. Res..

[2]  Johann Eder,et al.  Logic and Databases , 1992, Advanced Topics in Artificial Intelligence.

[3]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[4]  Ron Kohavi,et al.  Bias Plus Variance Decomposition for Zero-One Loss Functions , 1996, ICML.

[5]  Mehran Sahami,et al.  Learning Limited Dependence Bayesian Classifiers , 1996, KDD.

[6]  Steven Salzberg,et al.  On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach , 1997, Data Mining and Knowledge Discovery.

[7]  Jirí Vomlel,et al.  Exploiting Functional Dependence in Bayesian Network Inference , 2002, UAI.

[8]  E. Shortliffe,et al.  An analysis of physician attitudes regarding computer-based clinical consultation systems. , 1981, Computers and biomedical research, an international journal.

[9]  Tom Heskes,et al.  EM Algorithm for Symmetric Causal Independence Models , 2006, ECML.

[10]  D. G. Swain Computer aided diagnosis of acute abdominal pain , 1986 .

[11]  R. S. Ledley,et al.  Probability, Logic and Medical Diagnosis. , 1959 .

[12]  D. Bamber The area above the ordinal dominance graph and the area below the receiver operating characteristic graph , 1975 .

[13]  G. Sutton Computer aided diagnosis of acute abdominal pain , 1986, British medical journal.

[14]  Ingo Wegener,et al.  The complexity of Boolean functions , 1987 .

[15]  R S LEDLEY,et al.  Reasoning foundations of medical diagnosis; symbolic logic, probability, and value theory aid our understanding of how physicians reason. , 1959, Science.

[16]  Carmen Lacave,et al.  A review of explanation methods for Bayesian networks , 2002, The Knowledge Engineering Review.

[17]  Babs G Taal,et al.  Carcinoid heart disease. , 2003, The New England journal of medicine.

[18]  L. L. Cam,et al.  An approximation theorem for the Poisson binomial distribution. , 1960 .

[19]  R. Bouillon,et al.  Chromogranin A: its clinical value as marker of neuroendocrine tumours , 1998, European journal of clinical investigation.

[20]  B. Taal,et al.  Metastatic carcinoid tumors: a clinical review. , 2005, The oncologist.

[21]  James P. Egan,et al.  Signal detection theory and ROC analysis , 1975 .

[22]  José Mira Mira,et al.  NasoNet, modeling the spread of nasopharyngeal cancer with networks of probabilistic events in discrete time , 2002, Artif. Intell. Medicine.

[23]  D. J. Spiegelhalter,et al.  Statistical and Knowledge‐Based Approaches to Clinical Decision‐Support Systems, with an Application in Gastroenterology , 1984 .

[24]  Mark Kidd,et al.  Carcinoid Tumors and Fibrosis: An Association with No Explanation , 2004, The American Journal of Gastroenterology.

[25]  Gregory M. Provan,et al.  Knowledge Engineering for Large Belief Networks , 1994, UAI.

[26]  Herbert B. Enderton,et al.  A mathematical introduction to logic , 1972 .

[27]  Peter J. F. Lucas,et al.  Bayesian Network Modelling by Qualitative Patterns , 2002, ECAI.

[28]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[29]  Peter J. F. Lucas,et al.  Bayesian network modelling through qualitative patterns , 2005, Artif. Intell..

[30]  Peter J. F. Lucas,et al.  Using Background Knowledge to Construct Bayesian Classifiers for Data-Poor Domains , 2004, SGAI Conf..

[31]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[32]  Herbert B. Enderton A mathematical introduction to logic / Herbert B. Enderton , 1972 .

[33]  Mehryar Mohri,et al.  AUC Optimization vs. Error Rate Minimization , 2003, NIPS.

[34]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[35]  R. Reiter On Closed World Data Bases , 1987, Logic and Data Bases.

[36]  Rex B. Kline,et al.  Principles and Practice of Structural Equation Modeling , 1998 .

[37]  David Heckerman,et al.  A New Look at Causal Independence , 1994, UAI.

[38]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[39]  A. Edwards,et al.  The Meaning of Binomial Distribution , 1960, Nature.

[40]  Peter J. F. Lucas,et al.  Noisy Threshold Functions for Modelling Causal Independence in Bayesian Networks , 2006 .

[41]  Roger A. Sugden,et al.  Multiple Imputation for Nonresponse in Surveys , 1988 .

[42]  Francisco Javier Díez,et al.  Networks of probabilistic events in discrete time , 2002, Int. J. Approx. Reason..

[43]  P. Deas Notes of a Case of Spontaneous Fracture of the Humerus and Femur, Resulting from Degeneration of the Bones , 1877, British medical journal.

[44]  E. D. de Vries,et al.  Carcinoid heart disease. , 2003, The New England journal of medicine.

[45]  Francisco Javier Díez,et al.  Parameter adjustment in Bayes networks. The generalized noisy OR-gate , 1993, UAI.

[46]  Ian Witten,et al.  Data Mining , 2000 .