New statistical learning methods for chemical toxicity data analysis

[1]  R. Davies Hypothesis testing when a nuisance parameter is present only under the alternative , 1977 .

[2]  David R. Musicant,et al.  Successive overrelaxation for support vector machines , 1999, IEEE Trans. Neural Networks.

[3]  S. Geisler,et al.  A Mitochondrial Target Sequence Polymorphism in Manganese Superoxide Dismutase Predicts Inferior Survival in Breast Cancer Patients Treated with Cyclophosphamide , 2009, Clinical Cancer Research.

[4]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[5]  James C. Bezdek,et al.  Decision templates for multiple classifier fusion: an experimental comparison , 2001, Pattern Recognit..

[6]  Dimitris Karlis,et al.  Choosing Initial Values for the EM Algorithm for Finite Mixtures , 2003, Comput. Stat. Data Anal..

[7]  Bogdan Gabrys,et al.  Ridge regression ensemble for toxicity prediction , 2010, ICCS.

[8]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[9]  Anne M. P. Canuto,et al.  A Class-Based Feature Selection Method for Ensemble Systems , 2008, 2008 Eighth International Conference on Hybrid Intelligent Systems.

[10]  B. Silverman Density estimation for statistics and data analysis , 1986 .

[11]  S. Lee,et al.  Semiparametric Estimation of a Binary Response Model with a Change-Point Due to a Covariate Threshold , 2007 .

[12]  Kurt Hornik,et al.  Support Vector Machines in R , 2006 .

[13]  Emilio Benfenati,et al.  A QSAR Study of Avian Oral Toxicity using Support Vector Machines and Genetic Algorithms , 2006 .

[14]  Roberto Pastor-Barriuso,et al.  Transition models for change‐point estimation in logistic regression , 2003, Statistics in medicine.

[15]  K. Matthews,et al.  Improving the performance of physiologic hot flash measures with support vector machines. , 2009, Psychophysiology.

[16]  J. Kruskal TOWARD A PRACTICAL METHOD WHICH HELPS UNCOVER THE STRUCTURE OF A SET OF MULTIVARIATE OBSERVATIONS BY FINDING THE LINEAR TRANSFORMATION WHICH OPTIMIZES A NEW “INDEX OF CONDENSATION” , 1969 .

[17]  Alexander J. Smola,et al.  Support Vector Method for Function Approximation, Regression Estimation and Signal Processing , 1996, NIPS.

[18]  Tianzi Jiang,et al.  A combinational feature selection and ensemble neural network method for classification of gene expression data , 2004, BMC Bioinformatics.

[19]  D. Opitz,et al.  Popular Ensemble Methods: An Empirical Study , 1999, J. Artif. Intell. Res..

[20]  Adrian E. Raftery,et al.  CHANGE POINT AND CHANGE CURVE MODELING IN STOCHASTIC PROCESSES AND SPATIAL STATISTICS , 2007 .

[21]  Ian H. Witten,et al.  Stacked generalization: when does it work? , 1997, IJCAI 1997.

[22]  A. Formann,et al.  Sensitivity to initial values in full non-parametric maximum-likelihood estimation of the two-parameter logistic model. , 2011, The British journal of mathematical and statistical psychology.

[23]  J. Friedman,et al.  PROJECTION PURSUIT DENSITY ESTIMATION , 1984 .

[24]  Muin J. Khoury,et al.  Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes , 2010, BMC Medical Informatics Decis. Mak..

[25]  C. Chu,et al.  Kernel-Type Estimators of Jump Points and Values of a Regression Function , 1993 .

[26]  H. Chernoff,et al.  ESTIMATING THE CURRENT MEAN OF A NORMAL DISTRIBUTION WHICH IS SUBJECTED TO CHANGES IN TIME , 1964 .

[27]  Lars Kai Hansen,et al.  Neural Network Ensembles , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[29]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .

[30]  E. Guallar,et al.  Use of two-segmented logistic regression to estimate change-points in epidemiologic studies. , 1998, American journal of epidemiology.

[31]  Xiaogang Su,et al.  Subgroup Analysis via Recursive Partitioning , 2009 .

[32]  Georgios Paliouras,et al.  Combining Information Extraction Systems Using Voting and Stacked Generalization , 2005, J. Mach. Learn. Res..

[33]  R. Clemen Combining forecasts: A review and annotated bibliography , 1989 .

[34]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  Ludmila I. Kuncheva,et al.  Relationships between combination methods and measures of diversity in combining classifiers , 2002, Inf. Fusion.

[36]  Thibault Helleputte,et al.  Robust biomarker identification for cancer diagnosis with ensemble feature selection methods , 2010, Bioinform..

[37]  Artem Cherkasov Inductive QSAR Descriptors. Distinguishing Compounds with Antibacterial Activity by Artificial Neural Networks , 2005 .

[38]  O. Pons Estimation in a Cox regression model with a change-point according to a threshold in a covariate , 2003 .

[39]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[40]  Yasuo Amemiya,et al.  Latent class regression on latent factors. , 2005, Biostatistics.

[41]  Manfred Opper,et al.  An Approximate Analytical Approach to Resampling Averages , 2003, J. Mach. Learn. Res..

[42]  H. Müller CHANGE-POINTS IN NONPARAMETRIC REGRESSION ANALYSIS' , 1992 .

[43]  D. Andrews,et al.  Optimal Tests When a Nuisance Parameter Is Present Only Under the Alternative , 1992 .

[44]  M. Kosorok Introduction to Empirical Processes and Semiparametric Inference , 2008 .

[45]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[46]  M. Pazzani,et al.  Error Reduction through Learning Multiple Descriptions , 1996, Machine Learning.

[47]  Eric Bauer,et al.  An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants , 1999, Machine Learning.

[48]  Mia K. Markey,et al.  A machine learning perspective on the development of clinical decision support systems utilizing mass spectra of blood samples , 2006, J. Biomed. Informatics.

[49]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[50]  D. Andrews Testing When a Parameter Is on the Boundary of the Maintained Hypothesis , 2001 .

[51]  X. Y. Zhang,et al.  Application of support vector machine (SVM) for prediction toxic activity of different data sets. , 2006, Toxicology.

[52]  Linear Estimators in Change Point Problems , 1994 .

[53]  Stephen R. Johnson,et al.  The Trouble with QSAR (or How I Learned To Stop Worrying and Embrace Fallacy) , 2008, J. Chem. Inf. Model..

[54]  Chih-Jen Lin,et al.  Newton's Method for Large Bound-Constrained Optimization Problems , 1999, SIAM J. Optim..

[55]  John W. Tukey,et al.  A Projection Pursuit Algorithm for Exploratory Data Analysis , 1974, IEEE Transactions on Computers.

[56]  C. Manski MAXIMUM SCORE ESTIMATION OF THE STOCHASTIC UTILITY MODEL OF CHOICE , 1975 .

[57]  Mu Zhu,et al.  Kernels and Ensembles , 2007, 0712.1027.

[58]  David W. Opitz,et al.  Feature Selection for Ensembles , 1999, AAAI/IAAI.

[59]  J. Friedman,et al.  Projection Pursuit Regression , 1981 .

[60]  Thomas G. Dietterich Machine-Learning Research , 1997, AI Mag..

[61]  C. Manski Semiparametric analysis of discrete response: Asymptotic properties of the maximum score estimator , 1985 .

[62]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[63]  Lin Ma,et al.  Empirical analysis of support vector machine ensemble classifiers , 2009, Expert Syst. Appl..

[64]  Mee Young Park,et al.  Penalized logistic regression for detecting gene interactions. , 2008, Biostatistics.

[65]  Ting Chen,et al.  Ensemble Feature Selection: Consistent Descriptor Subsets for Multiple QSAR Models , 2007, J. Chem. Inf. Model..