Using Undersampling with Ensemble Learning to Identify Factors Contributing to Preterm Birth
暂无分享,去创建一个
David Kaeli | Shi Dong | John Meeker | Guangyu Li | Zlatan Feric | April Z. Gu | Jennifer Dy | Ingrid Y. Padilla | Chieh Wu | Jennifer G. Dy | Carmen Velez Vega | Zaira Rosario | Jose Cordero | Akram Alshawabkeh | Chieh-Tsai Wu | I. Padilla | A. Alshawabkeh | D. Kaeli | J. Meeker | J. Dy | A. Gu | Z. Rosario | J. Cordero | Z. Feric | Guangyu Li | C. V. Vega | Shi Dong | Zaira Rosario
[1] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..
[2] Qiang Wang,et al. Missing categorical data imputation approach based on similarity , 2012, 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC).
[3] Peter A. Flach,et al. Machine Learning - The Art and Science of Algorithms that Make Sense of Data , 2012 .
[4] Xiangyu Li,et al. A Hybrid Approach to Identifying Key Factors in Environmental Health Studies , 2018, 2018 IEEE International Conference on Big Data (Big Data).
[5] M. Kenward,et al. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls , 2009, BMJ : British Medical Journal.
[6] Magnus Rattray,et al. Making sense of big data in health research: Towards an EU action plan , 2016, Genome Medicine.
[7] Ki-Yeol Kim,et al. Reuse of imputed data in microarray analysis increases imputation efficiency , 2004, BMC Bioinformatics.
[8] Chao Chen,et al. Using Random Forest to Learn Imbalanced Data , 2004 .
[9] I. Kohane,et al. Big Data and Machine Learning in Health Care. , 2018, JAMA.
[10] Zhi-Hua Zhou,et al. Exploratory Under-Sampling for Class-Imbalance Learning , 2006, Sixth International Conference on Data Mining (ICDM'06).
[11] Carmine Zoccali,et al. Multiple imputation: dealing with missing data. , 2013, Nephrology, dialysis, transplantation : official publication of the European Dialysis and Transplant Association - European Renal Association.
[12] Thibault Helleputte,et al. Robust biomarker identification for cancer diagnosis with ensemble feature selection methods , 2010, Bioinform..
[13] David R. Kaeli,et al. An Efficient Data Management Framework for Puerto Rico Testsite for Exploring Contamination Threats (PROTECT) , 2018, 2018 IEEE International Conference on Big Data (Big Data).
[14] Mohamed Limam,et al. Ensemble feature selection for high dimensional data: a new method and a comparative study , 2017, Advances in Data Analysis and Classification.
[15] Noel A. Card,et al. Best practices for missing data management in counseling psychology. , 2010, Journal of counseling psychology.
[16] Susan M. Bridges,et al. An Ensemble Method for Identifying Robust Features for Biomarker Discovery , 2008 .
[17] Yu Zheng,et al. U-Air: when urban air quality inference meets big data , 2013, KDD.
[18] Nitesh V. Chawla,et al. SMOTEBoost: Improving Prediction of the Minority Class in Boosting , 2003, PKDD.
[19] R. Little. Missing-Data Adjustments in Large Surveys , 1988 .
[20] Taghi M. Khoshgoftaar,et al. RUSBoost: A Hybrid Approach to Alleviating Class Imbalance , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.
[21] Ying Chen,et al. IBM Watson: How Cognitive Computing Can Be Applied to Big Data Challenges in Life Sciences Research. , 2016, Clinical therapeutics.