论文信息 - Pre-Processing: A Data Preparation Step - 字舞流文

Pre-Processing: A Data Preparation Step

Jugal K. Kalita | Keshab Nath | Dhruba Kumar Bhattacharyya | Swarup Roy | Pooja Sharma | J. Kalita | D. Bhattacharyya | Pooja Sharma | Swarup Roy | Keshab Nath

[1] John Quackenbush,et al. Smooth Quantile Normalization , 2016, bioRxiv.

[2] Jugal K. Kalita,et al. CoBi: Pattern Based Co-Regulated Biclustering of Gene Expression Data , 2013, Pattern Recognit. Lett..

[3] Hyun Kang. The prevention and handling of the missing data , 2013, Korean journal of anesthesiology.

[4] Rafael C. Jimenez,et al. Teaching the Fundamentals of Biological Data Integration Using Classroom Games , 2012, PLoS Comput. Biol..

[5] Jugal K. Kalita,et al. Discretization in gene expression data analysis: a selected survey , 2012, CCSEIT '12.

[6] Giovanni Parmigiani,et al. Integrating diverse genomic data using gene sets , 2011, Genome Biology.

[7] Frank Emmert-Streib,et al. Pathway Analysis of Expression Data: Deciphering Functional Building Blocks of Complex Diseases , 2011, PLoS Comput. Biol..

[8] Gary D Bader,et al. Bringing order to protein disorder through comparative genomics and genetic interactions , 2011, Genome Biology.

[9] Roderick J A Little,et al. A Review of Hot Deck Imputation for Survey Non‐response , 2010, International statistical review = Revue internationale de statistique.

[10] Craig K. Enders,et al. An introduction to modern missing data analyses. , 2010, Journal of school psychology.

[11] Song Wang,et al. OFFD: Optimal Flexible Frequency Discretization for Naïve Bayes Classification , 2009, ADMA.

[12] Luis González Abril,et al. Ameva: An autonomous discretization algorithm , 2009, Expert Syst. Appl..

[13] Katherine G. Herbert,et al. Biological data cleaning: a case study , 2007, Int. J. Inf. Qual..

[14] Ramón Díaz-Uriarte,et al. Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[15] T. H. Bø,et al. LSimpute: accurate estimation of missing values in microarray data with least squares methods. , 2004, Nucleic acids research.

[16] Lukasz A. Kurgan,et al. CAIM discretization algorithm , 2004, IEEE Transactions on Knowledge and Data Engineering.

[17] Shin Ishii,et al. A Bayesian missing value estimation method for gene expression profile data , 2003, Bioinform..

[18] Terence P. Speed,et al. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[19] John Quackenbush. Microarray data normalization and transformation , 2002, Nature Genetics.

[20] Huan Liu,et al. Discretization: An Enabling Technique , 2002, Data Mining and Knowledge Discovery.

[21] Tok Wang Ling,et al. A knowledge-based approach for duplicate elimination in data cleaning , 2001, Inf. Syst..

[22] Huan Liu,et al. Feature Selection via Discretization , 1997, IEEE Trans. Knowl. Data Eng..

[23] T. Moon. The expectation-maximization algorithm , 1996, IEEE Signal Process. Mag..

[24] P W Lavori,et al. A multiple imputation strategy for clinical trials with truncation of patient data. , 1995, Statistics in medicine.

[25] Ron Kohavi,et al. Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[26] Andrew K. C. Wong,et al. Class-Dependent Discretization for Inductive Learning from Continuous and Mixed-Mode Data , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[27] Usama M. Fayyad,et al. Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[28] Jason Catlett,et al. On Changing Continuous Attributes into Ordered Discrete Attributes , 1991, EWSL.

[29] Andrew K. C. Wong,et al. Information synthesis based on hierarchical maximum entropy discretization , 1990, J. Exp. Theor. Artif. Intell..

[30] J. Ross Quinlan,et al. Induction of Decision Trees , 1986, Machine Learning.

[31] Rajashree Dash,et al. Comparative Analysis of Supervised and Unsupervised Discretization Techniques , 2011 .

[32] Geoffrey I. Webb,et al. Discretization for naive-Bayes learning: managing discretization bias and variance , 2008, Machine Learning.

[33] Michael Q. Zhang,et al. DWE: Discriminating Word Enumerator , 2005, Bioinform..

[34] Gustavo E. A. P. A. Batista,et al. A Study of K-Nearest Neighbour as an Imputation Method , 2002, HIS.

[35] H. Do,et al. Data Cleaning: Problems and Current Approaches. , 2000 .

[36] H. Hotelling. Analysis of a complex of statistical variables into principal components. , 1933 .