Pre-Processing: A Data Preparation Step

[1]  John Quackenbush,et al.  Smooth Quantile Normalization , 2016, bioRxiv.

[2]  Jugal K. Kalita,et al.  CoBi: Pattern Based Co-Regulated Biclustering of Gene Expression Data , 2013, Pattern Recognit. Lett..

[3]  Hyun Kang The prevention and handling of the missing data , 2013, Korean journal of anesthesiology.

[4]  Rafael C. Jimenez,et al.  Teaching the Fundamentals of Biological Data Integration Using Classroom Games , 2012, PLoS Comput. Biol..

[5]  Jugal K. Kalita,et al.  Discretization in gene expression data analysis: a selected survey , 2012, CCSEIT '12.

[6]  Giovanni Parmigiani,et al.  Integrating diverse genomic data using gene sets , 2011, Genome Biology.

[7]  Frank Emmert-Streib,et al.  Pathway Analysis of Expression Data: Deciphering Functional Building Blocks of Complex Diseases , 2011, PLoS Comput. Biol..

[8]  Gary D Bader,et al.  Bringing order to protein disorder through comparative genomics and genetic interactions , 2011, Genome Biology.

[9]  Roderick J A Little,et al.  A Review of Hot Deck Imputation for Survey Non‐response , 2010, International statistical review = Revue internationale de statistique.

[10]  Craig K. Enders,et al.  An introduction to modern missing data analyses. , 2010, Journal of school psychology.

[11]  Song Wang,et al.  OFFD: Optimal Flexible Frequency Discretization for Naïve Bayes Classification , 2009, ADMA.

[12]  Luis González Abril,et al.  Ameva: An autonomous discretization algorithm , 2009, Expert Syst. Appl..

[13]  Katherine G. Herbert,et al.  Biological data cleaning: a case study , 2007, Int. J. Inf. Qual..

[14]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[15]  T. H. Bø,et al.  LSimpute: accurate estimation of missing values in microarray data with least squares methods. , 2004, Nucleic acids research.

[16]  Lukasz A. Kurgan,et al.  CAIM discretization algorithm , 2004, IEEE Transactions on Knowledge and Data Engineering.

[17]  Shin Ishii,et al.  A Bayesian missing value estimation method for gene expression profile data , 2003, Bioinform..

[18]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[19]  John Quackenbush Microarray data normalization and transformation , 2002, Nature Genetics.

[20]  Huan Liu,et al.  Discretization: An Enabling Technique , 2002, Data Mining and Knowledge Discovery.

[21]  Tok Wang Ling,et al.  A knowledge-based approach for duplicate elimination in data cleaning , 2001, Inf. Syst..

[22]  Huan Liu,et al.  Feature Selection via Discretization , 1997, IEEE Trans. Knowl. Data Eng..

[23]  T. Moon The expectation-maximization algorithm , 1996, IEEE Signal Process. Mag..

[24]  P W Lavori,et al.  A multiple imputation strategy for clinical trials with truncation of patient data. , 1995, Statistics in medicine.

[25]  Ron Kohavi,et al.  Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[26]  Andrew K. C. Wong,et al.  Class-Dependent Discretization for Inductive Learning from Continuous and Mixed-Mode Data , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[28]  Jason Catlett,et al.  On Changing Continuous Attributes into Ordered Discrete Attributes , 1991, EWSL.

[29]  Andrew K. C. Wong,et al.  Information synthesis based on hierarchical maximum entropy discretization , 1990, J. Exp. Theor. Artif. Intell..

[30]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[31]  Rajashree Dash,et al.  Comparative Analysis of Supervised and Unsupervised Discretization Techniques , 2011 .

[32]  Geoffrey I. Webb,et al.  Discretization for naive-Bayes learning: managing discretization bias and variance , 2008, Machine Learning.

[33]  Michael Q. Zhang,et al.  DWE: Discriminating Word Enumerator , 2005, Bioinform..

[34]  Gustavo E. A. P. A. Batista,et al.  A Study of K-Nearest Neighbour as an Imputation Method , 2002, HIS.

[35]  H. Do,et al.  Data Cleaning: Problems and Current Approaches. , 2000 .

[36]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .