Preprocessing implementation for microarray (PRIM): an efficient method for processing cDNA microarray data.

cDNA microarray technology is useful for systematically analyzing the expression profiles of thousands of genes at once. Although many useful results inferred by using this technology and a hierarchical clustering method for statistical analysis have been confirmed using other methods, there are still questions about the reproducibility of the data. We have therefore developed a data processing method that very efficiently extracts reproducible data from the result of duplicate experiments. It is designed to automatically filter the raw results obtained from cDNA microarray image-analysis software. We optimize the threshold value for filtering the data by using the product of N and R, where N is the ratio of the number of spots that passed the filtering vs. the total number of spots, and R is the correlation coefficient for results obtained in the duplicate experiments. Using this method to process mouse tissue expression profile data that contain 1,881,600 points of analysis, we obtained clustered results more reasonable than those obtained using previously reported filtering methods.