OVERVIEW This document is a detailed reference guide for the Statistical Algorithms used in the analysis of GeneChip ® expression data. The guide focuses on how they work, what calculations and approaches they comprise, and how the tunable parameters are designed. Additional references are provided for additional information. It is important to understand how a GeneChip ® array is designed when considering the most appropriate approaches for its analysis. A GeneChip ® probe array cons ists of a number of probe cells where each probe cell contains a unique probe. Probes are tiled in probe pairs as a Perfect Match (PM) and a Mismatch (MM). The sequence for PM and MM are the same, except for a change to the Watson-Crick complement in the middle of the MM probe sequence. A probe set consists of a series of probe pairs and represents an expressed transcript. The statistical algorithms provide the following data outputs: Output Descriptions Signal A measure of the abundance of a transcript. The number of probe pairs in the probe set. The number of probe pairs in the probe set used in the Detection call.
[1]
D. Wolfe,et al.
Nonparametric Statistical Methods.
,
1974
.
[2]
D. Rubin,et al.
Statistical Analysis with Missing Data
,
1988
.
[3]
F. Wilcoxon.
Individual Comparisons by Ranking Methods
,
1945
.
[4]
Wei-Min Liu,et al.
Robust estimators for expression analysis
,
2002,
Bioinform..
[5]
Wei-Min Liu,et al.
Analysis of high density expression microarrays with signed-rank call algorithms
,
2002,
Bioinform..
[6]
Wei-Min Liu,et al.
Rank-based algorithms for anlaysis of microarrays
,
2001,
SPIE BiOS.
[7]
F. Mosteller,et al.
Understanding robust and exploratory data analysis
,
1985
.