Rigorous benchmarking in reasonable time
暂无分享,去创建一个
[1] Jacob Cohen. The earth is round (p < .05) , 1994 .
[2] David J. Lilja,et al. Measuring computer performance : A practitioner's guide , 2000 .
[3] Lizy Kurian John,et al. Efficiently Evaluating Speedup Using Sampled Processor Simulation , 2004, IEEE Computer Architecture Letters.
[4] E. C. Fieller. SOME PROBLEMS IN INTERVAL ESTIMATION , 1954 .
[5] R. Coe,et al. It's the Effect Size, Stupid What effect size is and why it is important , 2012 .
[6] Lieven Eeckhout,et al. Java performance evaluation through rigorous replay compilation , 2008, OOPSLA.
[7] Matthew Arnold,et al. Online feedback-directed optimization of Java , 2002, OOPSLA '02.
[8] V. Guiard,et al. The robustness of parametric statistical methods , 2004 .
[9] Karl J. Friston,et al. Variance Components , 2003 .
[10] Bruce Thompson,et al. Computing and Interpreting Effect Sizes , 2004 .
[11] Petr Tuma,et al. Precise Regression Benchmarking with Random Effects: Improving Mono Benchmark Results , 2006, EPEW.
[12] Toshio Nakatani,et al. Replay compilation: improving debuggability of a just-in-time compiler , 2006, OOPSLA '06.
[13] S. R. Searle,et al. Generalized, Linear, and Mixed Models , 2005 .
[14] Anirban DasGupta,et al. Robustness of Standard Confidence Intervals for Location Parameters Under Departure from Normality , 1995 .
[15] Dayong Gu,et al. Code Layout as a Source of Noise in JVM Performance , 2005, Stud. Inform. Univ..
[16] Amer Diwan,et al. The DaCapo benchmarks: java benchmarking development and analysis , 2006, OOPSLA '06.
[17] Ray Jain,et al. The art of computer systems performance analysis - techniques for experimental design, measurement, simulation, and modeling , 1991, Wiley professional computing.
[18] Emery D. Berger,et al. STABILIZER: statistically sound performance evaluation , 2013, ASPLOS '13.
[19] Matthias Hauswirth,et al. Producing wrong data without doing anything obviously wrong! , 2009, ASPLOS.
[20] I. Cuthill,et al. Effect size, confidence interval and statistical significance: a practical guide for biologists , 2007, Biological reviews of the Cambridge Philosophical Society.
[21] Scott E. Maxwell,et al. Designing Experiments and Analyzing Data: A Model Comparison Perspective , 1990 .
[22] N. Schenker,et al. Overlapping confidence intervals or standard error intervals: What do they mean in terms of statistical significance? , 2003, Journal of insect science.
[23] R. Royall. The Effect of Sample Size on the Meaning of Significance Tests , 1986 .
[24] Lieven Eeckhout,et al. Statistically rigorous java performance evaluation , 2007, OOPSLA.