A Comparison of Simultaneous Confidence Intervals to Identify Handwritten Digits

This paper evaluates the use of several known simultaneous confidence interval methods for the automated recognition of handwritten digits from data in a well-known handwriting database. Contained in this database are handwritten digits, 0 through 9, that were obtained from 42,000 participants' writing samples. The objective of the analyses is to utilize statistical testing procedures that can be easily automated by a computer to recognize which digit was written by a subject. The methodologies discussed in this paper are designed to be sensitive to Type I errors and will control an overall measure of these errors, called the Familywise Error Rate. The procedures were constructed based off of a training portion of the data set, then applied and validated on the remaining testing portion of the data.

[1]  Iljoo Kim,et al.  Personal Financial Aggregation and Social Media Mining: A New Framework for Actionable Financial Business Intelligence (AFBI) , 2014, Int. J. Bus. Intell. Res..

[2]  Traci J. Hess,et al.  Understanding Decision-Making in Data Warehousing and Related Decision Support Systems: An Explanatory Study of a Customer Relationship Management Application , 2002, Inf. Resour. Manag. J..

[3]  Z. Šidák Rectangular Confidence Regions for the Means of Multivariate Normal Distributions , 1967 .

[4]  B. Holland,et al.  An Improved Sequentially Rejective Bonferroni Test Procedure , 1987 .

[5]  John Wang,et al.  Encyclopedia of Business Analytics and Optimization , 2018 .

[6]  Roy Huber,et al.  Handwriting Identification: Facts and Fundamentals , 1999 .

[7]  Padhraic Smyth,et al.  Clustering Sequences with Hidden Markov Models , 1996, NIPS.

[8]  Jerzy Sas,et al.  Semi-automatic Handwritten Word Segmentation Based on Character Width Approximation Via Maximum Likelihood Method and Regression Model , 2008, Computer Recognition Systems 2.

[9]  GuptaVipul,et al.  Personal Financial Aggregation and Social Media Mining , 2014 .

[10]  A. Tamhane,et al.  Multiple Comparison Procedures , 1989 .

[11]  M. Raisinghani Business Intelligence in the Digital Economy: Opportunities, Limitations and Risks , 2004 .

[12]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[13]  Y. Hochberg A sharper Bonferroni procedure for multiple tests of significance , 1988 .

[14]  Moshe Kam,et al.  Writer identification using hand-printed and non-hand-printed questioned documents. , 2003, Journal of forensic sciences.

[15]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[16]  Pietro G. Morasso,et al.  Recognition experiments of cursive dynamic handwriting with self-organizing networks , 1993, Pattern Recognit..

[17]  Robert G. Isaac,et al.  National Intellectual Capital Stocks and Organizational Cultures: A Comparison of Lebanon and Iran , 2010 .

[18]  S. Sarkar,et al.  The Simes Method for Multiple Hypothesis Testing with Positively Dependent Test Statistics , 1997 .

[19]  Donald T. Gantz,et al.  Construction and evaluation of classifiers for forensic document analysis , 2010, 1004.0678.

[20]  S. Sarkar Some probability inequalities for ordered $\rm MTP\sb 2$ random variables: a proof of the Simes conjecture , 1998 .

[21]  Thierry Paquet,et al.  Unsupervised writer adaptation applied to handwritten text recognition , 2004, Pattern Recognit..

[22]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[23]  Dorothy Miller Improving Business Intelligence: The Six Sigma Way , 2010, Int. J. Bus. Intell. Res..