Feature selection for aiding glass forensic evidence analysis

The evaluation of glass evidence in forensic science is an important issue. Traditionally, this has depended on the comparison of the physical and chemical attributes of an unknown fragment with a control fragment. A high degree of discrimination between glass fragments is now achievable due to advances in analytical capabilities. A random effects model using two levels of hierarchical nesting is applied to the calculation of a likelihood ratio (LR) as a solution to the problem of comparison between two sets of replicated continuous observations where it is unknown whether the sets of measurements shared a common origin. Replicate measurements from a population of such measurements allow the calculation of both within-group and between-group variances. Univariate normal kernel estimation procedures have been used for this, where the between-group distribution is considered to be non-normal. However, the choice of variable for use in LR estimation is critical to the quality of LR produced. This paper investigates the use of feature selection for the purpose of selecting the variable for estimation without the need for expert knowledge. Results are recorded for several selectors using normal, exponential, adaptive and biweight kernel estimation techniques. Misclassification rates for the LR estimators are used to measure performance. The experiments performed reveal the capability of the proposed approach for this task.

[1]  Robert C. Holte,et al.  Very Simple Classification Rules Perform Well on Most Commonly Used Datasets , 1993, Machine Learning.

[2]  Qiang Shen,et al.  Rough set-aided keyword reduction for text categorization , 2001, Appl. Artif. Intell..

[3]  Huan Liu,et al.  Discretization: An Enabling Technique , 2002, Data Mining and Knowledge Discovery.

[4]  Cezary Z. Janikow,et al.  Fuzzy decision trees: issues and methods , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[5]  U. Höhle Quotients with respect to similarity relations , 1988 .

[6]  Hiroshi Motoda,et al.  Book Review: Computational Methods of Feature Selection , 2007, The IEEE intelligent informatics bulletin.

[7]  Masoud Nikravesh,et al.  Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing) , 2006 .

[8]  R. Koons,et al.  The Forensic Significance of Glass Composition and Refractive Index Measurements , 1999 .

[9]  Allan H. Seheult,et al.  On a problem in forensic science , 1978 .

[10]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[11]  P. J. Green,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[12]  Josef Kittler,et al.  Pattern recognition : a statistical approach , 1982 .

[13]  William H. Press,et al.  Numerical recipes in C , 2002 .

[14]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[15]  Roman Słowiński,et al.  Intelligent Decision Support , 1992, Theory and Decision Library.

[16]  Bart Kosko,et al.  Fuzzy entropy and conditioning , 1986, Inf. Sci..

[17]  Qiang Shen,et al.  Computational Intelligence and Feature Selection - Rough and Fuzzy Approaches , 2008, IEEE Press series on computational intelligence.

[18]  D. Hawkins Multivariate Statistics: A Practical Approach , 1990 .

[19]  Claude Roux,et al.  Statistics and the Evaluation of Evidence for Forensic Scientists, by Colin G. G. Aitken and Franco Taroni 2nd edition. John Wiley and Sons, 2004. , 2006 .

[20]  Qiang Shen,et al.  Fuzzy-Rough Sets Assisted Attribute Selection , 2007, IEEE Transactions on Fuzzy Systems.

[21]  Didier Dubois,et al.  Putting Rough Sets and Fuzzy Sets Together , 1992, Intelligent Decision Support.

[22]  Qiang Shen,et al.  A rough-fuzzy approach for generating classification rules , 2002, Pattern Recognit..

[23]  Franco Taroni,et al.  Statistics and the Evaluation of Evidence for Forensic Scientists , 2004 .

[24]  Matthew P. Wand,et al.  Kernel Smoothing , 1995 .

[25]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[26]  Hung Son Nguyen,et al.  A View on Rough Set Concept Approximations , 2003, Fundam. Informaticae.

[27]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[28]  Philip J. Stone,et al.  Experiments in induction , 1966 .

[29]  Huan Liu,et al.  Chi2: feature selection and discretization of numeric attributes , 1995, Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence.

[30]  R. Koons,et al.  Interpretation of glass composition measurements: the effects of match criteria on discrimination capability. , 2002, Journal of forensic sciences.

[31]  C. Aitken,et al.  A Two‐Level Model for Evidence Evaluation , 2007, Journal of forensic sciences.

[32]  Hiroshi Motoda,et al.  Computational Methods of Feature Selection , 2022 .

[33]  Qiang Shen,et al.  The evaluation of evidence for exponentially distributed data , 2007, Comput. Stat. Data Anal..

[34]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[35]  Qiang Shen,et al.  Semantics-preserving dimensionality reduction: rough and fuzzy-rough-based approaches , 2004, IEEE Transactions on Knowledge and Data Engineering.

[36]  H. Riedwyl,et al.  Multivariate Statistics: A Practical Approach , 1988 .