An experimental study of employing visual appearance as a phenotype

Visual and non-visual data are often related through complex, indirect links, thus making the prediction of one from the other difficult. Examples include the partially- understood connections between firing of VI neurons and visual stimuli, the coupling between recorded speech and video of the corresponding lip movements, and the attempts to infer criminal intentions from surveillance videos. In this study, we explore the exploitation of the visual/non-visual relation between genetic sequences and visual appearance. This exploitation is currently considered infeasible due to the many hidden variables and unknown factors involved, the considerable variability and noise that exist in images and the high-dimensionality of the data. Despite the difficulties, we show convincing evidence that the application of correlations between genotype and visual phenotype for identification is feasible with current technologies. To this end, we employ sensitive forced- matching tests, that can accurately detect correlations between data sets. These tests are used to compare the performance of several existing algorithms, as well as novel ones that we have designed for the task.

[1]  R. Ward,et al.  DNA barcoding Australia's fish species , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[2]  P. Hebert,et al.  DNA barcoding for effective biodiversity assessment of a hyperdiverse arthropod group: the ants of Madagascar , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[3]  R. Sturm,et al.  Eye colour: portals into pigmentation genes and ancestry. , 2004, Trends in genetics : TIG.

[4]  P. Hebert,et al.  Comprehensive DNA barcode coverage of North American birds , 2007, Molecular ecology notes.

[5]  F. Cohen,et al.  Co-evolution of proteins with their interaction partners. , 2000, Journal of molecular biology.

[6]  Robert R. Sokal,et al.  A statistical method for evaluating systematic relationships , 1958 .

[7]  Frederick R Adler,et al.  Genetic basis for systems of skeletal quantitative traits: Principal component analysis of the canid skeleton , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Michael J. Lyons,et al.  Perception psychology: Visual structure of a Japanese Zen garden , 2002, Nature.

[9]  C. Bustamante,et al.  A Single IGF1 Allele Is a Major Determinant of Small Size in Dogs , 2007, Science.

[10]  Tijl De Bie,et al.  A metamorphosis of Canonical Correlation Analysis into multivariate maximum margin learning , 2007, ESANN.

[11]  Thomas Serre,et al.  Robust Object Recognition with Cortex-Like Mechanisms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[13]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[14]  Dahua Lin,et al.  Inter-modality Face Recognition , 2006, ECCV.

[15]  David R. Hardoon,et al.  KCCA for different level precision in content-based image retrieval , 2003 .

[16]  W. Brown,et al.  Rapid evolution of animal mitochondrial DNA. , 1979, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Bruce A. Draper,et al.  The CSU Face Identification Evaluation System , 2005, Machine Vision and Applications.

[18]  M. Kimura A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences , 1980, Journal of Molecular Evolution.

[19]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[20]  H. Vinod Canonical ridge and econometrics of joint production , 1976 .

[21]  Lior Wolf,et al.  Learning over Sets using Kernel Principal Angles , 2003, J. Mach. Learn. Res..

[22]  P. Hebert,et al.  bold: The Barcode of Life Data System (http://www.barcodinglife.org) , 2007, Molecular ecology notes.

[23]  Frédéric Jurie,et al.  Sampling Strategies for Bag-of-Features Image Classification , 2006, ECCV.