Discrete Graph Hashing

Hashing has emerged as a popular technique for fast nearest neighbor search in gigantic databases. In particular, learning based hashing has received considerable attention due to its appealing storage and search efficiency. However, the performance of most unsupervised learning based hashing methods deteriorates rapidly as the hash code length increases. We argue that the degraded performance is due to inferior optimization procedures used to achieve discrete binary codes. This paper presents a graph-based unsupervised hashing model to preserve the neighborhood structure of massive data in a discrete code space. We cast the graph hashing problem into a discrete optimization framework which directly learns the binary codes. A tractable alternating maximization algorithm is then proposed to explicitly deal with the discrete constraints, yielding high-quality codes to well capture the local neighborhoods. Extensive experiments performed on four large datasets with up to one million samples show that our discrete optimization based graph hashing method obtains superior search accuracy over state-of-the-art un-supervised hashing methods, especially for longer codes.

[1]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[2]  W. Atchley,et al.  Ratios and the Statistical Analysis of Biological Data , 1978 .

[3]  T. Garland Scaling the Ecological Cost of Transport to Body Mass in Terrestrial Mammals , 1983, The American Naturalist.

[4]  S. J. Arnold,et al.  THE MEASUREMENT OF SELECTION ON CORRELATED CHARACTERS , 1983, Evolution; international journal of organic evolution.

[5]  S. J. Arnold,et al.  Behavioural variation in natural populations. III: Antipredator displays in the garter snake Thamnophis radix , 1984, Animal Behaviour.

[6]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[7]  D. Blumstein MULTIVARIATE ANALYSIS OF GOLDEN MARMOT MAXIMUM RUNNING SPEED: A NEW METHOD TO STUDY MRS IN THE FIELD' , 1992 .

[8]  G. Bachman The effect of body condition on the trade-off between vigilance and foraging in Belding's ground squirrels , 1993, Animal Behaviour.

[9]  D. A. Wolf Recent advances in descriptive multivariate analysis , 1996 .

[10]  M. Willig,et al.  Functional Diversity of Soil Bacterial Communities in the Tabonuco Forest: Interaction of Anthropogenic and Natural Disturbance1 , 1996 .

[11]  M. Lynch,et al.  Genetics and Analysis of Quantitative Traits , 1996 .

[12]  D. Roff,et al.  Quantitative genetics of the trade-off between fecundity and wing dimorphism in the cricket Allonemobius socius , 1996, Heredity.

[13]  B. May,et al.  Characterization of microsatellite loci in the Northern Idaho ground squirrel Spermophilus brunneus brunneus , 1997, Molecular ecology.

[14]  D. Roff Evolutionary Quantitative Genetics , 1997, Springer US.

[15]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  A. Frid Vigilance by female Dall's sheep: interactions between predation risk factors , 1997, Animal Behaviour.

[17]  B. Goossens,et al.  Extra-pair paternity in the monogamous Alpine marmot revealed by nuclear DNA microsatellite analysis , 1998, Behavioral Ecology and Sociobiology.

[18]  Alan M. Frieze,et al.  Min-wise independent permutations (extended abstract) , 1998, STOC '98.

[19]  T. Clutton‐Brock,et al.  Selfish sentinels in cooperative mammals. , 1999, Science.

[20]  S. Hanslik,et al.  Microsatellite loci for two European sciurid species (Marmota marmota, Spermophilus citellus) , 2000, Molecular ecology.

[21]  J. Leeuw Applications of Convex Analysis to Multidimensional Scaling , 2000 .

[22]  L. Kruuk,et al.  Heritability of fitness in a wild mammal population. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Alan M. Frieze,et al.  Min-Wise Independent Permutations , 2000, J. Comput. Syst. Sci..

[24]  A. Adams Effects of a hurricane on two assemblages of coral reef fishes: Multiple-year analysis reverses a false snapshot' interpretation , 2001 .

[25]  Johan Håstad,et al.  Some optimal inapproximability results , 2001, JACM.

[26]  Moses Charikar,et al.  Similarity estimation techniques from rounding algorithms , 2002, STOC '02.

[27]  N. Dingemanse,et al.  Repeatability and heritability of exploratory behaviour in great tits from the wild , 2002, Animal Behaviour.

[28]  P. Taberlet,et al.  Isolation and characterization of microsatellites in European alpine marmots (Marmota marmota) , 2003 .

[29]  Trevor Darrell,et al.  Fast pose estimation with parameter-sensitive hashing , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[30]  A. Jones,et al.  Methods of parentage analysis in natural populations , 2003, Molecular ecology.

[31]  G. E. Woolfenden,et al.  Florida Scrub‐Jays (Aphelocoma coerulescens) are Sentinels More When Well‐Fed (Even with no Kin Nearby) , 2003 .

[32]  N. Metcalfe,et al.  Changing priorities: the effect of pre-migratory fattening on the trade-off between foraging and vigilance , 1984, Behavioral Ecology and Sociobiology.

[33]  Janice C. Daniel,et al.  Locomotor Ability and Wariness in Yellow‐Bellied Marmots , 2004 .

[34]  L. Kruuk Estimating genetic parameters in natural populations using the "animal model". , 2004, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[35]  W. Cooper,et al.  Locomotor impairment and defense in gravid lizards (Eumeces laticeps): behavioral shift in activity may offset costs of reproduction in an active forager , 1990, Behavioral Ecology and Sociobiology.

[36]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[37]  T. Mappes,et al.  GENETIC BASIS OF THE TRADE‐OFF BETWEEN OFFSPRING NUMBER AND QUALITY IN THE BANK VOLE , 2004, Evolution; international journal of organic evolution.

[38]  P. Drent,et al.  Realized heritability and repeatability of risk-taking behaviour in relation to avian personalities , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[39]  Janice C. Daniel,et al.  The loss of anti-predator behaviour following isolation on islands , 2005, Proceedings of the Royal Society B: Biological Sciences.

[40]  C. Floyd,et al.  MARMOTS ON GREAT BASIN MOUNTAINTOPS: USING GENETICS TO TEST A BIOGEOGRAPHIC PARADIGM , 2005 .

[41]  C. Boake Repeatability: Its role in evolutionary studies of mating behavior , 1989, Evolutionary Ecology.

[42]  P. Webster,et al.  Changes in Tropical Cyclone Number, Duration, and Intensity in a Warming Environment , 2005, Science.

[43]  T. Caro,et al.  Antipredator Defenses in Birds and Mammals , 2006 .

[44]  D. Blumstein,et al.  Effect of predation risk on the presence and persistence of yellow‐bellied marmot (Marmota flaviventris) colonies , 2006 .

[45]  Y. D. Blanco,et al.  Determinants of vigilance behavior in the ring-tailed coati (Nasua nasua): the importance of within-group spatial position , 2006, Behavioral Ecology and Sociobiology.

[46]  Alexandr Andoni,et al.  Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[47]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  L. Kruuk,et al.  How to separate genetic and environmental causes of similarity between relatives , 2007, Journal of evolutionary biology.

[49]  J. Mateo Ecological and hormonal correlates of antipredator behavior in adult Belding’s ground squirrels (Spermophilus beldingi) , 2007, Behavioral Ecology and Sociobiology.

[50]  Julien G. A. Martin,et al.  Many eyes or many ewes: vigilance tactics in female bighorn sheep Ovis canadensis vary according to reproductive status , 2007 .

[51]  S. Kalinowski,et al.  Revising how the computer program cervus accommodates genotyping error increases success in paternity assignment , 2007, Molecular ecology.

[52]  Antonio Torralba,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 80 Million Tiny Images: a Large Dataset for Non-parametric Object and Scene Recognition , 2022 .

[53]  J. Rogers,et al.  Genetic influences on behavioral inhibition and anxiety in juvenile rhesus macaques , 2008, Genes, brain, and behavior.

[54]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[55]  J. Hadfield Estimating evolutionary parameters when viability selection is operating , 2008, Proceedings of the Royal Society B: Biological Sciences.

[56]  G. Beauchamp What is the magnitude of the group-size effect on vigilance? , 2008 .

[57]  S. Creel,et al.  Time and space in general models of antipredator response: tests with wolves and elk , 2008, Animal Behaviour.

[58]  Kilian Q. Weinberger,et al.  Feature hashing for large scale multitask learning , 2009, ICML '09.

[59]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[60]  Chuan-Sheng Foo,et al.  A majorization-minimization algorithm for (multiple) hyperparameter learning , 2009, ICML '09.

[61]  Trevor Darrell,et al.  Learning to Hash with Binary Reconstructive Embeddings , 2009, NIPS.

[62]  John Langford,et al.  Hash Kernels for Structured Data , 2009, J. Mach. Learn. Res..

[63]  Geoffrey E. Hinton,et al.  Semantic hashing , 2009, Int. J. Approx. Reason..

[64]  Shuicheng Yan,et al.  Weakly-supervised hashing in kernel space , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[65]  Wei Liu,et al.  Large Graph Construction for Scalable Semi-Supervised Learning , 2010, ICML.

[66]  Krista A. Ehinger,et al.  SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[67]  David J. Fleet,et al.  Minimal Loss Hashing for Compact Binary Codes , 2011, ICML.

[68]  Ping Li,et al.  Theory and applications of b-bit minwise hashing , 2011, Commun. ACM.

[69]  Ping Li,et al.  Hashing Algorithms for Large-Scale Learning , 2011, NIPS.

[70]  Wei Liu,et al.  Hashing with Graphs , 2011, ICML.

[71]  Tal Hassner,et al.  Face recognition in unconstrained videos with matched background similarity , 2011, CVPR 2011.

[72]  M. Willig,et al.  Long-Term Research in the Luquillo Mountains: Synthesis and Foundations for the Future , 2012 .

[73]  Anna Choromanska,et al.  Majorization for CRFs and Latent Likelihoods , 2012, NIPS.

[74]  David J. Fleet,et al.  Hamming Distance Metric Learning , 2012, NIPS.

[75]  Wei Liu,et al.  Compact Hyperplane Hashing with Bilinear Functions , 2012, ICML.

[76]  Wu-Jun Li,et al.  Isotropic Hashing , 2012, NIPS.

[77]  Kristen Grauman,et al.  Kernelized Locality-Sensitive Hashing , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[78]  Rongrong Ji,et al.  Supervised hashing with kernels , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[79]  Antonio Torralba,et al.  Multidimensional Spectral Hashing , 2012, ECCV.

[80]  Ruslan Salakhutdinov,et al.  The Power of Asymmetry in Binary Hashing , 2013, NIPS.

[81]  Jonathon Shlens,et al.  Fast, Accurate Detection of 100,000 Object Classes on a Single Machine , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[82]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[83]  Fumin Shen,et al.  Inductive Hashing on Manifolds , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[84]  Guosheng Lin,et al.  Learning Hash Functions Using Column Generation , 2013, ICML.

[85]  Svetlana Lazebnik,et al.  Multi-scale Orderless Pooling of Deep Convolutional Activation Features , 2014, ECCV.