Energy efficient cosine similarity measures according to a convex cost function

We propose a new family of vector similarity measures. Each measure is associated with a convex cost function. Given two vectors, we determine the surface normals of the convex function at the vectors. The angle between the two surface normals is the similarity measure. Convex cost function can be the negative entropy function, total variation (TV) function and filtered variation function constructed from wavelets. The convex cost functions need not to be differentiable everywhere. In general, we need to compute the gradient of the cost function to compute the surface normals. If the gradient does not exist at a given vector, it is possible to use the sub-gradients and the normal producing the smallest angle between the two vectors is used to compute the similarity measure. The proposed measures are compared experimentally to other nonlinear similarity measures and the ordinary cosine similarity measure. The TV-based vector product is more energy efficient than the ordinary inner product because it does not require any multiplications.

[1]  A. Enis Cetin,et al.  L1 norm based multiplication-free cosine similarity measures for big data analysis , 2014, 2014 International Workshop on Computational Intelligence for Multimedia Understanding (IWCIM).

[2]  A. Enis Çetin,et al.  Target detection in SAR images using codifference and directional filters , 2010, Defense + Commercial Sensing.

[3]  L. Bregman The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming , 1967 .

[4]  Moses Charikar,et al.  Similarity estimation techniques from rounding algorithms , 2002, STOC '02.

[5]  Éva Tardos,et al.  Approximation algorithms for classification problems with pairwise relationships: metric labeling and Markov random fields , 1999, 40th Annual Symposium on Foundations of Computer Science (Cat. No.99CB37039).

[6]  Din J. Wasem,et al.  Mining of Massive Datasets , 2014 .

[7]  Inderjit S. Dhillon,et al.  Clustering with Bregman Divergences , 2005, J. Mach. Learn. Res..

[8]  A. Enis Çetin,et al.  Covariance matrix-based fire and flame detection method in video , 2012, Machine Vision and Applications.

[9]  A. Enis Çetin,et al.  A multiplication-free framework for signal processing and applications in biomedical image analysis , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  A. Enis Çetin,et al.  Image Description Using a Multiplier-Less Operator , 2009, IEEE Signal Processing Letters.

[11]  L. Rudin,et al.  Nonlinear total variation based noise removal algorithms , 1992 .

[12]  Ashish Goel,et al.  Dimension independent similarity computation , 2012, J. Mach. Learn. Res..

[13]  Mario Fritz,et al.  On the Significance of Real-World Conditions for Material Classification , 2004, ECCV.

[14]  Juan Enrique Ramos,et al.  Using TF-IDF to Determine Word Relevance in Document Queries , 2003 .

[15]  Denis J. Dean,et al.  Comparative accuracies of artificial neural networks and discriminant analysis in predicting forest cover types from cartographic variables , 1999 .

[16]  Richard Baraniuk,et al.  The Dual-tree Complex Wavelet Transform , 2007 .

[17]  Y. Censor,et al.  An iterative row-action method for interval convex programming , 1981 .

[18]  P. Baldi,et al.  Searching for exotic particles in high-energy physics with deep learning , 2014, Nature Communications.

[19]  A. Cetin Reconstruction of signals from Fourier transform samples , 1989 .

[20]  Onur Yorulmaz,et al.  Detection of fungal damaged popcorn using image property covariance features , 2012 .

[21]  A. Enis Çetin,et al.  Compressive sensing using the modified entropy functional , 2014, Digit. Signal Process..