Learning Similarity with Fuzzy Functions of Adaptable Complexity

A common approach in database queries involves the multi-dimensional representation of objects by a set of features. These features are compared to the query representation and then combined together to produce a total similarity metric. In this paper we introduce a novel technique for similarity learning within features (attributes) by manipulating fuzzy membership functions (FMFs) of different complexity. Our approach is based on a gradual complexity increase adaptable to problem requirements. The underlying idea is that less adaptable functions will act as approximations for more complex ones. We begin by interpolating a set of planes in the training dataset and due to linearity we get a fast first impression of the underlying complexity. We proceed to interpolate two asymmetrical sigmoidal functions whose initial approximations are calculated from the plane properties. If satisfactory accuracy is not achieved we provide advanced modeling capabilities by investigating FMFs parameters and convolving their output with additional functions.

[1]  Robert B. Ash,et al.  Information Theory , 2020, The SAGE International Encyclopedia of Mass Media and Society.

[2]  Dimitris Papadias,et al.  Processing fuzzy spatial queries: a configuration similarity approach , 1999, Int. J. Geogr. Inf. Sci..

[3]  Bart Kosko,et al.  Neural fuzzy agents that learn a user's preference map , 1997, Proceedings of ADL '97 Forum on Research and Technology. Advances in Digital Libraries.

[4]  E. L. Harder,et al.  The Institute of Electrical and Electronics Engineers, Inc. , 2019, 2019 IEEE International Conference on Software Architecture Companion (ICSA-C).

[5]  Dimitrios Gunopulos,et al.  Robust similarity measures for mobile object trajectories , 2002, Proceedings. 13th International Workshop on Database and Expert Systems Applications.

[6]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[7]  C. Pappis,et al.  A comparative assessment of measures of similarity of fuzzy values , 1993 .

[8]  Simone Santini,et al.  Similarity Measures , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Christos Faloutsos,et al.  Fast Time Sequence Indexing for Arbitrary Lp Norms , 2000, VLDB.

[10]  Morton Nadler,et al.  Pattern recognition engineering , 1993 .

[11]  Y. Wang,et al.  Learning Feature Weights for Similarity Measures using Genetic Algorithms , 1998 .

[12]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[13]  Bruce G. Batchelor,et al.  Pattern Recognition: Ideas in Practice , 1978 .

[14]  Alberto O. Mendelzon,et al.  Similarity-based queries for time series data , 1997, SIGMOD '97.

[15]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[16]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[17]  Joo-Hwee Lim,et al.  Learning Similarity Matching in Multimedia Content-Based Retrieval , 2001, IEEE Trans. Knowl. Data Eng..

[18]  Thomas Mandl Tolerant Information Retrieval with Backpropagation Networks , 2000, Neural Computing & Applications.

[19]  Fatos T. Yarman-Vural,et al.  Learning similarity space , 2002, Proceedings. International Conference on Image Processing.

[20]  Tony R. Martinez,et al.  An Integrated Instance‐Based Learning Algorithm , 2000, Comput. Intell..

[21]  Hans-Peter Kriegel,et al.  S3: similarity search in CAD database systems , 1997, SIGMOD '97.