An Approach to Math-Similarity Search

The unique structural syntax and the variety of semantic equivalences of mathematic expressions make it a challenge for a keyword-based text search engine to effectively meet the users’ search needs. Many existing math search solutions focus on exact search where the notational matching determines the relevance rank, while the structural similarity and mathematical semantics are often missed out or not addressed adequately. One important research question is how to effectively and efficiently find math expressions that are similar to a user’s query, and how to do relevance ranking of hits by similarity. This paper focuses on (1) conceptualizing similarity between mathematical expressions, (2) defining metrics to measure math similarity, (3) utilizing those metrics for math similarity search, and (4) evaluating performance to validate advantage of the proposed math similarity search. Our results show that the performance of math-similarity search is superior to that of keyword-based math search.

[1]  Frank Wm. Tompa,et al.  A new mathematics retrieval system , 2010, CIKM '10.

[2]  Akiko Aizawa,et al.  An Approach to Similarity Search for Mathematical Expressions using MathML , 2009 .

[3]  Bruce R. Miller,et al.  Technical Aspects of the Digital Library of Mathematical Functions , 2003, Annals of Mathematics and Artificial Intelligence.

[4]  Kai-Uwe Kühnberger,et al.  Algorithmic Aspects of Theory Blending , 2014, AISC.

[5]  Rajesh Munavalli,et al.  An Approach to Mathematical Search Through Query Formulation and Data Normalization , 2007, Calculemus/MKM.

[6]  Petr Sojka,et al.  The art of mathematics retrieval , 2011, DocEng '11.

[7]  Michael Kohlhase,et al.  A Search Engine for Mathematical Formulae , 2006, AISC.

[8]  Paul Libbrecht,et al.  Methods to Access and Retrieve Mathematical Content in ActiveMath , 2006, ICMS.

[9]  Abdou Youssef,et al.  Methods of Relevance Ranking and Hit-content Generation in Math Search , 2007, Calculemus/MKM.

[10]  Christoph Lüth,et al.  A Framework for Interactive Proof , 2007, Calculemus/MKM.

[11]  J. Misutka,et al.  Mathematical Extension of Full Text Search Engine Indexer , 2008, 2008 3rd International Conference on Information and Communication Technologies: From Theory to Applications.

[12]  Hideki Hashimoto,et al.  Incorporating breadth first search for indexing MathML objects , 2008, 2008 IEEE International Conference on Systems, Man and Cybernetics.

[13]  Andrés Iglesias,et al.  Mathematical Software - ICMS 2006, Second International Congress on Mathematical Software, Castro Urdiales, Spain, September 1-3, 2006, Proceedings , 2006, ICMS.