Probabilistic correlation-based similarity measure of unstructured records
暂无分享,去创建一个
[1] Karen Spärck Jones. Index term weighting , 1973, Inf. Storage Retr..
[2] Andrew McCallum,et al. Efficient clustering of high-dimensional data sets with application to reference matching , 2000, KDD '00.
[3] William W. Cohen. Integration of heterogeneous databases without common domains using queries based on textual similarity , 1998, SIGMOD '98.
[4] Gonzalo Navarro,et al. A guided tour to approximate string matching , 2001, CSUR.
[5] Esko Ukkonen,et al. Approximate String Matching with q-grams and Maximal Matches , 1992, Theor. Comput. Sci..
[6] Gerard Salton,et al. Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .
[7] Stephen E. Robertson,et al. Understanding inverse document frequency: on theoretical arguments for IDF , 2004, J. Documentation.
[8] C. J. van Rijsbergen,et al. Information Retrieval , 1979, Encyclopedia of GIS.
[9] Luis Gravano,et al. Text joins in an RDBMS for web data integration , 2003, WWW '03.