Incremental Rapidly Grouping Aggregation Method for Similar Web News Headline

[1]  Richard Millham,et al.  The Comparative Analysis of Smith-Waterman Algorithm with Jaro-Winkler Algorithm for the Detection of Duplicate Health Related Records , 2018, 2018 International Conference on Advances in Big Data, Computing and Data Communication Systems (icABCD).

[2]  Hau-San Wong,et al.  Locality-Sensitive Term Weighting for Short Text Clustering , 2017, ICONIP.

[3]  Guoliang Li,et al.  An Efficient Partition Based Method for Exact Set Similarity Joins , 2015, Proc. VLDB Endow..

[4]  Xuan Zhou,et al.  Architecting Big Data: Challenges, Studies and Forecasts: Architecting Big Data: Challenges, Studies and Forecasts , 2011 .

[5]  Jun Yan,et al.  Microsoft Concept Graph: Mining Semantic Concepts for Short Text Understanding , 2019, Data Intelligence.

[6]  Hui Zhang,et al.  Experimental explorations on short text topic mining between LDA and NMF based Schemes , 2019, Knowl. Based Syst..

[7]  Enrico Motta,et al.  Integration of Semantically Annotated Data by the KnoFuss Architecture , 2008, EKAW.

[8]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[9]  Thomas G. Szymanski,et al.  A fast algorithm for computing longest common subsequences , 1977, CACM.

[10]  Jun Ye,et al.  Cosine similarity measures for intuitionistic fuzzy sets and their applications , 2011, Math. Comput. Model..

[11]  Patrick A. V. Hall,et al.  Approximate String Matching , 1994, Encyclopedia of Algorithms.

[12]  Wael Hassan Gomaa,et al.  A Survey of Text Similarity Approaches , 2013 .

[13]  Peng Wang,et al.  Self-Taught Convolutional Neural Networks for Short Text Clustering , 2017, Neural Networks.

[14]  Diana Inkpen,et al.  Semantic text similarity using corpus-based word similarity and string similarity , 2008, ACM Trans. Knowl. Discov. Data.

[15]  Hau-San Wong,et al.  Corpus-based topic diffusion for short text clustering , 2018, Neurocomputing.

[16]  Matthew A. Jaro,et al.  Probabilistic linkage of large public health data files. , 1995, Statistics in medicine.

[17]  Chien-Hung Liu,et al.  Applying VSM and LCS to develop an integrated text retrieval mechanism , 2012, Expert Syst. Appl..

[18]  J. L. Rana,et al.  Text Document Clustering based on Phrase Similarity using Affinity Propagation , 2013 .

[19]  Matthew A. Jaro,et al.  Advances in Record-Linkage Methodology as Applied to Matching the 1985 Census of Tampa, Florida , 1989 .