Using a semisupervised fuzzy clustering process for identity identification in digital libraries

This paper introduces a new semisupervised fuzzy algorithm that makes use of must-link and cannot-link constraints. These constraints are applied to the process of finding the optimum α-cut of a dendrogram. We have applied this method to identity identification in digital libraries.

[1]  Berthier A. Ribeiro-Neto,et al.  Using web information for author name disambiguation , 2009, JCDL '09.

[2]  Claire Cardie,et al.  Clustering with Instance-Level Constraints , 2000, AAAI/IAAI.

[3]  A. Bonaert Introduction to the theory of Fuzzy subsets , 1977, Proceedings of the IEEE.

[4]  Jussara M. Almeida,et al.  A tool for generating synthetic authorship records for evaluating author name disambiguation methods , 2012, Inf. Sci..

[5]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[6]  Cheng Li,et al.  Two supervised learning approaches for name disambiguation in author citations , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[7]  C. Lee Giles,et al.  Efficient Name Disambiguation for Large-Scale Databases , 2006, PKDD.

[8]  Antonio F. Gómez-Skarmeta,et al.  On the use of hierarchical clustering in fuzzy modeling , 1996, Int. J. Approx. Reason..

[9]  Jan-Ming Ho,et al.  Disambiguating authors in citations on the web and authorship correlations , 2012, Expert Syst. Appl..

[10]  C. Lee Giles,et al.  Disambiguating authors in academic publications using random forests , 2009, JCDL '09.

[11]  Ian Davidson,et al.  Reveling in Constraints , 2009, ACM Queue.

[12]  M. Amparo Vila,et al.  An automatic data mining authority control system: A first approach , 2010, 2010 10th International Conference on Intelligent Systems Design and Applications.

[13]  Witold Pedrycz,et al.  Fuzzy Clustering With Viewpoints , 2010, IEEE Transactions on Fuzzy Systems.

[14]  Marcos André Gonçalves,et al.  An unsupervised heuristic-based hierarchical method for name disambiguation in bibliographic citations , 2010 .

[15]  M. Amparo Vila,et al.  An automatic system for identifying authorities in digital libraries , 2013, Expert Syst. Appl..