Statistical supervised meta-ensemble algorithm for medical record linkage

[1]  Dongwon Lee,et al.  HARRA: fast iterative hashed record linkage for large-scale data collections , 2010, EDBT '10.

[2]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[3]  Sungroh Yoon,et al.  NC-Link: A New Linkage Method for Efficient Hierarchical Clustering of Large-Scale Data , 2017, IEEE Access.

[4]  Stephen E. Fienberg,et al.  A Comparison of Blocking Methods for Record Linkage , 2014, Privacy in Statistical Databases.

[5]  Peter Christen,et al.  Quality and Complexity Measures for Data Linkage and Deduplication , 2007, Quality Measures in Data Mining.

[6]  Vladimir Vapnik,et al.  Support-vector networks , 2004, Machine Learning.

[7]  Peter Christen,et al.  Febrl -: an open source data cleaning, deduplication and record linkage system with a graphical user interface , 2008, KDD.

[8]  Mauricio Lima Barreto,et al.  Effect of Brazil’s Conditional Cash Transfer Programme on the new case detection rate of leprosy in children under 15 years old , 2018 .

[9]  Katie Harron,et al.  Evaluation of record linkage of two large administrative databases in a middle income country: stillbirths and notifications of dengue during pregnancy in Brazil , 2017, BMC Medical Informatics and Decision Making.

[10]  Peter Christen,et al.  Preparation of name and address data for record linkage using hidden Markov models , 2002, BMC Medical Informatics Decis. Mak..

[11]  George Papastefanatos,et al.  Parallel meta-blocking for scaling entity resolution over big heterogeneous data , 2017, Inf. Syst..

[12]  Strother H. Walker,et al.  Estimation of the probability of an event as a function of several independent variables. , 1967, Biometrika.

[13]  Rainer Schnell,et al.  Bmc Medical Informatics and Decision Making Privacy-preserving Record Linkage Using Bloom Filters , 2022 .

[14]  Sanguthevar Rajasekaran,et al.  Efficient sequential and parallel algorithms for record linkage , 2013, J. Am. Medical Informatics Assoc..

[15]  Leo Breiman,et al.  Stacked regressions , 2004, Machine Learning.

[16]  Hairong Yu,et al.  Data extraction from electronic health records - existing tools may be unreliable and potentially unsafe. , 2013, Australian family physician.

[17]  G. Hommel,et al.  Linear regression analysis: part 14 of a series on evaluation of scientific publications. , 2010, Deutsches Arzteblatt international.

[18]  Yves Grandvalet,et al.  Bagging Equalizes Influence , 2004, Machine Learning.

[19]  Avigdor Gal,et al.  Multi-Source Uncertain Entity Resolution at Yad Vashem: Transforming Holocaust Victim Reports into People , 2016, SIGMOD Conference.

[20]  Spiros Denaxas,et al.  On the Accuracy and Scalability of Probabilistic Data Linkage Over the Brazilian 114 Million Cohort , 2018, IEEE Journal of Biomedical and Health Informatics.

[21]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[22]  Lars Schmidt-Thieme,et al.  Scaling Record Linkage to Non-uniform Distributed Class Sizes , 2008, PAKDD.

[23]  Thiago P. Leal,et al.  Genomic ancestry and ethnoracial self-classification based on 5,871 community-dwelling Brazilians (The Epigen Initiative) , 2015, Scientific Reports.