Map to humans and reduce error: crowdsourcing for deduplication applied to digital libraries
暂无分享,去创建一个
Wolfgang Nejdl | Mihai Georgescu | Claudiu S. Firan | Julien Gaugaz | Dang Duc Pham | W. Nejdl | Julien Gaugaz | Mihai Georgescu | C. S. Firan
[1] Luis von Ahn,et al. Human computation , 2009, 2009 46th ACM/IEEE Design Automation Conference.
[2] Gianluca Demartini,et al. ZenCrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking , 2012, WWW.
[3] Panagiotis G. Ipeirotis,et al. Quality management on Amazon Mechanical Turk , 2010, HCOMP '10.
[4] Salvatore J. Stolfo,et al. Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem , 1998, Data Mining and Knowledge Discovery.
[5] Peter Fankhauser,et al. From Web Data to Entities and Back , 2010, CAiSE.
[6] Irwin King,et al. A Survey of Human Computation Systems , 2009, 2009 International Conference on Computational Science and Engineering.
[7] Anuradha Bhamidipaty,et al. Interactive deduplication using active learning , 2002, KDD.
[8] Jayant Madhavan,et al. Reference reconciliation in complex information spaces , 2005, SIGMOD '05.
[9] A. P. Dawid,et al. Maximum Likelihood Estimation of Observer Error‐Rates Using the EM Algorithm , 1979 .
[10] Foster J. Provost,et al. Why label when you can search?: alternatives to active learning for applying human resources to build classification models under extreme class imbalance , 2010, KDD.
[11] Jennifer Widom,et al. Swoosh: a generic approach to entity resolution , 2008, The VLDB Journal.
[12] Adam Tauman Kalai,et al. Adaptively Learning the Crowd Kernel , 2011, ICML.
[13] Gerardo Hermosillo,et al. Learning From Crowds , 2010, J. Mach. Learn. Res..
[14] David A. Cohn,et al. Improving generalization with active learning , 1994, Machine Learning.
[15] Martin Lukasiewycz,et al. Opt4J: a modular framework for meta-heuristic optimization , 2011, GECCO '11.
[16] Panagiotis G. Ipeirotis,et al. Get another label? improving data quality and data mining using multiple, noisy labelers , 2008, KDD.
[17] Claudia Niederée,et al. Probabilistic Entity Linkage for Heterogeneous Information Spaces , 2008, CAiSE.
[18] Ahmed K. Elmagarmid,et al. Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.