论文信息 - Data Mining: The Next Generation (Data Mining: Die nächste Generation)

Data Mining: The Next Generation (Data Mining: Die nächste Generation)

Summary Data Mining has enjoyed great popularity in recent years, with advances in both research and commercialization. The first generation of data mining research and development has yielded several commercially available systems, both stand-alone and integrated with database systems, produced scalable versions of algorithms for many classical data mining problems and introduced novel pattern discovery problems. In July 2004 researchers from a variety of backgrounds assembled at the Dagstuhl Conference Center in Germany for a workshop to re-assess the current directions of the field, to identify critical problems that require attention, and to discuss ways to increase the flow of ideas across the different disciplines that Data Mining has brought together. The workshop did not seek to draw up an agenda for the field of data mining. Rather, it offers the participants' perspective on two technical directions – compositionality and privacy – and describes some important application challenges which drove the discussion.

Rakesh Agrawal | Johann-Christoph Freytag | Raghu Ramakrishnan

[1] Yehuda Lindell,et al. Privacy Preserving Data Mining , 2002, Journal of Cryptology.

[2] J. Blake,et al. Creating the Gene Ontology Resource : Design and Implementation The Gene Ontology Consortium 2 , 2001 .

[3] George T. Duncan,et al. Enhancing Access to Microdata while Protecting Confidentiality: Prospects for the Future , 1991 .

[4] Ivan P. Fellegi,et al. On the Question of Statistical Confidentiality , 1972 .

[5] Sang Joon Kim,et al. A Mathematical Theory of Communication , 2006 .

[6] I. Jonassen,et al. Predicting gene regulatory elements in silico on a genomic scale. , 1998, Genome research.

[7] Ulf Leser,et al. Systematic feature evaluation for gene name recognition , 2005, BMC Bioinformatics.

[8] S. Fienberg,et al. Bounding Entries in Multi-way Contingency Tables Given a Set of Marginal Totals , 2003 .

[9] Jeremy Buhler,et al. Designing seeds for similarity search in genomic DNA , 2003, RECOMB '03.

[10] T. Jenssen,et al. A literature network of human genes for high-throughput analysis of gene expression , 2001, Nature Genetics.

[11] Joel D. Martin,et al. PreBIND and Textomy – mining the biomedical literature for protein-protein interactions using a support vector machine , 2003, BMC Bioinformatics.