The Application of Information Fusion and Extraction in Maize Seed Breeding

It has been the irresistible trend of agriculture informationization which uses information technologies to treat enormous data and finds out potential useful rules to direct the development and reformation of agriculture. Aiming at the specific application of maize seed breeding, this paper effectively integrates several data mining technologies and presents a new method called CA to analyze the whole maize information. The algorithm achieves transverse dimension reduction by combining PCA and other methods, and also achieves longitudinal dimension reduction by improving CURE and k-means. The decision-tree method of CA algorithm introduces three different classifiers in order to enhance the accuracy of trees. By comparing the results of improved algorithm with traditional methods, we can find that the new algorithm is better in performance and degree of parallelism.

[1]  Qing Chen,et al.  The Application of Improved Decision Tree Algorithm in Data Mining of Employment Rate: Evidence from China , 2009, 2009 First International Workshop on Database Technology and Applications.

[2]  Guan Zhong-ren Research of clustering algorithm based on K-means , 2009 .

[3]  DU Zi-ping Combined optimization decision tree algorithm suitable for large scale data-base , 2009 .

[4]  Yongqing Zheng,et al.  A New Decision Tree Algorithm Based on Rough Set Theory , 2009, 2009 Asia-Pacific Conference on Information Processing.

[5]  Yan Jia,et al.  Stream Event Detection: A Unified Framework for Mining Outlier, Change and Burst Simultaneously over Data Stream , 2007 .

[6]  Dezhao Chen,et al.  An Enhanced ART2 Neural Network for Clustering Analysis , 2008, First International Workshop on Knowledge Discovery and Data Mining (WKDD 2008).

[7]  Gang Yu,et al.  A novel method for pruning decision trees , 2009, 2009 International Conference on Machine Learning and Cybernetics.

[8]  Hui Shao,et al.  Privacy Preserving C4.5 Algorithm over Vertically Distributed Datasets , 2009, 2009 International Conference on Networks Security, Wireless Communications and Trusted Computing.

[9]  Christophe Marsala A fuzzy decision tree based approach to characterize medical data , 2009, 2009 IEEE International Conference on Fuzzy Systems.

[10]  T. Aslanidis,et al.  CUZ: An Improved Clustering Algorithm , 2008, 2008 IEEE 8th International Conference on Computer and Information Technology Workshops.

[11]  Rafal A. Angryk,et al.  GDClust: A Graph-Based Document Clustering Technique , 2007 .

[12]  Chen Xiang-tao An Improved ID3 Algorithm of Decision Trees , 2009 .