Probability on Graphical Structure: A Knowledge-Based Agricultural Case

This paper provides a rich framework to estimate the causal relationship among eighteen features (related to the product type and classification) on an agronomy study by using Bayesian Networks, which are a type of probabilistic graphical model. Thereby, with this class of models, we aimed to classify and identify the complaints based on corn seed commercialization. Simulation studies were used to compare both adopted algorithms, K2 and PC, and their hybrid version. These studies indicate excellent classification performance, given the knowledge of the network structure. After the estimated Directed Acyclic Graph, three features (Brand, Germination percentage, and Amount of commercialized bags) were evidenced as Impacting factors in the complaints based on corn seed commercialization.

[1]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[2]  Diego Colombo,et al.  Order-independent constraint-based causal structure learning , 2012, J. Mach. Learn. Res..

[3]  Marek J. Druzdzel,et al.  A Hybrid Anytime Algorithm for the Construction of Causal Models From Sparse Data , 1999, UAI.

[4]  Carter T. Butts,et al.  network: A Package for Managing Relational Data in R , 2008 .

[5]  Marco Scutari,et al.  Learning Bayesian Networks with the bnlearn R Package , 2009, 0908.3817.

[6]  Peter J. F. Lucas,et al.  Markov Equivalence in Bayesian Networks , 2004 .

[7]  Jose Miguel Puerta,et al.  Learning Bayesian networks by hill climbing: efficient methods based on progressive restriction of the neighborhood , 2010, Data Mining and Knowledge Discovery.

[8]  Richard E. Neapolitan,et al.  Learning Bayesian networks , 2007, KDD '07.

[9]  Bor-Wen Cheng,et al.  A case study in solving customer complaints based on the 8Ds method and Kano model , 2010 .

[10]  Yong Shi,et al.  An alternative approach for the classification of orange varieties based on near infrared spectroscopy , 2013 .

[11]  Francisco Louzada,et al.  Reliability-centered maintenance: analyzing failure in harvest sugarcane machine using some generalizations of the Weibull distribution , 2017, 1712.03304.

[12]  Martina Morris,et al.  A statnet Tutorial. , 2008, Journal of statistical software.

[13]  Mohiuddin Ahmed,et al.  Deep Learning: Hope or Hype , 2020 .

[14]  W. Deming Quality, productivity, and competitive position , 1982 .

[15]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[16]  Joaquín Abellán,et al.  Some Variations on the PC Algorithm , 2006, Probabilistic Graphical Models.

[17]  C. Granger Investigating Causal Relations by Econometric Models and Cross-Spectral Methods , 1969 .

[18]  Ernesto Estrada,et al.  The Structure of Complex Networks: Theory and Applications , 2011 .

[19]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[20]  Judea Pearl,et al.  Convince: A Conversational Inference Consolidation Engine , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[21]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[22]  A. Paterson,et al.  Global agricultural intensification during climate change: a role for genomics , 2015, Plant biotechnology journal.

[23]  C. Sims Money, Income, and Causality , 1972 .

[24]  Steffen L. Lauritzen,et al.  Graphical models in R , 1996 .

[25]  Kevin B. Korb,et al.  Bayesian Artificial Intelligence , 2004, Computer science and data analysis series.

[26]  A. Raftery,et al.  World population stabilization unlikely this century , 2014, Science.

[27]  L. Breiman Arcing classifier (with discussion and a rejoinder by the author) , 1998 .

[28]  James M. Tien,et al.  Internet of Things, Real-Time Decision Making, and Artificial Intelligence , 2017, Annals of Data Science.

[29]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[30]  Dimitris Kugiumtzis,et al.  Evaluation of Granger Causality Measures for Constructing Networks from Multivariate Time Series , 2019, Entropy.

[31]  Allan Tucker,et al.  Learning Bayesian networks from big data with greedy search: computational complexity and efficient implementation , 2018, Statistics and Computing.

[32]  H. Akaike A new look at the statistical model identification , 1974 .

[33]  Manoj Kumar,et al.  Bayesian Inference for Rayleigh Distribution Under Step-Stress Partially Accelerated Test with Progressive Type-II Censoring with Binomial Removal , 2019, Annals of Data Science.

[34]  Shyam Visweswaran,et al.  Learning genetic epistasis using Bayesian network scoring criteria , 2011, BMC Bioinformatics.

[35]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[36]  G. Schwarz Estimating the Dimension of a Model , 1978 .