Data mining to estimate broiler mortality when exposed to heat wave

Heat waves usually result in losses of animal production since they are exposed to thermal stress inducing an increase in mortality and consequent economical losses. Animal science and meteorological databases from the last years contain enough data in the poultry production business to allow the modeling of mortality losses due to heat wave incidence. This research analyzes a database of broiler production associated to climatic data, using data mining techniques such as attribute selection and data classification (decision tree) to model the impact of heat wave incidence on broiler mortality. The temperature and humidity index (THI) was used for screening environmental data. The data mining techniques allowed the development of three comprehensible models for estimating specifically high mortality during broiler production. Two models yielded a classification accuracy of 89.3% by using Principal Component Analysis (PCA) and Wrapper feature selection approaches. Both models obtained a class precision of 0.83 for classifying high mortality. When the feature selection was made by the domain experts, the model accuracy reached 85.7%, while the class precision of high mortality was 0.76. Meteorological data and the calculated THI from meteorological stations were helpful to select the range of harmful environmental conditions for broilers 29 and 42 days old. The data mining techniques were useful for building animal production models.

[1]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[2]  F. Owens,et al.  Chronic heat stress and respiratory alkalosis: occurrence and treatment in broiler chicks. , 1985, Poultry science.

[3]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[4]  G T Tabler,et al.  Feed and water consumption, growth, and mortality of male broilers. , 1994, Poultry science.

[5]  S. Yahav,et al.  Physiological responses of chickens and turkeys to relative humidity during exposure to high ambient temperature , 1995 .

[6]  J. Ross Quinlan,et al.  Improved Use of Continuous Attributes in C4.5 , 1996, J. Artif. Intell. Res..

[7]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[8]  Usama M. Fayyad,et al.  Data mining and KDD: Promise and challenges , 1997, Future Gener. Comput. Syst..

[9]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[10]  Thomas Reinartz,et al.  CRISP-DM 1.0: Step-by-step data mining guide , 2000 .

[11]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[12]  Alan Keller Gomes Análise do conhecimento extraído de classificadores simbólicos utilizando medidas de avaliação e de interessabilidade. , 2002 .

[13]  Filippo Menczer,et al.  Feature selection in data mining , 2003 .

[14]  H. Xin,et al.  Temperature-Humidity-Velocity Index for Market-size Broilers , 2003 .

[15]  Gary D. Schnitkey,et al.  ECONOMIC LOSSES FROM HEAT STRESS BY US LIVESTOCK INDUSTRIES , 2003 .

[16]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[17]  L. Newberry,et al.  Mortality Patterns Associated with Commercial Broiler Production , 2004 .

[18]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[19]  Peter D. Kemp,et al.  Modelling the productivity of naturalised pasture in the North Island, New Zealand: a decision tree approach , 2005 .

[20]  Olac Fuentes,et al.  Using C4.5 as variable selection criterion in classification tasks , 2005 .

[21]  H. J. Chepete,et al.  PRODUCTION PERFORMANCE AND TEMPERATURE-HUMIDITY INDEX OF COBB 500 BROILERS REARED IN OPEN-SIDED NATURALLY VENTILATED HOUSES IN BOTSWANA , 2005 .