Computer Information Systems and Industrial Management

Statistically, since Pearson, data are recorded as matrices of size n p, where rows contain n subjects (individuals, cases), and columns are values of the p variables (attributes) characterizing the subjects. When performing traditional multivariate analysis of the recorded data, the crucial problem is: should all the p recorded variables be taken for the analysis; may be less of them will be sufficient and some of them are not relevant, or even an impediment. The old saying: “the more the better” has become questionable nowadays: too many non-relevant variables may be disturbing by introducing some random effects into the data. The problem to solve is composite. I will consider it in the context of regression or classification analysis, when dealing with directly recorded ‘variables’ (no ‘features’ derived from them). I will concentrate on group of methods referred to as Collective Intelligence (contains, among others, Ensemble Learning, Decision trees and Random Forests). Specifically, I will concentrate on the Random Forests (RFs) methodology. RFs offer some non-conventional indices of importance of variables in the context of regression and clustering. They work directly on original variables (not on new features derived from them). They can work on mixed type variables, that is quantitative (numeric) or qualitative (categorial). They work without assumption on the probability distribution of the variables. They yield an internal unbiased estimate of the generalization error. It has been shown that RFs are resistant to outliers, however not all of them are universally consistent. I intend to show – on real data examples – how all this works in practice. A Data-Driven Approach Towards Forecasting Generalized Mid-Term Energy Requirement for Industrial Sector Users of Smart Grid

[1]  Michel Diaz,et al.  Petri Nets: Fundamental Models, Verification and Applications , 2009 .

[2]  Wil vanderAalst,et al.  Workflow Management: Models, Methods, and Systems , 2004 .

[3]  Praveen Raghavan,et al.  ASIP (Application Specific Instruction-domain-set Processor) microcomputer with extremely low power consumption , 2006 .

[4]  Henri E. Bal,et al.  Distributed Shared Memory Management for Java , 1999 .

[5]  Marco Dorigo,et al.  Distributed Optimization by Ant Colonies , 1992 .

[6]  Walid Mohamed Aly,et al.  Performance Analysis and Tuning for Parallelization of Ant Colony Optimization by Using OpenMP , 2015, CISIM.

[7]  Jan Martinovic,et al.  HyperLoom: A Platform for Defining and Executing Scientific Pipelines in Distributed Environments , 2018, PARMA-DITAM '18.

[8]  James Jerome O'Brien CPM In Construction Management , 1984 .

[9]  Douglas C. Schmidt,et al.  WreckWatch: Automatic Traffic Accident Detection and Notification with Smartphones , 2011, Mob. Networks Appl..

[10]  Joseph Pasquale,et al.  An analysis and improvement of congestion control in the CoAP Internet-of-Things protocol , 2016, 2016 13th IEEE Annual Consumer Communications & Networking Conference (CCNC).

[11]  M. Bister,et al.  Automated segmentation of cardiac MR images , 1989, [1989] Proceedings. Computers in Cardiology.

[12]  Kees M. van Hee,et al.  The Right Timing: Reflections on the Modeling and Analysis of Time , 2013, Petri Nets.

[13]  C. Vidya Lakshmi,et al.  Automatic Accident Detection via Embedded GSM message interface with Sensor Technology , 2012 .

[14]  Jiří Dvorský,et al.  Flexible Neural Trees - Parallel Learning on HPC , 2016, ACSS.

[15]  Khalid Saeed,et al.  The Conceptual Approach of System for Automatic Vehicle Accident Detection and Searching for Life Signs of Casualties , 2018, ACSS.

[16]  August Betzler,et al.  CoAP congestion control for the internet of things , 2016, IEEE Communications Magazine.

[17]  Norman Jones,et al.  Vehicle Crash Mechanics , 2002 .

[18]  Miodrag Potkonjak,et al.  MediaBench: a tool for evaluating and synthesizing multimedia and communications systems , 1997, Proceedings of 30th Annual International Symposium on Microarchitecture.

[19]  Haimd M. Ali,et al.  Car Accident Detection and Notification System using Smartphone , 2017 .

[20]  Jan Vargovský Ant Colony Optimization Algorithm for Vehicle Routing Problem , 2017 .

[21]  Diederik Verkest,et al.  Coffee: COmpiler Framework for Energy-Aware Exploration , 2008, HiPEAC.

[22]  Md. Mamun Ibne Reaz,et al.  Kalman filtered GPS accelerometerbased accident detection and location system , 2014 .

[23]  Sabela Ramos,et al.  Java in the High Performance Computing arena: Research, practice and experience , 2013, Sci. Comput. Program..

[24]  M. Niezgoda,et al.  Collision detection algorithms in the ecall system , 2015 .

[25]  Dino Mandrioli,et al.  Modeling Time in Computing , 2012, Monographs in Theoretical Computer Science. An EATCS Series.

[26]  Fernando Rosa-Velardo,et al.  Dynamic Networks of Timed Petri Nets , 2014, Petri Nets.

[27]  Li Jiao,et al.  Property-Preserving Petri Net Process Algebra in Software Engineering , 2012 .