The Big Data are increasing exponential every year so that data became very complex and difficult to be processed. To resolve this problem, data management and analysis offer opportunities to improve decisions in critical development areas such as: meteorology, medicine, finance, sociology or internet. But, classical statistics programs encounter their limits in processing large data-sets, so that introduction of such programs in non-sql database applications is required. Existing large-scale processing data-sets frameworks does not provide statistics tools to reduce the complexity of the large data-sets to meaningful results. More, nowadays statistics have meanings in context of predictions, forecasting and estimation requiring non-linear regressions to define the complex equations of such systems. Non-linear regressions offer the best solution for our complex time-series application where observational data are modeled by non-linear functions and multiple independent variables. Our analytic application is based on data came from BTWord serial application, that collected public trackers to obtain information about the performance, scalability and reliability of BitTorrent. We show how descriptive, inductive and non-linear regression statistics may be integrated in our map-reduce application to generate statistics about evolution in time of BitTorrent network.
[1]
Grey Giddins,et al.
Statistics
,
2016,
The Journal of hand surgery, European volume.
[2]
A. Iosup,et al.
Parallel and Distributed Systems Report Series On Assessing Measurement Accuracy in BitTorrent Peer-to-Peer File-Sharing Networks
,
2009
.
[3]
Henri P. Gavin,et al.
The Levenberg-Marquardt method for nonlinear least squares curve-fitting problems c ©
,
2013
.
[4]
Inferential Role and the Ideal of Deductive Logic
,
2009
.
[5]
Kaj Madsen,et al.
Methods for Non-Linear Least Squares Problems
,
1999
.
[6]
Lotfi A. Zadeh,et al.
Please Scroll down for Article International Journal of General Systems Fuzzy Sets and Systems* Fuzzy Sets and Systems*
,
2022
.
[7]
Alexandru Iosup,et al.
BTWorld: towards observing the global BitTorrent file-sharing network
,
2010,
HPDC '10.
[8]
W. P. Bowen,et al.
Nonlinear regression using spreadsheets.
,
1995,
Trends in pharmacological sciences.
[10]
Mikel Izal,et al.
Dissecting BitTorrent: Five Months in a Torrent's Lifetime
,
2004,
PAM.
[11]
David R. Cox,et al.
The Oxford Dictionary of Statistical Terms
,
2006
.