Sentiment-Based Prediction of Alternative Cryptocurrency Price Fluctuations Using Gradient Boosting Tree Model

In this paper, we analyze Twitter signals as a medium for user sentiment to predict the price fluctuations of a small-cap alternative cryptocurrency called \emph{ZClassic}. We extracted tweets on an hourly basis for a period of 3.5 weeks, classifying each tweet as positive, neutral, or negative. We then compiled these tweets into an hourly sentiment index, creating an unweighted and weighted index, with the latter giving larger weight to retweets. These two indices, alongside the raw summations of positive, negative, and neutral sentiment were juxtaposed to $\sim 400$ data points of hourly pricing data to train an Extreme Gradient Boosting Regression Tree Model. Price predictions produced from this model were compared to historical price data, with the resulting predictions having a 0.81 correlation with the testing data. Our model's predictive data yielded statistical significance at the $p < 0.0001$ level. Our model is the first academic proof of concept that social media platforms such as Twitter can serve as powerful social signals for predicting price movements in the highly speculative alternative cryptocurrency, or "alt-coin", market.

[1]  Muzammil Hussain,et al.  Crypto-Currency , 2019, FinTech as a Disruptive Technology for Financial Institutions.

[2]  Andrea Baronchelli,et al.  Machine Learning the Cryptocurrency Market , 2018, Complex..

[3]  Young Bin Kim,et al.  Predicting Fluctuations in Cryptocurrency Transactions Based on User Comments and Replies , 2016, PloS one.

[4]  Nino Antulov-Fantulin,et al.  Predicting short-term Bitcoin price fluctuations from buy and sell orders , 2018, ArXiv.

[5]  Alex 'Sandy' Pentland,et al.  An Experimental Study of Cryptocurrency Market Dynamics , 2018, CHI.

[6]  E. C. Stewart-Seed The dynamic approach. , 1979, The International journal of oral myology.

[7]  Bitcoin Proof of Stake: A Peer-to-Peer Electronic Cash System , 2020 .

[8]  Andrea Baronchelli,et al.  Wikipedia and Digital Currencies: Interplay Between Collective Attention and Market Performance , 2019, SSRN Electronic Journal.

[9]  Attilio Meucci,et al.  'P' Versus 'Q': Differences and Commonalities between the Two Areas of Quantitative Finance , 2011 .

[10]  Andrea Baronchelli,et al.  The fragility of decentralised trustless socio-technical systems , 2018, EPJ Data Science.

[11]  Denise Gorse,et al.  Predicting cryptocurrency price bubbles using social media data and epidemic modelling , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[12]  Lin Wang,et al.  Evolutionary games on multilayer networks: a colloquium , 2015, The European Physical Journal B.

[13]  Frank Schweitzer,et al.  Social signals and algorithmic trading of Bitcoin , 2015, Royal Society Open Science.

[14]  Laetitia Gauvin,et al.  Analysis of the Bitcoin blockchain: socio-economic factors behind the adoption , 2018, EPJ Data Science.

[15]  Attila Szolnoki,et al.  Optimal interdependence between networks for the evolution of cooperation , 2013, Scientific Reports.

[16]  Albert Bifet,et al.  Bitcoin Volatility Forecasting with a Glimpse into Buy and Sell Orders , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[17]  A. F. Bariviera The Inefficiency of Bitcoin Revisited: A Dynamic Approach , 2017, 1709.08090.

[18]  Attila Szolnoki,et al.  Information sharing promotes prosocial behaviour , 2013, ArXiv.

[19]  Haroldo V. Ribeiro,et al.  Clustering patterns in efficiency and the coming-of-age of the cryptocurrency market , 2019, Scientific Reports.

[20]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.