Predicting The Cricket Match Outcome Using Crowd Opinions On Social Networks: A Comparative Study Of Machine Learning Methods

Social media has become a platform of first choice where one can express his/her feelings with freedom. The sports and matches being played are also discussed on social media such as Twitter. In this article, efforts are made to investigate the feasibility of using collective knowledge obtained from microposts posted on Twitter to predict the winner of a Cricket match. For predictions, we use three different methods that depend on the total number of tweets before the game for each team, fans sentiments toward each team and fans score predictions on Twitter. By combining these three methods, we classify winning team prediction in a Cricket game before the start of game. Our results are promising enough to be used for winning team forecast. Furthermore, the effectiveness of supervised learning algorithms is evaluated where Support Vector Machine (SVM) has shown advantage over other classifiers.

[1]  P. S. Gill,et al.  Modelling and simulation for one‐day cricket , 2009 .

[2]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[3]  Michael Bailey,et al.  Predicting the Match Outcome in One Day International Cricket Matches, while the Game is in Progress. , 2006, Journal of sports science & medicine.

[4]  Gianluca Morelli,et al.  A comparison of different classification methods , 2013 .

[5]  Winning the Coin Toss and the Home Team Advantage in One-day International Cricket Matches , 2004 .

[6]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[7]  Erik Cambria,et al.  A concept-level approach to the analysis of online review helpfulness , 2016, Comput. Hum. Behav..

[8]  Qaiser Abbas,et al.  Comparative Study of Feature Selection Approaches for Urdu Text Categorization , 2015 .

[9]  Sameem Abdul Kareem,et al.  Identifying ISI‐indexed articles by their lexical usage: A text analysis approach , 2015, J. Assoc. Inf. Sci. Technol..

[10]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[11]  Brendan T. O'Connor,et al.  From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series , 2010, ICWSM.

[12]  Amal Kaluarachchi,et al.  CricAI: A classification based tool to predict the outcome in ODI cricket , 2010, 2010 Fifth International Conference on Information and Automation for Sustainability.

[13]  Michael Vitale,et al.  The Wisdom of Crowds , 2015, Cell.

[14]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[15]  George Forman,et al.  An Extensive Empirical Study of Feature Selection Metrics for Text Classification , 2003, J. Mach. Learn. Res..

[16]  Raza Ul-Mustafa,et al.  Finding Healthcare Issues with Search Engine Queries and Social Network Data , 2017, Int. J. Semantic Web Inf. Syst..

[17]  R. G. Raj,et al.  A Preliminary Investigation of User Perception and Behavioral Intention for Different Review Types: Customers and Designers Perspective , 2014, TheScientificWorldJournal.

[18]  Steven Skiena,et al.  The Wisdom of Bookies? Sentiment Analysis Versus. the NFL Point Spread , 2010, ICWSM.

[19]  Din J. Wasem,et al.  Mining of Massive Datasets , 2014 .

[20]  Laks V. S. Lakshmanan,et al.  Auto-play: A Data Mining Approach to ODI Cricket Simulation and Prediction , 2014, SDM.

[21]  Ananda Bandulasiri,et al.  Predicting the Winner in One Day International Cricket , 2008 .

[22]  P. Allsopp,et al.  Rating teams and analysing outcomes in one‐day and test cricket , 2004 .

[23]  Sebastian Thrun,et al.  Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[24]  A. J. Lewis,et al.  A fair method for resetting the target in interrupted one-day cricket matches , 1998, J. Oper. Res. Soc..

[25]  Ram Gopal Raj,et al.  INFORMATION DISSEMINATION AND STORAGE FOR TELE-TEXT BASED CONVERSATIONAL SYSTEMS' LEARNING , 1970 .

[26]  Noah A. Smith,et al.  Predicting the NFL using Twitter , 2013, MLSA@PKDD/ECML.

[27]  Ram Gopal Raj,et al.  An application of case-based reasoning with machine learning for forensic autopsy , 2014, Expert Syst. Appl..

[28]  Basit Shahzad,et al.  When are Tweets Better Valued? An Empirical Study , 2014, J. Univers. Comput. Sci..

[29]  Roi Blanco,et al.  TwitterPaul: Extracting and Aggregating Twitter Predictions , 2012, ArXiv.

[30]  Wesley De Neve,et al.  Beating the bookmakers: leveraging statistics and Twitter microposts for predicting soccer results , 2014, KDD 2014.

[31]  Benyuan Liu,et al.  Predicting Flu Trends using Twitter data , 2011, 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[32]  K. A. A. D. Raj,et al.  Application of Association Rule Mining: A case study on team India , 2013, 2013 International Conference on Computer Communication and Informatics.

[33]  Håvard Rue,et al.  Prediction and retrospective analysis of soccer matches in a league , 2000 .

[34]  Ram Gopal Raj,et al.  The artificial neural network for solar radiation prediction and designing solar systems: a systematic literature review , 2015 .