Social networks 1 produce enormous quantity of data. Twitter, a microblogging network, consists of over 230 million active users posting over 500 million tweets every day. We propose to analyze public data from Twitter to predict crime rates. Crime rates have increased in the past recent years. Although crime stoppers are utilizing various technics to reduce crime rates, none of the previous approaches targeted utilizing the language usage (offensive vs. non-offensive) in Tweets as a source of information to predict crime rates. In this paper, we hypothesize that analyzing the language usage in tweets is a valid measure to predict crime rates in cities. Tweets were collected for a period of 3 months in the Houston and New York City by locking the collection by geographic longitude and latitude. Further, tweets regarding crime events in the two cities were collected for verification of the validity of the prediction algorithm. We utilized Support Vector Machine (SVM) classifier to create a model of prediction of crime rates based on tweets. Finally, we report the validity of prediction algorithm in predicting crime rates in cities.
[1]
Johan Bollen,et al.
Twitter mood predicts the stock market
,
2010,
J. Comput. Sci..
[2]
Ana-Maria Popescu,et al.
A Machine Learning Approach to Twitter User Classification
,
2011,
ICWSM.
[3]
Hosung Park,et al.
What is Twitter, a social network or a news media?
,
2010,
WWW '10.
[4]
Andrea L. Bertozzi,et al.
Topic Time Series Analysis of Microblogs
,
2016
.
[5]
Max Bramer,et al.
Review of Current Crime Prediction Techniques
,
2006,
SGAI Conf..
[6]
Wael Khreich,et al.
A Survey of Techniques for Event Detection in Twitter
,
2015,
Comput. Intell..
[7]
Thomas J. Lampoltshammer,et al.
Exploring Twitter to Analyze the Public’s Reaction Patterns to Recently Reported Homicides in London
,
2015,
PloS one.
[8]
Ian H. Witten,et al.
The WEKA data mining software: an update
,
2009,
SKDD.
[9]
Xu Yao,et al.
Criminal Detection Based on Social Network Analysis
,
2012,
SKG.