Micro-blogging sites such as Facebook, Twitter, Google+ present a nice opportunity for targeting advertisements that are contextually related to the microblog content. By virtue of the sparse and noisy text makes identifying the microblogs suitable for advertising a very hard problem. In this work, we approach the problem of identifying the microblogs that could be targeted for advertisements as a two-step classification approach. In the first pass, microblogs suitable for advertising are identified. Next, in the second pass, we build a model to find the sentiment of the advertisable microblog. The systems use features derived from the Part-of-speech tags, the tweet content and uses external resources such as query logs and n-gram dictionaries from previously labeled data.This work aims at providing a thorough insight into the problem and analyzing various features to assess which features contribute the most towards identifying the tweets that can be targeted for advertisements.
[1]
John Blitzer,et al.
Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification
,
2007,
ACL.
[2]
Owen Rambow,et al.
Sentiment Analysis of Twitter Data
,
2011
.
[3]
Vasudeva Varma,et al.
Pattern based keyword extraction for contextual advertising
,
2010,
CIKM '10.
[4]
George A. Miller,et al.
WordNet: A Lexical Database for English
,
1995,
HLT.
[5]
Andrei Z. Broder,et al.
To swing or not to swing: learning when (not) to advertise
,
2008,
CIKM '08.
[6]
Joshua Goodman,et al.
Finding advertising keywords on web pages
,
2006,
WWW '06.
[7]
Patrick Paroubek,et al.
Twitter as a Corpus for Sentiment Analysis and Opinion Mining
,
2010,
LREC.