Ensemble decision tree algorithms are well known for good prediction accuracy in most cases, but not much research has been done on applying ensemble methods to improve personalization in the field of behavioral targeting in online advertisements. In behavioral targeting, the best ad is matched to the user based on his/her past activities and demographics. At present, most models used in the behavioral targeting are some form of linear models. Our goal in this paper is to analyze and understand the effect of ensemble techniques on large scale advertising data. Few of the main challenges of this kind of large scale data are sparse features and high dimensionality that make it hard for one single model to work the best. The form of ensemble method explored in this paper is the random forest based regression algorithm that combines the power of multiple decision trees to produce a more robust model which has a reduced variance as well bias. Also, in the field of online advertising it is imperative to learn in an online fashion (while the advertising campaign is being run) as the customers want to get the most off their money at the earliest and the lifetime of such advertisements is short. So, some form of exploration vs. exploitation technique is also required to be used in the system. Our contributions in this paper are three fold. First, we develop a new technique to determine optimal parameters of the random forest algorithms. Second, we do a comparative analysis of random forest vs. logistic regression. Third, we combine ensemble decision tree algorithms with bandit algorithms to produce around 17% CTR improvement over random.
[1]
Liang Tang,et al.
Automatic ad format selection via contextual bandits
,
2013,
CIKM.
[2]
Vanja Josifovski,et al.
Web-scale user modeling for targeting
,
2012,
WWW.
[3]
Trevor Hastie,et al.
The Elements of Statistical Learning
,
2001
.
[4]
Abhimanyu Das,et al.
Factoring past exposure in display advertising targeting
,
2012,
KDD.
[5]
Misha Denil,et al.
Consistency of Online Random Forests
,
2013,
ICML.
[6]
Wei Chu,et al.
Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms
,
2010,
WSDM '11.
[7]
D. Opitz,et al.
Popular Ensemble Methods: An Empirical Study
,
1999,
J. Artif. Intell. Res..
[8]
John F. Canny,et al.
Large-scale behavioral targeting
,
2009,
KDD.
[9]
Wen Zhang,et al.
How much can behavioral targeting help online advertising?
,
2009,
WWW '09.
[10]
Leo Breiman,et al.
Random Forests
,
2001,
Machine Learning.
[11]
Wei Chu,et al.
A contextual-bandit approach to personalized news article recommendation
,
2010,
WWW '10.
[12]
Sandeep Pandey,et al.
Learning to target: what works for behavioral targeting
,
2011,
CIKM '11.