A cloud-based intelligent TV program recommendation system

In recent years, cloud computing technology has matured significantly, as has the development of digital TV services. This, therefore, has led to an increased demand for improved quality TV services. In this paper, cloud computing technology is used to build a program recommendation system for digital TV programs, and the Hadoop Fair Scheduler is utilized to improve processing performance. Historical data of watched TV programs are collected through an electronic program guide, and then processed using K-means clustering, term frequency/inverse document frequency and k-nearest neighbor algorithms, to obtain clusters of audience groups and to find popular TV programs for each cluster. The proposed system can process massive amounts of user data in real-time, and can easily be scaled up.

[1]  S. T. Buckland,et al.  An Introduction to the Bootstrap. , 1994 .

[2]  Li Wei,et al.  Network Traffic Classification Using K-means Clustering , 2007 .

[3]  Yu Shiwen,et al.  An adaptive k -nearest neighbor text categorization strategy , 2004 .

[4]  Patrick Baudisch,et al.  TV Scout: Lowering the Entry Barrier to Personalized TV Program Recommendation , 2002, From Integrated Publication and Information Systems to Virtual Information and Knowledge Environments.

[5]  F. Girosi,et al.  Some Extensions of the K-Means Algorithm for Image Segmentation and Pattern Classification , 1993 .

[6]  Chao Tian,et al.  A Dynamic MapReduce Scheduler for Heterogeneous Workloads , 2009, 2009 Eighth International Conference on Grid and Cooperative Computing.

[7]  Zhong-Hui Duan,et al.  A weighted k-nearest neighbor method for gene ontology based protein function prediction , 2007 .

[8]  Kyoung-jae Kim,et al.  A recommender system using GA K-means clustering in an online shopping market , 2008, Expert Syst. Appl..

[9]  Jorge García Duque,et al.  What's on tv tonight? An efficient and effective personalized recommender system of TV programs , 2009, 2009 Digest of Technical Papers International Conference on Consumer Electronics.

[10]  Eun Yi Kim,et al.  Personalized digital TV content recommendation with integration of user behavior profiling and multimodal content rating , 2009, IEEE Transactions on Consumer Electronics.

[11]  Randy H. Katz,et al.  Topology-aware resource allocation for data-intensive workloads , 2010, APSys '10.

[12]  Erdogan Dogdu,et al.  A TV Recommendation System Using Semantic Web , 2010, SWWS.

[13]  David G. Stork,et al.  Pattern Classification , 1973 .

[14]  Zheng Shao,et al.  Data warehousing and analytics infrastructure at facebook , 2010, SIGMOD Conference.

[15]  Quan Chen,et al.  SAMR: A Self-adaptive MapReduce Scheduling Algorithm in Heterogeneous Environment , 2010, 2010 10th IEEE International Conference on Computer and Information Technology.

[16]  R. J. Kuo,et al.  Integration of self-organizing feature map and K-means algorithm for market segmentation , 2002, Comput. Oper. Res..

[17]  Rebecca N. Wright,et al.  Privacy-preserving distributed k-means clustering over arbitrarily partitioned data , 2005, KDD '05.

[18]  Xiaowei Liu,et al.  Multiple-Job Optimization in MapReduce for Heterogeneous Workloads , 2010, 2010 Sixth International Conference on Semantics, Knowledge and Grids.

[19]  Bram van Ginneken,et al.  Image Denoising with k-nearest Neighbor and Support Vector Regression , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[20]  Debnath Mukherjee,et al.  A context-aware recommendation system considering both user preferences and learned behavior , 2011, 2011 7th International Conference on Information Technology in Asia.

[21]  Chun-Chia Lee,et al.  AIMED- A Personalized TV Recommendation System , 2006, EuroITV.

[22]  Yun Tian,et al.  Improving MapReduce performance through data placement in heterogeneous Hadoop clusters , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW).

[23]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[24]  Mostafa Bamha,et al.  Semi-join computation on distributed file systems using map-reduce-merge model , 2010, SAC '10.

[25]  Eui-nam Huh,et al.  Efficient service recommendation system for cloud computing market , 2009, ICIS.