Unbiased Decision Tree Model for User's QoE in Imbalanced Dataset

Nowadays, Internet Protocol Television (IPTV) is gradually replacing the traditional TV. IPTV Users require better experience. Therefore, media providers are interested in finding the key factors which influence the Quality of Experience (QoE), and it is necessary to find a model to predict the QoE. In this paper, we discuss the relationship between the status of IPTV set-top box and user's QoE. There is not a uniform standard to measure or improve user's QoE in IPTV, so we combine the status data from IPTV set-top box with user's complaints, selecting the appropriate model and using it for predicting user's QoE. As the data from IPTV set-top box is imbalance, the traditional algorithm does not perform well in terms of predicting user's QoE. To solve this problem, we propose the unbiased decision tree model to deal with the imbalance dataset. First of all, we clean the dataset. Then, we select important features influencing QoE by the feature selection technology. Finally, we compare CART model and the unbiased decision tree model. We demonstrate that the unbiased decision tree model performs well in the imbalance dataset and achieve a high accuracy.

[1]  Yonggang Wen,et al.  Distributed Wireless Video Scheduling With Delayed Control Information , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Hsiao-Hwa Chen,et al.  Energy-Spectrum Efficiency Tradeoff for Video Streaming over Mobile Ad Hoc Networks , 2013, IEEE Journal on Selected Areas in Communications.

[3]  Seong Gon Choi,et al.  A study on a QoS/QoE correlation model for QoE evaluation on IPTV service , 2010, 2010 The 12th International Conference on Advanced Communication Technology (ICACT).

[4]  Ben Y. Zhao,et al.  Understanding user behavior in large-scale video-on-demand systems , 2006, EuroSys.

[5]  Yipeng Zhou,et al.  Video Browsing - A Study of User Behavior in Online VoD Services , 2013, 2013 22nd International Conference on Computer Communication and Networks (ICCCN).

[6]  Zhi-Hua Zhou,et al.  Exploratory Under-Sampling for Class-Imbalance Learning , 2006, Sixth International Conference on Data Mining (ICDM'06).

[7]  Kong Xiang,et al.  Survey on Models and Evaluation of Quality of Experience , 2012 .

[8]  Kevin Kok Wai Wong,et al.  Enhancing classification performance of multi-class imbalanced data using the OAA-DB algorithm , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[9]  Seetha Hari,et al.  Learning From Imbalanced Data , 2019, Advances in Computer and Electrical Engineering.

[10]  Jie Hu,et al.  Survey on Models and Evaluation of Quality of Experience: Survey on Models and Evaluation of Quality of Experience , 2012 .

[11]  Gustavo E. A. P. A. Batista,et al.  A study of the behavior of several methods for balancing machine learning training data , 2004, SKDD.

[12]  Fu Yun-fei,et al.  Cart-based Land Use/cover Classification of Remote Sensing Images , 2005 .

[13]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[14]  Zhi-Hua Zhou,et al.  Exploratory Under-Sampling for Class-Imbalance Learning , 2006, ICDM.

[15]  Mohsen Guizani,et al.  Impact of Execution Time on Adaptive Wireless Video Scheduling , 2014, IEEE Journal on Selected Areas in Communications.

[16]  Ramesh K. Sitaraman,et al.  Video Stream Quality Impacts Viewer Behavior: Inferring Causality Using Quasi-Experimental Designs , 2012, IEEE/ACM Transactions on Networking.

[17]  Srinivasan Seshan,et al.  Developing a predictive model of quality of experience for internet video , 2013, SIGCOMM.

[18]  Vyas Sekar,et al.  Understanding the impact of video quality on user engagement , 2011, SIGCOMM.