Perceived Audiovisual Quality Modelling based on Decison Trees, Genetic Programming and Neural Networks

Our objective is to build machine learning based models that predict audiovisual quality directly from a set of correlated parameters that are extracted from a target quality dataset. We have used the bitstream version of the INRS audiovisual quality dataset that reflects contemporary real-time configurations for video frame rate, video quantization, noise reduction parameters and network packet loss rate. We have utilized this dataset to build bitstream perceived quality estimation models based on the Random Forests, Bagging, Deep Learning and Genetic Programming methods. We have taken an empirical approach and have generated models varying from very simple to the most complex depending on the number of features used from the quality dataset. Random Forests and Bagging models have overall generated the most accurate results in terms of RMSE and Pearson correlation coefficient values. Deep Learning and Genetic Programming based bitstream models have also achieved good results but that high performance was observed only with a limited range of features. We have also obtained the epsilon-insensitive RMSE values for each model and have computed the significance of the difference between the correlation coefficients. Overall we conclude that computing the bitstream information is worth the effort it takes to generate and helps to build more accurate models for real-time communications. However, it is useful only for the deployment of the right algorithms with the carefully selected subset of the features. The dataset and tools that have been developed during this research are publicly available for research and development purposes.

[1]  Chengwen Xing,et al.  A real-time QoE methodology for AMR codec voice in mobile network , 2014, Science China Information Sciences.

[2]  Pedro M. Domingos A few useful things to know about machine learning , 2012, Commun. ACM.

[3]  Methods , metrics and procedures for statistical evaluation , qualification and comparison of objective quality prediction models , 2013 .

[4]  Ofer Hadar,et al.  Real Time Video Quality Representation Classification of Encrypted HTTP Adaptive Video Streaming - the Case of Safari , 2016, ArXiv.

[5]  Alexander Raake,et al.  IP-Based Mobile and Fixed Network Audiovisual Media Services , 2011, IEEE Signal Processing Magazine.

[6]  Haiqing Du,et al.  Research on relationship between QoE and QoS based on BP Neural Network , 2009, 2009 IEEE International Conference on Network Infrastructure and Digital Content.

[7]  Alexander B. Bordetsky,et al.  A Feedback Control Model for Managing Quality of Service in Multimedia Communications , 2001, Telecommun. Syst..

[8]  Abdelhamid Mellouk,et al.  Survey on machine learning-based QoE-QoS correlation models , 2014, 2014 International Conference on Computing, Management and Telecommunications (ComManTel).

[9]  Marie-Neige Garcia Parametric Packet-Based Audiovisual Quality Model for Iptv Services , 2014 .

[10]  Hani Yehia,et al.  A concise review of the quality of experience assessment for video streaming , 2015, Comput. Commun..

[11]  Andrew Perkis Quality of experience (QoE) in multimedia applications , 2013 .

[12]  Riccardo Poli,et al.  A Field Guide to Genetic Programming , 2008 .

[13]  Scott A. Hissam,et al.  Assessing QoS trade-offs for real-time video , 2013, 2013 IEEE 14th International Symposium on "A World of Wireless, Mobile and Multimedia Networks" (WoWMoM).

[14]  Peter Schelkens,et al.  Qualinet White Paper on Definitions of Quality of Experience , 2013 .

[15]  Marcus Barkowsky,et al.  The Influence of Subjects and Environment on Audiovisual Subjective Tests: An International Study , 2012, IEEE Journal of Selected Topics in Signal Processing.

[16]  Michael D. Schmidt,et al.  Symbolic Regression of Implicit Equations , 2010 .

[17]  Dragan Kukolj,et al.  A reduced-reference parametric model for audiovisual quality of IPTV services , 2013, 2013 Fifth International Workshop on Quality of Multimedia Experience (QoMEX).

[18]  Federica Battisti,et al.  A study on the effects of quality of service parameters on perceived video quality , 2014, 2014 5th European Workshop on Visual Information Processing (EUVIP).

[19]  Christian Keimel,et al.  The TUM high definition video datasets , 2012, 2012 Fourth International Workshop on Quality of Multimedia Experience.

[20]  Judith Redi,et al.  Supporting visual quality assessment with machine learning , 2013, EURASIP Journal on Image and Video Processing.

[21]  Benjamin Belmudez Audiovisual Quality Assessment and Prediction for Videotelephony , 2014 .

[22]  A. Mellouk,et al.  Empirical study based on machine learning approach to assess the QoS/QoE correlation , 2012, 2012 17th European Conference on Networks and Optical Communications.

[23]  Jean-Charles Grégoire,et al.  INRS Audiovisual Quality Dataset , 2016, ACM Multimedia.

[24]  Abdelhamid Mellouk,et al.  A brief synthesis of QoS-QoE methodologies , 2011, 2011 10th International Symposium on Programming and Systems.

[25]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[26]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[27]  Jean-Charles Grégoire,et al.  Multimedia Communication Quality Assessment Testbeds , 2016, ArXiv.

[28]  Stuart J. Russell,et al.  Online bagging and boosting , 2005, 2005 IEEE International Conference on Systems, Man and Cybernetics.

[29]  Achim Zeileis,et al.  Bias in random forest variable importance measures: Illustrations, sources and a solution , 2007, BMC Bioinformatics.

[30]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[31]  M. J. Bayarri,et al.  Calibration of ρ Values for Testing Precise Null Hypotheses , 2001 .

[32]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[33]  Jean-Charles Grégoire,et al.  Machine Learning--Based Parametric Audiovisual Quality Prediction Models for Real-Time Communications , 2017, ACM Trans. Multim. Comput. Commun. Appl..

[34]  Lingfen Sun,et al.  Audiovisual Quality Estimation for Video Calls in Wireless Applications , 2010, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010.

[35]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[36]  A. Comrie Comparing Neural Networks and Regression Models for Ozone Forecasting , 1997 .

[37]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.