Soft Computing in Data Science: 5th International Conference, SCDS 2019, Iizuka, Japan, August 28–29, 2019, Proceedings

The Voynich Manuscript (referred to as ‘VMS’) can be considered as one of the oldest puzzles which remained unsolved until now. While some say VMS is genuine, others say that it is just a hoax. In this research, we propose three methods to analyze VMS and verify the effectiveness of them. The reason for using this manuscript is that no one on this planet knows the meaning of the text of VMS, so we can tackle research without being biased. We analyze from the viewpoint of character frequency and entropy of VMS. Statistical analysis of word frequency already exists, in contrast, we adopt analysis based on a character unit. In that respect, there is diversity from other research. We stated the following three hypotheses as H1 through H3. H1: VMS is a Ciphertext. H2: VMS is close to a programming language. H3: VMS is close to a natural language. The experimental results demonstrate that our methods are valid and efficient. The methods can be applied to all text sources and different classes of them can be distinguished. The possibility that VMS is likely a Ciphertext can be rejected. This mysterious manuscript can be concluded as a meaningful human art, not a hoax. Moreover, the VMS is not encrypted. (However, we are not able to exclude being encoded.)

[1]  Jean-Michel Poggi,et al.  PM10 forecasting using clusterwise regression , 2011 .

[2]  J. Chow,et al.  A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: The case of Temuco, Chile , 2008 .

[3]  Guoqiang Peter Zhang,et al.  Time series forecasting using a hybrid ARIMA and neural network model , 2003, Neurocomputing.

[4]  Dedy Dwi Prastyo,et al.  Model Selection in Feedforward Neural Networks for Forecasting Inflow and Outflow in Indonesia , 2017 .

[6]  R M Harrison,et al.  Particulate matter in the atmosphere: which particle properties are important for its effects on health? , 2000, The Science of the total environment.

[7]  Yunpeng Wang,et al.  Long short-term memory neural network for traffic speed prediction using remote microwave sensor data , 2015 .

[8]  O. Nelles,et al.  An Introduction to Optimization , 1996, IEEE Antennas and Propagation Magazine.

[9]  Gautami Tripathi FEATURE SELECTION AND CLASSIFICATION APPROACH FOR SENTIMENT ANALYSIS , 2015 .

[10]  Pushpak Bhattacharyya,et al.  Feature selection and ensemble construction: A two-step method for aspect based sentiment analysis , 2017, Knowl. Based Syst..

[11]  Maigeng Zhou,et al.  The Effect of Air Pollution on Mortality in China: Evidence from the 2008 Beijing Olympic Games , 2015 .

[12]  Kyoungok Kim,et al.  An improved semi-supervised dimensionality reduction using feature weighting: Application to sentiment analysis , 2018, Expert Syst. Appl..

[13]  J. Manjourides,et al.  Cognitive impacts of ambient air pollution in the National Social Health and Aging Project (NSHAP) cohort. , 2016, Environment international.

[14]  Huang Zou,et al.  Sentiment Classification Using Machine Learning Techniques with Syntax Features , 2015, 2015 International Conference on Computational Science and Computational Intelligence (CSCI).

[15]  João Francisco Valiati,et al.  Document-level sentiment classification: An empirical comparison between SVM and ANN , 2013, Expert Syst. Appl..

[16]  Guy Pujolle,et al.  A Long Short-Term Memory Recurrent Neural Network Framework for Network Traffic Matrix Prediction , 2017, ArXiv.

[17]  S. Lee,et al.  A domain transferable lexicon set for Twitter sentiment analysis using a supervised machine learning approach , 2018, Expert Syst. Appl..

[18]  Suhartono Suhartono,et al.  New Procedures for Model Selection in Feedforward Neural Networks for Time Series Forecasting , 2008 .

[19]  Claudia Diamantini,et al.  A Negation Handling Technique for Sentiment Analysis , 2016, 2016 International Conference on Collaboration Technologies and Systems (CTS).

[20]  Ji Meng Loh,et al.  Learning-Based Method with Valence Shifters for Sentiment Analysis , 2017, 2017 IEEE International Conference on Data Mining Workshops (ICDMW).

[21]  Vo Ngoc Phu,et al.  Sentiment classification using Enhanced Contextual Valence Shifters , 2014, 2014 International Conference on Asian Language Processing (IALP).

[22]  Mustafa Mat Deris,et al.  Effect of negation in sentiment analysis , 2016, 2016 Sixth International Conference on Innovative Computing Technology (INTECH).

[23]  J. Hooyberghs,et al.  A neural network forecast for daily average PM10 concentrations in Belgium , 2005 .

[24]  Shouming Zhong,et al.  Time Series Forecasting Using a Hybrid RBF Neural Network and AR Model Based On Binomial Smoothing , 2011 .

[25]  Ahmed Tealab,et al.  Time series forecasting using artificial neural networks methodologies: A systematic review , 2018, Future Computing and Informatics Journal.

[26]  Rati Wongsathan,et al.  A Hybrid ARIMA and Neural Networks Model for PM-10 Pollution Estimation: The Case of Chiang Mai City Moat Area , 2016 .

[27]  Rob J Hyndman,et al.  Forecasting without significance tests ? , 2008 .

[28]  J. Armstrong Significance Tests Harm Progress in Forecasting , 2007 .

[29]  Evangelos Spiliotis,et al.  The M4 Competition: Results, findings, conclusion and way forward , 2018, International Journal of Forecasting.

[30]  Magnus Sahlgren,et al.  The Distributional Hypothesis , 2008 .

[31]  Hamid Taheri Shahraiyni,et al.  Statistical Modeling Approaches for PM10 Prediction in Urban Areas; A Review of 21st-Century Studies , 2016 .

[32]  Yacine Ouzrout,et al.  Negation Handling in Sentiment Analysis at Sentence Level , 2017, J. Comput..

[33]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[34]  Steven C. Wheelwright,et al.  Forecasting methods and applications. , 1979 .

[35]  Stefan Lessmann,et al.  A comparative study of LSTM neural networks in forecasting day-ahead global horizontal irradiance with satellite data , 2018 .

[36]  N. D. Lewis,et al.  Deep Learning Made Easy with R: A Gentle Introduction For Data Science , 2016 .

[37]  William W. S. Wei,et al.  Time series analysis - univariate and multivariate methods , 1989 .

[38]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.