Method for Determining Linguometric Coefficient Dynamics of Ukrainian Text Content Authorship

The article describes the peculiarities of linguometry information technologies usage to determine the linguometric coefficients dynamics of the text content authorship. The linguistic and statistical analysis of the author texts within a certain time period takes advantage of the text content-monitoring based on the NLP methods to determine the set of stop words and to study n-grams. The latter is used in the methods of linguometry and stylometry to determine the linguometric coefficients dynamics of the ownership of the analyzed text to a specific author in percentage points. There is proposed a formal approach to the definition of the author’s style of the Ukrainian text in the article. The experimental results of the proposed method for determining the ownership of the analyzed text to a particular author upon the availability of the reference text fragment are obtained. The study was conducted on the basis of the Ukrainian scientific texts of a technical area.

[1]  L. Nedostup,et al.  The software complex development for modeling and optimizing of processes of radio-engineering equipment quality providing at the stage of manufacture , 2012, Proceedings of International Conference on Modern Problem of Radio Engineering, Telecommunications and Computer Science.

[2]  Христина Ігорівна Микіч,et al.  Research of uncertainties in situational awareness systems and methods of their processing , 2016 .

[3]  Olena Vovk,et al.  UNCERTAINTY REDUCTION IN BIG DATA CATALOGUE FOR INFORMATION PRODUCT QUALITY EVALUATION , 2018 .

[4]  Natalia Lotoshynska,et al.  Single-frame image super-resolution based on singular square matrix operator , 2017, 2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON).

[5]  Bamshad Mobasher,et al.  Data Mining for Web Personalization , 2007, The Adaptive Web.

[6]  Lyubomyr Chyrun,et al.  Distance learning method for modern youth promotion and involvement in independent scientific researches , 2016, 2016 IEEE First International Conference on Data Stream Mining & Processing (DSMP).

[7]  Ivan Izonin,et al.  Image Superresolution via Divergence Matrix and Automatic Detection of Crossover , 2016 .

[8]  P. Kravets,et al.  Fuzzy logic controller for embedded systems , 2009, 2009 5th International Conference on Perspective Technologies and Methods in MEMS Design.

[9]  Yuriy Syerov,et al.  Web-community ontological representation using intelligent dataspace analyzing agent , 2009, 2009 10th International Conference - The Experience of Designing and Application of CAD Systems in Microelectronics.

[10]  Iryna Khomytska,et al.  The Method of Statistical Analysis of the Scientific, Colloquial, Belles-Lettres and Newspaper Styles on the Phonological Level , 2017, CSIT.

[11]  Vasyl Teslyuk,et al.  Development and Implementation of the Technical Accident Prevention Subsystem for the Smart Home System , 2018 .

[12]  Ivan Izonin,et al.  Learning-Based Image Scaling Using Neural-Like Structure of Geometric Transformation Paradigm , 2018 .

[13]  Pamela Kostur,et al.  Managing Enterprise Content: A Unified Content Strategy , 2002 .

[14]  Erin Smith Crabb,et al.  Using Structural Topic Modeling to Detect Events and Cluster Twitter Users in the Ukrainian Crisis , 2015, HCI.

[15]  Vasyl Lytvyn,et al.  Method of functioning of intelligent agents, designed to solve action planning problems based on ontological approach , 2017 .

[16]  Olga Lozynska,et al.  Mathematical Method of Translation into Ukrainian Sign Language Based on Ontologies , 2017 .

[17]  V. Vysotska,et al.  Analysis and Evaluation of Risks in Electronic Commerce , 2007, 2007 9th International Conference - The Experience of Designing and Applications of CAD Systems in Microelectronics.

[18]  Olga Lozynska,et al.  Linguistic models of assistive computer technologies for cognition and communication , 2016, 2016 XIth International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT).

[19]  Taras Basyuk The main reasons of attendance falling of internet resource , 2015, 2015 Xth International Scientific and Technical Conference "Computer Sciences and Information Technologies" (CSIT).

[20]  O. Chernukha,et al.  Mathematical modeling of random concentracion field and its second moments in semispace with erlangian distributions of layerd inclusions , 2016 .

[21]  Natalya Shakhovska,et al.  Application of algorithms of classification for uncertainty reduction , 2013 .

[22]  Di Cai,et al.  Sentiment Analysis of Polish Texts , 2012 .

[23]  Lyubomyr Chyrun,et al.  Features of e-learning realization using virtual research laboratory , 2016, 2016 XIth International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT).

[24]  Iryna Khomytska,et al.  Specifics of phonostatistical structure of the scientific style in English style system , 2016, 2016 XIth International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT).

[25]  Yevhen Burov,et al.  Uncertainty in situational awareness systems , 2016, 2016 13th International Conference on Modern Problems of Radio Engineering, Telecommunications and Computer Science (TCSET).

[26]  Olga Lozynska,et al.  Information system for translation into ukrainian sign language on mobile devices , 2017, 2017 12th International Scientific and Technical Conference on Computer Sciences and Information Technologies (CSIT).

[27]  Vasyl Lytvyn,et al.  The method of formation of the status of personality understanding based on the content analysis , 2016 .

[28]  Dumitru Ciobanu,et al.  Web Content Mining , 2012 .

[29]  Susan McKeever Understanding Web content management systems: evolution, lifecycle and market , 2003, Ind. Manag. Data Syst..

[30]  Yevhen Burov,et al.  Algebraic model for knowledge representation in situational awareness systems , 2016, 2016 XIth International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT).

[31]  O. Nadobko,et al.  The results of software complex OPTAN use for modeling and optimization of standard engineering processes of printed circuit boards manufacturing , 2012, Proceedings of International Conference on Modern Problem of Radio Engineering, Telecommunications and Computer Science.

[32]  Anjali Ganesh Jivani,et al.  A Comparative Study of Stemming Algorithms , 2011 .

[33]  Tetiana Shestakevych,et al.  The Model of Data Analysis of the Psychophysiological Survey Results , 2017 .

[34]  Petro Kravets,et al.  The control agent with fuzzy logic , 2010, 2010 Proceedings of VIth International Conference on Perspective Technologies and Methods in MEMS Design.

[35]  Vasyl Lytvyn,et al.  ANALYSIS OF STATISTICAL METHODS FOR STABLE COMBINATIONS DETERMINATION OF KEYWORDS IDENTIFICATION , 2018 .

[36]  Volodymyr Stepashko,et al.  Advances in Intelligent Systems and Computing II , 2017 .

[37]  Olena Vovk,et al.  The Method of Big Data Processing for Distance Educational System , 2017 .

[38]  Yanchun Zhang,et al.  Web Content Mining , 2011 .

[39]  Oksana Markiv,et al.  Linguistic Comparison Quality Evaluation of Web-Site Content with Tourism Documentation Objects , 2017 .

[40]  P. Kravets,et al.  The Game Method for Orthonormal Systems Construction , 2007, 2007 9th International Conference - The Experience of Designing and Applications of CAD Systems in Microelectronics.

[41]  Vasyl Lytvyn,et al.  Development of a method for the recognition of author’s style in the Ukrainian language texts based on linguometry, stylemetry and glottochronology , 2017 .

[42]  Dmytro Peleshko,et al.  Video-based Flame Detection using LBP-based Descriptor: Influences of Classifiers Variety on Detection Efficiency , 2017 .

[43]  Vasyl Lytvyn,et al.  Development of a method for determining the keywords in the slavic language texts based on the technology of web mining , 2017 .