Method of Textual Information Authorship Analysis Based on Stylometry

The paper dwells on the peculiarities of stylometry technologies usage to determine the style of the author publications. Statistical linguistic analysis of the author's text allows taking advantage of text content monitoring based on Porter stemmer and NLP methods to determine the set of stop words. The latter is used in the methods of stylometry to determine the ownership of the analyzed text to a specific author in percentage points. There is proposed a formal approach to the definition of the author's style of the Ukrainian text in the article. The experimental results of the proposed method for determining the ownership of the analyzed text to a particular author upon the availability of the reference text fragment are obtained. The study was conducted on the basis of the Ukrainian scientific texts of a technical area.

[1]  Lyubomyr Chyrun,et al.  Distance learning method for modern youth promotion and involvement in independent scientific researches , 2016, 2016 IEEE First International Conference on Data Stream Mining & Processing (DSMP).

[2]  Vasyl Lytvyn The similarity metric of scientific papers summaries on the basis of adaptive ontologies , 2011, Perspective Technologies and Methods in MEMS Design.

[3]  Ivan Izonin,et al.  Image Superresolution via Divergence Matrix and Automatic Detection of Crossover , 2016 .

[4]  P. Kravets,et al.  The Game Method for Orthonormal Systems Construction , 2007, 2007 9th International Conference - The Experience of Designing and Applications of CAD Systems in Microelectronics.

[5]  Lyubomyr Chyrun,et al.  Peculiarities of content forming and analysis in internet newspaper covering music news , 2017, 2017 12th International Scientific and Technical Conference on Computer Sciences and Information Technologies (CSIT).

[6]  Di Cai,et al.  Sentiment Analysis of Polish Texts , 2012 .

[7]  Volodymyr Riznyk,et al.  Multi-modular Optimum Coding Systems Based on Remarkable Geometric Properties of Space , 2017, CSIT.

[8]  Vasyl Teslyuk,et al.  Development and Implementation of the Technical Accident Prevention Subsystem for the Smart Home System , 2018 .

[9]  Lyubomyr Chyrun,et al.  Method of Integration and Content Management of the Information Resources Network , 2017 .

[10]  Ivan Izonin,et al.  Learning-Based Image Scaling Using Neural-Like Structure of Geometric Transformation Paradigm , 2018 .

[11]  Vasyl Lytvyn,et al.  Content linguistic analysis methods for textual documents classification , 2016, 2016 XIth International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT).

[12]  L. Nedostup,et al.  The software complex development for modeling and optimizing of processes of radio-engineering equipment quality providing at the stage of manufacture , 2012, Proceedings of International Conference on Modern Problem of Radio Engineering, Telecommunications and Computer Science.

[13]  Erin Smith Crabb,et al.  Using Structural Topic Modeling to Detect Events and Cluster Twitter Users in the Ukrainian Crisis , 2015, HCI.

[14]  Vasyl Lytvyn,et al.  Method of functioning of intelligent agents, designed to solve action planning problems based on ontological approach , 2017 .

[15]  Yevhen Burov,et al.  Algebraic Framework for Knowledge Processing in Systems with Situational Awareness , 2017 .

[16]  Taras Basyuk The main reasons of attendance falling of internet resource , 2015, 2015 Xth International Scientific and Technical Conference "Computer Sciences and Information Technologies" (CSIT).

[17]  O. Chernukha,et al.  Mathematical modeling of random concentracion field and its second moments in semispace with erlangian distributions of layerd inclusions , 2016 .

[18]  Lyubomyr Chyrun,et al.  The commercial content digest formation and distributional process , 2016, 2016 XIth International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT).

[19]  Iryna Khomytska,et al.  Specifics of phonostatistical structure of the scientific style in English style system , 2016, 2016 XIth International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT).

[20]  Olga Lozynska,et al.  Information system for translation into ukrainian sign language on mobile devices , 2017, 2017 12th International Scientific and Technical Conference on Computer Sciences and Information Technologies (CSIT).

[21]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[22]  Lyubomyr Chyrun,et al.  Intelligent Systems Design of Distance Learning Realization for Modern Youth Promotion and Involvement in Independent Scientific Researches , 2017, CSIT.

[23]  Vasyl Lytvyn,et al.  The risk management modelling in multi project environment , 2017, 2017 12th International Scientific and Technical Conference on Computer Sciences and Information Technologies (CSIT).

[24]  O. Nadobko,et al.  The results of software complex OPTAN use for modeling and optimization of standard engineering processes of printed circuit boards manufacturing , 2012, Proceedings of International Conference on Modern Problem of Radio Engineering, Telecommunications and Computer Science.

[25]  L. Chyrun,et al.  Analysis features of information resources processing , 2015, 2015 Xth International Scientific and Technical Conference "Computer Sciences and Information Technologies" (CSIT).

[26]  Dumitru Ciobanu,et al.  Web Content Mining , 2012 .

[27]  Vasyl Lytvyn,et al.  Designing architecture of electronic content commerce system , 2015, 2015 Xth International Scientific and Technical Conference "Computer Sciences and Information Technologies" (CSIT).

[28]  V. Vysotska,et al.  Analysis and Evaluation of Risks in Electronic Commerce , 2007, 2007 9th International Conference - The Experience of Designing and Applications of CAD Systems in Microelectronics.

[29]  Olga Lozynska,et al.  Linguistic models of assistive computer technologies for cognition and communication , 2016, 2016 XIth International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT).

[30]  Petro Kravets,et al.  The control agent with fuzzy logic , 2010, 2010 Proceedings of VIth International Conference on Perspective Technologies and Methods in MEMS Design.

[31]  Olga Lozynska,et al.  Mathematical Method of Translation into Ukrainian Sign Language Based on Ontologies , 2017 .

[32]  Oksana Markiv,et al.  Linguistic Comparison Quality Evaluation of Web-Site Content with Tourism Documentation Objects , 2017 .

[33]  Христина Ігорівна Микіч,et al.  Research of uncertainties in situational awareness systems and methods of their processing , 2016 .

[34]  Anjali Ganesh Jivani,et al.  A Comparative Study of Stemming Algorithms , 2011 .

[35]  L. Chyrun,et al.  Information technology of processing information resources in electronic content commerce systems , 2016, 2016 XIth International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT).

[36]  Vasyl Lytvyn,et al.  The method of formation of the status of personality understanding based on the content analysis , 2016 .

[37]  Yevhen Burov,et al.  Algebraic model for knowledge representation in situational awareness systems , 2016, 2016 XIth International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT).

[38]  Iryna Khomytska,et al.  The Method of Statistical Analysis of the Scientific, Colloquial, Belles-Lettres and Newspaper Styles on the Phonological Level , 2017, CSIT.

[39]  Lyubomyr Chyrun,et al.  Features of e-learning realization using virtual research laboratory , 2016, 2016 XIth International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT).

[40]  Yevhen Burov,et al.  Uncertainty in situational awareness systems , 2016, 2016 13th International Conference on Modern Problems of Radio Engineering, Telecommunications and Computer Science (TCSET).

[41]  Yevhen Burov,et al.  Information resources processing using linguistic analysis of textual content , 2017, 2017 9th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS).

[42]  Victoria Vysotska,et al.  Linguistic analysis of textual commercial content for information resources processing , 2016, 2016 13th International Conference on Modern Problems of Radio Engineering, Telecommunications and Computer Science (TCSET).

[43]  Vasyl Lytvyn,et al.  Development of a method for the recognition of author’s style in the Ukrainian language texts based on linguometry, stylemetry and glottochronology , 2017 .

[44]  Dmytro Peleshko,et al.  Video-based Flame Detection using LBP-based Descriptor: Influences of Classifiers Variety on Detection Efficiency , 2017 .

[45]  Vasyl Lytvyn,et al.  Development of a method for determining the keywords in the slavic language texts based on the technology of web mining , 2017 .

[46]  Yevhen Burov,et al.  The Contextual Search Method Based on Domain Thesaurus , 2017 .

[47]  Bamshad Mobasher,et al.  Data Mining for Web Personalization , 2007, The Adaptive Web.

[48]  Vasyl Lytvyn,et al.  Classification Methods of Text Documents Using Ontology Based Approach , 2017 .

[49]  Shestakevych Tetiana,et al.  The method of education format ascertaining in program system of inclusive education support , 2017, 2017 12th International Scientific and Technical Conference on Computer Sciences and Information Technologies (CSIT).