Natural Language Processing System Applied in Public Health for Assessment of an Automatic Analysis of Patterns Generator

Nowadays, there are many scientific articles referring to any topic like medicine, technology, economics, finance, and so on. These articles are better known as papers, they represent the evaluation and interpretation of different arguments, showing results of scientific interest. At the end, most of these are published in magazines, books, journals, etc. Due to the fact that these papers are created with a higher frequency it is feasible to analyse how people write in the same domain. At the level of structure and with the help of graphs some of the results that can be found are: groups of words that are used (to determine if they come from a specific vocabulary), most common grammatical categories, most repeated words in a domain, patterns found, and frequency of patterns found. This research has been created to fulfil these needs. A domain of public health has been selected and it is composed of 800 papers about different topics referring to genetics such as mutations, genetic deafness, DNA, trinucleotide, suppressor genes, among others; and an ontology of public health has been used to provide the basis of the study.