Collection of HerbalMedicinalProperty relation extracted from texts

This research aims to collect the extracted HerbalMedicinalProperty relations from downloaded herbal-plant documents for creating the herbal-medicinal-property-network based representation. An HerbalMedicinalProperty relation is a semantic relation between one herbal-plant-component concept and several herbal-medicinal-property-concept expressions on texts and vice versa. An herbal-plant-component occurrence is a noun-phrase expression and each herbal-medicinal-property- concept occurrence is an event expression by a verb-phrase of EDU (an Elementary Discourse Unit or a simple sentence). The herbal-medicinal-property-network based representation benefits a recommendation system of solving health-problems on web-boards. The research has two main problems: 1) how to extract HerbalMedicinalProperty relations from the documents, and 2) how to collect the HerbalMedicinalProperty relations for creating the herbal-medicinal-property-network based representation. Therefore, we propose applying a co-occurrence of N-Words (or N-Word-Co) including N-Word-Co size learning on the verb phrase to identify several medicinal-property-concept EDU occurrences over the documents after the linguistic phenomena has been applied to solve the herbal-plant-component concepts. The extracted HerbalMedicinalProperty relations are then collected as a matrix of herbal-plant names, herbal-plant components, and herbal-medicinal properties for creating the herbal-medicinal-property-network based representation. The research results provide the high precision of the HerbalMedicinalProperty-relation extraction from the documents.