Detecting proper nouns in indonesian-language translation of the quran using a guided method

Abstract Proper nouns (often abbreviated PN or NNP) are a class of words important in labelling and subsequent text processing, especially in natural language processing (NLP). Name entity recognition (NER) is one study that requires PN. The lack of labeled data for Indonesian text, especially the PN label, may be attributed for the lack of NER research in the Indonesian language. This study aims to detect PN in Indonesian-language translations of the Quran guided by deriving location information from the Quran as its source text. In the Indonesian language, PN are written using initial capital letters, which are used to determine and guide PN location. This article proposes that PN in Indonesian-language translations of the Quran can be determined based on PoS Information of Quranic text by developing a certain production rule. The results of this research showed that the proposed method has promising results for further research.