STEMMER FOR "BASA SUNDA"
暂无分享,去创建一个
Stemming usually be used to remove suffixes from given word(s). In this paper, we used stemming algorithm to remove suffixes from word in " basa Sunda", the second biggest local language in Indonesia. Although the "basa Sunda" is common language in Indonesia especially in Jawa Barat, we didn't find any reference about it. We begin our re search by develop a software for the stemming process in order to begin milestone project of the Natural Language Processing for basa Sunda.
[1] M. F. Porter,et al. An algorithm for suffix stripping , 1997 .
[2] Tomek Strzalkowski,et al. Robust Text Processing in Automated Information Retrieval , 1994, ANLP.
[3] Hugh E. Williams,et al. Stemming Indonesian , 2005, ACSC.
[4] Hugh E. Williams,et al. Stemming Indonesian: A confix-stripping approach , 2007, TALIP.