A New Semantic Lexicon and Similarity Measure in Bangla

The Mental Lexicon (ML) refers to the organization of lexical entries of a language in the human mind.A clear knowledge of the structure of ML will help us to understand how the human brain processes language. The knowledge of semantic association among the words in ML is essential to many applications. Although, there are works on the representation of lexical entries based on their semantic association in the form of a lexicon in English and other languages, such works of Bangla is in a nascent stage. In this paper, we have proposed a distinct lexical organization based on semantic association between Bangla words which can be accessed efficiently by different applications. We have developed a novel approach of measuring the semantic similarity between words and verified it against user study. Further, a GUI has been designed for easy and efficient access.

[1]  P. Resnik Selection and information: a class-based approach to lexical relationships , 1993 .

[2]  Josef Ruppenhofer,et al.  FrameNet II: Extended theory and practice , 2006 .

[3]  John Murphy,et al.  Using WordNet as a Knowledge Base for Measuring Semantic Similarity between Words , 1994 .

[4]  Mritunjoy Chandra Saha Roy,et al.  Semi-automatic building of wordnet for Bangla , 2008 .

[5]  A. Tversky Features of Similarity , 1977 .

[6]  Danny Jones,et al.  Words in the mind: An introduction to the mental lexicon , 2004, Machine Translation.

[7]  Hugo Liu,et al.  ConceptNet — A Practical Commonsense Reasoning Tool-Kit , 2004 .

[8]  Jordan L. Boyd-Graber,et al.  Adding dense, weighted connections to WordNet , 2005 .

[9]  Tong Wang,et al.  Refining the Notions of Depth and Density in WordNet-based Semantic Similarity Measures , 2011, EMNLP.

[10]  David McLean,et al.  An Approach for Measuring Semantic Similarity between Words Using Multiple Information Sources , 2003, IEEE Trans. Knowl. Data Eng..

[11]  R. H. Seashore,et al.  The measurement of individual differences in English vocabularies , 1940 .

[12]  Jin H. Kim,et al.  A Model of Knowledge Based Information Retrieval with Hierarchical Concept Graph , 1990, J. Documentation.

[13]  ROBERT H. SEASHORE,et al.  THE MEASUREMENT OF INDIVIDUAL DIFFERENCES IN GENERAL ENGLISH VOCABULARIES , 2004 .

[14]  Mitsuaki Yoneyama,et al.  Mental Lexicon , 2013 .

[15]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[16]  Philip Resnik,et al.  Semantic Classes and Syntactic Ambiguity , 1993, HLT.

[17]  Sivaji Bandyopadhyay,et al.  SemanticNet-Perception of Human Pragmatics , 2010 .

[18]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[19]  Myoung-Ho Kim,et al.  Information Retrieval Based on Conceptual Distance in is-a Hierarchies , 1993, J. Documentation.

[20]  Michael Healy,et al.  Theory and Applications of Ontology: Computer Applications , 2010 .