A Proposed Method Using the Semantic Similarity of WordNet 3.1 to Handle the Ambiguity to Apply in Social Media Text

The semantic similarity between two concepts is widely used in natural language processing. In this article, we propose a method using WordNet 3.1 to determine the similarity based on feature combinations. This work focuses on overcoming the ambiguity in social media text via the selection of informative features to improve semantic representation. In addition, this research uses social media as its research domain used in this work, and the study is only limited to the politic dataset. A feature-based method is applied to predict the outcome and improve the performance of the proposed method depending on factors related to the fidelity, continuity, and balance of knowledge sources in WordNet 3.1. Semantic similarity measurements among words are insufficient and unbalanced features. However, this study presents a semantic similarity measure of a feature-based method in WordNet 3.1 to determine the similarity between two concepts/words depending on the selected features used to measure their similarity, which is also known as a “noun” and “is-a” relations-based method. We evaluate our proposed method using the data set in Agirre [1] (AG203) and compare our results of our new method as which three of methods taxonomy relation, non-taxonomy and Glosses with those of related studies. The correlation with human judgments is subjective and low based on our results was a better. Experimental results show that our new method significantly outperforms other existing computational methods with the following results: r = 0.73%, p = 0.69%, m = 0.71% and nonzero = 0.95%.

[1]  Eneko Agirre,et al.  A Study on Similarity and Relatedness Using Distributional and WordNet-based Approaches , 2009, NAACL.

[2]  Said Jadid Abdul Kadir,et al.  Binary Optimization Using Hybrid Grey Wolf Optimization for Feature Selection , 2019, IEEE Access.

[3]  David McLean,et al.  An Approach for Measuring Semantic Similarity between Words Using Multiple Information Sources , 2003, IEEE Trans. Knowl. Data Eng..

[4]  P. N. Girija,et al.  Restoring The Missing Features of the Corrupted Speech using Linear Interpolation Methods , 2017 .

[5]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[6]  Ummi Zakiah Zainodin,et al.  Weighting-based semantic similarity measure based on topological parameters in semantic taxonomy , 2018, Nat. Lang. Eng..

[7]  David Sánchez,et al.  Ontology-based information content computation , 2011, Knowl. Based Syst..

[8]  Ted Pedersen,et al.  UMND1: Unsupervised Word Sense Disambiguation Using Contextual Semantic Relatedness , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[9]  David Sánchez,et al.  Content annotation for the semantic web: an automatic web-based approach , 2011, Knowledge and Information Systems.

[10]  Abdelmajid Ben Hamadou,et al.  Taxonomy-based information content and wordnet-wiktionary-wikipedia glosses for semantic relatedness , 2015, Applied Intelligence.

[11]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[12]  Abdelmajid Ben Hamadou,et al.  Ontology-based approach for measuring semantic similarity , 2014, Eng. Appl. Artif. Intell..

[13]  Ahmad Abdollahzadeh Barforoush,et al.  A new word sense similarity measure in wordnet , 2008, 2008 International Multiconference on Computer Science and Information Technology.

[14]  Yong Tang,et al.  Feature-based approaches to semantic similarity assessment of concepts using Wikipedia , 2015, Inf. Process. Manag..

[15]  Taha H. Rassem,et al.  Pattern-Matching Based for Arabic Question Answering: A Challenge Perspective , 2018 .

[16]  Mohamed Ali Hadj Taieb,et al.  Derivation of "is a" taxonomy from Wikipedia Category Graph , 2016, Eng. Appl. Artif. Intell..

[17]  Taha H. Rassem,et al.  Combined Support Vector Machine and Pattern Matching for Arabic Islamic Hadith Question Classification System , 2018, Advances in Intelligent Systems and Computing.

[18]  David Sánchez,et al.  A semantic similarity method based on information content exploiting multiple ontologies , 2013, Expert Syst. Appl..

[19]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[20]  Pu Li,et al.  A graph-based semantic relatedness assessment method combining wikipedia features , 2017, Eng. Appl. Artif. Intell..

[21]  David Sánchez,et al.  A framework for unifying ontology-based semantic similarity measures: A study in the biomedical domain , 2014, J. Biomed. Informatics.

[22]  Lailatul Qadri Zakaria,et al.  Question classification using support vector machine and pattern matching , 2016 .