Knowledge-Based Construction of Confusion Matrices for Multi-Label Classification Algorithms using Semantic Similarity Measures

So far, multi-label classification algorithms have been evaluated using statistical methods that do not consider the Semantics of the considered classes and that fully depend on abstract computations such as Bayesian Reasoning. Currently, several efforts are provided to develop ontology-based methods for a better assessment of supervised classification algorithms. In this research paper, we define a novel approach that aligns expected labels with predicted labels in multi-label classification using ontology-driven feature-based semantic similarity measures and we use it to develop a method for creating precise confusion matrices for a more effective evaluation of multi-label classification algorithms.

[1]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[2]  Zhiyong Lu,et al.  ML-Net: multi-label classification of biomedical texts with deep neural networks , 2018, J. Am. Medical Informatics Assoc..

[3]  Md. Kamruzzaman Sarker,et al.  Wikipedia Knowledge Graph for Explainable AI , 2020, Iberoamerican Conference on Knowledge Graphs and Semantic Web.

[4]  Thabet Slimani,et al.  Description and Evaluation of Semantic Similarity Measures Approaches , 2013, ArXiv.

[5]  Ana M. García-Serrano,et al.  HESML: A scalable ontology-based semantic similarity measures library with a set of reproducible experiments and a replication dataset , 2017, Inf. Syst..

[6]  Abdelmajid Ben Hamadou,et al.  Ontology-based approach for measuring semantic similarity , 2014, Eng. Appl. Artif. Intell..

[7]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[8]  Antonio Pertusa,et al.  PadChest: A large chest x-ray image dataset with multi-label annotated reports , 2019, Medical Image Anal..

[9]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[10]  Rodrigo C. Barros,et al.  Hierarchical Multi-Label Classification Networks , 2018, ICML.

[11]  Abdelmajid Ben Hamadou,et al.  Taxonomy-based information content and wordnet-wiktionary-wikipedia glosses for semantic relatedness , 2015, Applied Intelligence.

[12]  Leonardo Vanneschi,et al.  Multidimensional genetic programming for multiclass classification , 2019, Swarm Evol. Comput..

[13]  Marek Kurzynski,et al.  Weighting scheme for a pairwise multi-label classifier based on the fuzzy confusion matrix , 2018, Pattern Recognit. Lett..

[14]  José Camacho-Collados,et al.  From Word to Sense Embeddings: A Survey on Vector Representations of Meaning , 2018, J. Artif. Intell. Res..

[15]  Mohamed Ali Hadj Taieb,et al.  Network representation learning systematic review: ancestors and current development state , 2021, ArXiv.

[16]  Abdelmajid Ben Hamadou,et al.  A new semantic relatedness measurement using WordNet features , 2013, Knowledge and Information Systems.

[17]  Magbubah Essack,et al.  Application and evaluation of knowledge graph embeddings in biomedical data , 2021, PeerJ Comput. Sci..

[18]  Max J. Egenhofer,et al.  Determining Semantic Similarity among Entity Classes from Different Ontologies , 2003, IEEE Trans. Knowl. Data Eng..

[19]  Manik Varma,et al.  Extreme Multi-label Loss Functions for Recommendation, Tagging, Ranking & Other Missing Label Applications , 2016, KDD.

[20]  Alaa Tharwat,et al.  Classification assessment methods , 2020, Applied Computing and Informatics.

[21]  M. Gholamian International Journal of Data Warehousing and Mining , 2014 .

[22]  Eneko Agirre,et al.  A reproducible survey on word embeddings and ontology-based methods for word similarity: Linear combinations outperform the state of the art , 2019, Eng. Appl. Artif. Intell..

[23]  Anna Saranti,et al.  Towards multi-modal causability with Graph Neural Networks enabling information fusion for explainable AI , 2021, Inf. Fusion.

[24]  Abdelmajid Ben Hamadou,et al.  SISR: System for integrating semantic relatedness and similarity measures , 2016, Soft Computing.

[25]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[26]  Rong Qu,et al.  Computing semantic similarity based on novel models of semantic representation using Wikipedia , 2018, Inf. Process. Manag..

[27]  Inderjit S. Dhillon,et al.  Large-scale Multi-label Learning with Missing Labels , 2013, ICML.

[28]  Torsten Zesch,et al.  A survey of semantic relatedness evaluation datasets and procedures , 2019, Artificial Intelligence Review.

[29]  Wenjing Kang,et al.  Non-negative matrix factorization based modeling and training algorithm for multi-label learning , 2019, Frontiers of Computer Science.

[30]  Ted Pedersen,et al.  An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet , 2002, CICLing.

[31]  Euripides G. M. Petrakis,et al.  Information Retrieval by Semantic Similarity , 2006, Int. J. Semantic Web Inf. Syst..

[32]  Eneko Agirre,et al.  Uncovering Divergent Linguistic Information in Word Embeddings with Lessons for Intrinsic and Extrinsic Evaluation , 2018, CoNLL.

[33]  Baoyuan Wu,et al.  Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning , 2019, IEEE Access.