Using deep learning for short text understanding

Classifying short texts to one category or clustering semantically related texts is challenging, and the importance of both is growing due to the rise of microblogging platforms, digital news feeds, and the like. We can accomplish this classifying and clustering with the help of a deep neural network which produces compact binary representations of a short text, and can assign the same category to texts that have similar binary representations. But problems arise when there is little contextual information on the short texts, which makes it difficult for the deep neural network to produce similar binary codes for semantically related texts. We propose to address this issue using semantic enrichment. This is accomplished by taking the nouns, and verbs used in the short texts and generating the concepts and co-occurring words with the help of those terms. The nouns are used to generate concepts within the given short text, whereas the verbs are used to prune the ambiguous context (if any) present in the text. The enriched text then goes through a deep neural network to produce a prediction label for that short text representing it’s category.

[1]  Dongwoo Kim,et al.  Context-Dependent Conceptualization , 2013, IJCAI.

[2]  Luis Gravano,et al.  Categorizing web queries according to geographical locality , 2003, CIKM '03.

[3]  Qiang Yang,et al.  Query enrichment for web-query classification , 2006, TOIS.

[4]  Geoffrey E. Hinton,et al.  Semantic hashing , 2009, Int. J. Approx. Reason..

[5]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[6]  Nan Sun,et al.  Exploiting internal and external semantics for the clustering of short texts using world knowledge , 2009, CIKM.

[7]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[8]  Haixun Wang,et al.  Short Text Conceptualization Using a Probabilistic Knowledgebase , 2011, IJCAI.

[9]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[10]  Patrick van der Smagt,et al.  Introduction to neural networks , 1995, The Lancet.

[11]  A. Kitchen,et al.  Knowledge based systems in artificial intelligence , 1985, Proceedings of the IEEE.

[12]  Ophir Frieder,et al.  Automatic web query classification using labeled and unlabeled training data , 2005, SIGIR '05.

[13]  Mehran Sahami,et al.  A web-based kernel function for measuring the similarity of short text snippets , 2006, WWW '06.

[14]  Douglas B. Lenat,et al.  Knowledge-based systems in artificial intelligence , 1981 .

[15]  Mario Jarmasz,et al.  Roget's Thesaurus as a Lexical Resource for Natural Language Processing , 2012, ArXiv.

[16]  Somnath Banerjee,et al.  Clustering short texts using wikipedia , 2007, SIGIR.

[17]  In-Ho Kang,et al.  Query type classification for web document retrieval , 2003, SIGIR.

[18]  Haixun Wang,et al.  Understanding Short Texts through Semantic Enrichment and Hashing , 2016, IEEE Transactions on Knowledge and Data Engineering.

[19]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[20]  Christopher Meek,et al.  Improving Similarity Measures for Short Segments of Text , 2007, AAAI.

[21]  J. Giles Internet encyclopaedias go head to head , 2005, Nature.

[22]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[23]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[24]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[25]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[26]  Evgeniy Gabrilovich,et al.  Feature Generation for Text Categorization Using World Knowledge , 2005, IJCAI.

[27]  Ramanathan V. Guha,et al.  Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project , 1990 .

[28]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[29]  Haixun Wang,et al.  Probase: a probabilistic taxonomy for text understanding , 2012, SIGMOD Conference.

[30]  Ramanathan V. Guha,et al.  Building large knowledge-based systems , 1989 .

[31]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .