Deep Learning Based Abstractive Turkish News Summarization

With the increase of knowledge, there is a need for summarization systems that will direct the person to the area they are interested in without any waste of time. In this work, Turkish news headlines have been predicted by using encoder-decoder model from deep learning methods. Abstraction based text summarization method has been used during the generation of headlines. The system has been trained with recurrent neural networks by developing encoder-decoder model. The word embeddings of the words in news texts have been generated by using FastText that is very commonly used model in the literature recently. The system has been tested separately by training the first sentence, first two sentences and full-text of each news. The success of the system is measured by ROUGE score and semantic similarity score. According to the experimental results, it has been observed the model trained with full-text of news outperforms among the other models.

[1]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[2]  A. Burak Inner,et al.  Detecting similar news by summarizing Turkish news , 2018, 2018 26th Signal Processing and Communications Applications Conference (SIU).

[3]  Ilyas Cicekli,et al.  Generic text summarization for Turkish , 2009, 2009 24th International Symposium on Computer and Information Sciences.

[4]  Mirella Lapata,et al.  Ranking Sentences for Extractive Summarization with Reinforcement Learning , 2018, NAACL.

[5]  Jason Weston,et al.  A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.

[6]  Prakhar Gupta,et al.  Learning Word Vectors for 157 Languages , 2018, LREC.

[7]  Markus Freitag,et al.  Beam Search Strategies for Neural Machine Translation , 2017, NMT@ACL.

[8]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[9]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[10]  Lakhmi C. Jain,et al.  Recurrent Neural Networks: Design and Applications , 1999 .

[11]  Bowen Zhou,et al.  Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.

[12]  Nilgun Guler Bayazit,et al.  Automatic summarization of Turkish documents using non-negative matrix factorization , 2011, 2011 International Symposium on Innovations in Intelligent Systems and Applications.

[13]  Krys J. Kochut,et al.  Text Summarization Techniques: A Brief Survey , 2017, International Journal of Advanced Computer Science and Applications.

[14]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.