The sentiment analysis model with multi-head self-attention and Tree-LSTM

In the natural language processing task.We need to extract information from the tree topology. Sentence structure can be achieved by the dependency tree or constituency tree structure to represent.The LSTM can handle sequential information (equivalent to a sequential list), but not tree-structured data.Multi-headed self-attention is used in this model. The main purpose of this model is to reduce the computation and improve the parallel efficiency without damaging the effect of the model.Eliminates the CNN and RNN respectively corresponding to the large amount of calculation, parameter and unable to the disadvantage of parallel computing,keep parallel computing and long distance information.The model combines multi-headed self-attention and tree-LSTM, and uses maxout neurons in the output position.The accuracy of the model on SST was 89%.