论文信息 - Lightweight, Dynamic Graph Convolutional Networks for AMR-to-Text Generation

Lightweight, Dynamic Graph Convolutional Networks for AMR-to-Text Generation

AMR-to-text generation is used to transduce Abstract Meaning Representation structures (AMR) into text. A key challenge in this task is to efficiently learn effective graph representations. Previously, Graph Convolution Networks (GCNs) were used to encode input AMRs, however, vanilla GCNs are not able to capture non-local information and additionally, they follow a local (first-order) information aggregation scheme. To account for these issues, larger and deeper GCN models are required to capture more complex interactions. In this paper, we introduce a dynamic fusion mechanism, proposing Lightweight Dynamic Graph Convolutional Networks (LDGCNs) that capture richer non-local interactions by synthesizing higher order information from the input graphs. We further develop two novel parameter saving strategies based on the group graph convolutions and weight tied convolutions to reduce memory usage and model complexity. With the help of these strategies, we are able to train a model with fewer parameters while maintaining the model capacity. Experiments demonstrate that LDGCNs outperform state-of-the-art models on two benchmark datasets for AMR-to-text generation with significantly fewer parameters.

[1] Richard S. Zemel,et al. Gated Graph Sequence Neural Networks , 2015, ICLR.

[2] Jonathan Berant,et al. Global Reasoning over Database Structures for Text-to-SQL Parsing , 2019, EMNLP.

[3] Wei Lu,et al. Attention Guided Graph Convolutional Networks for Relation Extraction , 2019, ACL.

[4] Doina Precup,et al. Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks , 2019, NeurIPS.

[5] Alon Lavie,et al. Meteor Universal: Language Specific Translation Evaluation for Any Target Language , 2014, WMT@ACL.

[6] Xiaojun Wan,et al. AMR-To-Text Generation with Graph Transformer , 2020, TACL.

[7] Vladlen Koltun,et al. Deep Equilibrium Models , 2019, NeurIPS.

[8] Yue Zhang,et al. AMR-to-text Generation with Synchronous Node Replacement Grammar , 2017, ACL.

[9] Yann Dauphin,et al. Pay Less Attention with Lightweight and Dynamic Convolutions , 2019, ICLR.

[10] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[11] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[12] Yann Dauphin,et al. Language Modeling with Gated Convolutional Networks , 2016, ICML.

[13] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Kristina Lerman,et al. MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing , 2019, ICML.

[15] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[16] Maja Popovic,et al. chrF++: words helping character n-grams , 2017, WMT.

[17] Zhuowen Tu,et al. Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Jaime G. Carbonell,et al. Generation from Abstract Meaning Representation using Tree Transducers , 2016, NAACL.

[19] Xiang Ren,et al. KagNet: Knowledge-Aware Graph Networks for Commonsense Reasoning , 2019, EMNLP.

[20] Kevin Knight,et al. Generating English from Abstract Meaning Representations , 2016, INLG.

[21] Jian Yang,et al. Selective Kernel Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Joonseok Lee,et al. N-GCN: Multi-scale Graph Convolution for Semi-supervised Node Classification , 2018, UAI.

[23] Bo Chen,et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[24] Shay B. Cohen,et al. Structural Neural Encoders for AMR-to-text Generation , 2019, NAACL.

[25] Jonathan Berant,et al. Representing Schema Structure with Graph Neural Networks for Text-to-SQL Parsing , 2019, ACL.