TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages

Recently, the structural reading comprehension (SRC) task on web pages has attracted increasing research interests. Although previous SRC work has leveraged extra information such as HTML tags or XPaths, the informative topology of web pages is not effectively exploited. In this work, we propose a Topological Information Enhanced model (TIE), which transforms the token-level task into a tag-level task by introducing a two-stage process (i.e. node locating and answer refining). Based on that, TIE integrates Graph Attention Network (GAT) and Pre-trained Language Model (PLM) to leverage the topological information of both logical structures and spatial structures. Experimental results demonstrate that our model outperforms strong baselines and achieves state-of-the-art performances on the web-based SRC benchmark WebSRC at the time of writing. The code of TIE will be publicly available at https://github.com/X-LANCE/TIE.

[1]  Yongbin Li,et al.  S^2SQL: Injecting Syntax to Question-Schema Interaction Graph Encoder for Text-to-SQL Parsers , 2022, FINDINGS.

[2]  Furu Wei,et al.  MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding , 2021, ACL.

[3]  Kai Yu,et al.  Decoupled Dialogue Modeling and Semantic Parsing for Multi-Turn Text-to-SQL , 2021, FINDINGS.

[4]  Kai Yu,et al.  LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations , 2021, ACL.

[5]  Thomas Muller,et al.  DoT: An efficient Double Transformer for NLP tasks with tables , 2021, FINDINGS.

[6]  Nicolas Rodolfo Fauceglia,et al.  Capturing Row and Column Semantics in Transformer Based Question Answering over Tables , 2021, NAACL.

[7]  Isil Dillig,et al.  Web question answering with neurosymbolic program synthesis , 2021, PLDI.

[8]  Kai Yu,et al.  ShadowGNN: Graph Projection Neural Network for Text-to-SQL Parser , 2021, NAACL.

[9]  Kai Yu,et al.  LET: Linguistic Knowledge Enhanced Graph Transformer for Chinese Short Text Matching , 2021, AAAI.

[10]  Lu Chen,et al.  WebSRC: A Dataset for Web-Based Structural Reading Comprehension , 2021, Conference on Empirical Methods in Natural Language Processing.

[11]  William W. Cohen,et al.  Open Question Answering over Tables and Text , 2020, ICLR.

[12]  Ming Gong,et al.  A Graph Representation of Semi-structured Data for Web Question Answering , 2020, COLING.

[13]  Kai Yu,et al.  Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management , 2020, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[14]  Lu Chen,et al.  Neural Graph Matching Networks for Chinese Short Text Matching , 2020, ACL.

[15]  Lu Chen,et al.  Line Graph Enhanced AMR-to-Text Generation with Mix-Order Graph Attention Networks , 2020, ACL.

[16]  Jianjun Hu,et al.  A Survey on Machine Reading Comprehension: Tasks, Evaluation Metrics, and Benchmark Datasets , 2020, Applied Sciences.

[17]  Jian Pei,et al.  Mining Implicit Relevance Feedback from User Behavior for Web Question Answering , 2020, KDD.

[18]  Xin Luna Dong,et al.  ZeroShotCeres: Zero-Shot Relation Extraction from Semi-Structured Webpages , 2020, ACL.

[19]  Wenhu Chen,et al.  HybridQA: A Dataset of Multi-Hop Question Answering over Tabular and Textual Data , 2020, FINDINGS.

[20]  Kai Yu,et al.  Efficient Context and Schema Fusion Networks for Multi-Domain Dialogue State Tracking , 2020, FINDINGS.

[21]  Chi Wang,et al.  Schema-Guided Multi-Domain Dialogue State Tracking with Graph Attention Neural Networks , 2020, AAAI.

[22]  Xiaodong Liu,et al.  RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers , 2019, ACL.

[23]  Tao Gui,et al.  A Lexicon-Based Graph Neural Network for Chinese NER , 2019, EMNLP.

[24]  Jens Lehmann,et al.  LC-QuAD 2.0: A Large Dataset for Complex Question Answering over Wikidata and DBpedia , 2019, SEMWEB.

[25]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[26]  Ali Farhadi,et al.  OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Kai Yu,et al.  AgentGraph: Toward Universal Dialogue Management With Structured Deep Reinforcement Learning , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[28]  Xueqi Cheng,et al.  Controlling Risk of Web Question Answering , 2019, SIGIR.

[29]  Jonathan Berant,et al.  Representing Schema Structure with Graph Neural Networks for Text-to-SQL Parsing , 2019, ACL.

[30]  Yuan Luo,et al.  Graph Convolutional Networks for Text Classification , 2018, AAAI.

[31]  Danqi Chen,et al.  CoQA: A Conversational Question Answering Challenge , 2018, TACL.

[32]  Frank Hutter,et al.  Decoupled Weight Decay Regularization , 2017, ICLR.

[33]  Yoshua Bengio,et al.  HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering , 2018, EMNLP.

[34]  Lu Chen,et al.  Structured Dialogue Policy with Graph Neural Networks , 2018, COLING.

[35]  Tao Yu,et al.  TypeSQL: Knowledge-Based Type-Aware Neural Text-to-SQL Generation , 2018, NAACL.

[36]  Zhi Chen,et al.  Policy Adaptation for Deep Reinforcement Learning-Based Dialogue Management , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[37]  Jonathan Berant,et al.  The Web as a Knowledge-Base for Answering Complex Questions , 2018, NAACL.

[38]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.

[39]  Qi Wu,et al.  FVQA: Fact-Based Visual Question Answering , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Eunsol Choi,et al.  TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension , 2017, ACL.

[41]  Guokun Lai,et al.  RACE: Large-scale ReAding Comprehension Dataset From Examinations , 2017, EMNLP.

[42]  Tiejun Zhao,et al.  Constraint-Based Question Answering with Knowledge Graph , 2016, COLING.

[43]  Ming-Wei Chang,et al.  The Value of Semantic Parse Labeling for Knowledge Base Question Answering , 2016, ACL.

[44]  Jian Zhang,et al.  SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[45]  Percy Liang,et al.  Compositional Semantic Parsing on Semi-Structured Tables , 2015, ACL.

[46]  Margaret Mitchell,et al.  VQA: Visual Question Answering , 2015, International Journal of Computer Vision.

[47]  Wei Zhang,et al.  Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.

[48]  Andrew Chou,et al.  Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.

[49]  Ah Chung Tsoi,et al.  The Graph Neural Network Model , 2009, IEEE Transactions on Neural Networks.