Natural language text describes the nature of people's internal representation of space. It is investigated that 80% of unstructured text has location expressions e.g. place names and spatial relations. In the past few years, text has become a most important geospatial data resource as well as survey, map, satellite images and GPS. The most previous research focused on the recognition of place names in text and its integration with map services. Spatial relations play an important role in the fields of spatial data modelling, spatial query, spatial analysis, spatial reasoning and map generalization. Spatial relations in text are described in natural language with qualitative spatial expressions including place names, spatial terms, prepositions, verbs and so on. And these expressions are combined with certain syntactic patterns to represent their semantic functions. An instance of spatial relation in text can be simply formalized as (C1, P1, C2, P2, C3), where P1 and P2 are place names, and C1, C2 and C3 are the context. Support Vector Machine (SVM) is a pattern recognition method popularly used in information extraction from text. This paper investigates the extraction of spatial relations based on SVM model which can implement the recognition of spatial expressions and their classification synchronously. For the SVM model, a set of feature vectors are specified, such as lexical tokens, spatial terms, syntactic structures and geographical feature types of place names, and a multi-label classifier is presented to solve the multi-classification problem. Finally, an experimental evaluation is explored in a Chinese annotation corpus. This study proves that spatial terms are important indicators for identification of spatial relations in text. However, there is serious ambiguity of their classification. Therefore, integration of much more context information could potentially improve the performance of extraction of spatial relations in text.
[1]
Yuan Zheng-wu.
Study on geographical entity recognition in GIS
,
2008
.
[2]
Betsy Palkowsky.
A New Approach to Information Discovery
,
2005
.
[3]
Lv Guonian.
Natural-language Spatial Relations and Their Applications in GIS
,
2007
.
[4]
Gumwon Hong.
Relation Extraction Using Support Vector Machine
,
2005,
IJCNLP.
[5]
M. Egenhofer.
Categorizing Binary Topological Relations Between Regions, Lines, and Points in Geographic Databases
,
1998
.
[6]
Xu Jun.
Formalizing the Natural-language Descriptions about the Spatial Relations between Linear Geographic Objects
,
2007
.
[7]
Scott Miller,et al.
A Novel Use of Statistical Parsing to Extract Information from Text
,
2000,
ANLP.
[8]
Albert K. W. Yeung,et al.
Concepts And Techniques Of Geographic Information Systems
,
2002
.
[9]
Razvan C. Bunescu,et al.
Subsequence Kernels for Relation Extraction
,
2005,
NIPS.
[10]
Jian Su,et al.
Exploring Various Knowledge in Relation Extraction
,
2005,
ACL.
[11]
Xueying Zhang,et al.
Rule-Based Extraction of Spatial Relations in Natural Language Text
,
2009,
2009 International Conference on Computational Intelligence and Software Engineering.
[12]
Guonian Lv,et al.
EXTRACTION AND VISUALIZATION OF GEOGRAPHICAL NAMES IN TEXT
,
2009
.
[13]
Betsy Palkowsky.
A New Approach to Information Discovery—Geography Really Does Matter
,
2005
.
[14]
Xueying Zhang,et al.
Annotation of Spatial Relations in Natural Language
,
2009,
2009 International Conference on Environmental Science and Information Application Technology.
[15]
Chongjun Yang,et al.
Spatial concept extraction based on spatial semantic role in natural language
,
2005
.