skip to main content
10.1145/2675354.2675702acmconferencesArticle/Chapter ViewAbstractPublication PagesgirConference Proceedingsconference-collections
research-article

Building a corpus of spatial relational expressions extracted from web documents

Published: 04 November 2014 Publication History

Abstract

Spatial language, despite decades of research, still poses substantial challenges for automated systems, for instance in geographic information retrieval or human-robot interaction. We describe an approach to building a corpus of natural language expressions extracted from web documents for analyzing and modeling spatial relational expressions (SRE). The unique characteristic of this corpus is that it is built around georeferenced triplets, with each triplet containing two entities (including their latitude/longitude coordinates) related by a spatial expression such as near. While the approach is still experimental, our first results are promising, in that we believe they will form the foundation for a comprehensive contextualized model for interpreting spatial natural language expressions. For the time being, we are focusing on a single domain, hotel reviews. This domain restriction allowed us to implement a proof-of-concept that this approach, with advances in natural language technologies, will indeed deliver a comprehensive corpus. The potential to collect larger corpora, and associated challenges, is discussed.

References

[1]
J. A. Bateman. Language and space: a two-level semantic approach based on principles of ontological engineering. International Journal of Speech Technology, 13:29--48, 2010.
[2]
J. Brennan and E. Martin. Spatial proximity is more than just a distance measure. International Journal of Human-Computer Studies, 70(1):88--106, 2012.
[3]
M. Burigo and K. Coventry. Context Affects Scale Selection for Proximity Terms. Spatial Cognition & Computation, 10(4):292--312, 2010.
[4]
K. R. Coventry and S. Garrod. Saying, Seeing, and Acting: The Psychological Semantics of Spatial Prepositions. Psychology Press, Hove, 2004.
[5]
K. R. Coventry and S. Garrod. Towards a classification of extra-geometric influences on the comprehension of spatial prepositions. In L. A. Carlson and E. van der Zee, editors, Functional features in language and space. Oxford UP, 2004.
[6]
C. Derungs and R. Purves. Where's near? Using web n-grams to explore spatial relations. In GIScience 2014, to appear.
[7]
S. Dutta. Qualitative spatial resoning: A semi-quantitative approach using fuzzy logic. In A. Buchmann, O. Günther, T. R. Smith, and Y. F. Wang, editors, Design and implementation of large spatial datebases, pages 345--364. Springer, 1990.
[8]
G. Edwards and B. Moulin. Toward the simulation of spatial mental images using the voronoi model. In P. Olivier and K.-P. Gapp, editors, Representation and processing of spatial expressions, pages 163--184. Lawrence Erlbaum, 1998.
[9]
A. U. Frank. Qualitative spatial reasoning about distances and directions in geographic space. Journal of Visual Languages and Computing, 3:343--371, 1992.
[10]
M. Gahegan. Proximity operators for qualitative spatial reasoning. In A. U. Frank and W. Kuhn, editors, Spatial Information Theory. 1995.
[11]
K.-P. Gapp. Object localization: Selection of optimal reference objects. In A. U. Frank and W. Kuhn, editors, Spatial Information Theory. Springer, 1995.
[12]
M. M. Hall and C. B. Jones. Cultural and language influences on the interpretation of spatial prepositions. In Proceedings of the GI Forum 2012, 2012.
[13]
A. Herskovits. Language and Spatial Cognition: An Interdisciplinary Study of the Representation of the Prepositions in English. Cambridge University Press, Cambrigde and England, 1986.
[14]
D. Kemmerer. "Near" and "far" in language and perception. Cognition, 73(1):35--63, 1999.
[15]
D. Kettani and B. Moulin. A spatial model based on the notions of spatial conceptual map and of object's influence area. In C. Freksa and D. M. Mark, editors, Spatial Information Theory. Cognitive and Computational Foundations of Geographic Information Science, pages 401--416. Springer, Berlin, 1999.
[16]
A. Khan, M. Vasardani, and S. Winter. Extracting spatial information from place descriptions. In Proc. of the First ACM SIGSPATIAL International Workshop on Computational Models of Place, 2013.
[17]
P. Kordjamshidi, J. Hois, M. van Otterlo, and M.-F. Moens. Learning to interpret spatial natural language in terms of qualitative spatial relations. In T. Tenbrink, J. Wiener, and C. Claramunt, editors, Series Explorations in Language and Space. 2013.
[18]
M. Lapata and F. Keller. The web as a baseline: Evaluating the performance of unsupervised web-based models for a range of NLP tasks. In Proceedings of the Human Language Technology conference of the North American Chapter of the Association for Computational Linguistics, pages 121--128.
[19]
G. A. Miller and P. N. Johnson-Laird. Language and perception. Cambridge University Press, 1976.
[20]
R. Moratz, K. Fischer, and T. Tenbrink. Cognitive modelling of spatial reference for human-robot interaction. In IEEE International Conference on Tools with Artificial Intelligence, pages 222--228. 2000.
[21]
D. G. Morrow and H. H. Clark. Interpreting words in spatial descriptions. Language and Cognitive Processes, 3(4):275--291, 1988.
[22]
S. Schockaert, M. D. Cock, and E. Kerre. Location approximation for local search services using natural language hints. International Journal of Geographical Information Science, 22:315--336, 2008.
[23]
G. Skoumas, D. Pfoser, and T. K. Anastasios. On quantifying qualitative geospatial data: a probabilistic approach. In Proceedings of the 2nd GEOGROWD Workshop, pages 71--78, 2013.
[24]
K. Stock and C. Cialone. Universality, Language-Variability and Individuality: Defining Linguistic Building Blocks for Spatial Relations. In M. Egenhofer, N. Giudice, R. Moratz, and M. Worboys, editors, Spatial Information Theory. 10th International Conference, COSIT 2011. Proceedings, pages 391--412. Springer, 2011.
[25]
L. Talmy. Force dynamics in language and cognition. Cognitive Science, 12(1):49--100, 1988.
[26]
F. Twaroch, R. Purves, and C. Jones. Stability of qualitative spatial relations between vernacular regions mined from web data. In Proceedings of Workshop on Geographic Information on the Internet, 2009.
[27]
S. Xu and A. Klippel. Developing nearness models from geocoding spatial entities in a news corpus. In GIScience 2012, extended abstracts. 2012.
[28]
X. Yao and J.-C. Thill. How far Is too far? -- A statistical approach to context-contingent proximity modeling. Transactions in GIS, 9(2):157--178, 2005.

Cited By

View all
  • (2022)Disambiguating spatial prepositions: The case of geo‐spatial sense detectionTransactions in GIS10.1111/tgis.1297626:6(2621-2650)Online publication date: 6-Sep-2022
  • (2021)Extraction and Visualization of Tourist Attraction Semantics from Travel BlogsISPRS International Journal of Geo-Information10.3390/ijgi1010071010:10(710)Online publication date: 18-Oct-2021
  • (2021)Detecting geospatial location descriptions in natural language textInternational Journal of Geographical Information Science10.1080/13658816.2021.198744136:3(547-584)Online publication date: 22-Dec-2021
  • Show More Cited By

Index Terms

  1. Building a corpus of spatial relational expressions extracted from web documents

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    GIR '14: Proceedings of the 8th Workshop on Geographic Information Retrieval
    November 2014
    94 pages
    ISBN:9781450331357
    DOI:10.1145/2675354
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 November 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. corpus building
    2. information retrieval
    3. proximity
    4. spatial language
    5. spatial relations

    Qualifiers

    • Research-article

    Conference

    SIGSPATIAL '14
    Sponsor:
    • University of North Texas
    • Microsoft
    • ORACLE
    • Facebook
    • SIGSPATIAL

    Acceptance Rates

    GIR '14 Paper Acceptance Rate 11 of 15 submissions, 73%;
    Overall Acceptance Rate 46 of 61 submissions, 75%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)14
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 01 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Disambiguating spatial prepositions: The case of geo‐spatial sense detectionTransactions in GIS10.1111/tgis.1297626:6(2621-2650)Online publication date: 6-Sep-2022
    • (2021)Extraction and Visualization of Tourist Attraction Semantics from Travel BlogsISPRS International Journal of Geo-Information10.3390/ijgi1010071010:10(710)Online publication date: 18-Oct-2021
    • (2021)Detecting geospatial location descriptions in natural language textInternational Journal of Geographical Information Science10.1080/13658816.2021.198744136:3(547-584)Online publication date: 22-Dec-2021
    • (2020)A Review of Geospatial Semantic Information Modeling and Elicitation ApproachesISPRS International Journal of Geo-Information10.3390/ijgi90301469:3(146)Online publication date: 1-Mar-2020
    • (2020)Nearness as context-dependent expression: an integrative review of modeling, measurement and contextual propertiesSpatial Cognition & Computation10.1080/13875868.2020.1754832(1-73)Online publication date: 22-May-2020
    • (2019)Geographic Entity Relationship Extraction Model Based on Piecewise Convolution of Residual NetworkProceedings of the 3rd International Conference on Machine Learning and Soft Computing10.1145/3310986.3311025(160-165)Online publication date: 25-Jan-2019
    • (2018)Fictive motion in the context of mountaineeringSpatial Cognition & Computation10.1080/13875868.2018.143164618:4(259-284)Online publication date: 23-Feb-2018
    • (2016)A distantly supervised method for extracting spatio-temporal information from textProceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems10.1145/2996913.2996967(1-4)Online publication date: 31-Oct-2016
    • (2016)Location Estimation Using Crowdsourced Spatial RelationsACM Transactions on Spatial Algorithms and Systems10.1145/28947452:2(1-23)Online publication date: 21-Jun-2016
    • (2016)Automated Geocoding of Textual Documents: A Survey of Current ApproachesTransactions in GIS10.1111/tgis.1221221:1(3-38)Online publication date: 17-Jun-2016
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media