Skip to main content
Log in

JS4Geo: a canonical JSON Schema for geographic data suitable to NoSQL databases

  • Published:
GeoInformatica Aims and scope Submit manuscript

Abstract

The large volume and variety of data produced in the current Big Data era lead companies to seek solutions for the efficient data management. Within this context, NoSQL databases rise as a better alternative to the traditional relational databases, mainly in terms of scalability and availability of data. A usual feature of NoSQL databases is to be schemaless, i.e., they do not impose a schema or have a flexible schema. This is interesting for systems that deal with complex data, such as GIS. However, the lack of a schema becomes a problem when applications need to perform processes such as data validation, data integration, or data interoperability, as there is no pattern for schema representation in NoSQL databases. On the other hand, the JSON language stands out as a standard for representing and exchanging data in document NoSQL databases, and JSON Schema is a schema representation language for JSON documents that it is also leading to become a standard. However, it does not include spatial data types. From this limitation, this paper proposes an extension to JSON Schema, called JS4Geo, that allows the definition of schemas for geographic data. We demonstrate that JS4Geo is able to represent schemas of any NoSQL data model, as well as other standards for geographic data, like GML and KML. We also present a case study that shows how a data integration system can benefit of JS4Geo to define local schemas for geographic datasets and generate an integrated global schema.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23

Similar content being viewed by others

Notes

  1. https://github.com/geojson/schema

  2. More details about JSON and related standards are given in the next section.

  3. We assume JSON as the format for representing document instances given its acceptance by almost all NoSQL document DBMSs.

  4. https://www.json.org

  5. http://json-schema.org

  6. https://github.com/geojson/schema

  7. http://schemastore.org/json/

  8. N/A means Not Applicable.

  9. https://goessner.net/articles/JsonPath/

  10. http://www2.portoalegre.rs.gov.br/spm/

  11. http://datapoa.com.br/

  12. http://www.nosbairros.com.br/utilidade/delegacia-porto-alegres.htm

  13. http://www2.portoalegre.rs.gov.br/sms/default.php?p_secao=917

  14. https://www.qgis.org/en/site/

  15. https://en.wikipedia.org/wiki/Geohash

References

  1. Afsari K, Eastman CM, Castro-Lacouture D (2017) Javascript Object Notation (JSON) data serialization for IFC schema in web-based BIM data exchange. Autom Constr 77:24–51

    Article  Google Scholar 

  2. Angles R, Gutiérrez C (2008) Survey of graph database models. ACM Comput Surv 40(1):1:1–1:39

    Article  Google Scholar 

  3. Bordogna G, Psaila G (2018) Why we need a novel framework to integrate and transform heterogeneous multi-source geo-referenced information: the j-CO proposal. Computer Science &, Information Technology (CS & IT) 8(13):41–60

    Article  Google Scholar 

  4. Cattell R (2011) Scalable sql and nosql data stores. SIGMOD Rec, 39, 4

  5. CouchDB Apache CouchDB (2019). http://couchdb.apache.org. Accessed 24-Jul-2019

  6. Cure O, Hecht R, Duc C L, Lamolle M (2011) Data Integration over noSQL Stores Using Access Path Based Mappings. In: Proceedings of the 22nd International Conference on Database and Expert Systems Applications, DEXA. Springer, Berlin, pp 481–495

  7. Frozza A A, Mello RDS, da Costa FDS (2018) An Approach for Schema Extraction of JSON and Extended JSON Document Collections. In: IEEE International conference on information reuse and integration (IRI). IEEE, pp 356–363

  8. Golshan B, Halevy A, Mihaila G, Tan WC (2017) data integration: After the teenage years. In: Proceedings of the 36th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems - PODS ’17. ACM Press, New York, pp 101–106

  9. IETF The GeoJSON Format (2019). http://geojson.org/. Accessed 24-Jul-2019

  10. ISO ISO/IEC 14977:1996 - Extended BNF (1996). http://www.iso.org/iso/catalogue_detail?csnumber=26153. Accessed 01-Feb-2019

  11. JSON SCHEMA JSON Schema (2019). http://json-schema.org/. Accessed 24-Jul-2019

  12. Kapsammer E, Kusel A, Mitsch S, Pröll B, Retschitzegger W, Schwinger W, Schömböck J, Wimmer M, Wischenbart M, Lechner S (2012) User profile integration made easy - Model-driven extraction and transformation of social network schemas. In: Proceedings of the 21st annual conference on world wide web companion, WWW, pp 939–948

  13. Karpov V Mongoose NPM package (2017). https://www.npmjs.com/package/mongoose. Accessed 24-Jul-2019

  14. Kaur K, Rani R (2013) Modeling and querying data in noSQL databases. In: IEEE International conference on big data, pp 1–7. IEEE

  15. Khalfi B, De Runz C, Faiz S, Herman A (2017) A New Methodology for Storing Consistent Fuzzy Geospatial Data in Big Data Environment. IEEE Transactions on Big Data pp 1–1

  16. Klettke M, Storl U, Scherzinger S (2015) Schema Extraction and Structural Outlier Detection for JSON-based noSQL Data Stores. In: BTW, LNI, vol 241, pp 425–444. GI

  17. Kresse W, Danko DM (2012) Springer handbook of geographic information. Springer

  18. Lanza J, Sanchez L, Gomez D, Elsaleh T, Steinke R, Cirillo F (2016) A Proof-of-Concept for Semantically Interoperable Federation of IoT Experimentation Facilities. Sensors 16(7):1006

    Article  Google Scholar 

  19. Ledoux H, Ohori KA, Kumar K, Dukai B, Labetski A, Vitalis S (2019) CityJSON: a compact and easy-to-use encoding of the CityGML data model. Open Geospatial Data, Software and Standards 4(4):1–12. https://doi.org/10.1186/s40965-019-0064-0

    Article  Google Scholar 

  20. Lerario A, Varasano A, Turi S D, Maiellaro N (2017) Smart Tirana. Sustainability 9(12):2338

    Article  Google Scholar 

  21. Lomotey R K, Deters R (2014) Towards knowledge discovery in big data. In: Proceedings - IEEE 8th International Symposium on Service Oriented System Engineering, SOSE

  22. Mesiti M, Valtolina S (2014) Towards a user-friendly loading system for the analysis of big data in the internet of things. In: Proceedings - IEEE 38th Annual International Computers, Software and Applications Conference Workshops, COMPSACW 2014

  23. MongoDB MongoDB: Open Source Document Database (2019). https://www.mongodb.com. Accessed 01-Feb-2019

  24. Murray C et al Oracle ®; Spatial and Graph Developer’s Guide 12c Release 1 (12.1). Tech. rep., Oracle (2017). https://docs.oracle.com/database/121/SPATL/E49172-07.pdf

  25. NoSQL NoSQL Databases (2019). http://nosql-database.org. Accessed 24-Jul-2019

  26. OGC Coordinate Transformation Service (2001). https://www.opengeospatial.org/standards/ct. Accessed 20-Mar-2019

  27. OGC Geography Markup Language — OGC (2019). https://www.opengeospatial.org/standards/gml. Accessed 24-Jul-2019

  28. OGC Keyhole Markup Language — KML (2019). http://www.opengeospatial.org/standards/kml. Accessed 01-Feb-2019

  29. OGC Simple Feature Access - Part 1: Common Architecture — OGC (2019). http://www.opengeospatial.org/standards/sfa. Accessed 01-Feb-2019

  30. Open Geospatial Consortium O CityGML — OGC (2012). https://www.ogc.org/standards/citygml

  31. Pezoa F, Reutter J L, Suarez F, Ugarte M, Vrgoč D (2016) Foundations of JSON Schema. In: 25Th int. Conf. on world wide web, WWW, pp 263–273. WWW Steering Committee

  32. Pourabbas E (2014) Geographical Information Systems: Trends and Technologies. A science publishers book Taylor & Francis

  33. Ruckstieß T Mongodb-schema NPM package (2017). https://www.npmjs.com/package/mongodb-schema. Accessed 24-Jul-2019

  34. Ruiz D S, Morales S F, Molina J G (2015) Inferring versioned schemas from nosql databases and its applications. Lect Notes Comput Sci 9381:467–480

    Article  Google Scholar 

  35. Sadalage P J, Fowler M (2013) noSQL distilled : a brief guide to the emerging world of polyglot persistence, 1st edn Addison-Wesley

  36. Saltor F, Castellanos M, García-solaco M (1991) Suitability of data models as canonical models for federated databases. SIGMOD Record 20(4):44–48

    Article  Google Scholar 

  37. Scavuzzo M, Di Nitto E, Ceri S (2014) Interoperable data migration between nosql columnar databases. In: 18Th IEEE international enterprise distributed object computing conference, EDOC 2014, ulm, germany, september 1–2, 2014, pp 154–162

  38. Sheth A P, Larson J A (1990) Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Comput Surv 22(3):183–236

    Article  Google Scholar 

  39. Sotres P, Santana J R, Sanchez L, Lanza J, Munoz L (2017) Practical Lessons from the Deployment and Management of a Smart City Internet-of-Things Infrastructure: The SmartSantander Testbed Case. IEEE Access 5:14309–14322

    Article  Google Scholar 

  40. T. Bray E The JavaScript Object Notation (JSON) Data Interchange Format - RFC 7159 (2014). Accessed 22-Oct-2018

  41. Torun A (2002) using schema and data integration technique to integrate spatial and Non-Spatial data: Developing populated places DB of turkey. In: Symposium on geospatial theory, processing and applications, pp 1–6. Ottawa (CA)

Download references

Acknowledgements

This research was partially financed by the Instituto Federal Catarinense (IFC) and by Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) - Brazil - Finance Code 001.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Angelo A. Frozza.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Frozza, A.A., Mello, R.d.S. JS4Geo: a canonical JSON Schema for geographic data suitable to NoSQL databases. Geoinformatica 24, 987–1019 (2020). https://doi.org/10.1007/s10707-020-00415-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10707-020-00415-w

Keywords

Navigation