Abstract
The large volume and variety of data produced in the current Big Data era lead companies to seek solutions for the efficient data management. Within this context, NoSQL databases rise as a better alternative to the traditional relational databases, mainly in terms of scalability and availability of data. A usual feature of NoSQL databases is to be schemaless, i.e., they do not impose a schema or have a flexible schema. This is interesting for systems that deal with complex data, such as GIS. However, the lack of a schema becomes a problem when applications need to perform processes such as data validation, data integration, or data interoperability, as there is no pattern for schema representation in NoSQL databases. On the other hand, the JSON language stands out as a standard for representing and exchanging data in document NoSQL databases, and JSON Schema is a schema representation language for JSON documents that it is also leading to become a standard. However, it does not include spatial data types. From this limitation, this paper proposes an extension to JSON Schema, called JS4Geo, that allows the definition of schemas for geographic data. We demonstrate that JS4Geo is able to represent schemas of any NoSQL data model, as well as other standards for geographic data, like GML and KML. We also present a case study that shows how a data integration system can benefit of JS4Geo to define local schemas for geographic datasets and generate an integrated global schema.
Similar content being viewed by others
Notes
More details about JSON and related standards are given in the next section.
We assume JSON as the format for representing document instances given its acceptance by almost all NoSQL document DBMSs.
N/A means Not Applicable.
References
Afsari K, Eastman CM, Castro-Lacouture D (2017) Javascript Object Notation (JSON) data serialization for IFC schema in web-based BIM data exchange. Autom Constr 77:24–51
Angles R, Gutiérrez C (2008) Survey of graph database models. ACM Comput Surv 40(1):1:1–1:39
Bordogna G, Psaila G (2018) Why we need a novel framework to integrate and transform heterogeneous multi-source geo-referenced information: the j-CO proposal. Computer Science &, Information Technology (CS & IT) 8(13):41–60
Cattell R (2011) Scalable sql and nosql data stores. SIGMOD Rec, 39, 4
CouchDB Apache CouchDB (2019). http://couchdb.apache.org. Accessed 24-Jul-2019
Cure O, Hecht R, Duc C L, Lamolle M (2011) Data Integration over noSQL Stores Using Access Path Based Mappings. In: Proceedings of the 22nd International Conference on Database and Expert Systems Applications, DEXA. Springer, Berlin, pp 481–495
Frozza A A, Mello RDS, da Costa FDS (2018) An Approach for Schema Extraction of JSON and Extended JSON Document Collections. In: IEEE International conference on information reuse and integration (IRI). IEEE, pp 356–363
Golshan B, Halevy A, Mihaila G, Tan WC (2017) data integration: After the teenage years. In: Proceedings of the 36th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems - PODS ’17. ACM Press, New York, pp 101–106
IETF The GeoJSON Format (2019). http://geojson.org/. Accessed 24-Jul-2019
ISO ISO/IEC 14977:1996 - Extended BNF (1996). http://www.iso.org/iso/catalogue_detail?csnumber=26153. Accessed 01-Feb-2019
JSON SCHEMA JSON Schema (2019). http://json-schema.org/. Accessed 24-Jul-2019
Kapsammer E, Kusel A, Mitsch S, Pröll B, Retschitzegger W, Schwinger W, Schömböck J, Wimmer M, Wischenbart M, Lechner S (2012) User profile integration made easy - Model-driven extraction and transformation of social network schemas. In: Proceedings of the 21st annual conference on world wide web companion, WWW, pp 939–948
Karpov V Mongoose NPM package (2017). https://www.npmjs.com/package/mongoose. Accessed 24-Jul-2019
Kaur K, Rani R (2013) Modeling and querying data in noSQL databases. In: IEEE International conference on big data, pp 1–7. IEEE
Khalfi B, De Runz C, Faiz S, Herman A (2017) A New Methodology for Storing Consistent Fuzzy Geospatial Data in Big Data Environment. IEEE Transactions on Big Data pp 1–1
Klettke M, Storl U, Scherzinger S (2015) Schema Extraction and Structural Outlier Detection for JSON-based noSQL Data Stores. In: BTW, LNI, vol 241, pp 425–444. GI
Kresse W, Danko DM (2012) Springer handbook of geographic information. Springer
Lanza J, Sanchez L, Gomez D, Elsaleh T, Steinke R, Cirillo F (2016) A Proof-of-Concept for Semantically Interoperable Federation of IoT Experimentation Facilities. Sensors 16(7):1006
Ledoux H, Ohori KA, Kumar K, Dukai B, Labetski A, Vitalis S (2019) CityJSON: a compact and easy-to-use encoding of the CityGML data model. Open Geospatial Data, Software and Standards 4(4):1–12. https://doi.org/10.1186/s40965-019-0064-0
Lerario A, Varasano A, Turi S D, Maiellaro N (2017) Smart Tirana. Sustainability 9(12):2338
Lomotey R K, Deters R (2014) Towards knowledge discovery in big data. In: Proceedings - IEEE 8th International Symposium on Service Oriented System Engineering, SOSE
Mesiti M, Valtolina S (2014) Towards a user-friendly loading system for the analysis of big data in the internet of things. In: Proceedings - IEEE 38th Annual International Computers, Software and Applications Conference Workshops, COMPSACW 2014
MongoDB MongoDB: Open Source Document Database (2019). https://www.mongodb.com. Accessed 01-Feb-2019
Murray C et al Oracle ®; Spatial and Graph Developer’s Guide 12c Release 1 (12.1). Tech. rep., Oracle (2017). https://docs.oracle.com/database/121/SPATL/E49172-07.pdf
NoSQL NoSQL Databases (2019). http://nosql-database.org. Accessed 24-Jul-2019
OGC Coordinate Transformation Service (2001). https://www.opengeospatial.org/standards/ct. Accessed 20-Mar-2019
OGC Geography Markup Language — OGC (2019). https://www.opengeospatial.org/standards/gml. Accessed 24-Jul-2019
OGC Keyhole Markup Language — KML (2019). http://www.opengeospatial.org/standards/kml. Accessed 01-Feb-2019
OGC Simple Feature Access - Part 1: Common Architecture — OGC (2019). http://www.opengeospatial.org/standards/sfa. Accessed 01-Feb-2019
Open Geospatial Consortium O CityGML — OGC (2012). https://www.ogc.org/standards/citygml
Pezoa F, Reutter J L, Suarez F, Ugarte M, Vrgoč D (2016) Foundations of JSON Schema. In: 25Th int. Conf. on world wide web, WWW, pp 263–273. WWW Steering Committee
Pourabbas E (2014) Geographical Information Systems: Trends and Technologies. A science publishers book Taylor & Francis
Ruckstieß T Mongodb-schema NPM package (2017). https://www.npmjs.com/package/mongodb-schema. Accessed 24-Jul-2019
Ruiz D S, Morales S F, Molina J G (2015) Inferring versioned schemas from nosql databases and its applications. Lect Notes Comput Sci 9381:467–480
Sadalage P J, Fowler M (2013) noSQL distilled : a brief guide to the emerging world of polyglot persistence, 1st edn Addison-Wesley
Saltor F, Castellanos M, García-solaco M (1991) Suitability of data models as canonical models for federated databases. SIGMOD Record 20(4):44–48
Scavuzzo M, Di Nitto E, Ceri S (2014) Interoperable data migration between nosql columnar databases. In: 18Th IEEE international enterprise distributed object computing conference, EDOC 2014, ulm, germany, september 1–2, 2014, pp 154–162
Sheth A P, Larson J A (1990) Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Comput Surv 22(3):183–236
Sotres P, Santana J R, Sanchez L, Lanza J, Munoz L (2017) Practical Lessons from the Deployment and Management of a Smart City Internet-of-Things Infrastructure: The SmartSantander Testbed Case. IEEE Access 5:14309–14322
T. Bray E The JavaScript Object Notation (JSON) Data Interchange Format - RFC 7159 (2014). Accessed 22-Oct-2018
Torun A (2002) using schema and data integration technique to integrate spatial and Non-Spatial data: Developing populated places DB of turkey. In: Symposium on geospatial theory, processing and applications, pp 1–6. Ottawa (CA)
Acknowledgements
This research was partially financed by the Instituto Federal Catarinense (IFC) and by Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) - Brazil - Finance Code 001.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Frozza, A.A., Mello, R.d.S. JS4Geo: a canonical JSON Schema for geographic data suitable to NoSQL databases. Geoinformatica 24, 987–1019 (2020). https://doi.org/10.1007/s10707-020-00415-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10707-020-00415-w