Abstract
Similarity calculations for zircons are vital to topical issues in sedimentology, such as provenance analysis, dating of sediment and identification of geotectonic effects. In general, zircon data is stored in a table where each column represents a key-value pair. According to the semantics of the keys, multiple tables are merged to extract data for analyzing the variability of single feature. However, there are conflicts between the different indicators due to sedimentation, which leads to inaccuracy of similarity. Moreover, unknown and semantically ambiguous keys are not recognized by the knowledge base, which results in the inefficiency of aggregating key-value pairs. Therefore, this paper proposed a Fast Much zircon (FM-zircon) framework that combines natural language processing (NLP) and multidimensional scaling (MDS) for calculating the similarity of zircons. First, NLP classifies keys by extracting semantic features. After the key-value pairs with the same key are fused, MDS is implemented to calculate multiple features. Ultimately, the results are represented in a visual representation To evaluate the performance, experiments were performed with zircon tables, that showed the good performance of FM-zircon.
Similar content being viewed by others
References
Ahmed E (2008) Resource capability discovery and description management system for bioinformatics data and service integration - an experiment with gene regulatory networks. In: 2008 11th international conference on computer and information technology. pp 56–61. https://doi.org/10.1109/ICCITECHN.2008.4802991
Berant J, Deutch D, Globerson A, Milo T, Wolfson T (2019) Explaining queries over web tables to non-experts. In: 2019 IEEE 35th international conference on data engineering (ICDE). pp 1570–1573. https://doi.org/10.1109/ICDE.2019.00144
Bindeman IN, Melnik OE (2016) Zircon survival, rebirth and recycling during crustal melting, magma crystallization, and mixing based on numerical modelling. Journal of Petrology 57(3):437–460. https://doi.org/10.1093/petrology/egw013
Bruand E, Storey C, Fowler M (2014) Accessory mineral chemistry of high Ba-Sr granites from Northern Scotland: Constraints on petrogenesis and records of whole-rock signature. Journal of Petrology 55(8):1619–1651. https://doi.org/10.1093/petrology/egu037
Chamiran K, Rukshan A, Thayasivam U (2020) Automating web table columns to knowledge base mapping using translation embedding. In: 2020 IEEE 14th international conference on semantic computing (ICSC). pp 150–153. https://doi.org/10.1109/ICSC.2020.00029
Delaigle A, Hall P, Meister A (2008) On deconvolution with repeated measurements. The Annals of Statistics 36:665–685
Eslahi Y, Bhardwaj A, Rosso P, Stockinger K, Cudré-Mauroux PP (2020) Annotating Web tables through knowledge bases: A context-based approach. In: 2020 7th swiss conference on data science (SDS). pp 29–34. https://doi.org/10.1109/SDS49233.2020.00013
Guo S, Fang C, Lin J, Wang Z (2020) A configurable FPGA accelerator of Bi-LSTM inference with structured sparsity. In: 2020 IEEE 33rd international system-on-chip conference (SOCC). pp 174–179. https://doi.org/10.1109/SOCC49529.2020.9524784
Guo H, Liu T, Liu F, Li Y, Hu W (2021) Chinese text classification model based on bert and capsule network structure. In: 2021 7th IEEE Intl conference on big data security on cloud (BigDataSecurity), IEEE Intl conference on high performance and smart computing, (HPSC) and IEEE Intl conference on intelligent data and security (IDS). pp 105–110. https://doi.org/10.1109/BigDataSecurityHPSCIDS52275.2021.00029
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Computation 9(8):1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Istiake Sunny MA, Maswood MMS, Alharbi AG (2020) Deep learning-based stock price prediction using LSTM and Bi-Directional LSTM model. In: 2020 2nd novel intelligent and leading emerging sciences conference (NILES). pp 87–92. https://doi.org/10.1109/NILES50944.2020.9257950
Liu Q, Gong Y (2005) The application of virtual strategy table base in regional stability control. In: 2005 IEEE/PES transmission and distribution conference and exposition: Asia and pacific. pp 1–5. https://doi.org/10.1109/TDC.2005.1547088
Liu H, Xie L (2021) Research on sarcasm detection of news headlines based on Bert-LSTM. In: 2021 IEEE international conference on emergency science and information technology (ICESIT). pp 89–92. https://doi.org/10.1109/ICESIT53460.2021.9696851
Luzuriaga J, Munoz E, Rosales-Mendez H, Hogan A (2021) Merging web tables for relation extraction with knowledge graphs. In: IEEE transactions on knowledge and data engineering. https://doi.org/10.1109/TKDE.2021.3101479
Nandakwang J, Chongstitvatana P (2016) Extract semantic web knowledge from Wikipedia tables and lists. In: 2016 8th international conference on knowledge and smart technology (KST). pp 108–113. https://doi.org/10.1109/KST.2016.7440520
Nemchin AA, Pidgeon RT (1997) Evolution of the darling range batholith, Yilgarn Craton, Western Australia: a SHRIMP Zircon Study. Journal of Petrology 38(5):625–649. https://doi.org/10.1093/petroj/38.5.625
Rao Q, Yu B, He K, Feng B (2019) Regularization and iterative initialization of softmax for fast training of convolutional neural networks. 2019 international joint conference on neural networks (IJCNN). pp. 1–8. https://doi.org/10.1109/IJCNN.2019.8852459
Saylor JE, Quade J, Dettman DL (2009) The late miocene through present paleoelevation history of Southwestern Tibet. American Journal of Science 309:1–42
Shaik Z, Ilievski F, Morstatter F (2021) Analyzing race and citizenship bias in Wikidata. In: 2021 IEEE 18th international conference on mobile Ad Hoc and smart systems (MASS). pp 665–666. https://doi.org/10.1109/MASS52906.2021.00099
Sharman GR, Malkowski MA (2020) Needles in a haystack: Detrital zircon UPb ages and the maximum depositional age of modern global sediment. Earth-Sci Rev 203. https://doi.org/10.1016/j.earscirev.2020.103109
Van Lankvelt A, Schneider DA, Biczok J, McFarlane CRM, Hattori K (2016) Decoding Zircon geochronology of igneous and alteration events based on chemical and microstructural features: A study from the western superior province. Canada. Journal of Petrology 57(7):1309–1334. https://doi.org/10.1093/petrology/egw041
Vermeesch P (2012) On the Visualisation of detrital age distributions. Chemical Geology 312:190–194. https://doi.org/10.1016/j.chemgeo.2012.04.021
Wang M, Nebel O, Wang CY (2016) The flaw in the crustal ‘Zircon archive’: Mixed Hf isotope signatures record progressive contamination of late-stage liquid in mafic-ultramafic layered intrusions. Journal of Petrology 57(1):27–52. https://doi.org/10.1093/petrology/egv072
Watts KE, John DA, Colgan JP, Henry CD, Bindeman IN, Schmitt AK (2016) Probing the volcanic-plutonic connection and the genesis of crystal-rich rhyolite in a deeply dissected supervolcano in the Nevada Great Basin: Source of the late eocene caetano tuff. Journal of Petrology 57(8):1599–1644. https://doi.org/10.1093/petrology/egw051
Wilson AH, Zeh A, Gerdes A (2017) In Situ Sr isotopes in plagioclase and trace element systematics in the lowest part of the Eastern Bushveld complex: dynamic processes in an evolving Magma Chamber. Journal of Petrology 58(2):327–360. https://doi.org/10.1093/petrology/egx018
Xu Y, Yu Z, Mao C, Wang Y, Guo J (2010) Entity answer extraction of web table. In: 2010 seventh international conference on fuzzy systems and knowledge discovery. pp 2465–2468. https://doi.org/10.1109/FSKD.2010.5569791
Yan W, Ma H, Yang Z (2020) A general framework of knowledge-based coaching system with application in table tennis training. In: 2020 39th Chinese control conference (CCC). pp 2902–2907. https://doi.org/10.23919/CCC50068.2020.9188412
Yongvanich N, Jitpagdee T, Chukaew B, Papathe S (2019) Yellow ceramic pigments from amorphous nanosized oxides using rice husk and Zircon. In: 2019 IEEE 14th international conference on nano/micro engineered and molecular systems (NEMS). pp 225–228. https://doi.org/10.1109/NEMS.2019.8915653
Yurin AY, Dorodnykh NO, Shigarov AO (2021) Semi-automated formalization and representation of the engineering knowledge extracted from spreadsheet data. IEEE Access 9:157468–157481. https://doi.org/10.1109/ACCESS.2021.3130172
Acknowledgements
This research is supported by the National key research and development program of China (International Technology Cooperation Project) under grant no.2021YFE0104400, the National Natural Science Foundation of China under grant no.41975183.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Hu, X., Hu, Z., Jiang, J. et al. Character embedding-based Bi-LSTM for Zircon similarity calculation with clustering. Earth Sci Inform 15, 1417–1425 (2022). https://doi.org/10.1007/s12145-022-00847-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12145-022-00847-y