Abstract
Geo-referenced textual data has been the subject of multiple investigations, by providing opportunities to better understand certain phenomena according to the content that is shared, either on-line such as social networks, blogs, and news; or through repositories such as scientific research articles, geo-referenced virtual books, among others. However, the characteristics of this information are studied, analyzed and processed separately, either through its textual components or its geo-spatial components, which offers a separate understanding of the results.
In this paper, we propose an integration of textual and geo-spatial components from the pre-processing phase to the visualization stage, As a part of the Document Mapping process based on the phases of the Knowledge Discovery in Databases (KDD). Achieving two main results (1) minimize the problems that arise in the visual phase, such as data occlusion and (2) provide a more detailed understanding between the textual relationships of the data when plotted in a geo-spatial map.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aggarwal, C.C., Zhai, C.: A survey of text clustering algorithms. In: Aggarwal, C., Zhai, C. (eds.) Mining Text Data, pp. 77–128. Springer, Boston (2012). https://doi.org/10.1007/978-1-4614-3223-4_4
Bao, J., Zheng, Y., Mokbel, M.F.: Location-based and preference-aware recommendation using sparse geo-social networking data. In: Proceedings of the 20th International Conference on Advances in Geographic Information Systems, pp. 199–208. ACM (2012)
Cong, G., Feng, K., Zhao, K.: Querying and mining geo-textual data for exploration: challenges and opportunities. In: IEEE 32nd International Conference on Data Engineering Workshops (ICDEW), 2016, pp. 165–168. IEEE (2016)
De Oliveira, M.C.F., Levkowitz, H.: From visual data exploration to visual data mining: a survey. IEEE Trans. Vis. Comput. Graph. 9(3), 378–394 (2003)
Doytsher, Y., Galon, B., Kanza, Y.: Querying geo-social data by bridging spatial networks and social networks. In: Proceedings of the 2nd ACM SIGSPATIAL International Workshop on Location Based Social Networks, pp. 39–46. ACM (2010)
Dykes, J., MacEachren, A.M., Kraak, M.J.: Moving geovisualization toward support for group work. In: Exploring Geovisualization, p. 445 (2005)
Jain, S.: Real-Time Social Network Data Mining for Predicting the Path for a Disaster (2015)
Lee, C.-H., Yang, H.-C., Wang, S.-H.: A location based text mining approach for geospatial data mining. In: Fourth International Conference on Innovative Computing, Information and Control (ICICIC), 2009, pp. 1172–1175. IEEE (2009)
Liu, S., Cui, W., Yingcai, W., Liu, M.: A survey on information visualization: recent advances and challenges. Vis. Comput. 30(12), 1373–1393 (2014)
Ltifi, H., Ayed, M.B., Alimi, A.M., Lepreux, S.: Survey of information visualization techniques for exploitation in KDD. In: IEEE/ACS International Conference on Computer Systems and Applications, 2009, AICCSA 2009, pp. 218–225. IEEE (2009)
Lwin, K.K., Zettsu, K., Sugiura, K.: Geovisualization and correlation analysis between geotagged Twitter and JMA rainfall data: case of heavy rain disaster in Hiroshima. In: 2015 2nd IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services (ICSDM), pp. 71–76. IEEE (2015)
Mislove, A., Lehmann, S., Ahn, Y.-Y., Onnela, J.-P., Rosenquist, J.N.: Understanding the demographics of twitter users. In: ICWSM, 11:5th (2011)
Mohamed, E.B., Ltifi, H., Ayed, M.B.: Integration of temporal data visualization techniques in a KDD-based DSS application in the medical field. J. Netw. Innovative Comput. 2, 061–070 (2014)
Paulovich, F.V., Oliveira, M.C.F., Minghim, R.: The projection explorer: a flexible tool for projection-based multidimensional visualization. In: XX Brazilian Symposium on Computer Graphics and Image Processing, 2007, SIBGRAPI 2007, pp. 27–36. IEEE (2007)
Paulovich, F.V., Minghim, R.: Text map explorer: a tool to create and explore document maps. In: Tenth International Conference on Information Visualisation (IV 2006), pp. 245–251. IEEE (2006)
Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes Twitter users: real-time event detection by social sensors. In: Proceedings of the 19th International Conference on World Wide Web, pp. 851–860. ACM (2010)
Tsou, M.-H., Yang, J.-A., Lusher, D., Han, S., Spitzberg, B., Gawron, J.M., Gupta, D., An, L.: Mapping social activities and concepts with social media (Twitter) and web search engines (Yahoo and Bing): a case study in 2012 US presidential election. Cartography Geogr. Inf. Sci. 40(4), 337–348 (2013)
Wüthrich, B.: Knowledge Discovery in Databases (1995)
Zhao, K., Liu, Y., Yuan, Q., Chen, L., Chen, Z., Cong, G.: Towards personalized maps: mining user preferences from geo-textual data. Proc. VLDB Endow. 9(13), 1545–1548 (2016)
Acknowledgements
The present work was achieved thanks to the joint work with my advisor, for her persistence and tenacity at the moment of sharing her teachings with me, to my distinguished teachers who have forged knowledge from the first day of classes, whom with nobility and enthusiasm influenced as an example in me and my colleagues in the master’s degree in computer science; also thanks to CONCYTEC, FONDECYT and Cienciactiva for the support and opportunities provided that made this work possible.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Gomez, F., Iquira, D., Cuadros, A.M. (2018). Application of the KDD Process for the Visualization of Integrated Geo-Referenced Textual Data from the Pre-processing Phase. In: R. Luaces, M., Karimipour, F. (eds) Web and Wireless Geographical Information Systems. W2GIS 2018. Lecture Notes in Computer Science(), vol 10819. Springer, Cham. https://doi.org/10.1007/978-3-319-90053-7_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-90053-7_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-90052-0
Online ISBN: 978-3-319-90053-7
eBook Packages: Computer ScienceComputer Science (R0)