Abstract
The development of artificial intelligence and the global emergence of big data have provided access to data from different fields. However, while the reuse and sharing of data resources is vital for cost cutting, the data potentially reflects the design intent of those who design and obtain the data. It is necessary to establish a mechanism to quantify the data quality by sharing information regarding who, for what purpose, and how the target data was acquired. In this study, we discuss the methodology to observe and digitize unobserved events and propose the concept of data origination. Further, we introduce two tools to realize and support data origination: variable quest and TEEDA. Moreover, we explain the limitations of the current approach in achieving the data origination and discuss the approaches to overcome them.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Rabinovich, E., Cheon, S.: Expanding horizons and deepening understanding via the use of secondary data sources. J. Bus. Logist. 32(4), 303–316 (2011)
Ellram, M.L., Tate, L.W.: The use of secondary data in purchasing and supply management research. J. Purch. Supply Manag. 22(4), 250–254 (2016)
Manyika, J., Chui, M., Groves, P., Farrell, D., Kuiken, S.V., Doshi, E.A.: Open data: Unlocking innovation and performance with liquid information, McKinsey Global Institute (2013)
Balazinska, M., Howe, B., Suciu, D.: Data markets in the cloud: an opportunity for the database community. VLDB Endowment 4(12), 1482–1485 (2011)
Stahl, F., Schomm, F., Vossen, G.: Data marketplaces: an emerging species. Frontiers in Artificial Intelligence and Applications, pp. 145–158 (2014)
Liang, F., Yu, W., An, D., Yang, Q., Fu, X., Zhao, W.: A survey on big data market: pricing, trading and protection. IEEE Access 6, 15132–15154 IEEE (2018)
Spiekermann, M.: Data marketplaces: trends and monetisation of data goods. Intereconomics 54(4), 208–216 (2019)
Silver, N.: Coronavirus Case Counts Are Meaningless, FiveThirtyEight. https://fivethirtyeight.com/features/coronavirus-case-counts-are-meaningless/. Accessed 12 Oct 2020
Hayashi, T., Uehara, N., Hase, D., and Ohsawa, Y.: Data Requests and Scenarios for Data Design of Unobserved Events in Corona-related Confusion Using TEEDA, arXiv:2009.04035 (2020)
Boisot, M., Canals, A.: Data, Information and knowledge: have we got it right? J. Evol. Econ. 14, 43–67 (2004)
Ohsawa, Y., McBurney, P.: Chance Discovery. Springer, Heidelberg (2003)
Maeno, Y., Ohsawa, Y.: Human-Computer Interactive Annealing for Discovering Invisible Dark Events. IEEE Trans. Industr. Electron. 54(2), 1184–1192 (2007)
Maeno, Y., Ohsawa, Y.: Intuitive visualization of the intelligence for the run-down of terrorist wire-pullers, arXiv:0805.3972 (2008)
Ohsawa, Y.: Detection of Earthquake Risks with KeyGraph, Chance Discovery, Springer-Verlag Berlin Heidelberg, 339–350, (2003).
Babbie, E.R.: The basics of social research (7th Edition), Cengage Learning, (2016).
Hayashi, T., Ohsawa, Y.: Understanding the structural characteristics of data platforms using metadata and a network approach. IEEE Access 8, 35469–35481 (2020)
Ohsawa, Y., Kido, H., Hayashi, T., Liu, C.: Data jackets for synthesizing values in the market of data. Procedia Comput. Sci. 22, 709–716 (2013)
Hayashi, T., Ohsawa, Y.: VARIABLE QUEST: network visualization of variable labels unifying co-occurrence graphs. In: ICDM Workshops, pp. 577–583 (2017)
Hayashi, T., Ohsawa, Y.: Inferring variable labels using outlines of data in data jackets by considering similarity and co-occurrence. Int. J. Data Sci. Anal. 6(4), 351–361 (2018)
Hayashi, T., Ohsawa, Y.: TEEDA: an interactive platform for matching data providers and users in the data marketplace. Information 11(4), 218 (2020)
Pfitzmann, A, Hansen, M.: A terminology for talking about privacy by data minimization: anonymity, unlinkability, undetectability, unobservability, pseudonymity, and identity management (2010). https://dud.inf.tu-dresden.de/Anon_Terminology.shtml. Accessed 12 Oct 2020
Acknowledgement
This study was supported by JSPS KAKENHI (JP20H02384), the “Startup Research Program for Post-Corona Society” of Academic Strategy Office, School of Engineering, the University of Tokyo, and the Artificial Intelligence Research Promotion Foundation. We wish to thank Editage for providing English language editing.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Hayashi, T., Ohsawa, Y. (2021). Data Origination: Human-Centered Approach for Design, Acquisition, and Utilization of Data. In: Abraham, A., et al. Proceedings of the 12th International Conference on Soft Computing and Pattern Recognition (SoCPaR 2020). SoCPaR 2020. Advances in Intelligent Systems and Computing, vol 1383. Springer, Cham. https://doi.org/10.1007/978-3-030-73689-7_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-73689-7_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-73688-0
Online ISBN: 978-3-030-73689-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)