Abstract
Industry 4.0, characterized by the development of automation and data exchanging technologies, has contributed to an increase in the volume of data, generated from various data sources, with great speed and variety. Organizations need to collect, store, process, and analyse this data in order to extract meaningful insights from these vast amounts of data. By overcoming these challenges imposed by what is currently known as Big Data, organizations take a step towards optimizing business processes. This paper proposes a Big Data Analytics architecture as an artefact for the integration of historical data - from the organizational business processes - and predictive data - obtained by the use of Machine Learning models -, providing an advanced data analytics environment for decision support. To support data integration in a Big Data Warehouse, a data modelling method is also proposed. These proposals were implemented and validated with a demonstration case in a multinational organization, Bosch Car Multimedia in Braga. The obtained results highlight the ability to take advantage of large amounts of historical data enhanced with predictions that support complex decision support scenarios.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Wang, L., Alexander, C.A.: Machine learning in big data. Int. J. Math. Eng. Manag. Sci. 1, 52–66 (2016)
Alswedani, S., Saleh, M.: Big data analytics: importance, challenges, categories, techniques, and tools. J. Adv. Trends Comput. Sci. Eng. 9, 5384–5392 (2020)
Alsghaier, H.: The importance of big data analytics in business: a case study. Am. J. Softw. Eng. Appl. 6, 111–115 (2017)
Rialti, R., Marzi, G., Caputo, A., Mayah, K.A.: Achieving strategic flexibility in the era of big data: the importance of knowledge management and ambidexterity. Manag. Decis. 58, 1585–1600 (2020)
Gao, R.X., Wang, L., Helu, M., Teti, R.: Big data analytics for smart factories of the future. CIRP Ann. 69, 668–692 (2020)
Papageorgiou, L., Eleni, P., Raftopoulou, S., Mantaiou, M., Megalooikonomou, V., Vlachakis, D.: Genomic big data hitting the storage bottleneck. EMBnet J. 24, e910 (2018)
Chavalier, M., El Malki, M., Kopliku, A., Teste, O., Tournier, R.: Document-oriented data warehouses: models and extended cuboids, extended cuboids in oriented document. In: Proceedings - Conference on Research Challenges in Information Science, August 2016
Cuzzocrea, A., Song, I.Y., Davis, K.C.: Analytics over large-scale multidimensional data: the big data revolution! In: Conference on Information and Knowledge Management (2011)
Santos, M.Y., Costa, C.: Big data: concepts, warehousing and analytics. River (2020)
Vaisman, A., Zimányi, E.: Data warehouses: next challenges. In: Aufaure, M.-A., Zimányi, E. (eds.) eBISS 2011. LNBIP, vol. 96, pp. 1–26. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-27358-2_1
Costa, C., Santos, M.Y.: Evaluating several design patterns and trends in big data warehousing systems. In: Krogstie, J., Reijers, H.A. (eds.) CAiSE 2018. LNCS, vol. 10816, pp. 459–473. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91563-0_28
Elshawi, R., Sakr, S., Talia, D., Trunfio, P.: Big data systems meet machine learning challenges: towards big data science as a service. Big Data Res. 14, 1–11 (2018)
Syafrudin, M., Alfian, G., Fitriyani, N.L., Rhee, J.: Performance analysis of IoT-based sensor, big data processing, and machine learning model for real-time monitoring system in automotive manufacturing. Sensors 18, 2946 (2018)
Lee, J., Ardakani, H.D., Yang, S., Bagheri, B.: Industrial big data analytics and cyber-physical systems for future maintenance & service innovation. Procedia CIRP 38, 3–7 (2015)
Baldominos, A., Albacete, E., Saez, Y., Isasi, P.: A scalable machine learning online service for big data real-time analysis. In: 2014 IEEE Computational Intelligence in Big Data (2014)
Krishnamoorthy, R., Udhayakumar, K.: Futuristic bigdata framework with optimization techniques for wind energy resource assessment and management in smart grid. In: 2021 7th International Conference on Electrical Energy Systems (ICEES), pp. 507–514 (2021)
Montoya-Torres, J.R., Moreno, S., Guerrero, W.J., Mejía, G.: Big data analytics and intelligent transportation systems. IFAC-PapersOnLine 54, 216–220 (2021)
Cai, L., Zhu, Y.: The challenges of data quality and data quality assessment in the big data era. Data Sci. J. 14, 1683–1470 (2015)
Dehghani, Z.: How to move beyond a monolithic data lake to a distributed data mesh (2019)
Project Jupyter: Project Jupyter | Home. https://jupyter.org/. Accessed 19 July 2021
Spark.apache.org: Spark SQL and DataFrames - Spark 1.5.2 Documentation. https://spark.apache.org/docs/latest/sql-programming-guide.html. Accessed 19 July 2021
PySpark Documentation — PySpark 3.1.2 documentation. https://spark.apache.org/docs/latest/api/python/. Accessed 19 July 2021
Ribeiro, D., Matos, L.M., Cortez, P., Moreira, G., Pilastri, A.: A comparison of anomaly detection methods for industrial screw tightening. In: Gervasi, O., et al. (eds.) ICCSA 2021. LNCS, vol. 12950, pp. 485–500. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-86960-1_34
Liu, F.T., Ting, K.M., Zhou, Z.H.: Isolation forest. In: Proceedings - IEEE International Conference on Data Mining, ICDM, pp. 413–422 (2008)
Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science 313, 504–507 (2006)
Alla, S., Adari, S.K.: Traditional Methods of Anomaly Detection. Apress, Berkeley (2019)
Acknowledgements
This work has been supported by FCT – Fundação para a Ciên-cia e Tecnologia within the Project Scope: UIDB/00319/2020, the Doctoral scholarships PD/BDE/135100/2017 and PD/BDE/135105/2017, and European Structural and Investment Funds in the FEDER component, through the Operational Competitiveness and Internationalization Programme (COMPETE 2020) [Project nº 039479; Funding Reference: POCI-01-0247-FEDER-039479]. The authors also wish to thank the automotive electronics company staff involved with this project for providing the data and valuable domain feedback. This paper uses icons made by Freepik, from www.flaticon.com.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Galvão, J. et al. (2022). Bosch’s Industry 4.0 Advanced Data Analytics: Historical and Predictive Data Integration for Decision Support. In: Guizzardi, R., Ralyté, J., Franch, X. (eds) Research Challenges in Information Science. RCIS 2022. Lecture Notes in Business Information Processing, vol 446. Springer, Cham. https://doi.org/10.1007/978-3-031-05760-1_34
Download citation
DOI: https://doi.org/10.1007/978-3-031-05760-1_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-05759-5
Online ISBN: 978-3-031-05760-1
eBook Packages: Computer ScienceComputer Science (R0)