Skip to main content

A Big Data Platform for Industrial Enterprise Asset Value Enablers

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 800))

Abstract

The growing ubiquity of IoT, along with bigger steps towards full digitalization in the manufacturing industry, makes it easier to constantly monitor equipment activity and implement predictive maintenance approaches. Big Data solutions are best suited to process the large amounts of data generated through monitorization – additionally, they also allow for processing of unstructured data, such as documents used in not fully-digitalized processes. This paper describes the creation of a small Hadoop cluster, without high-availability, its integration in the InValue architecture and the processes through which it was populated with historical data from a relational warehouse. The degree of parallelization on the data ingestion tasks and its effect on performance were evaluated for the different kinds of datasets that are currently being used for batch data processing.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Chui, M., Loffler, M., Robert, R.: The internet of things. Mckinsey Q. (2) (2010)

    Google Scholar 

  2. Mobley, R.K.: An Introduction to Predictive Maintenance. Elsevier Science, New York (2002)

    Google Scholar 

  3. McAfee, A., Brynjolfsson, E.: Big data: the management revolution. Harvard Bus. Rev. 90(10), 60–68 (2012)

    Google Scholar 

  4. Cohen, J., Dolan, B., Dunlap, M., Hellerstein, J.M., Welton, C.: MAD skills: new analysis practices for big data. Proc. VLDB Endowment 2(2), 1481–1492 (2009)

    Article  Google Scholar 

  5. Al-Noukari, M., Al-Hussan, W.: Using data mining techniques for predicting future car market demand; DCX case study. In: 3rd International Conference on Information and Communication Technologies: From Theory to Applications, ICTTA 2008, pp. 1–5. IEEE (2008)

    Google Scholar 

  6. Chon, S.H., Slaney, M., Berger, J.: Predicting success from music sales data: a statistical and adaptive approach. In: Proceedings of the 1st ACM Workshop on Audio and Music Computing Multimedia, pp. 83–88. ACM, October 2006

    Google Scholar 

  7. Martens, D., Provost, F., Clark, J., de Fortuny, E.J.: Mining massive fine-grained behavior data to improve predictive analytics. MIS Q. 40(4) (2016)

    Article  Google Scholar 

  8. InValuePt. InValuePT - Home (2017). http://www.invalue.com.pt/. Accessed 01 Feb 2018

  9. Canito, A., et al.: An architecture for proactive maintenance in the machinery industry. In: International Symposium on Ambient Intelligence. Springer (2017)

    Google Scholar 

  10. O’Donovan, P., Leahy, K., Bruton, K., O’Sullivan, D.T.: Big data in manufacturing: a systematic mapping study. J. Big Data 2(1), 20 (2015)

    Article  Google Scholar 

  11. The Apache Software Foundation. Welcome to Apache Hadoop (2018). http://hadoop.apache.org/. Accessed 25 Jan 2018

  12. IBM: What is the Hadoop Distributed File System (HDFS)? https://www-01.ibm.com/software/data/infosphere/hadoop/hdfs/

  13. Borthakur, D.: HDFS Architecture Guide (2013). https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html. Accessed 05 Feb 2018

  14. Cloudera. Cluster Hosts and Role Assignments (2018). https://www.cloudera.com/documentation/enterprise/latest/topics/cm_ig_host_allocations.html. Accessed 05 Feb 2018

  15. Thusoo, A., Sen Sarma, J., Jain, N., Shao, Z., Chakka, P., Anthony, S., Liu, H., Wyckoff, P., Murthy, R.: Hive: a warehousing solution over a map-reduce framework. Proc. VLDB Endowment 2(2), 1626–1629 (2009)

    Article  Google Scholar 

  16. Vavilapalli, V.K., Murthy, A.C., Douglas, C., Agarwal, S., Konar, M., Evans, R., Graves, T., Lowe, J., Shah, H., Seth, S., Saha, B.: Apache hadoop yarn: yet another resource negotiator. In: Proceedings of the 4th Annual Symposium on Cloud Computing, p. 5. ACM, October 2013

    Google Scholar 

  17. Ting, K., Cecho, J.J.: Apache Sqoop Cookbook. O’Reilly Media, Sebastopol (2013)

    Google Scholar 

  18. Islam, M., Huang, A.K., Battisha, M., Chiang, M., Srinivasan, S., Peters, C., Neumann, A., Abdelnur, A.: Oozie: towards a scalable workflow management system for hadoop. In: Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies, p. 4. ACM, May 2012

    Google Scholar 

  19. Apache ZooKeeper: What is zookeeper (2014). http://zookeeper.apache.org. Accessed 01 Feb 2018

  20. Bittorf, M.K.A.B.V., Bobrovytsky, T., Erickson, C.C.A.C.J., Hecht, M.G.D., Kuff, M.J.I.J.L., Leblang, D.K.A., Robinson, N.L.I.P.H., Rus, D.R.S., Wanderman, J.R.D.T.S., Yoder, M.M.: Impala: a modern, open-source SQL engine for Hadoop. In: Proceedings of the 7th Biennial Conference on Innovative Data Systems Research (2015)

    Google Scholar 

  21. Garg, N.: Apache Kafka. Packt Publishing Ltd. (2013)

    Google Scholar 

  22. Fernandes, M., Canito, A., Bolón, V., Conceição, L., Praça, I., Marreiros, G.: Predictive Maintenance in the Metallurgical Industry: data analysis and feature selection. In: World Conference on Information Systems and Technologies, pp. 478–489. Springer, Cham (2018)

    Google Scholar 

  23. White, T.: Hadoop: The Definitive Guide. O’Reilly Media, Inc. (2012)

    Google Scholar 

  24. Groover, M., Malaska, T., Seidman, J., Saphira, G.: Hadoop Application Architectures: Designing Real-World Big Data Applications. O’Reilly Media, Inc. (2015)

    Google Scholar 

Download references

Acknowledgments

The present work has been developed under the EUREKA - ITEA2 Project INVALUE (ITEA-13015), INVALUE Project (ANI|P2020 17990), and has received funding from FEDER Funds through NORTE2020 program and from National Funds through FCT under the project UID/EEA/00760/2013.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alda Canito .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Canito, A., Fernandes, M., Conceição, L., Praça, I., Marreiros, G. (2019). A Big Data Platform for Industrial Enterprise Asset Value Enablers. In: De La Prieta, F., Omatu, S., Fernández-Caballero, A. (eds) Distributed Computing and Artificial Intelligence, 15th International Conference. DCAI 2018. Advances in Intelligent Systems and Computing, vol 800. Springer, Cham. https://doi.org/10.1007/978-3-319-94649-8_18

Download citation

Publish with us

Policies and ethics