Skip to main content

High-Performance Computation in Big Data Analytics

  • Conference paper
  • First Online:
Intelligent Systems Design and Applications (ISDA 2022)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 646))

  • 196 Accesses

Abstract

For many years, big data analytics has relied on High-Performance Computing (HPC) for an efficient analysis. Today data is growing at an accelerated pace, so new types of high-performance computing will be required to access historically unprecedented volumes of data. To identify patterns and new insights, high-performance data analytics combines high-performance computing (HPC) with data analytics. The technique of quickly evaluating exceptionally big data sets to identify insights is known as high-performance data analytics. This is accomplished by utilising high-performance computing's parallel processing to execute strong analytic tools. For government and commercial companies that need to integrate high-performance computing with data-intensive analysis, high-performance data analytics infrastructure is a new and rapidly rising sector. As we can see natural hazards are unpredicted and affects our day to day work unexpected rainfall. So climate prediction becomes a need for nowadays. For this a large amount of database is needed for storing, maintaining, processing the datasets for performing the prediction in well manner. Separately, big data computing and HPC has progressed throughout time. These two techniques are becoming increasingly dependent on one another for data management and algorithms due to the growth of data and the requirement for machine learning algorithms. For instance, public clouds like Microsoft Azure are enabling artificial intelligence algorithms on big datasets by deploying large-scale Graphical processing Unit (GPU) deployments in HPC clusters and adding high-performance computing instances with InfiniBand. Understanding the evolution of HPC systems and big data help to define the important differences, as well as the goals and architectures that support them. Big data systems have benefitted data management, data querying, and streaming applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Tulasi, B., Wagh, R.S., Balaji, S.: High-performance computing and big data analytics– paradigms and challenges. Int. J. Comput. Appl. 116(2) (2015)

    Google Scholar 

  2. Asaadi, H., Khaldi, D., Chapman, B.: A Comparative survey of the HPC and big data paradigms: analysis and experiments. In: IEEE International Conference on Cluster Computing (CLUSTER), pp. 423–432 (2016). https://doi.org/10.1109/CLUSTER.2016.21

  3. Big Data Meets High-Performance Computing”, Intel® Enterprise Edition for Lustre* software and Hadoop combine to bring big data analytics to high-performance computing configurations. https://www.intel.com/content/dam/www/public/us/en/documents/white-papers/big-data-meets-high-performance-computing-white-paper.pdf

  4. The convergence of HPC and Big Data: What does it mean for HPC sysadmins?. https://insidehpc.com/2019/02/the-convergence-of-hpc-and-bigdata-what-does-it-mean-for-hpc-sysadmins/

  5. Anderson, M., et al.: Bridging the gap between HPC and big data frameworks. In: Proceedings of the VLDB Endowment, vol. 10, issue No.- 8, pp. 901–912 (2017). https://doi.org/10.14778/3090163.3090168

  6. Muniswamaiah, M., Agerwala, T., Tappert, C.: Big data in cloud computing review and opportunities. Int. J. Comput. Sci. Inform. Technol. 11(4) (2019)

    Google Scholar 

  7. Najafabadi, M.M., Villanustre, F., Khoshgoftaar, T.M., Seliya, N., Wald, R., Muharemagic, E.: Deep learning applications and challenges in big data analytics. J. Big Data 2(1), 1–21 (2015). https://doi.org/10.1186/s40537-014-0007-7

    Article  Google Scholar 

  8. Kumari, S., Muthulakshmi, P.: Transformative effects of big data on advanced data analytics: open issues and critical challenges. J. Comput. Sci. 18(6), 463–479 (2022). https://doi.org/10.3844/jcssp.2022.463.479

  9. Kumari, S., Vani, V., Malik, S., Tyagi, A.K., Reddy, S.: Analysis of text mining tools in disease prediction. In: Abraham, A., Hanne, T., Castillo, O., Gandhi, N., Nogueira Rios, T., Hong, T.-P. (eds.) HIS 2020. AISC, vol. 1375, pp. 546–564. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-73050-5_55

    Chapter  Google Scholar 

  10. Mishra, S., Tyagi, A.K.: The role of machine learning techniques in internet of things-based cloud applications. In: Pal, S., De, D., Buyya, R. (eds.) Artificial Intelligence-based Internet of Things Systems, Internet of Things (Technology, Communications and Computing). Springer, Cham (2022). https://doi.org/10.1007/978-3-030-87059-1_4

  11. Varsha, R., Nair, S.M., Tyagi, A.K., Aswathy, S.U., RadhaKrishnan, R.: The future with advanced analytics: a sequential analysis of the disruptive technology’s scope. In: Abraham, A., Hanne, T., Castillo, O., Gandhi, N., Nogueira Rios, T., Hong, T.-P. (eds.) HIS 2020. AISC, vol. 1375, pp. 565–579. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-73050-5_56

    Chapter  Google Scholar 

  12. Tyagi, A.K., Rekha, G.: Challenges of applying deep learning in real-world applications. Book: Challenges and Applications for Implementing Machine Learning in Computer Vision, IGI Global 2020, pp. 92–118 (2020). https://doi.org/10.4018/978-1-7998-0182-5.ch004

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shabnam Kumari .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kumari, S., Muthulakshmi, P. (2023). High-Performance Computation in Big Data Analytics. In: Abraham, A., Pllana, S., Casalino, G., Ma, K., Bajaj, A. (eds) Intelligent Systems Design and Applications. ISDA 2022. Lecture Notes in Networks and Systems, vol 646. Springer, Cham. https://doi.org/10.1007/978-3-031-27440-4_52

Download citation

Publish with us

Policies and ethics