Skip to main content

Development of Large-Scale Scientific Cyberinfrastructure and the Growing Opportunity to Democratize Access to Platforms and Data

  • Conference paper
  • First Online:
Distributed, Ambient and Pervasive Interactions (HCII 2023)

Abstract

As researchers across scientific domains rapidly adopt advanced scientific computing methodologies, access to advanced cyberinfrastructure (CI) becomes a critical requirement in scientific discovery. Lowering the entry barriers to CI is a crucial challenge in interdisciplinary sciences requiring frictionless software integration, data sharing from many distributed sites, and access to heterogeneous computing platforms. In this paper, we explore how the challenge is not merely a factor of availability and affordability of computing, network, and storage technologies but rather the result of insufficient interfaces with an increasingly heterogeneous mix of computing technologies and data sources. With more distributed computation and data, scientists, educators, and students must invest their time and effort in coordinating data access and movements, often penalizing their scientific research. Investments in the interfaces’ software stack are necessary to help scientists, educators, and students across domains take advantage of advanced computational methods. To this end, we propose developing a science data fabric as the standard scientific discovery interface that seamlessly manages data dependencies within scientific workflows and CI.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.top500.org/.

  2. 2.

    https://www.thequilt.net/.

References

  1. The Quilt - Advanced Regional Networking in Support of Research and Education. https://www.thequilt.net/

  2. Scientific and Engineering Research Facilities: 1999. https://wayback.archive-it.org/5902/20150628160048/http://www.nsf.gov/statistics/nsf04334/pdfstart.htm

  3. Scientific and Engineering Research Facilities: 2001. https://wayback.archive-it.org/5902/20150629121928/http://www.nsf.gov/statistics/nsf02307/sectb.htm

  4. Scientific and Engineering Research Facilities at Colleges and Universities: 1998. https://wayback.archive-it.org/5902/20150627201815/http://www.nsf.gov/statistics/nsf01301/

  5. Scientific and Engineering Research Facilities at Colleges and Universities: 1998 - Appendix E. https://wayback.archive-it.org/5902/20150629135427/http://www.nsf.gov/statistics/nsf01301/appe.htm

  6. ESnet6 Maps (2022). https://www.es.net/welcome-esnet6/esnet6-maps/

  7. Banda, T.: Research and Education Networks in Africa, August 2020

    Google Scholar 

  8. Bohr, M.: A 30 Year retrospective on Dennard’s MOSFET scaling paper. IEEE Solid-State Circuits Soc. Newslett. 12(1), 11–13 (2007). https://doi.org/10.1109/N-SSC.2007.4785534

    Article  Google Scholar 

  9. Chalker, A., Hillegas, C.W., Sill, A., Broude Geva, S., Stewart, C.A.: Cloud and on-premises data center usage, expenditures, and approaches to return on investment: a survey of academic research computing organizations. In: Practice and Experience in Advanced Research Computing, pp. 26–33. ACM, Portland OR USA, July 2020. https://doi.org/10.1145/3311790.3396642

  10. Chen, J., Ghafoor, S., Impagliazzo, J.: Producing competent HPC graduates. Commun. ACM 65(12), 56–65 (2022). https://doi.org/10.1145/3538878

    Article  Google Scholar 

  11. FCC: FCC National Broadband Map (2023). https://broadbandmap.fcc.gov/home

  12. GEANT: GÉANT Connectivity Map (2023). https://map.geant.org/

  13. Gibbons, M.: Computing and Networking Capacity Increases at Academic Research Institutions (2013)

    Google Scholar 

  14. Holland, T.M.: ATAK Improves Situational Awareness for California Fire Department. https://insights.samsung.com/2019/10/16/atak-improves-situational-awareness-for-california-fire-department/, October 2019

  15. Ian: Answer to "How computationally powerful is an Arduino Uno board?", November 2012. https://robotics.stackexchange.com/a/538

  16. Internet2: Operations and Support (2023). https://internet2.edu/network/operations-and-support/

  17. Luettgau, J.: Maps of the Top500 Supercomputers over Time, November 2022. https://doi.org/10.5281/zenodo.7606369

  18. Luettgau, J., Kirkpatrick, C.R., Scorzelli, G., Pascucci, V., Tarcea, G., Taufer, M.: NSDF-catalog: lightweight indexing service for democratizing data delivering. In: IEEE ACM International Conference on Utility and Cloud Computing (UCC2022) (2022)

    Google Scholar 

  19. Luettgau, J., Olaya, P., Zhou, N., Scorzelli, G., Pascucci, V., Taufer, M.: NSDF-Cloud: enabling ad-hoc compute clusters across academic and commercial clouds. In: Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing, pp. 279–280. ACM, Minneapolis MN USA, June 2022. https://doi.org/10.1145/3502181.3533710

  20. NVIDIA: ADA GPU Architecture V1.01 (2022). https://images.nvidia.com/aem-dam/Solutions/geforce/ada/nvidia-ada-gpu-architecture.pdf

  21. Olaya, P., et al.: Building trust in earth science findings through data traceability and results explainability. IEEE Trans. Parallel Distrib. Syst. 34(2), 704–717 (2023). https://doi.org/10.1109/TPDS.2022.3220539

    Article  Google Scholar 

  22. Olaya, P., et al.: NSDF-FUSE: a testbed for studying object storage via FUSE file systems. In: Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing, pp. 277–278. ACM, Minneapolis MN USA, Jun 2022. https://doi.org/10.1145/3502181.3533709

  23. RedCLARA: Network Maps, Mar 2020. https://www.redclara.net/index.php/en/recursos/publicaciones-para-difusion/mapas-de-la-red

  24. Sony Entertainment: Announcement of the Playstation 4, April 2013. https://web.archive.org/web/20130424075309/http://scei.co.jp/corporate/release/130221a_e.html

  25. Tarcea, G., et al.: The materials commons data repository. In: 2022 IEEE 18th International Conference on E-Science (e-Science), Salt Lake City, UT, USA, pp. 405–406. IEEE, October 2022. https://doi.org/10.1109/eScience55777.2022.00060

  26. TEIN: Network Maps (2020). https://www.tein.asia/sub/?mc=2030

  27. Top500: Top500 Supercomputing Sites (2019). http://www.top500.org/

  28. Vince Weaver: The GFLOPS/W of the various machines in the VMW Research Group (2023). https://web.eece.maine.edu/vweaver/group/green_machines.html

Download references

Acknowledgment

This research was supported by the National Science Foundation (NSF) under grant numbers #1841758, #2028923, #2103845, and #2138811; the Advanced Cyberinfrastructure Coordination Ecosystem: Services and Support (ACCESS) program, under allocation TG-CIS210128; Chameleon Cloud under allocation CHI-210923; and IBM through a Shared University Research Award.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jakob Luettgau .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Luettgau, J., Scorzelli, G., Pascucci, V., Taufer, M. (2023). Development of Large-Scale Scientific Cyberinfrastructure and the Growing Opportunity to Democratize Access to Platforms and Data. In: Streitz, N.A., Konomi, S. (eds) Distributed, Ambient and Pervasive Interactions. HCII 2023. Lecture Notes in Computer Science, vol 14036. Springer, Cham. https://doi.org/10.1007/978-3-031-34668-2_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-34668-2_25

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-34667-5

  • Online ISBN: 978-3-031-34668-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics