skip to main content
10.1145/3125719.3125722acmconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article
Public Access

Request aggregation, caching, and forwarding strategies for improving large climate data distribution with NDN: a case study

Published: 26 September 2017 Publication History

Abstract

Scientific domains such as Climate Science, High Energy Particle Physics (HEP) and others, routinely generate and manage petabytes of data, projected to rise into exabytes [26]. The sheer volume and long life of the data stress IP networking and traditional content distribution networks mechanisms. Thus, each scientific domain typically designs, develops, implements, deploys and maintains its own data management and distribution system, often duplicating functionality. Supporting various incarnations of similar software is wasteful, prone to bugs, and results in an ecosystem of one-off solutions.
In this paper, we present the first trace-driven study that investigates NDN in the context of a scientific application domain. Our contribution is threefold. First, we analyze a three-year climate data server log and characterize data access patterns to expose important variables such as cache size. Second, using an approximated topology derived from the log, we replay log requests in real-time over an NDN simulator to evaluate how NDN improves traffic flows through aggregation and caching. Finally, we implement a simple, nearest-replica NDN forwarding strategy and evaluate how NDN can improve scientific content delivery.

References

[1]
CDN Pricing,https://www.maxcdn.com/blog/cdn-framework-step-2/.
[2]
Climate Science's Globally Distributed Infrastructure. https://nci.org.au/wp-content/uploads/2016/11/Williams-Trenham-2016-AGU-Fall-Meeting-ESGF.pdf.
[3]
ESNet. https://fasterdata.es.net.
[4]
Globus, www.globus.org.
[5]
https://aws.amazon.com/s3/pricirig/.
[6]
Microsoft Azure CDN Pricing, https://azure.microsoft.com/en-us/pricing/details/cdn/.
[7]
NDN Memory Usage Problem. http://www.lists.cs.ucla.edu/pipermail/ndnsim/2015-February/001682.html.
[8]
Source code for simulation scenarios. https://github.com/susmit85/icn17-simulation-scenario.
[9]
TCP Tuning at ESNet. https://fasterdata.es.net/assets/fasterdata/JT-201010.pdf.
[10]
Åkesson, T. The atlas experiment at the cern large hadron collider.
[11]
Cinquini, L., Crichton, D., Mattmann, C., Harney, J., Shipman, G., Wang, F., Ananthakrishnan, R., Miller, N., Denvil, S., Morgan, M., et al. The earth system grid federation: An open infrastructure for access to distributed geospatial data. Future Generation Computer Systems 36 (2014), 400--417.
[12]
Collaboration, L. D. E. S., et al. Large synoptic survey telescope: dark energy science collaboration. arXiv preprint arXiv:1211.0310 (2012).
[13]
Dabirmoghaddam, A., Barijough, M. M., and Garcia-Luna-Aceves, J. Understanding optimal caching and opportunistic caching at the edge of information-centric networks. In Proceedings of the 1st international conference on Information-centric networking (2014), ACM, pp. 47--56.
[14]
Dabirmoghaddam, A., Dehghan, M., and Garcia-Luna-Aceves, J. Characterizing interest aggregation in content-centric networks. In IFIP Networking Conference (IFIP Networking) and Workshops, 2016 (2016), IEEE, pp. 449--457.
[15]
Dehghan, M., Jiang, B., Dabirmoghaddam, A., and Towsley, D. On the analysis of caches with pending interest tables. In Proceedings of the 2nd International Conference on Information-centric Networking (2015), ACM, pp. 69--78.
[16]
Dewdney, P. E., Hall, P. J., Schilizzi, R. T., and Lazio, T. J. L. The square kilometre array. Proceedings of the IEEE 97, 8 (2009), 1482--1496.
[17]
Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer, R. J., and Taylor, K. E. Overview of the coupled model intercomparison project phase 6 (cmip6) experimental design and organization. Geoscientific Model Development 9, 5 (2016), 1937--1958.
[18]
Fan, C., Shannigrahi, S., DiBenedetto, S., Olschanowsky, C., Papadopoulos, C., and Newman, H. Managing scientific data with named data networking. In Proceedings of the Fifth International Workshop on Network-Aware Data Management (2015), ACM, p. 1.
[19]
Fayazbakhsh, S. K., Lin, Y., Tootoonchian, A., Ghodsi, A., Koponen, T., Maggs, B., Ng, K., Sekar, V., and Shenker, S. Less pain, most of the gain: Incrementally deployable icn. In ACM SIGCOMM Computer Communication Review (2013), vol. 43, ACM, pp. 147--158.
[20]
Guok, C., Robertson, D., Thompson, M., Lee, J., Tierney, B., and Johnston, W. Intra and interdomain circuit provisioning using the oscars reservation system. In Broadband Communications, Networks and Systems, 2006. BROADNETS 2006. 3rd International Conference on (2006), IEEE, pp. 1--8.
[21]
Imbrenda, C., Muscariello, L., and Rossi, D. Analyzing cacheable traffic in isp access networks for micro cdn applications via content-centric networking. In Proceedings of the 1st international conference on Information-centric networking (2014), ACM, pp. 57--66.
[22]
Mastorakis, S., Afanasyev, A., Moiseenko, I., and Zhang, L. ndnsim 2.0: A new version of the ndn simulator for ns-3. NDN, Technical Report NDN-0028 (2015).
[23]
MaxMind, L. Geoip, 2006.
[24]
Olmos, F., Kauffmann, B., Simonian, A., and Carlinet, Y. Catalog dynamics: Impact of content publishing and perishing on the performance of a lru cache. In Teletraffic Congress (ITC), 2014 26th International (2014), IEEE, pp. 1--9.
[25]
Olschanowsky, C., Shannigrahi, S., and Papadopoulos, C. Supporting climate research using named data networking. In Local & Metropolitan Area Networks (LANMAN), 2014 IEEE 20th International Workshop on (2014), IEEE, pp. 1--6.
[26]
Overpeck, J. T., Meehl, G. A., Bony, S., and Easterling, D. R. Climate data challenges in the 21st century. science 331, 6018 (2011), 700--702.
[27]
Ozmutlu, H. C., Spink, A., and Ozmutlu, S. Analysis of large data logs: an application of poisson sampling on excite web queries. Information processing & management 38, 4 (2002), 473--490.
[28]
Ohara, R. B., and Kotze, D. J. Do not log-transform count data. Methods in Ecology and Evolution 1, 2 (2010), 118--122.
[29]
Ren, Y., Li, J., Shi, S., Li, L., Wang, G., and Zhang, B. Congestion control in named data networking-a survey. Computer Communications 86 (2016), 1--11.
[30]
Schneider, K., Yi, C., Zhang, B., and Zhang, L. A practical congestion control scheme for named data networking. In Proceedings of the 2016 conference on 3rd ACM Conference on Information-Centric Networking (2016), ACM, pp. 21--30.
[31]
Shannigrahi, S., Papadopoulos, C., Yeh, E., Newman, H., Barczyk, A. J., Liu, R., Sim, A., Mughal, A., Monga, I., Vlimant, J.-R., et al. Named data networking in climate research and hep applications. In Journal of Physics: Conference Series (2015), vol. 664, IOP Publishing, p. 052033.
[32]
So, W., Narayanan, A., Oran, D., and Stapp, M. Named data networking on a router: forwarding at 20gbps and beyond. In ACM SIGCOMM Computer Communication Review (2013), vol. 43, ACM, pp. 495--496.
[33]
Song, T., Yuan, H., Crowley, P., and Zhang, B. Scalable name-based packet forwarding: From millions to billions. In Proceedings of the 2nd International Conference on Information-centric Networking (2015), ACM, pp. 19--28.
[34]
Strand, G. Community earth system model data management: Policies and challenges. Procedia Computer Science 4 (2011), 558--566.
[35]
Taylor, K. E., Stouffer, R. J., and Meehl, G. A. An overview of cmip5 and the experiment design. Bulletin of the American Meteorological Society 93, 4 (2012), 485--498.
[36]
Tortelli, M., Grieco, L. A., and Boggia, G. Performance assessment of routing strategies in named data networking. In IEEE ICNP (2013).
[37]
Wang, Y., Li, Z., Tyson, G., Uhlig, S., and Xie, G. Optimal cache allocation for content-centric networking. In Network Protocols (ICNP), 2013 21st IEEE International Conference on (2013), IEEE, pp. 1--10.
[38]
Zhang, L., Afanasyev, A., Burke, J., Jacobson, V., Crowley, P., Papadopoulos, C., Wang, L., Zhang, B., et al. Named data networking. ACM SIGCOMM Computer Communication Review 44, 3 (2014), 66--73.

Cited By

View all
  • (2025)An Efficient Multipath-Based Caching Strategy for Information-Centric NetworksElectronics10.3390/electronics1403043914:3(439)Online publication date: 22-Jan-2025
  • (2024)Towards named data networking technology: Emerging applications, use cases, and challenges for secure data communicationFuture Generation Computer Systems10.1016/j.future.2023.09.031151(12-31)Online publication date: Feb-2024
  • (2023)Capture and Analysis of Traffic Traces on a Wide-Area NDN TestbedProceedings of the 10th ACM Conference on Information-Centric Networking10.1145/3623565.3623707(101-108)Online publication date: 9-Oct-2023
  • Show More Cited By

Index Terms

  1. Request aggregation, caching, and forwarding strategies for improving large climate data distribution with NDN: a case study

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          ICN '17: Proceedings of the 4th ACM Conference on Information-Centric Networking
          September 2017
          239 pages
          ISBN:9781450351225
          DOI:10.1145/3125719
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 26 September 2017

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. NDN
          2. information centric networking
          3. large scientific data
          4. named data networking
          5. network simulations
          6. network strategies

          Qualifiers

          • Research-article

          Funding Sources

          Conference

          ICN '17
          Sponsor:

          Acceptance Rates

          Overall Acceptance Rate 133 of 482 submissions, 28%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)63
          • Downloads (Last 6 weeks)14
          Reflects downloads up to 19 Feb 2025

          Other Metrics

          Citations

          Cited By

          View all
          • (2025)An Efficient Multipath-Based Caching Strategy for Information-Centric NetworksElectronics10.3390/electronics1403043914:3(439)Online publication date: 22-Jan-2025
          • (2024)Towards named data networking technology: Emerging applications, use cases, and challenges for secure data communicationFuture Generation Computer Systems10.1016/j.future.2023.09.031151(12-31)Online publication date: Feb-2024
          • (2023)Capture and Analysis of Traffic Traces on a Wide-Area NDN TestbedProceedings of the 10th ACM Conference on Information-Centric Networking10.1145/3623565.3623707(101-108)Online publication date: 9-Oct-2023
          • (2023)Investigating the Impact of Barrier Adjustment and Priority Caching on Improving Quality of Service in Named Data Vehicular Sensor Networks2023 4th International Conference on Computing and Communication Systems (I3CS)10.1109/I3CS58314.2023.10127292(1-8)Online publication date: 16-Mar-2023
          • (2023)Big Data, Transmission Errors, and the Internet2023 53rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks - Supplemental Volume (DSN-S)10.1109/DSN-S58398.2023.00040(142-145)Online publication date: Jun-2023
          • (2022)N-DISEProceedings of the 9th ACM Conference on Information-Centric Networking10.1145/3517212.3558087(103-113)Online publication date: 6-Sep-2022
          • (2022)Routing Scalability in Named Data Networking2022 4th International Conference on Advances in Computing, Communication Control and Networking (ICAC3N)10.1109/ICAC3N56670.2022.10074549(1749-1753)Online publication date: 16-Dec-2022
          • (2021)Joint Request Aggregation and Content Caching at the Edge via Named Data Networking2021 29th Iranian Conference on Electrical Engineering (ICEE)10.1109/ICEE52715.2021.9544137(601-606)Online publication date: 18-May-2021
          • (2020)What's in a Name?Proceedings of the 7th ACM Conference on Information-Centric Networking10.1145/3405656.3418717(12-23)Online publication date: 22-Sep-2020
          • (2020)Named Data Networking for Content Delivery Network Workflows2020 IEEE 9th International Conference on Cloud Networking (CloudNet)10.1109/CloudNet51028.2020.9335806(1-7)Online publication date: 9-Nov-2020
          • Show More Cited By

          View Options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Login options

          Figures

          Tables

          Media

          Share

          Share

          Share this Publication link

          Share on social media