Skip to main content

TorqueDB: Distributed Querying of Time-Series Data from Edge-local Storage

  • Conference paper
  • First Online:
Euro-Par 2020: Parallel Processing (Euro-Par 2020)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12247))

Included in the following conference series:

Abstract

The rapid growth in edge computing devices as part of Internet of Things (IoT) allows real-time access to time-series data from 1000’s of sensors. Such observations are often queried to optimize the health of the infrastructure. Recently, edge storage systems allow us to retain data on the edge rather than moving them centrally to the cloud. However, such systems do not support flexible querying over the data spread across 10–100’s of devices. There is also a lack of distributed time-series databases that can run on the edge devices. Here, we propose TorqueDB, a distributed query engine over time-series data that operates on edge and fog resources. TorqueDB leverages our prior work on ElfStore, a distributed edge-local file store, and InfluxDB, a time-series database, to enable temporal queries to be decomposed and executed across multiple fog and edge devices. Interestingly, we move data into InfluxDB on-demand while retaining the durable data within ElfStore for use by other applications. We also design a cost model that maximizes parallel movement and execution of the queries across resources, and utilizes caching. Our experiments on a real edge, fog and cloud deployment show that TorqueDB performs comparable to InfluxDB on a cloud VM for a smart city query workload, but without the associated monetary costs.

Supported by the DST ICPS Program, Government of India.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.influxdata.com/products/influxdb-overview/.

  2. 2.

    http://datacanvas.org/sense-your-city/.

References

  1. Abadi, D., et al.: The seattle report on database research. SIGMOD Rec. 48(4), 44–53 (2020). https://doi.org/10.1145/3385658.3385668

    Article  Google Scholar 

  2. Georgiou, Z., Symeonides, M., Trihinas, D., Pallis, G., Dikaiakos, M.D.: Streamsight: a query-driven framework for streaming analytics in edge computing. In: IEEE International Conference on Utility and Cloud Computing (UCC) (2018). https://doi.org/10.1109/UCC.2018.00023

  3. Ghosh, R., Simmhan, Y.: Distributed scheduling of event analytics across edge and cloud. ACM Trans. Cyber-Phys. Syst. 2(4), 1–28 (2018). https://doi.org/10.1145/3140256

    Article  Google Scholar 

  4. Grunert, H., Heuer, A.: Rewriting complex queries from cloud to fog under capability constraints to protect the users’ privacy. Open J. Internet Things (OJIOT) 3(1), 31–45 (2017)

    Google Scholar 

  5. Gupta, H., Xu, Z., Ramachandran, U.: Datafog: towards a holistic data management platform for the IoT age at the network edge. In: USENIX HotEdge (2018)

    Google Scholar 

  6. Liu, R., Yuan, J.: Benchmarking time series databases with IoTDB-benchmark for IoT scenarios. Technical Report. arXiv:1901.08304, arXiv (2019)

  7. Malensek, M., Pallickara, S.L., Pallickara, S.: Hermes: federating fog and cloud domains to support query evaluations in continuous sensing environments. IEEE Cloud Comput. 4(2), 54–62 (2017). https://doi.org/10.1109/MCC.2017.26

    Article  Google Scholar 

  8. Martinviita, M.: Time series database in Industrial IoT and its testing tool. Master’s thesis, University of Oulu (2018)

    Google Scholar 

  9. Monga, S.K., Ramachandra, S.K., Simmhan, Y.: Elfstore: a resilient data storage service for federated edge and fog resources. In: IEEE International Conference on Web Services (ICWS) (2019). https://doi.org/10.1109/ICWS.2019.00062

  10. Nagato, T., Tsutano, T., Kamada, T., Takaki, Y., Ohta, C.: Distributed key-value storage for edge computing and its explicit data distribution method. IEICE Trans. Commun. (2019). https://doi.org/10.1587/transcom.2019CPP0007

    Article  Google Scholar 

  11. Patel, P., Ali, M.I., Sheth, A.: On using the intelligent edge for IoT analytics. IEEE Intell. Syst. 32(5), 64–69 (2017). https://doi.org/10.1109/MIS.2017.3711653

    Article  Google Scholar 

  12. Schultz-Moller, N.P., Migliavacca, M., Pietzuch, P.: Distributed complex event processing with query rewriting. In: ACM International Conference on Distributed Event-Based Systems (DEBS) (2009). https://doi.org/10.1145/1619258.1619264

  13. Simmhan, Y.: Big data and fog computing. In: Sakr, S., Zomaya, A.Y. (eds.) Encyclopedia of Big Data Technologies. Springer (2019). https://doi.org/10.1007/978-3-319-63962-8_41-1

  14. Varshney, P., Simmhan, Y.: Characterizing application scheduling on edge, fog, and cloud computing resources. Softw.: Pract. Experience 50(5), 558–595 (2019). https://doi.org/10.1002/spe.2699

    Article  Google Scholar 

  15. Zhang, W., et al.: LSTM-based analysis of industrial IoT equipment. IEEE Access 6, 23551–23560 (2018). https://doi.org/10.1109/ACCESS.2018.2825538

  16. Zhou, Z.B., Zhao, D., Xu, X., Du, C., Sun, H.: Periodic query optimization leveraging popularity-based caching in wireless sensor networks for industrial IoT Applications. Mob. Netw. Appl. 20(2), 124–136 (2014). https://doi.org/10.1007/s11036-014-0545-4

    Article  Google Scholar 

Download references

Acknowledgment

The authors thank members of the DREAM:Lab, IISc, including Aakash Khochare, Shriram Ramesh and Sheshadri KR, for their assistance with the design, development and experiments of TorqueDB. This research was supported by grants from the DST ICPS program.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yogesh Simmhan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Garg, D., Shirolkar, P., Shukla, A., Simmhan, Y. (2020). TorqueDB: Distributed Querying of Time-Series Data from Edge-local Storage. In: Malawski, M., Rzadca, K. (eds) Euro-Par 2020: Parallel Processing. Euro-Par 2020. Lecture Notes in Computer Science(), vol 12247. Springer, Cham. https://doi.org/10.1007/978-3-030-57675-2_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-57675-2_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-57674-5

  • Online ISBN: 978-3-030-57675-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics