Skip to main content

Fuzzy Join as a Preparation Step for the Analysis of Training Data

  • Conference paper
  • First Online:
Beyond Databases, Architectures and Structures. Paving the Road to Smart Data Processing and Analysis (BDAS 2019)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1018))

Abstract

Analysis of training data has become an inseparable part of sports preparation not only for professional athletes but also for sports enthusiasts and sports amateurs. Nowadays, smart wearables and IoT devices allow monitoring of various parameters of our physiology and activity. The intensity and effectiveness of the activity and values of some physiology parameters may depend on weather conditions in particular days. Therefore, for efficient analysis of training data, it is important to align training data to weather sensor data. In this paper, we show how this process can be performed with the use of the fuzzy join technique, which allows to combine data points shifted in time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Afrati, F.N., Sarma, A.D., Menestrina, D., Parameswaran, A., Ullman, J.D.: Fuzzy joins using MapReduce. In: 2012 IEEE 28th International Conference on Data Engineering, pp. 498–509 (04 2012). https://doi.org/10.1109/ICDE.2012.66

  2. Deng, D., Li, G., Hao, S., Wang, J., Feng, J.: MassJoin: a MapReduce - based method for scalable string similarity joins. In: 2014 IEEE 30th International Conference on Data Engineering, pp. 340–351 (03 2014)

    Google Scholar 

  3. Dhahbi, W.: Seasonal weather conditions affect training program efficiency and physical performance among special forces trainees: a long-term follow-up study. PLoS ONE 13(10) (10 2018). https://doi.org/10.1371/journal.pone.0206088

  4. El Helou, N., Tafflet, M., Berthelot, G., Tolaini, J., Marc, A., Guillaume, M., Hausswirth, C., Toussaint, J.F.: Impact of environmental parameters on marathon running performance. PLoS ONE (05 2012). https://doi.org/10.1371/journal.pone.0037407

  5. Ely, M.R., Cheuvront, S.N., Roberts, W.O., Montain, S.J.: Impact of weather on marathon-running performance. Med. Sci. Sports Exerc. 39(3), 487–493 (2007)

    Article  Google Scholar 

  6. Khorasani, E.S., Cremeens, M., Zhao, Z.: Implementation of scalable fuzzy relational operations in MapReduce. Soft Comput. 22(9), 3061–3075 (2018). https://doi.org/10.1007/s00500-017-2561-3

    Article  Google Scholar 

  7. Kimmett, B., Srinivasan, V., Thomo, A.: Fuzzy joins in MapReduce: an experimental study. In: Proceedings of VLDB Endow, pp. 1514–1517 (08 2015). https://doi.org/10.14778/2824032.2824049

  8. Lara, O.D., Labrador, M.A.: A survey on human activity recognition using wearable sensors. IEEE Commun. Surv. Tutorials 15(3), 1192–1209 (2013)

    Article  Google Scholar 

  9. Małysiak-Mrozek, B., Lipińska, A., Mrozek, D.: Fuzzy join for flexible combining big data lakes in cyber-physical systems. IEEE Access 6, 69545–69558 (2018). https://doi.org/10.1109/ACCESS.2018.2879829

    Article  Google Scholar 

  10. Małysiak-Mrozek, B., Stabla, M., Mrozek, D.: Soft and declarative fishing of information in big data lake. IEEE Trans. Fuzzy Syst. 26(5), 2732–2747 (2018). https://doi.org/10.1109/TFUZZ.2018.2812157

    Article  Google Scholar 

  11. Małysiak-Mrozek, B., Baron, T., Mrozek, D.: Spark-IDPP: high-throughput and scalable prediction of intrinsically disordered protein regions with Spark clusters on the Cloud. Cluster Comput., November 2018. https://doi.org/10.1007/s10586-018-2857-9

  12. Mrozek, D., Daniłowicz, P., Małysiak-Mrozek, B.: HDInsight4PSi: boosting performance of 3D protein structure similarity searching with HDInsight clusters in Microsoft Azure cloud. Inf. Sci. (2016). https://doi.org/10.1016/j.ins.2016.02.029

  13. Mrozek, Dariusz: Scalable Big Data Analytics for Protein Bioinformatics. CB, vol. 28. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-98839-9

    Book  MATH  Google Scholar 

  14. Mukhopadhyay, S.C.: Wearable sensors for human activity monitoring : a review. IEEE Sensors J. 15(3), 1321–1327 (2015)

    Article  Google Scholar 

  15. Revathi Pulichintha Harshitha, S., Narramneni, P., Raghavee, N.S.: Body sensor using internet of things (IoT). ARPN J. Eng. Appl. Sci. 13(8) (2018)

    Google Scholar 

  16. Toh, W.Y., Tan, Y.K., Koh, W.S., Siek, L.: Autonomous wearable sensor nodes with flexible energy harvesting. IEEE Sensors J. 14, 2299–2306 (2014)

    Article  Google Scholar 

  17. Vihma, T.: Effects of weather on the performance of marathon runners. Int. J. Biometeorol. 54(3), 297–306 (2010). https://doi.org/10.1007/s00484-009-0280-x

    Article  Google Scholar 

  18. Yamato, Y.: Proposal of vital data analysis platform using wearable sensor. In: Proceedings of the 5th IIAE International Conference on Industrial Application Engineering (2017)

    Google Scholar 

  19. Yan, C., Zhao, X., Zhang, Q., Huang, Y.: Efficient string similarity join in multi-core and distributed systems. PLoS ONE 12(3), 1–16 (2017)

    Google Scholar 

  20. Zadeh, L.A.: Fuzzy sets. Inf. Control 8, 338–353 (1965)

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by Microsoft Research within Microsoft Azure for Research Award grant, pro-quality grant for highly scored publications or issued patents of the Rector of the Silesian University of Technology, Gliwice, Poland (grant No 02/020/RGJ19/0167), and partially, by Statutory Research funds of Institute of Informatics, Silesian University of Technology, Gliwice, Poland (grant No BK/204/RAU2/2019).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Anna Wachowicz or Dariusz Mrozek .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wachowicz, A., Mrozek, D. (2019). Fuzzy Join as a Preparation Step for the Analysis of Training Data. In: Kozielski, S., Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kostrzewa, D. (eds) Beyond Databases, Architectures and Structures. Paving the Road to Smart Data Processing and Analysis. BDAS 2019. Communications in Computer and Information Science, vol 1018. Springer, Cham. https://doi.org/10.1007/978-3-030-19093-4_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-19093-4_20

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-19092-7

  • Online ISBN: 978-3-030-19093-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics