Skip to main content
Log in

A parallel online trajectory compression approach for supporting big data workflow

  • Published:
Computing Aims and scope Submit manuscript

Abstract

Nowadays with booming of sensor technology, location big data exhibit as high complexity, massive volume, real-time and stream-based characteristic. The current workflow systems are facing the challenge hardly to efficiently process the real-time location big data like trajectory stream. Online compression method is an available solution to preprocess these trajectory data in order to speed up the processing of big data workflow. However, the current online compression methods are in a serial execution that are hard to fast compress massive real-time original trajectory data. Aiming at this problem, we employ the multi-core and many-core approaches to accelerate a representative online trajectory compression method SQUISH-E. First a parallel version of SQUISH-E is proposed. PSQUISH-E used a data parallel scheme based on overlap technique and OpenMP to achieve the implementation over multiple-core CPUs. For further reducing compression time, we combine iteration method and GPU Hyper-Q feature to develop GPU-aided PSQUISH-E algorithm called as G-PSQUISH-E. The experimental results showed that (1) the data parallel scheme based on overlap can reach a similar SED error as the SQUISH-E (2) the proposed PSQUISH-E running on multi-core CPU achieved 3.8 times acceleration effect, and (3) G-PSQUISH-E further accelerated the effect of about 3 times compared with PSQUISH-E.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Bowers S, Workflow S (2012) Provenance, and data modeling challenges and approaches. J Data Semant 1:19–30. doi:10.1007/s13740-012-0004-y

    Article  Google Scholar 

  2. Bryant RE (2011) Data-intensive scalable computing for scientific applications. Comput Sci Eng 13(6):25–33

    Article  Google Scholar 

  3. Chapman B, Jost G, van der Pas R (2007) OpenMP: portable shared memory parallel programming. MIT Press, Cambridge

    Google Scholar 

  4. Chen CLP, Zhang C-Y (2014) Data-intensive applications, challenges, techniques and technologies: a survey on big data. Inf Sci 275:314–347

    Article  Google Scholar 

  5. Chen Y, Wang L, Li F, Bo D, Choo K-KR, Hassan H, Qin W (2017) Air quality data clustering using EPLS method. Inf Fusion 36:225–232

    Article  Google Scholar 

  6. Davidson SB, Freire J (2008) Provenance and scientific workflows: challenges and opportunities. In: SIGMOD08, June 9C12, Vancouver, BC, Canada, ACM 978-1-60558-102-6/08/06

  7. Diaz J, Muñoz-Caro C, Niño A (2012) A survey of parallel programming modelsand tools in the multi and many-core era. IEEE Trans Parallel Distrib Syst 23:1369–1386

    Article  Google Scholar 

  8. Douglas DH, Peucker TK (1973) Algorithms for the reduction of the number of points required to represent a line or its caricature. Can Cartogr 10:112–122

    Article  Google Scholar 

  9. Gudmundsson J, Katajainen J, Merrick D, Ong C, Wolle T (2007) Compressing spatio-temporal trajectories. LNCS 4835:763–775

    MathSciNet  MATH  Google Scholar 

  10. Guo C, Fang Y, Liu JN, Wan Y (2013) Study on social awareness computation methods for location-based services. J Comput Res Dev 50(12):2531–2542

    Google Scholar 

  11. Huang F, Tao J, Xiang Y, Liu P, Dong L, Wang L (2017) Parallel compressive sampling matching pursuit algorithm for compressed sensing signal reconstruction with OpenCL. J Syst Archit Embed Syst Des 72:51–60

    Article  Google Scholar 

  12. Lange R, Drr F, Rothermel K (2011) Efficient real-time trajectory tracking. VLDB J 20:671–694

    Article  Google Scholar 

  13. Liu J, Zhao K, Sommer P, Shang S, Kusy B, Lee J-G, Jurdak R (2016) A novel framework for online amnesic trajectory compression in resource constrained environments. IEEE Trans Knowl Data Eng 28:2827–2841

    Article  Google Scholar 

  14. Liu J, Zhao K, Sommer P, Shang S, Kusy B, Jurdak R (2015) Bounded quadrant system: error-bounded trajectory compression on the go. In: The IEEE international conference on data engineering (ICDE), pp 987–998

  15. Ma Y, Haiping W, Wang L, Huang B, Ranjan R, Zomaya AY, Jie W (2015) Remote sensing big data computing: challenges and opportunities. Future Gen Comput Syst 51:47–60

    Article  Google Scholar 

  16. Meratnia N, de By RA (2004) Spatiotemporal compression techniques for moving point objects. LNCS 2992:765–782

    Google Scholar 

  17. Meratnia N, de By RA (2004) Spatiotemporal compression techniques for moving point objects. In: International conference on extending database technology (EDBT), pp 765–782

  18. Meratnia N, de By RA (2004) Spatiotemporal compression techniques for moving point objects. In: Proceedings of the 9th international conference on extending database technology (EDBT), pp 765–782

  19. Miao Y, Wang L, Liu D, Ma Y, Zhang W, Chen L (2015) A Web 2.0-based science gateway for massive remote sensing image processing. Concurr Comput Pract Exp 27(9):2489–2501

    Article  Google Scholar 

  20. Muckell J et al (2011) SQUISH: an online approach for GPS trajectory compression. In: Proceedings of the 2nd international conference on computing for geospatial research & applications. ACM

  21. Muckell J, Olsen PW Jr, Hwang J-H, Lawson CT, Ravi SS (2014) Compression of trajectory data: a comprehensive evaluation and new approach. Geoinformatica 18:435–460

    Article  Google Scholar 

  22. Popa IS, Zeitouni K, Oria V, Kharrat A (2014) Spatio-temporal compression of trajectories in road networks. Geoinformatica, vol, preprint

  23. Quercia D, Lathia N, Calabrese F, Di Lorenzo G, Crowcroft J (2010) Recommending social events from mobile phone location data (PDF). In: 2010 IEEE international conference on data mining, p 971. doi:10.1109/ICDM.2010.152. ISBN 978-1-4244-9131-5

  24. Song W, Liu P, Wang L (2016) Sparse representation-based correlation analysis of non-stationary spatiotemporal big data. Int J Digit Earth 9(9):892–913

    Article  Google Scholar 

  25. Trajcevski G, Cao H, Scheuermanny P, Wolfsonz O, Vaccaro D (2006) On-line data reduction and the quality of history in moving objects databases. In: ACM international workshop on data engineering for wireless and mobile access (MobiDE), pp 19–26

  26. Tuning CUDA applications for Kepler (2015)

  27. Vitter JS (1985) Random sampling with a reservoir. ACM TOMS 11:37–57

    Article  MathSciNet  MATH  Google Scholar 

  28. Wang L, Ke L, Liu P, Ranjan R, Chen L (2014) IK-SVD: dictionary learning for spatial big data via incremental atom update. Comput Sci Eng 16(4):41–52

    Article  Google Scholar 

  29. Wang L, Geng H, Liu P, Ke L, Kolodziej J, Ranjan R, Zomaya AY (2015) Particle swarm optimization based dictionary learning for remote sensing big data. Knowl Based Syst 79:43–50

    Article  Google Scholar 

  30. Wang L, Song W, Liu P (2016) Link the remote sensing big data to the image features via wavelet transformation. Clust Comput 19(2):793–810

    Article  Google Scholar 

  31. Wang L, Zhang J, Liu P, Choo K-KR, Huang F (2017) Spectral-spatial multi-feature-based deep learning for hyperspectral remote sensing image classification. Soft Comput 21(1):213–221

    Article  Google Scholar 

  32. Yuan J, Zheng Y, Xie X, Sun G (2011) Driving with knowledge from the physical world. In: KDD, pp 949–960

  33. Zheng Y, Xie X, Ma WY (2010) Geolife: a collaborative social networking service among user, location and trajectory. IEEE Data Eng Bull 33:32–40

    Google Scholar 

Download references

Acknowledgements

This work was supported in part by National Science and Technology Major Project of the Ministry of Science and Technology of China (2016ZX05014-003), the China Postdoctoral Science Foundation (2014M552112), the Fundamental Research Funds for the National University, China University of Geosciences (Wuhan) (No. 1610491B24).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Ze Deng or Jing Zhu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Han, W., Deng, Z., Chu, J. et al. A parallel online trajectory compression approach for supporting big data workflow. Computing 100, 3–20 (2018). https://doi.org/10.1007/s00607-017-0563-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00607-017-0563-8

Keywords

Mathematics Subject Classification

Navigation