A system based on Hadoop for radar data analysis

Yang, Chi; Yang, Xiaomin; Yang, Feng

doi:10.1007/s12652-018-0980-3

A system based on Hadoop for radar data analysis

Original Research
Published: 22 August 2018

Volume 10, pages 3899–3913, (2019)
Cite this article

Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

Chi Yang¹,
Xiaomin Yang¹ &
Feng Yang²

435 Accesses
3 Citations
Explore all metrics

A Correction to this article was published on 01 September 2020

This article has been updated

Abstract

Nowadays, radar technique is widely used in many applications, such as electronic warfare, weather prediction, navigation, and self-driving car. Large amounts of radar data has been generated by the wide use of radar technique. Analyzing radar data has a quite important role in daily life, as well as in military. Finding the frequent sequences in radar data is significant for radar data analysis. However, traditional analysis systems using standalones cannot process big data due to the four features of big data, namely, volume, velocity, variety and value. Many distributed frameworks are promising for processing large scale data sets, such as Hadoop and Spark. Therefore, to deal with the problem of finding frequent sequences from large amounts of radar data, we built a system based on Hadoop and Spark. With the combination of Hadoop and Spark, we can store big data, as well as analyze big radar data more easily. In the proposed system, Hadoop distributed file system offers stable data storage, and Spark offers efficient in-memory calculation. In this paper, a three-node Hadoop–Spark cluster was built to perform the distributed data mining algorithm. Additionally, to make the analysis of radar data accurate, we proposed ideas of preprocessing radar data and post processing mining results. Experimental results show that the system we proposed can analyze the large amounts of radar data efficiently and accurately.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A data structure perspective to the RDD-based Apriori algorithm on Spark

Article 07 August 2019

SWEclat: a frequent itemset mining algorithm over streaming data using Spark Streaming

Article Open access 04 February 2020

Multi-agent distributed data mining approach for classifying meteorology data: case study on Iran’s synoptic weather stations

Article 15 May 2017

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

Artificial Intelligence

Change history

01 September 2020
The below affiliation for author Chi Yang was missed to be included in the original article.

References

Achar A, Ibrahim A, Sastry PS (2013) Pattern-growth based frequent serial episode discovery. Data Knowl Eng 87(9):91–108
Article Google Scholar
Agrawal R, Srikant R (1995) Mining sequential patterns. Proc Eleventh Inter Conf Data Eng 31(6):3–14
Article Google Scholar
Daszykowski M, Walczak B, Massart DL (2002) Looking for natural patterns in analytical data. 2. Tracing local density with optics. J Chem Inf Comput Sci 42(3):500
Article Google Scholar
Dean J, Ghemawat S (2010) Mapreduce: a flexible data processing tool. Commun ACM 53(1):72–77
Article Google Scholar
Ester M (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD-96), pp 226–231
Fournier-Viger P, Wu CW, Gomariz A, Tseng VS (2014) VMSP: efficient vertical mining of maximal sequential patterns. In: Canadian conference on artificial intelligence, pp 109–120
Garcagil D, Ramrezgallego S, Garca S, Herrera F (2017) A comparison on scalability for batch big data processing on apache spark and apache flink. Big Data Anal 2(1):1
Article Google Scholar
Ghazi MR, Gangodkar D (2015) Hadoop, mapreduce and HDFS: a developers perspective. Proc Comput Sci 48:45–50
Article Google Scholar
Merrill IS (2001) Introduction to radar systems. McGraw-Hill, Boston, pp 607–609
Google Scholar
Ji X, Bailey J, Dong G (2007) Mining minimal distinguishing subsequence patterns with gap constraints. Knowl Inf Syst 11(3):259–286
Article Google Scholar
Jian L, Yang X, Zhou Z, Zhou K, Liu K (2018) Multi-scale image fusion through rolling guidance filter. Future Gener Comput Syst 83:1399–1420
Article Google Scholar
Jiang QB, Hong guang MA, Xi A (2006) A novel nonmatched detection algorithm of radar pulse signal. Mod Radar 28(11):36–40
Google Scholar
Koliopoulos AK, Yiapanis P, Tekiner F, Nenadic G, Keane J (2015) A parallel distributed weka framework for big data mining using spark. 9–16
Lin X, Wang P, Wu B (2014) Log analysis in cloud computing environment with hadoop and spark. In: IEEE international conference on broadband network and multimedia technology, pp 273–276
Malewicz G, Austern MH, Bik AJC, Dehnert JC, Horn I, Leiser N, Czajkowski G (2010) Pregel: a system for large-scale graph processing. In: ACM SIGMOD international conference on management of data, pp 135–146
Meng X, Bradley J, Yavuz B, Sparks E, Venkataraman S, Liu D, Freeman J, Tsai D, Amde M, Owen S (2015) Mllib: machine learning in apache spark. J Mach Learn Res 17(1):1235–1241
MathSciNet MATH Google Scholar
Pei J, Han J, Mortazaviasl B, Pinto H, Chen Q, Dayal U, Hsu MC (2001) Prefixspan: mining sequential patterns efficiently by prefix-projected pattern growth. In: International conference on data engineering, p 215
Rathore MMU, Ahmad A, Paul A (2017) Hadoop-based intelligent care system (hics): analytical approach for big data in iot. ACM Trans Internet Technol 2016(1):1–24
Article Google Scholar
Shi J, Qiu Y, Minhas UF, Jiao L, Wang C, Reinwald B (2015) Clash of the titans: mapreduce vs. spark for large scale data analytics. Proc Vldb Endow 8(13):2110–2121
Article Google Scholar
Shvachko KV (2010) HDFS scalability: the limits to growth. USENIX Mag 35:6–16
Google Scholar
Srikant R, Agrawal R (1996) Mining sequential patterns: generalizations and performance improvements. In: International conference on extending database technology, Springer, pp 1–17
White T, Cutting D (2012) Hadoop : the definitive guide, vol 215(11). Oreilly Media Inc, Gravenstein Highway North, pp 1–4
Google Scholar
Yang X, Wu W, Liu K, Kim PW, Sangaiah AK, Jeon G (2018) Multi-semi-couple super-resolution method for edge computing. IEEE Access PP(99):1–1
Google Scholar
Zaharia M, Chowdhury M, Franklin MJ, Shenker S, Stoica I (2010) Spark: cluster computing with working sets. In: Usenix conference on hot topics in cloud computing, pp 10–10

Download references

Acknowledgements

The research in our paper is sponsored by National Natural Science Foundation of China (Nos. 61701327, 61711540303, 61771378), and Science Foundation of Sichuan Science and Technology Department (No. 2018GZ0178).

Author information

Authors and Affiliations

College of Electronics and Information Engineering, Sichuan University, Chengdu, 610064, Sichuan, China
Chi Yang & Xiaomin Yang
Science and Technology on Electronic Information Control Laboratory, Chengdu, 610036, China
Feng Yang

Authors

Chi Yang
View author publications
You can also search for this author inPubMed Google Scholar
Xiaomin Yang
View author publications
You can also search for this author inPubMed Google Scholar
Feng Yang
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Xiaomin Yang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, C., Yang, X. & Yang, F. A system based on Hadoop for radar data analysis. J Ambient Intell Human Comput 10, 3899–3913 (2019). https://doi.org/10.1007/s12652-018-0980-3

Download citation

Received: 31 March 2018
Accepted: 11 August 2018
Published: 22 August 2018
Issue Date: October 2019
DOI: https://doi.org/10.1007/s12652-018-0980-3

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A system based on Hadoop for radar data analysis

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A data structure perspective to the RDD-based Apriori algorithm on Spark

SWEclat: a frequent itemset mining algorithm over streaming data using Spark Streaming

Multi-agent distributed data mining approach for classifying meteorology data: case study on Iran’s synoptic weather stations

Explore related subjects

Change history

01 September 2020

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now