Big data-based prediction of terrorist attacks

https://doi.org/10.1016/j.compeleceng.2019.05.013Get rights and content

Abstract

An optimised hybrid classifier is proposed for the prediction of terrorist attacks. Hybrid classifier is designed using big data. It puts forward a framework that includes data collection, preprocessing, hybrid classification mining, and classifier testing as a single unit in predicting the terrorist attacks. The genetic algorithm is used to optimise the weight of each single classifier to improve the prediction accuracy of the hybrid classifier. The results reveal that the hybrid classifier is superior to the single classifier in prediction accuracy.

Introduction

The shadow of terrorist attacks has loomed globally since the 9–11 attacks. Although countries have stepped up their efforts to prevent and control them, terrorist attacks have not been far from people's lives. The Global Terrorism Database (GTD) shows that a total of 25,903 terrorist attacks took place around the world between 2000 and 2012; the average is approximately 2000 per year and more than 5 times per day [1]. There are many underlying correlations and laws in the numbers associated with terrorist attacks. If these hidden phenomena can effectively guide the construction of anti-terrorism early warning systems, they can help solve the difficult problems in counter-terrorism decision-making and conduct regular identification and prediction. This has become a research topic in many fields of informatics, as it improves the risk management and accurate warning of terrorist attacks through the use of classification technology to mine large data about terrorist attacks in depth and minimise the unknown risk of terrorist attacks.

The University of Maryland built the GTD as an open-source collection that includes data sets from the 1970s to 2014 on global terrorist activity. The database has collected and combed approximately 140,000 terrorist incidents worldwide from 1970 to 2014 and includes information about the timing, location, use of weapons and targets, number of casualties, and identifiable responsible parties. The average terrorist attack contains up to 45 messages, where the one with the most has more than 120. GTD provides researchers with comprehensive, reliable, and open-source data, and helps uncover the underlying structure behind terrorist attacks. For example, reference [2] is a data source based on the GTD's listed terrorist attacks in India that analyses the type and number of attacks related to the attack organisation. Reference [3] divides the terrorist attacks in GTD into transnational events and domestic events, analyses and compares the impact of the both, and finds that transnational terrorism will bring more negative impact on a country's economic growth than domestic terrorism. The research results also show that domestic terrorism can evolve into transnational terrorism. The target country cannot ignore the domestic terrorism of its neighbours in order to reduce the risk of terrorist attacks in the country, and it should cooperate to eliminate terrorism with its neighbours. Literature [4] uses GTD as a data source, a visual environment is introduced to explore various types of terrorist attacks, including attack target type and attack strategy.

The paper is organised as follows: Section 2 discusses the research methods, i.e., data collection, preprocessing, and construction of the classifier. The analysis of the experiments conducted is presented in Section 3. Section 4 presents the conclusion.

Section snippets

Research methods

In this paper, a large-data prediction framework for terrorist attacks based on a mixed classifier is presented (see Fig. 1). The framework includes four steps: data collection, data preprocessing, construction of the mixed classifier, and data testing [5], [6], [7], [8]. In the data-collection phase, the framework learns about the types of attacks and attributes that GTD contains. In addition, more than 140,000 terrorist attacks in the database were examined in terms of the number of terrorist

Analysis and discussion of experimental results

The iterative curve of GA optimisation integrated classifier is presented in Fig. 8 to verify the convergence of the proposed algorithm in this paper. It can be observed that the optimal value of the population shows a step-down trend. Furthermore, its convergence speed is less than 30 times, and its prediction error rate is stable at 6%. The population mean curve fluctuates between 7.5% and 8.5%. The prediction accuracy is still higher than that of the optimal single classifier (bagging 91.5%)

Conclusion

Terrorists attacks have been increasing in recent times, and prediction of these attacks is almost impossible. The GTD was used as big data, and hybrid classifiers were trained to predict new attacks. The data, after preprocessing for noise, was categorised based on the type of attacks. Then, the hybrid classifier with KNN, decision tree C4.5, bagging, and SVM was designed with a GA to predict attacks. Experiments were also conducted for individual classifiers without the hybrid approach. From

Conflict of interest

None.

Meng Xi is currently a lecturer in the department of Crime Investigation and Counterterrorism of the People's Public Security University of China. Her research interest is mainly in the area of information visualization, counterterrorism, and semantic analysis. She has published several research papers in scholarly journals in the above research areas and has hosted two ministry-level projects.

References (25)

  • A.A. Mutlag et al.

    Enabling technologies for fog computing in healthcare IoT systems

    Futur Gener Comput Syst

    (2019)
  • G LaFree et al.

    Introducing the global terrorism database

    Terror Political Violence

    (2007)
  • A Khan et al.

    Terrorism and India: an economic perspective

    Qual Quant

    (2016)
  • W Enders et al.

    Domestic versus transnational terrorism: data, decomposition, and dynamics

    J Peace Res

    (2011)
  • D Guo et al.

    Visualizing patterns in a global terrorism incident database

    Environ Plann B Plan Des

    (2007)
  • KM Leung

    Naive Bayesian classifier

    (2007)
  • J J Hopfield

    Artificial neural networks

    IEEE Circuits Devices Mag

    (1988)
  • R. Kohavi

    Scaling up the accuracy of Naive-Bayes classifiers: a decision-tree hybrid

  • J Chen et al.

    An improved ID3 decision tree algorithm

  • S Nizamani et al.

    Detecting terrorism incidence type from news summary

    (2012)
  • S Shafiq et al.

    Attack type prediction using hybrid classifier

  • N Stepenosky et al.

    Majority vote and decision template based ensemble classifiers trained on event related potentials for early diagnosis of Alzheimer's Disease

    Int J Comput Appl

    (2006)
  • Cited by (22)

    • External blast flow field evolution and response mechanism of single-layer reticulated dome structure

      2023, Defence Technology
      Citation Excerpt :

      Statistics of global terrorism database (GTD) reveals that nearly 200,000 incidents of terrorist bombing have taken place in 1970–2019 [1], and in recent times there are on average 2000 incidents per year (i.e., more than five per day) [2].

    • A two-step machine learning method for casualty prediction under emergencies

      2022, Journal of Safety Science and Resilience
      Citation Excerpt :

      The subsequent explosion killed 15 people and injured at least an additional 151. To reduce the risk of emergencies to people and society, risk prediction and assessment have been widely used [1–8]. For instance, Paté-Cornell and Guikema [7] proposed a risk analysis model based on Bayesian probability, which further contributed to establishing a decision-making model for setting priorities among threats and countermeasures.

    • Investigation into the role of human and organizational factors in security work against terrorism at large-scale events

      2020, Safety Science
      Citation Excerpt :

      Therefore, LSE security management personnel should continually monitor the need to upgrade cyber security protection systems and update the training of cyber security professionals. In addition, the cumulative frequency of L5F1 is also 0.33, indicating that terrorism has posed a serious threat to the security and stability of various nations and regions for decades (Meng et al., 2019). China is facing a serious threat of terrorism (Zhang, 2019).

    View all citing articles on Scopus

    Meng Xi is currently a lecturer in the department of Crime Investigation and Counterterrorism of the People's Public Security University of China. Her research interest is mainly in the area of information visualization, counterterrorism, and semantic analysis. She has published several research papers in scholarly journals in the above research areas and has hosted two ministry-level projects.

    Nie Lingyu is currently an undergraduate student in the school of Investigation and Counterterrorism at Chinese People's Public Security University. Her research interest is mainly in the area of public security intelligence. She has participated some scientific research projects and made some achievements.

    Song Jiepeng is currently an undergraduate student in the school of Investigation and Counterterrorism at Chinese People's Public Security University. His research interest is mainly in the area of public security intelligence, and he has participated some scientific research projects.

    This paper is for CAEE special section SI-hai. Reviews processed and recommended for publication to the Editor-in-Chief by Associate Editor Dr. G. Ramirez Gonzalez.

    View full text