Skip to main content
Log in

LASSO-based feature selection and naïve Bayes classifier for crime prediction and its type

  • Original Research Paper
  • Published:
Service Oriented Computing and Applications Aims and scope Submit manuscript

Abstract

For centuries, crime has been viewed as random because it is based on human behavior; even now, it incorporates an excessive number of factors for current machine learning models to forecast accurately. In this work, we tend to discuss the early crime prediction results from a model developed using the data from the Chicago crime dataset. In any case, with a superior execution future crime is to anticipated accurately, it is a testing assignment as a result of the increase in several crimes in present days. Therefore, the crime foreseeing method is foremost, and it identifies the future crimes and number of crimes are degraded. In this paper, we built up a model to anticipate future crime occurrences at a future time and also predict which type of crime may be happening in a given area. First, we analyze how certain crime features like given a date, time and some geologically important relevant features such as latitude and longitude. Second, we discuss several analytics techniques we used to find meaning in our data, such as LASSO feature selection analysis, classification models like naïve Bayes and SVM. Finally, we select the best model for foreseeing crime type and seriousness of the crime for giving different features.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  1. Kochar B, Chhillar R et al (2012) An effective data warehousing system for RFID using novel data cleaning, data transformation, and loading techniques. Arab J Inf Technol 9(3):208–216

    Google Scholar 

  2. Santhi P, Bhaskaran VM et al (2010) Performance of clustering algorithms in healthcare database. Int J Adv Comput Sci 2(1):26–31

    Google Scholar 

  3. Wahbeh AH, Al-Radaideh QA et al (2011) A comparison study between data mining tools over some classification methods. Int J Adv Comput Sci Appl (Special Issue):18–26

  4. Han J, Kamber M et al (2006) Data mining: concepts and techniques. Morgan Kaufmann Publishers, San Francisco

    MATH  Google Scholar 

  5. Li G, Wang Y et al (2012) A privacy-preserving classification method based on singular value decomposition. Arab J Inf Technol 9(6):529–534

    Google Scholar 

  6. Gorr W, Olligschlaeger A, Thompson Y (2003) Short-term forecasting of crime. Int J Forecast 19(4):579–594

    Article  Google Scholar 

  7. Chen P, Yuan H, Shu X et al (2008) Forecasting crime using the ARIMA model. In: Proceedings of the 5th international conference on fuzzy systems and knowledge discovery. pp 627–630

  8. Shingleton JS et al (2012) Crime trend prediction using regression models for salinas, California. Naval Postgraduate School, Monterey

    Google Scholar 

  9. Kianmehr K, Alhajj R et al (2006) Crime hot-spots prediction using SVM. In: Proceedings of the international conference on computer systems and applications. pp 952–959

  10. Wang P, Mathieu R, Ke J, Cai HJ et al (2010) Predicting criminal recidivism with SVM. In: Proceedings of the international conference on management and service science. pp 1–9

  11. Nath SV et al (2006) Crime pattern detection using data mining. Oracle Corporation, Redwood Shores

    Book  Google Scholar 

  12. Yu C-H, Ward MW, Morabito M, Ding W et al (2011) Crime forecasting using data mining techniques. University of Massachusetts Boston, Boston

    Book  Google Scholar 

  13. Wang T, Rudin C, Wagner D, Sevier R et al (2013) Learning to detect patterns of crime. Massachusetts Institute of Technology, Cambridge

    Book  Google Scholar 

  14. Kang H-W, Kang H-B (2017) Prediction of crime occurrence from multimodal data using deep learning. PLoS ONE 12(4):e0176244

    Article  MathSciNet  Google Scholar 

  15. Lin Y-L, Chen T-Y, Yu L-C (2017) Using machine learning to assist crime prevention. In: 2017 6th IIAI international congress on advanced applied informatics

  16. Ahishakiye E, Taremwa D, Opiyo E, Niyonzima I (2017) Crime prediction using decision tree (J48) classification algorithm. Int J Comput Inf Technol. 06(03) ISSN: 2279-0764

  17. BalaAnand M, Sankari S, Sowmipriya R, Sivaranjani S. Identifying fake user’s in social networks using non verbal behavior. Int J Technol Eng Syst (IJTES) 7(2):157–161

  18. Maram B, Gnanasekar JM, Manogaran G et al (2018) SOCA. https://doi.org/10.1007/s11761-018-0249-x

    Article  Google Scholar 

  19. https://data.cityofchicago.org/Public-Safety/Crimes-2001-to-present-Dashboard/5cd6-ry5g

  20. Tibshirani R (1996) Regression shrinkage and selection via the lasso. J Roy Stat Soc 58(1):267–288

    MathSciNet  MATH  Google Scholar 

  21. Liao R, Wang X, Li L, Qin Z et al (2010) A novel serial crime prediction model based on Bayesian learning theory. Proc IEEE Int Conf Mach Learn Cybern 4:1757–1762

    Google Scholar 

  22. Gnaneswara Rao N, Sravani T, Vijaya Kumar V (2014) OCRM: optimal cost region matching similarity measure for region based image retrieval. Int J Multimedia Ubiquitous Eng 9(4):327–342

    Article  Google Scholar 

  23. Gnaneswara Rao N, Vijaya Kumar V (2007) Texturebased image indexing and retrieval. In: VISAPP 2007 international conference on computer vision theory and applications, Barcelona, Spain, March 2007, pp 177-181

  24. Kang H-W, Kang H-B et al (2017) Prediction of crime occurrence from multimodal data using deep learning. PLoS ONE 12(4):e0176244

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gnaneswara Rao Nitta.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nitta, G.R., Rao, B.Y., Sravani, T. et al. LASSO-based feature selection and naïve Bayes classifier for crime prediction and its type. SOCA 13, 187–197 (2019). https://doi.org/10.1007/s11761-018-0251-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11761-018-0251-3

Keywords

Navigation