Abstract
For centuries, crime has been viewed as random because it is based on human behavior; even now, it incorporates an excessive number of factors for current machine learning models to forecast accurately. In this work, we tend to discuss the early crime prediction results from a model developed using the data from the Chicago crime dataset. In any case, with a superior execution future crime is to anticipated accurately, it is a testing assignment as a result of the increase in several crimes in present days. Therefore, the crime foreseeing method is foremost, and it identifies the future crimes and number of crimes are degraded. In this paper, we built up a model to anticipate future crime occurrences at a future time and also predict which type of crime may be happening in a given area. First, we analyze how certain crime features like given a date, time and some geologically important relevant features such as latitude and longitude. Second, we discuss several analytics techniques we used to find meaning in our data, such as LASSO feature selection analysis, classification models like naïve Bayes and SVM. Finally, we select the best model for foreseeing crime type and seriousness of the crime for giving different features.







Similar content being viewed by others
References
Kochar B, Chhillar R et al (2012) An effective data warehousing system for RFID using novel data cleaning, data transformation, and loading techniques. Arab J Inf Technol 9(3):208–216
Santhi P, Bhaskaran VM et al (2010) Performance of clustering algorithms in healthcare database. Int J Adv Comput Sci 2(1):26–31
Wahbeh AH, Al-Radaideh QA et al (2011) A comparison study between data mining tools over some classification methods. Int J Adv Comput Sci Appl (Special Issue):18–26
Han J, Kamber M et al (2006) Data mining: concepts and techniques. Morgan Kaufmann Publishers, San Francisco
Li G, Wang Y et al (2012) A privacy-preserving classification method based on singular value decomposition. Arab J Inf Technol 9(6):529–534
Gorr W, Olligschlaeger A, Thompson Y (2003) Short-term forecasting of crime. Int J Forecast 19(4):579–594
Chen P, Yuan H, Shu X et al (2008) Forecasting crime using the ARIMA model. In: Proceedings of the 5th international conference on fuzzy systems and knowledge discovery. pp 627–630
Shingleton JS et al (2012) Crime trend prediction using regression models for salinas, California. Naval Postgraduate School, Monterey
Kianmehr K, Alhajj R et al (2006) Crime hot-spots prediction using SVM. In: Proceedings of the international conference on computer systems and applications. pp 952–959
Wang P, Mathieu R, Ke J, Cai HJ et al (2010) Predicting criminal recidivism with SVM. In: Proceedings of the international conference on management and service science. pp 1–9
Nath SV et al (2006) Crime pattern detection using data mining. Oracle Corporation, Redwood Shores
Yu C-H, Ward MW, Morabito M, Ding W et al (2011) Crime forecasting using data mining techniques. University of Massachusetts Boston, Boston
Wang T, Rudin C, Wagner D, Sevier R et al (2013) Learning to detect patterns of crime. Massachusetts Institute of Technology, Cambridge
Kang H-W, Kang H-B (2017) Prediction of crime occurrence from multimodal data using deep learning. PLoS ONE 12(4):e0176244
Lin Y-L, Chen T-Y, Yu L-C (2017) Using machine learning to assist crime prevention. In: 2017 6th IIAI international congress on advanced applied informatics
Ahishakiye E, Taremwa D, Opiyo E, Niyonzima I (2017) Crime prediction using decision tree (J48) classification algorithm. Int J Comput Inf Technol. 06(03) ISSN: 2279-0764
BalaAnand M, Sankari S, Sowmipriya R, Sivaranjani S. Identifying fake user’s in social networks using non verbal behavior. Int J Technol Eng Syst (IJTES) 7(2):157–161
Maram B, Gnanasekar JM, Manogaran G et al (2018) SOCA. https://doi.org/10.1007/s11761-018-0249-x
https://data.cityofchicago.org/Public-Safety/Crimes-2001-to-present-Dashboard/5cd6-ry5g
Tibshirani R (1996) Regression shrinkage and selection via the lasso. J Roy Stat Soc 58(1):267–288
Liao R, Wang X, Li L, Qin Z et al (2010) A novel serial crime prediction model based on Bayesian learning theory. Proc IEEE Int Conf Mach Learn Cybern 4:1757–1762
Gnaneswara Rao N, Sravani T, Vijaya Kumar V (2014) OCRM: optimal cost region matching similarity measure for region based image retrieval. Int J Multimedia Ubiquitous Eng 9(4):327–342
Gnaneswara Rao N, Vijaya Kumar V (2007) Texturebased image indexing and retrieval. In: VISAPP 2007 international conference on computer vision theory and applications, Barcelona, Spain, March 2007, pp 177-181
Kang H-W, Kang H-B et al (2017) Prediction of crime occurrence from multimodal data using deep learning. PLoS ONE 12(4):e0176244
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Nitta, G.R., Rao, B.Y., Sravani, T. et al. LASSO-based feature selection and naïve Bayes classifier for crime prediction and its type. SOCA 13, 187–197 (2019). https://doi.org/10.1007/s11761-018-0251-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11761-018-0251-3