Abstract
Brazil has a large prison population, which places it as the third country in the world with the most incarceration rate. In addition, the criminal caseload is increasing in Brazilian Judiciary, which is encouraging AI usage to advance in e-Justice. Within this context, the paper presents a case study with a dataset composed of 2,200 judgments from the Supreme Federal Court (STF) about pre-trial detention. These are cases in which a provisional prisoner requests for freedom through habeas corpus. We applied Machine Learning (ML) and Natural Language Processing (NLP) techniques to predict whether STF will release or not the provisional prisoner (text classification), and also to find a reliable association between the judgment outcome and the prisoners’ crime and/or the judge responsible for the case (association rules). We obtained satisfactory results in both tasks. Classification results show that, among the models used, Convolutional Neural Network (CNN) is the best, with 95% accuracy and 0.91 F1-Score. Association results indicate that, among the rules generated, there is a high probability of drug law crimes leading to a dismissed habeas corpus (which means the maintenance of pre-trial detention). We concluded that STF has not interfered in first degree decisions about pre-trial detention and that is necessary to discuss the drug criminalization in Brazil. The main contribution of the paper is to provide models that can support judges and pre-trial detainees.
E-Government UFSC Research Group - egov.ufsc@gmail.com.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Prisoners in Brazil Homepage. https://bityli.com/Bk8Ir. Accessed 10 Sept 2020
CONECTAS Homepage. Brazil ranks as the third country with the largest prison population in the world. https://bityli.com/lN18R. Accessed 10 Sept 2020
CNJ.: Justiça em Números 2020: ano-base 2019. CNJ, Brasília (2020). https://bityli.com/ORZNy. Accessed 4 Jan 2021
CNJ Homepage, Resolution no. 332 of August 21, 2020. https://atos.cnj.jus.br/files/original191707202008255f4563b35f8e8.pdf. Accessed 4 Jan 2021
CNJ Homepage, Ordinance no. 271 of December 4, 2020. https://atos.cnj.jus.br/files/original234208202012155fd949d04d990.pdf. Accessed 4 Jan 2021
Brazilian Criminal Procedure Code Homepage. http://www.planalto.gov.br/ccivil_03/decreto-lei/del3689.htm. Accessed 25 Mar 2021
Brazilian Federal Constitution Homepage. http://www.planalto.gov.br/ccivil_03/constituicao/constituicao.htm. Accessed 25 Mar 2021
Russell, S.J., Norvig, P., Davis, E.: Artificial Intelligence: A Modern Approach, 3rd edn. Prentice Hall, New Jersey (2013)
Géron, A.: Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, And Techniques to Build Intelligent Systems, 2nd edn. O’Reilly Media, Inc., Sebastopol (2019)
Bobadilla, J., Ortega, F., Hernando, A., Gutiérrez, A.: Recommender systems survey. Knowl.-Based Syst. 46, 109–132 (2013). https://doi.org/10.1016/j.knosys.2013.03.012,lastaccessed2021/04/02
Cios, K.J., Pedrycz, W., Swiniarski, R.W., Kurgan, L.A. (orgs.).: Data Mining: A Knowledge Discovery Approach. Springer, New York (2007). https://doi.org/10.1007/978-0-387-36795-8
Brin, S., Motwani, R., Silverstein, C.: Beyond market baskets: generalizing association rules to correlations. ACM SIGMOD Rec. 26(2), 265–276 (1997). https://doi.org/10.1145/253262.253327
Katz, D., Bommarito, M., Blackman, J.:. Predicting the behavior of the Supreme Court of the United States: a general approach. ArXiv, 1407.6333 v1, pp. 1–17. (2014)
Katz, D., Bommarito, M., Blackman, J.: A general approach for predicting the behavior of the Supreme Court of the United States. PLoS ONE (2017). https://doi.org/10.1371/journal.pone.0174698
Aletras, N., Tsarapatsanis, D., Preoţiuc-Pietro, D., Lampos, V.: Predicting judicial decisions of the European Court of human rights: a natural language processing perspective. Peer J. Comput. Sci. 2(93), 1–19 (2016)
Șulea, O., Zampieri, M., Vela, M., Genabith, J.: Predicting the Law area and decisions of French Supreme Court cases. In: Proceedings of Recent Advances in Natural Language Processing, Varna, Bulgaria. 2–8 September 2017, pp. 716–722. INCOMA Ltd Shoumen (2017)
Virtucio, M., et al.: Predicting decisions of the philippine supreme court using natural language processing and machine learning. In: IEEE International Conference on Computer Software and Applications, vol. 42. Tokyo (2018). https://doi.org/10.1109/COMPSAC.2018.10348. Accessed 6 Apr 2021
Silva, N., et al.: Document type classification for Brazil’s Supreme Court using a convolutional neural network. In: International Conference on Forensic Computer Science and Cyber Law, pp. 7–11. HTCIA (2018)
Yin, R.K.: Estudo de Caso: Planejamento e métodos. Bookman (2015)
Demšar, J., et al.: Orange: data mining toolbox in python. J. Mach. Learn. Res. 14(1), 2349–2353 (2013)
Keras GitHub Homepage. https://github.com/fchollet/keras. Accessed 10 Mar 2021
Brazilian Supreme Federal Court Homepage. http://portal.stf.jus.br. Accessed 10 Apr 2021
Van Rossum, G., Drake, F.L.: Python 3 Reference Manual. Create Space, Scotts Valley, CA (2009).
Bird, S., Loper, E.: NLTK: the natural language toolkit. In: Proceedings of the ACL Interactive Poster and Demonstration Sessions, pp. 214–217. Association for Computational Linguistics, Barcelona, Spain (2004)
García, S., Luengo, J., Herrera, F.: Data Preprocessing in Data Mining. Springer International Publishing, Switzerland (2015)
Jurafsky, D., Martin, J.H.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, 3rd edn. Stanford University, Draft (2019)
Aggarwal, C.C.: Machine Learning for Text. In: Machine Learning for Text, pp. 1–16. Springer International Publishing, Cham (2018). https://doi.org/10.1007/978-3-319-73531-3
Kotu, V., Deshpande, B.: Data Science: Concepts and Practice, 2nd edn. Morgan Kaufmann (Elsevier), Cambridge (2019)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: 1st International Conference on Learning Representations, ICLR 2013 - Workshop Track Proceedings, pp. 1–12 (2013)
Uijlings, J.R., Van De Sande, K.E., Gevers, T., Smeulders, A.W.: Selective search for object recognition. Int. J. Comput. Vis. 104(2), 154–171 (2013)
Meriño, L.M., et al.: A bag-of-words model for task-load prediction from EEG in complex environments. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 1227–1231 (2013)
Milosevic, N., Dehghantanha, A., Choo, K.K.R.: Machine learning aided android malware classification. Comput. Electr. Eng. 61, 266–274 (2017)
Kowsari, K., Meimandi, K.J., Heidarysafa, M., Mendu, S., Barnes, L., Brown, D.: Text classification algorithms: a survey. Inf. (Switzerland) 10(4), 150 (2019)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: Pre-training of deep bidirectional transformers for language understanding. 4171–4186 (2018)
Brown, T.B., et al.: Language models are few-shot learners. (2020)
Pennington, J., Socher, R., Manning, C.: Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). vol. 19, pp. 1532–1543. Association for Computational Linguistics, Stroudsburg (2014)
Dal Pont, T.R., Sabo, I.C., Hübner, J.F., Rover, A.J.: Impact of Text Specificity and Size on Word Embeddings Performance: An Empirical Evaluation in Brazilian Legal Domain. In: Cerri, R., Prati, R.C. (eds.) BRACIS 2020. LNCS (LNAI), vol. 12319, pp. 521–535. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-61377-8_36
Sewak, M., Sahay, S.K., Rathore, H.: Comparison of deep learning and the classical machine learning algorithm for the malware detection. In: 2018 19th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), pp. 293–296. IEEE (2018)
Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). vol. 2017, pp. 1746–1751. Association for Computational Linguistics, Stroudsburg (2014)
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. ACM SIGMOD Rec. 29(2), 1–12 (2000)
Agarwal, R.C., Aggarwal, C.C., Prasad, V.V.V.: Depth first generation of long patterns. In: Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2000), vol. 2, pp. 108–118. ACM Press, New York (2000)
Orange Data Mining Homepage. https://orangedatamining.com. Accessed 9 Apr 2021
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Dal Pont, T.R. et al. (2021). Classification and Association Rules in Brazilian Supreme Court Judgments on Pre-trial Detention. In: Kö, A., Francesconi, E., Kotsis, G., Tjoa, A.M., Khalil, I. (eds) Electronic Government and the Information Systems Perspective. EGOVIS 2021. Lecture Notes in Computer Science(), vol 12926. Springer, Cham. https://doi.org/10.1007/978-3-030-86611-2_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-86611-2_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86610-5
Online ISBN: 978-3-030-86611-2
eBook Packages: Computer ScienceComputer Science (R0)