Abstract
Goods and services trading taxation is the main source of revenue for Brazilian states, therefore its evasion directly affects public services. A particular case of evasion concerns the issuance and use of cold invoices – ones referring to transactions registered at Tax Administration, but which did not actually take place. Following the proposal by Mathews et al. [16], this work reports the application of classic supervised learning algorithms to identify circular trading behaviors involving taxpayers from Brazilian State of Goias, through the analysis of their goods and services trading operations. Experiments showed similar results to the original ones, but pointing to k-Nearest-Neighbours (and not Logistic Regression) as the most accurate technique for this purpose – given Brazilian context’s characteristics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abbass, H.A., Sarker, R., Newton, C.S.: Data Mining: A Heuristic Approach: A Heuristic Approach. IGI Global (2001)
Berrar, D.: Cross-validation (2019)
Breiman, L.: Random forest. Mach. Learn. 45(1), 5–32 (2001)
Buxton, E., Kriz, K., Cremeens, M., Jay, K.: An auto regressive deep learning model for sales tax forecasting from multiple short time series. In: 18th IEEE International Conference on Machine Learning and Applications, pp. 1359–1364 (2019)
Junqué de Fortuny, E., Stankova, M., Moeyersoms, J., Minnaert, B., Provost, F., Martens, D.: Corporate residence fraud detection. In: KDD 2014, pp. 1650–1659 (2014)
Guo, G., Wang, H., Bell, D., Bi, Y., Greer, K.: KNN model-based approach in classification. In: Meersman, R., Tari, Z., Schmidt, D.C. (eds.) OTM 2003. LNCS, vol. 2888, pp. 986–996. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-39964-3_62
Han, J., Pei, J., Kamber, M.: Data Mining: Concepts and Techniques. Elsevier, Amsterdam (2011)
Hayashi, F.Y., et al.: Manipulação do mercado de capitais baseada em informaçães: análise do art. 27-c da lei n\(^{\rm o}\) 6.385/1976 sob a perspectiva da tutela constitucional da liberdade de expressão (2021)
Huang, S., Cai, N., Pacheco, P.P., Narrandes, S., Wang, Y., Xu, W.: Applications of support vector machine (SVM) learning in cancer genomics. Cancer Genomics Proteomics 15(1), 41–51 (2018)
Jatobá, A.E., Moura, D.L., Martins, I.C., Ramos, H.S., Aquino, A.L.: CALT: uma ferramenta automática para cobrança do icms em operações interestaduais. In: IX Workshop de Computação Aplicada em Governo Eletrônico, pp. 143–154. SBC (2021)
Jordão, R.V.D., Silva, M.S., Brasil, H.G., Vasconcelos, M.C.R.L.: Um caleidoscópio de perspectivas sobre o sistema público de escrituração digital-sped. Rev. Contemp. Contabilidade 12(26), 119–140 (2015)
Mathews, J., Mehta, P., Babu, C.S., Kasi Visweswara Rao, S.V.: An algorithmic approach to handle circular trading in commercial taxation system, pp. 67–75 (2018)
Mathews, J., Mehta, P., Sobhan Babu, C., Kasi Visweswara Rao, S.V.: Clustering collusive dealers in commercial taxation system. In: Arai, K., Kapoor, S., Bhatia, R. (eds.) IntelliSys 2018. AISC, vol. 869, pp. 703–717. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-01057-7_54
Mathews, J., Mehta, P., Kuchibhotla, S., Bisht, D., Chintapalli, S.B., Rao, S.V.K.V.: Regression analysis towards estimating tax evasion in goods and services tax. In: 2018 IEEE/WIC/ACM WI, pp. 758–761 (2018)
Priya, Mathews, J., Kumar, K.S., Babu, C., Rao, S., et al.: A collusion set detection in value added tax using Benford’s analysis. In: Arai, K., Kapoor, S., Bhatia, R. (eds.) SAI 2018. Advances in Intelligent Systems and Computing, vol. 858, pp. 909–921. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01174-1_70
Mathews, J., Mehta, P., Suryamukhi, Babu, S.: Link prediction techniques to handle tax evasion. In: 8th ACM IKDD CODS and 26th COMAD, pp. 307–315 (2021)
Mehta, P., Mathews, J., Suryamukhi, K., Kumar, K.S., Babu, C.S.: Predictive modeling for identifying return defaulters in goods and services tax. In: 2018 IEEE 5th International Conference on Data Science and Advanced Analytics, pp. 631–637 (2018)
Mehta, P., et al.: Big data analytics for tax administration. In: Kő, A., Francesconi, E., Anderst-Kotsis, G., Tjoa, A.M., Khalil, I. (eds.) EGOVIS 2019. LNCS, vol. 11709, pp. 47–57. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-27523-5_4
Mehta, P., Mathews, J., Kumar, S., Suryamukhi, K., Sobhan Babu, C., Kasi Visweswara Rao, S.V.: Big data analytics for nabbing fraudulent transactions in taxation system. In: Chen, K., Seshadri, S., Zhang, L.-J. (eds.) BIGDATA 2019. LNCS, vol. 11514, pp. 95–109. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-23551-2_7
Michalski, R.S., Carbonell, J.G., Mitchell, T.M.: Machine Learning: An Artificial Intelligence Approach. Springer, Heidelberg (2013)
Moresi, E., et al.: Metodologia da pesquisa. Brasília: Univ. Católica de brasília 108(24), 5 (2003)
Rad, M.S., Shahbahrami, A.: High performance implementation of tax fraud detection algorithm. In: 2015 SPIS, pp. 6–9 (2015)
Silva, D., Carvalho, S.: Data science methods and techniques for goods and services trading taxation: a systematic mapping study (2021)
Speelman, D.: Logistic regression. Corpus Methods Semant. Quant. Stud. Polysemy Synonymy 43, 487–533 (2014)
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (2018)
de Planejamento e Tributação, I.B.: Brasileiros trabalham 149 dias para pagar os impostos de 2021 (2021). https://ibpt.com.br/brasileiros-trabalham-149-dias-para-pagar-os-impostos-de-2021/. Accessed 16 Feb 2022
Witten, I.H., Frank, E., Hall, M.A., Pal, C.J.: Data Mining: Practical Machine Learning Tools and Techniques, vol. 3 (2011)
Yu, J., Qiao, Y., Shu, N., Sun, K., Zhou, S., Yang, J.: Neural network based transaction classification system for Chinese transaction behavior analysis. In: 2019 IEEE International Congress on Big Data (BigDataCongress), pp. 64–71 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Silva, D., Carvalho, S.T., Silva, N. (2022). Comparative Analysis of Classification Algorithms Applied to Circular Trading Prediction Scenarios. In: Kő, A., Francesconi, E., Kotsis, G., Tjoa, A.M., Khalil, I. (eds) Electronic Government and the Information Systems Perspective. EGOVIS 2022. Lecture Notes in Computer Science, vol 13429. Springer, Cham. https://doi.org/10.1007/978-3-031-12673-4_7
Download citation
DOI: https://doi.org/10.1007/978-3-031-12673-4_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-12672-7
Online ISBN: 978-3-031-12673-4
eBook Packages: Computer ScienceComputer Science (R0)