Abstract
This paper describes an application of textual similarity techniques in the Legal Informatics domain. In European law, a relevant interest relates to the transposition of EU directives by the Member States, which can be complete, partial, or eventually absent. As part of an European project, legal experts annotated transpositions of six directives on a per-article basis. Following an established NLP pipeline, we explore a similarity-based technique to identify correspondences between transpositions of national implementations. Early results are promising and show the role that Artificial Intelligence may play within the process of harmonization and standardization of domestic legal systems as a result of the adoption of EU legislation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
From the scikit-learn python library sklearn.metrics.pairwise we adopted cosine_similarity method.
References
Amado, A., Cortez, P., Rita, P., Moro, S.: Research trends on big data in marketing: a text mining and topic modeling based literature analysis. Eur. Res. Manag. Bus. Econ. 24(1), 1–7 (2018)
Amantea, I.A., Caro, L.D., Humphreys, L., Nanda, R., Sulis, E.: Modelling norm types and their inter-relationships in EU directives. In: Ashley, K.D., et al. (eds.) Proceedings of the Third Workshop on Automated Semantic Analysis of Information in Legal Texts co-located with the 17th International Conference on Artificial Intelligence and Law (ICAIL 2019), Montreal, QC, Canada, 21 June 2019. CEUR Workshop Proceedings, vol. 2385. CEUR-WS.org (2019). http://ceur-ws.org/Vol-2385/paper8.pdf
Amantea, I.A., Robaldo, L., Sulis, E., Boella, G., Governatori, G.: Semi-automated checking for regulatory compliance in e-health. In: 25th International Enterprise Distributed Object Computing Workshop, EDOC Workshop 2021, Gold Coast, Australia, 25–29 October 2021, pp. 318–325. IEEE (2021). https://doi.org/10.1109/EDOCW52865.2021.00063
Andenas, M., Andersen, C.B.: Theory and Practice of Harmonisation. Edward Elgar Publishing (2012)
Ashley, K.D.: Artificial Intelligence and Legal Analytics: New Tools for Law Practice in the Digital Age. Cambridge University Press, Cambridge (2017)
Bhattacharya, P., Ghosh, K., Pal, A., Ghosh, S.: Methods for computing legal document similarity: a comparative study. CoRR abs/2004.12307 (2020). https://arxiv.org/abs/2004.12307
Biasiotti, M., Francesconi, E., Palmirani, M., Sartor, G., Vitali, F.: Legal informatics and management of legislative documents. Global Center for ICT in Parliament Working Paper 2 (2008)
Boella, G., Di Caro, L., Humphreys, L., Robaldo, L., Rossi, P., van der Torre, L.: Eunomos, a legal document and knowledge management system for the web to provide relevant, reliable and up-to-date information on the law. Artif. Intell. Law 24(3), 245–283 (2016)
Boella, G., Di Caro, L., Leone, V.: Semi-automatic knowledge population in a legal document management system. Artif. Intell. Law 27(2), 227–251 (2018). https://doi.org/10.1007/s10506-018-9239-8
Boella, G., Di Caro, L., Rispoli, D., Robaldo, L.: A system for classifying multi-label text into EuroVoc. In: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Law, pp. 239–240 (2013)
Cox, M.A., Cox, T.F.: Multidimensional scaling. In: Chen, C., Härdle, W., Unwin, A. (eds.) Handbook of Data Visualization, pp. 315–347. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-33037-0_14
Dimitrakopoulos, D.G.: The transposition of EU law:‘post-decisional politics’ and institutional autonomy. Eur. Law J. 7(4), 442–458 (2001)
Durante, M.: Computational Power: The Impact of ICT on Law, Society and Knowledge, Routledge (2021)
Elekes, Á., Schäler, M., Böhm, K.: On the various semantics of similarity in word embedding models. In: 2017 ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017, Toronto, ON, Canada, 19–23 June 2017, pp. 139–148. IEEE Computer Society (2017). https://doi.org/10.1109/JCDL.2017.7991568
Feldman, R., Sanger, J.: The Text Mining Handbook - Advanced Approaches in Analyzing Unstructured Data. Cambridge University Press, Cambridge (2007)
Ferreira-Mello, R., André, M., Pinheiro, A., Costa, E., Romero, C.: Text mining in education. Wiley Interdisc. Rev.: Data Min. Knowl. Discov. 9(6), e1332 (2019)
Friedrich, R., Luzzatto, M., Ash, E.: Entropy in legal language. In: Aletras, N., Androutsopoulos, I., Barrett, L., Meyers, A., Preotiuc-Pietro, D. (eds.) Proceedings of the Natural Legal Language Processing Workshop 2020 co-located with the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD 2020), Virtual Workshop, 24 August 2020. CEUR Workshop Proceedings, vol. 2645, pp. 25–30. CEUR-WS.org (2020). http://ceur-ws.org/Vol-2645/paper4.pdf
Haverland, M., Steunenberg, B., Van Waarden, F.: Sectors at different speeds: analysing transposition deficits in the European union. JCMS: J. Common Mark. Stud. 49(2), 265–291 (2011)
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 168–177 (2004)
Humphreys, L., Santos, C., Di Caro, L., Boella, G., Van Der Torre, L., Robaldo, L.: Mapping recitals to normative provisions in EU legislation to assist legal interpretation. In: JURIX, pp. 41–49 (2015)
John, A.K., Di Caro, L., Robaldo, L., Boella, G.: Legalbot: a deep learning-based conversational agent in the legal domain. In: Frasincar, F., Ittoo, A., Nguyen, L.M., Métais, E. (eds.) NLDB 2017. LNCS, vol. 10260, pp. 267–273. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59569-6_32
Katz, D.M., Dolin, R., Bommarito, M.J.: Legal Informatics. Cambridge University Press, Cambridge (2021)
Kaunert, C., Occhipinti, J.D., Léonard, S.: Introduction: supranational governance in the area of freedom, security and justice after the stockholm programme (2014)
Kim, M.-Y., Xu, Y., Goebel, R.: Legal question answering using ranking SVM and syntactic/semantic similarity. In: Murata, T., Mineshima, K., Bekki, D. (eds.) JSAI-isAI 2014. LNCS (LNAI), vol. 9067, pp. 244–258. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-48119-6_18
Kumar, S., Reddy, P.K., Reddy, V.B., Suri, M.: Finding similar legal judgements under common law system. In: Madaan, A., Kikuchi, S., Bhalla, S. (eds.) DNIS 2013. LNCS, vol. 7813, pp. 103–116. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37134-9_9
Levy, O., Goldberg, Y., Dagan, I.: Improving distributional similarity with lessons learned from word embeddings. Trans. Assoc. Comput. Linguist. 3, 211–225 (2015). https://doi.org/10.1162/tacl_a_00134
Mandal, A., Ghosh, K., Ghosh, S., Mandal, S.: Unsupervised approaches for measuring textual similarity between legal court case reports. Artif. Intell. Law 29(3), 417–451 (2021). https://doi.org/10.1007/s10506-020-09280-2
Meo, R., Sulis, E.: Processing affect in social media: a comparison of methods to distinguish emotions in tweets. ACM Trans. Internet Techn. 17(1), 7:1–7:25 (2017). https://doi.org/10.1145/2996187
Nanda, R., et al.: Unsupervised and supervised text similarity systems for automated identification of national implementing measures of European directives. Artif. Intell. Law 27(2), 199–225 (2018). https://doi.org/10.1007/s10506-018-9236-y
Nay, J.J.: Natural Language Processing for Legal Texts, pp. 99–113. Cambridge University Press, Cambridge (2021). https://doi.org/10.1017/9781316529683.011
Ontañón, S.: An overview of distance and similarity functions for structured data. Artif. Intell. Rev. 53(7), 5309–5351 (2020). https://doi.org/10.1007/s10462-020-09821-w
Renjit, S., Idicula, S.M.: CUSAT nlp@aila-fire2019: similarity in legal texts using document level embeddings. In: Mehta, P., Rosso, P., Majumder, P., Mitra, M. (eds.) Working Notes of FIRE 2019 - Forum for Information Retrieval Evaluation, Kolkata, India, 12–15 December 2019. CEUR Workshop Proceedings, vol. 2517, pp. 25–30. CEUR-WS.org (2019). http://ceur-ws.org/Vol-2517/T1-4.pdf
Robaldo, L., Villata, S., Wyner, A., Grabmair, M.: Introduction for artificial intelligence and law: special issue “natural language processing for legal texts” (2019). https://doi.org/10.1007/s10506-019-09251-2
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)
Satzger, H.: The harmonisation of criminal sanctions in the European union - a new approach. Eucrim (2019). https://doi.org/10.30709/eucrim-2019-007
Schroeder, W.: Limits to European harmonisation of criminal law. Eucrim (2020). https://doi.org/10.30709/eucrim-2020-008
Steunenberg, B., Rhinard, M.: The transposition of European law in EU member states: between process and politics. Eur. Polit. Sci. Rev. 2, 495–520 (2010). https://doi.org/10.1017/S1755773910000196
Sulis, E., Humphreys, L., Vernero, F., Amantea, I.A., Audrito, D., Di Caro, L.: Exploiting co-occurrence networks for classification of implicit inter-relationships in legal texts. Inf. Syst. 101821 (2021). https://doi.org/10.1016/j.is.2021.101821
Sulis, E., et al.: Exploring network analysis in a corpus-based approach to legal texts: a case study. In: Tagarelli, A., Zumpano, E., Latific, A.K., Calì, A. (eds.) Proceedings of the First International Workshop “CAiSE for Legal Documents” (COUrT 2020) Co-located with the 32nd International Conference on Advanced Information Systems Engineering (CAiSE 2020), Grenoble, France, 9 June 2020. CEUR Workshop Proceedings, vol. 2690, pp. 27–38. CEUR-WS.org (2020). http://ceur-ws.org/Vol-2690/COUrT-paper3.pdf
Sulis, E., Lai, M., Vinai, M., Sanguinetti, M.: Exploring sentiment in social media and official statistics: a general framework. In: Bosco, C., Cambria, E., Damiano, R., Patti, V., Rosso, P. (eds.) Proceedings of the 2nd International Workshop on Emotion and Sentiment in Social and Expressive Media: Opportunities and Challenges for Emotion-Aware Multiagent Systems Co-located with 14th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2015), Istanbul, Turkey, 5 May 2015. CEUR Workshop Proceedings, vol. 1351, pp. 96–105. CEUR-WS.org (2015). http://ceur-ws.org/Vol-1351/paper8.pdf
Van Rijsbergen, C.J., Robertson, S.E., Porter, M.F.: New models in probabilistic information retrieval, vol. 5587. British Library Research and Development Department London (1980)
Vogenauer, S., Weatherill, S.: The Harmonisation of European Contract Law: Implications for European Private Laws, Business and Legal Practice. Bloomsbury Publishing (2006). https://doi.org/10.1111/j.1468-0386.2007.00376_4.x
Wagh, R., Anand, D.: Application of citation network analysis for improved similarity index estimation of legal case documents: a study. In: 2017 IEEE International Conference on Current Trends in Advanced Computing (ICCTAC), pp. 1–5 (2017). https://doi.org/10.1109/ICCTAC.2017.8249996
Wagh, R.S., Anand, D.: Legal document similarity: a multi-criteria decision-making perspective. PeerJ Comput. Sci. 6, e262 (2020). https://doi.org/10.7717/peerj-cs.262
Wyner, A., Mochales-Palau, R., Moens, M.-F., Milward, D.: Approaches to text mining arguments from legal cases. In: Francesconi, E., Montemagni, S., Peters, W., Tiscornia, D. (eds.) Semantic Processing of Legal Texts. LNCS (LNAI), vol. 6036, pp. 60–79. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-12837-0_4
Zhong, H., Xiao, C., Tu, C., Zhang, T., Liu, Z., Sun, M.: How does NLP benefit legal system: a summary of legal artificial intelligence. In: Jurafsky, D., Chai, J., Schluter, N., Tetreault, J.R. (eds.) Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online, 5–10 July 2020, pp. 5218–5230. Association for Computational Linguistics (2020). https://doi.org/10.18653/v1/2020.acl-main.466
Acknowledgement
This work has been supported by the European Union’s Justice Programme (Grant Agreement No. 847346) for the project “Knowledge, Advisory and Capacity Building Information Tool for Criminal Procedural Rights in Judicial Cooperation”.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Sulis, E., Humphreys, L.B., Audrito, D., Di Caro, L. (2022). Exploiting Textual Similarity Techniques in Harmonization of Laws. In: Bandini, S., Gasparini, F., Mascardi, V., Palmonari, M., Vizzari, G. (eds) AIxIA 2021 – Advances in Artificial Intelligence. AIxIA 2021. Lecture Notes in Computer Science(), vol 13196. Springer, Cham. https://doi.org/10.1007/978-3-031-08421-8_13
Download citation
DOI: https://doi.org/10.1007/978-3-031-08421-8_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-08420-1
Online ISBN: 978-3-031-08421-8
eBook Packages: Computer ScienceComputer Science (R0)