Skip to main content

Exploiting Textual Similarity Techniques in Harmonization of Laws

  • Conference paper
  • First Online:
AIxIA 2021 – Advances in Artificial Intelligence (AIxIA 2021)

Abstract

This paper describes an application of textual similarity techniques in the Legal Informatics domain. In European law, a relevant interest relates to the transposition of EU directives by the Member States, which can be complete, partial, or eventually absent. As part of an European project, legal experts annotated transpositions of six directives on a per-article basis. Following an established NLP pipeline, we explore a similarity-based technique to identify correspondences between transpositions of national implementations. Early results are promising and show the role that Artificial Intelligence may play within the process of harmonization and standardization of domestic legal systems as a result of the adoption of EU legislation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.crossjustice.eu.

  2. 2.

    https://www.crossjustice.eu/en/index.html#crossjustice-platform.

  3. 3.

    From the scikit-learn python library sklearn.metrics.pairwise we adopted cosine_similarity method.

References

  1. Amado, A., Cortez, P., Rita, P., Moro, S.: Research trends on big data in marketing: a text mining and topic modeling based literature analysis. Eur. Res. Manag. Bus. Econ. 24(1), 1–7 (2018)

    Article  Google Scholar 

  2. Amantea, I.A., Caro, L.D., Humphreys, L., Nanda, R., Sulis, E.: Modelling norm types and their inter-relationships in EU directives. In: Ashley, K.D., et al. (eds.) Proceedings of the Third Workshop on Automated Semantic Analysis of Information in Legal Texts co-located with the 17th International Conference on Artificial Intelligence and Law (ICAIL 2019), Montreal, QC, Canada, 21 June 2019. CEUR Workshop Proceedings, vol. 2385. CEUR-WS.org (2019). http://ceur-ws.org/Vol-2385/paper8.pdf

  3. Amantea, I.A., Robaldo, L., Sulis, E., Boella, G., Governatori, G.: Semi-automated checking for regulatory compliance in e-health. In: 25th International Enterprise Distributed Object Computing Workshop, EDOC Workshop 2021, Gold Coast, Australia, 25–29 October 2021, pp. 318–325. IEEE (2021). https://doi.org/10.1109/EDOCW52865.2021.00063

  4. Andenas, M., Andersen, C.B.: Theory and Practice of Harmonisation. Edward Elgar Publishing (2012)

    Google Scholar 

  5. Ashley, K.D.: Artificial Intelligence and Legal Analytics: New Tools for Law Practice in the Digital Age. Cambridge University Press, Cambridge (2017)

    Book  Google Scholar 

  6. Bhattacharya, P., Ghosh, K., Pal, A., Ghosh, S.: Methods for computing legal document similarity: a comparative study. CoRR abs/2004.12307 (2020). https://arxiv.org/abs/2004.12307

  7. Biasiotti, M., Francesconi, E., Palmirani, M., Sartor, G., Vitali, F.: Legal informatics and management of legislative documents. Global Center for ICT in Parliament Working Paper 2 (2008)

    Google Scholar 

  8. Boella, G., Di Caro, L., Humphreys, L., Robaldo, L., Rossi, P., van der Torre, L.: Eunomos, a legal document and knowledge management system for the web to provide relevant, reliable and up-to-date information on the law. Artif. Intell. Law 24(3), 245–283 (2016)

    Article  Google Scholar 

  9. Boella, G., Di Caro, L., Leone, V.: Semi-automatic knowledge population in a legal document management system. Artif. Intell. Law 27(2), 227–251 (2018). https://doi.org/10.1007/s10506-018-9239-8

    Article  Google Scholar 

  10. Boella, G., Di Caro, L., Rispoli, D., Robaldo, L.: A system for classifying multi-label text into EuroVoc. In: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Law, pp. 239–240 (2013)

    Google Scholar 

  11. Cox, M.A., Cox, T.F.: Multidimensional scaling. In: Chen, C., Härdle, W., Unwin, A. (eds.) Handbook of Data Visualization, pp. 315–347. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-33037-0_14

    Chapter  Google Scholar 

  12. Dimitrakopoulos, D.G.: The transposition of EU law:‘post-decisional politics’ and institutional autonomy. Eur. Law J. 7(4), 442–458 (2001)

    Article  Google Scholar 

  13. Durante, M.: Computational Power: The Impact of ICT on Law, Society and Knowledge, Routledge (2021)

    Google Scholar 

  14. Elekes, Á., Schäler, M., Böhm, K.: On the various semantics of similarity in word embedding models. In: 2017 ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017, Toronto, ON, Canada, 19–23 June 2017, pp. 139–148. IEEE Computer Society (2017). https://doi.org/10.1109/JCDL.2017.7991568

  15. Feldman, R., Sanger, J.: The Text Mining Handbook - Advanced Approaches in Analyzing Unstructured Data. Cambridge University Press, Cambridge (2007)

    Google Scholar 

  16. Ferreira-Mello, R., André, M., Pinheiro, A., Costa, E., Romero, C.: Text mining in education. Wiley Interdisc. Rev.: Data Min. Knowl. Discov. 9(6), e1332 (2019)

    Google Scholar 

  17. Friedrich, R., Luzzatto, M., Ash, E.: Entropy in legal language. In: Aletras, N., Androutsopoulos, I., Barrett, L., Meyers, A., Preotiuc-Pietro, D. (eds.) Proceedings of the Natural Legal Language Processing Workshop 2020 co-located with the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD 2020), Virtual Workshop, 24 August 2020. CEUR Workshop Proceedings, vol. 2645, pp. 25–30. CEUR-WS.org (2020). http://ceur-ws.org/Vol-2645/paper4.pdf

  18. Haverland, M., Steunenberg, B., Van Waarden, F.: Sectors at different speeds: analysing transposition deficits in the European union. JCMS: J. Common Mark. Stud. 49(2), 265–291 (2011)

    Google Scholar 

  19. Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 168–177 (2004)

    Google Scholar 

  20. Humphreys, L., Santos, C., Di Caro, L., Boella, G., Van Der Torre, L., Robaldo, L.: Mapping recitals to normative provisions in EU legislation to assist legal interpretation. In: JURIX, pp. 41–49 (2015)

    Google Scholar 

  21. John, A.K., Di Caro, L., Robaldo, L., Boella, G.: Legalbot: a deep learning-based conversational agent in the legal domain. In: Frasincar, F., Ittoo, A., Nguyen, L.M., Métais, E. (eds.) NLDB 2017. LNCS, vol. 10260, pp. 267–273. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59569-6_32

    Chapter  Google Scholar 

  22. Katz, D.M., Dolin, R., Bommarito, M.J.: Legal Informatics. Cambridge University Press, Cambridge (2021)

    Book  Google Scholar 

  23. Kaunert, C., Occhipinti, J.D., Léonard, S.: Introduction: supranational governance in the area of freedom, security and justice after the stockholm programme (2014)

    Google Scholar 

  24. Kim, M.-Y., Xu, Y., Goebel, R.: Legal question answering using ranking SVM and syntactic/semantic similarity. In: Murata, T., Mineshima, K., Bekki, D. (eds.) JSAI-isAI 2014. LNCS (LNAI), vol. 9067, pp. 244–258. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-48119-6_18

    Chapter  Google Scholar 

  25. Kumar, S., Reddy, P.K., Reddy, V.B., Suri, M.: Finding similar legal judgements under common law system. In: Madaan, A., Kikuchi, S., Bhalla, S. (eds.) DNIS 2013. LNCS, vol. 7813, pp. 103–116. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37134-9_9

    Chapter  Google Scholar 

  26. Levy, O., Goldberg, Y., Dagan, I.: Improving distributional similarity with lessons learned from word embeddings. Trans. Assoc. Comput. Linguist. 3, 211–225 (2015). https://doi.org/10.1162/tacl_a_00134

    Article  Google Scholar 

  27. Mandal, A., Ghosh, K., Ghosh, S., Mandal, S.: Unsupervised approaches for measuring textual similarity between legal court case reports. Artif. Intell. Law 29(3), 417–451 (2021). https://doi.org/10.1007/s10506-020-09280-2

    Article  Google Scholar 

  28. Meo, R., Sulis, E.: Processing affect in social media: a comparison of methods to distinguish emotions in tweets. ACM Trans. Internet Techn. 17(1), 7:1–7:25 (2017). https://doi.org/10.1145/2996187

  29. Nanda, R., et al.: Unsupervised and supervised text similarity systems for automated identification of national implementing measures of European directives. Artif. Intell. Law 27(2), 199–225 (2018). https://doi.org/10.1007/s10506-018-9236-y

    Article  Google Scholar 

  30. Nay, J.J.: Natural Language Processing for Legal Texts, pp. 99–113. Cambridge University Press, Cambridge (2021). https://doi.org/10.1017/9781316529683.011

    Book  Google Scholar 

  31. Ontañón, S.: An overview of distance and similarity functions for structured data. Artif. Intell. Rev. 53(7), 5309–5351 (2020). https://doi.org/10.1007/s10462-020-09821-w

    Article  Google Scholar 

  32. Renjit, S., Idicula, S.M.: CUSAT nlp@aila-fire2019: similarity in legal texts using document level embeddings. In: Mehta, P., Rosso, P., Majumder, P., Mitra, M. (eds.) Working Notes of FIRE 2019 - Forum for Information Retrieval Evaluation, Kolkata, India, 12–15 December 2019. CEUR Workshop Proceedings, vol. 2517, pp. 25–30. CEUR-WS.org (2019). http://ceur-ws.org/Vol-2517/T1-4.pdf

  33. Robaldo, L., Villata, S., Wyner, A., Grabmair, M.: Introduction for artificial intelligence and law: special issue “natural language processing for legal texts” (2019). https://doi.org/10.1007/s10506-019-09251-2

  34. Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)

    Article  Google Scholar 

  35. Satzger, H.: The harmonisation of criminal sanctions in the European union - a new approach. Eucrim (2019). https://doi.org/10.30709/eucrim-2019-007

  36. Schroeder, W.: Limits to European harmonisation of criminal law. Eucrim (2020). https://doi.org/10.30709/eucrim-2020-008

  37. Steunenberg, B., Rhinard, M.: The transposition of European law in EU member states: between process and politics. Eur. Polit. Sci. Rev. 2, 495–520 (2010). https://doi.org/10.1017/S1755773910000196

    Article  Google Scholar 

  38. Sulis, E., Humphreys, L., Vernero, F., Amantea, I.A., Audrito, D., Di Caro, L.: Exploiting co-occurrence networks for classification of implicit inter-relationships in legal texts. Inf. Syst. 101821 (2021). https://doi.org/10.1016/j.is.2021.101821

  39. Sulis, E., et al.: Exploring network analysis in a corpus-based approach to legal texts: a case study. In: Tagarelli, A., Zumpano, E., Latific, A.K., Calì, A. (eds.) Proceedings of the First International Workshop “CAiSE for Legal Documents” (COUrT 2020) Co-located with the 32nd International Conference on Advanced Information Systems Engineering (CAiSE 2020), Grenoble, France, 9 June 2020. CEUR Workshop Proceedings, vol. 2690, pp. 27–38. CEUR-WS.org (2020). http://ceur-ws.org/Vol-2690/COUrT-paper3.pdf

  40. Sulis, E., Lai, M., Vinai, M., Sanguinetti, M.: Exploring sentiment in social media and official statistics: a general framework. In: Bosco, C., Cambria, E., Damiano, R., Patti, V., Rosso, P. (eds.) Proceedings of the 2nd International Workshop on Emotion and Sentiment in Social and Expressive Media: Opportunities and Challenges for Emotion-Aware Multiagent Systems Co-located with 14th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2015), Istanbul, Turkey, 5 May 2015. CEUR Workshop Proceedings, vol. 1351, pp. 96–105. CEUR-WS.org (2015). http://ceur-ws.org/Vol-1351/paper8.pdf

  41. Van Rijsbergen, C.J., Robertson, S.E., Porter, M.F.: New models in probabilistic information retrieval, vol. 5587. British Library Research and Development Department London (1980)

    Google Scholar 

  42. Vogenauer, S., Weatherill, S.: The Harmonisation of European Contract Law: Implications for European Private Laws, Business and Legal Practice. Bloomsbury Publishing (2006). https://doi.org/10.1111/j.1468-0386.2007.00376_4.x

  43. Wagh, R., Anand, D.: Application of citation network analysis for improved similarity index estimation of legal case documents: a study. In: 2017 IEEE International Conference on Current Trends in Advanced Computing (ICCTAC), pp. 1–5 (2017). https://doi.org/10.1109/ICCTAC.2017.8249996

  44. Wagh, R.S., Anand, D.: Legal document similarity: a multi-criteria decision-making perspective. PeerJ Comput. Sci. 6, e262 (2020). https://doi.org/10.7717/peerj-cs.262

    Article  Google Scholar 

  45. Wyner, A., Mochales-Palau, R., Moens, M.-F., Milward, D.: Approaches to text mining arguments from legal cases. In: Francesconi, E., Montemagni, S., Peters, W., Tiscornia, D. (eds.) Semantic Processing of Legal Texts. LNCS (LNAI), vol. 6036, pp. 60–79. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-12837-0_4

    Chapter  Google Scholar 

  46. Zhong, H., Xiao, C., Tu, C., Zhang, T., Liu, Z., Sun, M.: How does NLP benefit legal system: a summary of legal artificial intelligence. In: Jurafsky, D., Chai, J., Schluter, N., Tetreault, J.R. (eds.) Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online, 5–10 July 2020, pp. 5218–5230. Association for Computational Linguistics (2020). https://doi.org/10.18653/v1/2020.acl-main.466

Download references

Acknowledgement

This work has been supported by the European Union’s Justice Programme (Grant Agreement No. 847346) for the project “Knowledge, Advisory and Capacity Building Information Tool for Criminal Procedural Rights in Judicial Cooperation”.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Emilio Sulis .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sulis, E., Humphreys, L.B., Audrito, D., Di Caro, L. (2022). Exploiting Textual Similarity Techniques in Harmonization of Laws. In: Bandini, S., Gasparini, F., Mascardi, V., Palmonari, M., Vizzari, G. (eds) AIxIA 2021 – Advances in Artificial Intelligence. AIxIA 2021. Lecture Notes in Computer Science(), vol 13196. Springer, Cham. https://doi.org/10.1007/978-3-031-08421-8_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-08421-8_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-08420-1

  • Online ISBN: 978-3-031-08421-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics