Combining Natural Language Processing Approaches for Rule Extraction from Legal Documents

Dragoni, Mauro; Villata, Serena; Rizzi, Williams; Governatori, Guido

doi:10.1007/978-3-030-00178-0_19

Combining Natural Language Processing Approaches for Rule Extraction from Legal Documents

Mauro Dragoni¹⁸,
Serena Villata¹⁹,
Williams Rizzi²⁰ &
…
Guido Governatori²¹

Conference paper
First Online: 23 October 2018

1709 Accesses
12 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10791))

Abstract

Legal texts express conditions in natural language describing what is permitted, forbidden or mandatory in the context they regulate. Despite the numerous approaches tackling the problem of moving from a natural language legal text to the respective set of machine-readable conditions, results are still unsatisfiable and it remains a major open challenge. In this paper, we propose a preliminary approach which combines different Natural Language Processing techniques towards the extraction of rules from legal documents. More precisely, we combine the linguistic information provided by WordNet together with a syntax-based extraction of rules from legal texts, and a logic-based extraction of dependencies between chunks of such texts. Such a combined approach leads to a powerful solution towards the extraction of machine-readable rules from legal documents. We evaluate the proposed approach over the Australian “Telecommunications consumer protections code”.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
https://wordnet.princeton.edu/.
2.
Note that these ontologies are explicitly called lightweight ontologies as they are not expected to be used to normalize the concepts of legal text by mapping the legal terms into concepts in ontology, and obtain the meaning of the text by using the ontology structure. They uniquely provide a support for detecting the deontic components in legal texts and the structure of such texts, respectively.
3.
http://nlp.stanford.edu/software/lex-parser.shtml.
4.
For more details about the meaning of each tag and dependency clauses used by the parser, please refer to the official Stanford documentation: http://nlp.stanford.edu/software/dependencies_manual.pdf.

References

van Engers, T., van Gog, R., Sayah, K.: A case study on automated norm extraction. In: Proceedings of JURIX, pp. 49–58 (2004)
Google Scholar
Wyner, A., Peters, W.: On rule extraction from regulations. In: JURIX 2011, pp. 113–122 (2011)
Google Scholar
Curran, J.R., Clark, S., Bos, J.: Linguistically motivated large-scale NLP with c&c and boxer. In: Carroll, J.A., van den Bosch, A., Zaenen, A. (eds.) ACL 2007, Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, 23–30 June 2007, Prague, Czech Republic. The Association for Computational Linguistics (2007)
Google Scholar
Soria, C., Bartolini, R., Lenci, A., Montemagni, S., Pirrelli, V.: Automatic extraction of semantics in law documents. European Press Academic Publishing (2005)
Google Scholar
Biagioli, C., Francesconi, E., Passerini, A., Montemagni, S., Soria, C.: Automatic semantics extraction in law documents. In: ICAIL 2015, pp. 133–140 (2005)
Google Scholar
de Araujo, D.A., Rigo, S., Muller, C., de Oliveira Chishman, R.L.: Automatic information extraction from texts with inference and linguistic knowledge acquisition rules. In: Web Intelligence/IAT Workshops, pp. 151–154 (2013)
Google Scholar
Kiyavitskaya, N., et al.: Automating the extraction of rights and obligations for regulatory compliance. In: Li, Q., Spaccapietra, S., Yu, E., Olivé, A. (eds.) ER 2008. LNCS, vol. 5231, pp. 154–168. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-87877-3_13
Chapter Google Scholar
de Maat, E., Winkels, R.: Suggesting model fragments for sentences in Dutch laws. In: Proceedings of LOAIT, pp. 19–28 (2010)
Google Scholar
Brighi, R., Palmirani, M.: Legal text analysis of the modification provisions: a pattern oriented approach. In: ICAIL 2009, pp. 238–239 (2009)
Google Scholar
Francesconi, E.: Legal rules learning based on a semantic model for legislation. In: Proceedings of SPLeT Workshop (2010)
Google Scholar
Boella, G., Di Caro, L., Robaldo, L.: Semantic relation extraction from legislative text using generalized syntactic dependencies and support vector machines. In: Morgenstern, L., Stefaneas, P., Lévy, F., Wyner, A., Paschke, A. (eds.) RuleML 2013. LNCS, vol. 8035, pp. 218–225. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39617-5_20
Chapter Google Scholar
Dragoni, M., Governatori, G., Villata, S.: Automated rules generation from natural language legal texts. In: ICAIL 2015 Workshop on Automated Detection, Extraction and Analysis of Semantic Information in Legal Texts (2015)
Google Scholar
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Fondazione Bruno Kessler, Trento, Italy
Mauro Dragoni
CNRS, I3S Laboratory, Paris, France
Serena Villata
Universitá degli Studi di Trento, Trento, Italy
Williams Rizzi
NICTA Queensland, Brisbane, Australia
Guido Governatori

Authors

Mauro Dragoni
View author publications
You can also search for this author in PubMed Google Scholar
Serena Villata
View author publications
You can also search for this author in PubMed Google Scholar
Williams Rizzi
View author publications
You can also search for this author in PubMed Google Scholar
Guido Governatori
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mauro Dragoni .

Editor information

Editors and Affiliations

University of Turin, Turin, Italy
Ugo Pagallo
University of Bologna, Bologna, Italy
Monica Palmirani
La Trobe University, Melbourne, VIC, Australia
Pompeu Casanovas
University of Bologna, Bologna, Italy
Giovanni Sartor
Inria - Sophia Antipolis-Méditerranée, Sophia Antipolis, France
Serena Villata

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dragoni, M., Villata, S., Rizzi, W., Governatori, G. (2018). Combining Natural Language Processing Approaches for Rule Extraction from Legal Documents. In: Pagallo, U., Palmirani, M., Casanovas, P., Sartor, G., Villata, S. (eds) AI Approaches to the Complexity of Legal Systems. AICOL AICOL AICOL AICOL AICOL 2015 2016 2016 2017 2017. Lecture Notes in Computer Science(), vol 10791. Springer, Cham. https://doi.org/10.1007/978-3-030-00178-0_19

Download citation

DOI: https://doi.org/10.1007/978-3-030-00178-0_19
Published: 23 October 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00177-3
Online ISBN: 978-3-030-00178-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics