Skip to main content

Combining Natural Language Processing Approaches for Rule Extraction from Legal Documents

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10791))

Abstract

Legal texts express conditions in natural language describing what is permitted, forbidden or mandatory in the context they regulate. Despite the numerous approaches tackling the problem of moving from a natural language legal text to the respective set of machine-readable conditions, results are still unsatisfiable and it remains a major open challenge. In this paper, we propose a preliminary approach which combines different Natural Language Processing techniques towards the extraction of rules from legal documents. More precisely, we combine the linguistic information provided by WordNet together with a syntax-based extraction of rules from legal texts, and a logic-based extraction of dependencies between chunks of such texts. Such a combined approach leads to a powerful solution towards the extraction of machine-readable rules from legal documents. We evaluate the proposed approach over the Australian “Telecommunications consumer protections code”.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://wordnet.princeton.edu/.

  2. 2.

    Note that these ontologies are explicitly called lightweight ontologies as they are not expected to be used to normalize the concepts of legal text by mapping the legal terms into concepts in ontology, and obtain the meaning of the text by using the ontology structure. They uniquely provide a support for detecting the deontic components in legal texts and the structure of such texts, respectively.

  3. 3.

    http://nlp.stanford.edu/software/lex-parser.shtml.

  4. 4.

    For more details about the meaning of each tag and dependency clauses used by the parser, please refer to the official Stanford documentation: http://nlp.stanford.edu/software/dependencies_manual.pdf.

References

  1. van Engers, T., van Gog, R., Sayah, K.: A case study on automated norm extraction. In: Proceedings of JURIX, pp. 49–58 (2004)

    Google Scholar 

  2. Wyner, A., Peters, W.: On rule extraction from regulations. In: JURIX 2011, pp. 113–122 (2011)

    Google Scholar 

  3. Curran, J.R., Clark, S., Bos, J.: Linguistically motivated large-scale NLP with c&c and boxer. In: Carroll, J.A., van den Bosch, A., Zaenen, A. (eds.) ACL 2007, Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, 23–30 June 2007, Prague, Czech Republic. The Association for Computational Linguistics (2007)

    Google Scholar 

  4. Soria, C., Bartolini, R., Lenci, A., Montemagni, S., Pirrelli, V.: Automatic extraction of semantics in law documents. European Press Academic Publishing (2005)

    Google Scholar 

  5. Biagioli, C., Francesconi, E., Passerini, A., Montemagni, S., Soria, C.: Automatic semantics extraction in law documents. In: ICAIL 2015, pp. 133–140 (2005)

    Google Scholar 

  6. de Araujo, D.A., Rigo, S., Muller, C., de Oliveira Chishman, R.L.: Automatic information extraction from texts with inference and linguistic knowledge acquisition rules. In: Web Intelligence/IAT Workshops, pp. 151–154 (2013)

    Google Scholar 

  7. Kiyavitskaya, N., et al.: Automating the extraction of rights and obligations for regulatory compliance. In: Li, Q., Spaccapietra, S., Yu, E., Olivé, A. (eds.) ER 2008. LNCS, vol. 5231, pp. 154–168. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-87877-3_13

    Chapter  Google Scholar 

  8. de Maat, E., Winkels, R.: Suggesting model fragments for sentences in Dutch laws. In: Proceedings of LOAIT, pp. 19–28 (2010)

    Google Scholar 

  9. Brighi, R., Palmirani, M.: Legal text analysis of the modification provisions: a pattern oriented approach. In: ICAIL 2009, pp. 238–239 (2009)

    Google Scholar 

  10. Francesconi, E.: Legal rules learning based on a semantic model for legislation. In: Proceedings of SPLeT Workshop (2010)

    Google Scholar 

  11. Boella, G., Di Caro, L., Robaldo, L.: Semantic relation extraction from legislative text using generalized syntactic dependencies and support vector machines. In: Morgenstern, L., Stefaneas, P., Lévy, F., Wyner, A., Paschke, A. (eds.) RuleML 2013. LNCS, vol. 8035, pp. 218–225. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39617-5_20

    Chapter  Google Scholar 

  12. Dragoni, M., Governatori, G., Villata, S.: Automated rules generation from natural language legal texts. In: ICAIL 2015 Workshop on Automated Detection, Extraction and Analysis of Semantic Information in Legal Texts (2015)

    Google Scholar 

  13. Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mauro Dragoni .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dragoni, M., Villata, S., Rizzi, W., Governatori, G. (2018). Combining Natural Language Processing Approaches for Rule Extraction from Legal Documents. In: Pagallo, U., Palmirani, M., Casanovas, P., Sartor, G., Villata, S. (eds) AI Approaches to the Complexity of Legal Systems. AICOL AICOL AICOL AICOL AICOL 2015 2016 2016 2017 2017. Lecture Notes in Computer Science(), vol 10791. Springer, Cham. https://doi.org/10.1007/978-3-030-00178-0_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-00178-0_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-00177-3

  • Online ISBN: 978-3-030-00178-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics