research-article

Extraction and Classification of Statute Facets using Few-shot Learning

Authors:

Girish K. Palshikar,

Ramandeep Singh,

Dhirendra SinghAuthors Info & Claims

ICAIL '23: Proceedings of the Nineteenth International Conference on Artificial Intelligence and Law

Pages 197 - 206

https://doi.org/10.1145/3594536.3595134

Published: 07 September 2023 Publication History

Abstract

In this paper, we focus on automatic extraction of statute facets from legal statutes such as Act documents. We define statute facets to be key specific aspects of a statute which can potentially be used in legal arguments. For example, Section 25F of the Industrial Disputes Act (India) contains statute facets such as workman, employer, retrenchment of workmen, continuous service for not less than one year, etc. Such statute facets are often used by lawyers as part of their argumentation and also by judges for deciding on a case. In this paper, we propose a weakly supervised technique for extracting such statute facets from legal text. We use dependency tree structure to extract candidate statute facets and use BM25 ranking function to determine statute-specificity of these candidates. We propose a set of facet types which enable us to realize the definition of statute facets in a more computational way. We use recent deep learning models in a few-shot setting to predict an appropriate facet type for each candidate. Only those candidates with high statute-specificity and for which a facet type is predicted with high confidence, are selected as acceptable statute facets. We evaluate the extracted statute facets through both direct and indirect evaluation as well as conduct a user-study to get validation and feedback from lawyers.

References

[1]

Vincent AWMM Aleven. 1997. Teaching case-based argumentation through a model and examples. Citeseer.

[2]

Paheli Bhattacharya, Shounak Paul, Kripabandhu Ghosh, Saptarshi Ghosh, and Adam Wyner. 2019. Identification of rhetorical roles of sentences in indian legal judgments. In Legal Knowledge and Information Systems: JURIX 2019: The Thirty-second Annual Conference, Vol. 322. IOS Press, 3.

[3]

Ilias Chalkidis, Manos Fergadiotis, Prodromos Malakasiotis, Nikolaos Aletras, and Ion Androutsopoulos. 2020. LEGAL-BERT: The Muppets straight out of Law School. In Findings of the Association for Computational Linguistics: EMNLP 2020. Association for Computational Linguistics, Online, 2898--2904.

[4]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 4171--4186. https://aclanthology.org/N19-1423

[5]

Mohammad Hassan Falakmasir and Kevin D Ashley. 2017. Utilizing Vector Space Models for Identifying Legal Factors from Text. In JURIX. 183--192.

[6]

Ariel Gera, Alon Halfon, Eyal Shnarch, Yotam Perlitz, Liat Ein-Dor, and Noam Slonim. 2022. Zero-Shot Text Classification with Self-Training. In Conference on Empirical Methods in Natural Language Processing.

[7]

Matthew Honnibal, Ines Montani, Sofie Van Landeghem, Adriane Boyd, et al. 2020. spaCy: Industrial-strength natural language processing in python. (2020). https://spacy.io/

[8]

John F Horty and Trevor JM Bench-Capon. 2012. A factor-based definition of precedential constraint. Artificial intelligence and Law 20, 2 (2012), 181--214.

[9]

Andrew JI Jones and Marek Sergot. 1992. Deontic logic in the representation of law: Towards a methodology. Artificial Intelligence and Law 1 (1992), 45--64.

Digital Library

[10]

Prathamesh Kalamkar, Aman Tiwari, Astha Agarwal, Saurabh Karn, Smita Gupta, Vivek Raghavan, and Ashutosh Modi. 2022. Corpus for Automatic Structuring of Legal Documents. In Proceedings of the Language Resources and Evaluation Conference. European Language Resources Association, Marseille, France, 4420--4429. https://aclanthology.org/2022.lrec-1.470

[11]

Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Veselin Stoyanov, and Luke Zettlemoyer. 2020. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Online, 7871--7880. https://aclanthology.org/2020.acl-main.703

[12]

Jack Mumford, Katie Atkinson, and Trevor Bench-Capon. 2021. Explaining Factor Ascription. In Legal Knowledge and Information Systems. IOS Press, 191--196.

[13]

Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 3982--3992.

[14]

M Saravanan and Balaraman Ravindran. 2010. Identification of rhetorical roles for segmentation and summarization of a legal judgment. Artificial Intelligence and Law 18, 1 (2010), 45--76.

Digital Library

[15]

Hinrich Schütze, Christopher D Manning, and Prabhakar Raghavan. 2008. Introduction to information retrieval. Vol. 39. Cambridge University Press Cambridge.

[16]

Andrew Trotman, Antti Puurula, and Blake Burgess. 2014. Improvements to BM25 and language models examined. In Proceedings of the 2014 Australasian Document Computing Symposium. 58--65.

Digital Library

[17]

Michael van der Veen and Natalia Sidorova. 2021. Signal Phrase Extraction: A Gateway to Information Retrieval Improvement in Law Texts. In Legal Knowledge and Information Systems. IOS Press, 127--130.

[18]

Hannes Westermann, Vern R Walker, Kevin D Ashley, and Karim Benyekhlef. 2019. Using factors to predict and analyze landlord-tenant decisions to increase access to justice. In Proceedings of the Seventeenth International Conference on Artificial Intelligence and Law. 133--142.

Digital Library

[19]

Adam Wyner and Wim Peters. 2010. Towards annotating and extracting textual legal case factors. In Proceedings of the Language Resources and Evaluation Conference Workshop on Semantic Processing of Legal Texts, Malta.

[20]

Wenpeng Yin, Jamaal Hay, and Dan Roth. 2019. Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 3914--3923.

Index Terms

Extraction and Classification of Statute Facets using Few-shot Learning
1. Applied computing
  1. Law, social and behavioral sciences
    1. Law
2. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Information extraction
  2. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

Unsupervised Legal Concept Extraction from Indian Case Documents using Statutes
FIRE '20: Proceedings of the 12th Annual Meeting of the Forum for Information Retrieval Evaluation

Finding legal concepts pertaining to court case judgement documents, is an important task in the field of legal data mining. These concepts are also popularly termed as catchwords/keywords. Existing methods for the task lack the ability to extract legal ...
Lifting facets of the cut polytope

The cut polytope P"c(G) of a graph G is the convex hull of the incidence vectors of the edge sets of all cuts of G. We give a sufficient condition for an inequality defining a facet of P"c(G) to define a facet of the cut polytope of a graph containing G ...
Clique-Web Facets for Multicut Polytopes

Let G = V , E be a graph. An edge set { uv ∈ E | u ∈ S _i , v ∈ S _j , i â j }, where S ₁, ', S _k is a partition of V , is called a multicut with k shores. We investigate the polytopes MC _k ^â n and MC _k ^â n that are defined ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICAIL '23: Proceedings of the Nineteenth International Conference on Artificial Intelligence and Law

June 2023

499 pages

ISBN:9798400701979

DOI:10.1145/3594536

Conference Chair:
Francisco Andrade
University of Minho, Portugal
,
Program Chair:
Matthias Grabmair
Technical University of Munich, Germany

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

IAAIL: Intl Asso for Artifical Intel & Law

In-Cooperation

UMinho: University of Minho
SIGAI: ACM Special Interest Group on Artificial Intelligence
AAAI: Am Assoc for Artifical Intelligence

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 September 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ICAIL 2023

Sponsor:

IAAIL

ICAIL 2023: Nineteenth International Conference on Artificial Intelligence and Law

June 19 - 23, 2023

Braga, Portugal

Acceptance Rates

Overall Acceptance Rate 69 of 169 submissions, 41%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
106
Total Downloads

Downloads (Last 12 months)72
Downloads (Last 6 weeks)37

Reflects downloads up to 08 Mar 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten