Skip to main content

Legal Case Retrieval by Essential Element Extraction Based on Reading Comprehension Model

  • Conference paper
  • First Online:
Innovative Mobile and Internet Services in Ubiquitous Computing (IMIS 2024)

Abstract

In this study, we propose a Legal Document Retrieval Pipeline. Given a legal case, we construct a scenario retrieval process based on various types of Essential Elements for Prosecution (EEP) associated with different criminal charges. We employ a reading comprehension model to extract essential scenario details, meeting the requirements of individual criminal charges. Subsequently, we extract keywords from these essential scenarios and utilize the embeddings of these keywords to compute the cosine similarity between each essential element, thus identifying the most closely related judgment documents. This approach dissects the overall direction of judgments into smaller components and derives similar judgments by matching the details within the judgment documents. In this study, we use the crimes of forgery and breach of trust as preliminary case types. We incorporate ChatGPT to assess the similarity between two judicial documents. We demonstrate that ChatGPT’s similarity judgments closely align with those of legal experts. The experiment results demonstrate the effectiveness of our legal document retrieval pipeline.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Barmakian, D.: Better search engines for law. Law Libr. J. 92, 399 (2000)

    Google Scholar 

  2. Gao, J., et al.: FIRE2019@AILA: legal retrieval based on information retrieval model. In: FIRE (Working Notes), pp. 64–69 (2019)

    Google Scholar 

  3. Geist, A.: Using citation analysis techniques for computer-assisted legal research in continental jurisdictions. Available at SSRN 1397674 (2009)

    Google Scholar 

  4. Grimmelmann, J.: The structure of search engine law. Iowa L. Rev. 93, 1 (2007)

    Google Scholar 

  5. Grootendorst, M.: KeyBERT: minimal keyword extraction with BERT (2020). https://doi.org/10.5281/zenodo.4461265

  6. Hu, W., et al.: BERT_LF: a similar case retrieval method based on legal facts. Wirel. Commun. Mob. Comput. 2022 (2022)

    Google Scholar 

  7. Li, H., et al.: Sailer: structure-aware pre-trained language model for legal case retrieval. arXiv preprint arXiv:2304.11370 (2023)

  8. Ma, Y., et al.: Lecard: a legal case retrieval dataset for Chinese law system. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 2342–2348 (2021)

    Google Scholar 

  9. Maxwell, K.T., Schafer, B.: Concept and context in legal information retrieval. In: Legal Knowledge and Information Systems, pp. 63–72. IOS Press (2008)

    Google Scholar 

  10. Ming, X.: text2vec: a tool for text to vector (2022). https://github.com/shibing624/text2vec

  11. Rossi, J., Kanoulas, E.: Legal information retrieval with generalized language models. In: Proceedings of the 6th Competition on Legal Information Extraction/Entailment. COLIEE (2019)

    Google Scholar 

  12. Shao, Y., et al.: BERT-PLI: modeling paragraph-level interactions for legal case retrieval. In: IJCAI, pp. 3501–3507 (2020)

    Google Scholar 

  13. Turtle, H.: Text retrieval in the legal world. Artif. Intell. Law 3, 5–54 (1995)

    Article  Google Scholar 

  14. Van Opijnen, M., Santos, C.: On the concept of relevance in legal information retrieval. Artif. Intell. Law 25, 65–87 (2017)

    Article  Google Scholar 

  15. Vuong, Y.T.H., et al.: SM-BERT-CR: a deep learning approach for case law retrieval with supporting model. Artif. Intell. Law 1–28 (2022)

    Google Scholar 

  16. Wehnert, S., Sudhi, V., Dureja, S., Kutty, L., Shahania, S., De Luca, E.W.: Legal norm retrieval with variations of the BERT model combined with TF-IDF vectorization. In: Proceedings of the Eighteenth International Conference on Artificial Intelligence and Law, pp. 285–294 (2021)

    Google Scholar 

  17. Yu, W., et al.: Explainable legal case matching via inverse optimal transport-based rationale extraction. In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 657–668 (2022)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fang-Yie Leu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Huang, CH., Wang, CH., Fan, YC., Leu, FY. (2024). Legal Case Retrieval by Essential Element Extraction Based on Reading Comprehension Model. In: Barolli, L. (eds) Innovative Mobile and Internet Services in Ubiquitous Computing. IMIS 2024. Lecture Notes on Data Engineering and Communications Technologies, vol 214. Springer, Cham. https://doi.org/10.1007/978-3-031-64766-6_36

Download citation

Publish with us

Policies and ethics