Abstract
Due to the imbalance between a large number of litigation cases and the number of judicial personnel, many legal documents to be processed greatly increase the burden of legal practitioners. So the intelligent processing of legal documents is especially important. At present, machine learning and deep learning have made great achievements in the intelligent processing of legal documents, including the elements extraction of legal documents, classification of legal documents, generation of legal documents, abstract extraction of legal documents etc. The main aim of this paper is to present a review of legal documents intelligent processing based on deep learning from legal documents representation, elements extraction of legal documents, classification of legal documents, automatic generation of legal documents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Buchanan, B.G., Headrick, T.E.: Some speculation about artificial intelligence and legal reasoning. Stanford Law Rev. 23(1), 40–62 (1970)
Deedman, C., Smith, J.C.: The nervous shock adviser: a legal expert system in case- based law. In: Operational Expert Systems Applications in Canada, pp. 56–71 (1991)
Bench-Capon, T.: Knowledge based systems applied to law: a framework for discussion. In: Knowledge Based Systems and Legal Applications, pp. 329–342 (1991)
Hafner, C.D., Berman, D.H.: The role of context in case-based legal reasoning: teleological, temporal, and procedural. In: Artificial Intelligence and Law, vol. 10, pp. 19–64 (2002)
Nay, J.J.: Gov2vec: learning distributed representations of institutions and their legal texts. arXiv:1609.06616 (2016)
Chalkidis, I., Kampas, D.: Deep learning in law: early adaptation and legal word embeddings trained on large corpora. Artif. Intel. Law 27(2), 171–198 (2018)
Xia, C., He, T., Li, W.: Similarity analysis of law documents based on Word2vec. In: 2019 IEEE 19th International Conference on Software Quality, Reliability and Security Companion (QRS-C), pp. 354–357 (2019)
Zhong, H., Zhang, Z., Liu, Z., et al.: Open Chinese language pre-trained model zoo. In: Technical report (2019)
Chalkidis, I., Fergadiotis, M., Malakasiotis, P., et al.: LEGAL-BERT: The muppets straight out of law school. arXiv:2010.02559 (2020)
Li, S., Zhang, H., Ye, L.: Evaluating the rationality of judicial decision with LSTM-based case modeling. In: 2018 IEEE Third International Conference on Data Science in Cyberspace (DSC), pp. 392–397 (2018)
Li, S., Zhang, H., Ye, L.: MANN: a multichannel attentive neural network for legal judgment prediction. IEEE Access 7, 151144–151155 (2019)
Zou, A.: Construction of knowledge graph based on law. In: University of Electronic Science and Technology of China (2019)
Xie, Y.: Research on named entity recognition for Chinese legal texts. In: Nanjing Normal University (2018)
Chalkidis, I., Androutsopoulos, I.: A deep learning approach to contract element extraction. JURIX 2017, 155–164 (2017)
Li, Q., Zhang, Q., Yao, J.: Event extraction for criminal legal text. In: 2020 IEEE International Conference on Knowledge Graph (ICKG), pp. 573–580 (2020)
Wang, L.: Research on Chinese named entity recognition for legal documents. Soochow University (2018)
Wang, L.: Study and implementation of extraction of legal elements in judicial documents. In: Southeast University (2018)
Jiang, Z.: The application of ensemble learning on named entity recognition for legal knowledgebase of properties involved in criminal cases. In: 2020 IEEE International Conference on Advances in Electrical Engineering and Computer Applications (AEECA), pp. 701–705 (2020)
Samarawickrama, C., Almeida, M.D., Silva, D.N.: Party identification of legal documents using co-reference resolution and named entity recognition. In: 2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS), pp. 494–499 (2020)
Almeida, M.D., Samarawickrama, C., Silva, N.D.: Legal party extraction from legal opinion text with sequence to sequence learning. In: 2020 20th International Conference on Advances in ICT for Emerging Regions (ICTer), pp. 143–148 (2020)
Mustajar, H., Ge, S.A., Haider, M., Irshad, S.M.: Noman: a quantum spatial graph convolutional network for text classification. Comput. Syst. Sci. Eng. 36(2), 369–382 (2021)
Wu, H.P., Liu, Y.L., Wang, J.W.: Review of text classification methods on deep learning. Comput. Mater. Continua 63(3), 1309–1321 (2020)
Clavié, B., Gheewala, A., Briton, P., et al.: LegaLMFiT: efficient short legal text classification with LSTM language model pre-training. arXiv:2109.00993 (2021)
Li, G., Wang, Z., Ma, Y.: Combining domain knowledge extraction with graph long short-term memory for learning classification of Chinese legal documents. IEEE Access 7, 139616–139627 (2017)
Suyan, W., Entong, S., Binyang, L.: TextCNN-based Text Classification for E-government. In: 2019 6th International Conference on Information Science and Control Engineering (ICISCE), pp. 929–934 (2019)
Wei, F., Qin, H., Ye, S.: Empirical study of deep learning for text classification in legal document review. In: 2018 IEEE International Conference on Big Data (Big Data), pp. 3317–3320 (2018)
Hammami, E., Faiz, R., Akermi, I.: A dynamic convolutional neural network approach for legal text classification. In: Saad, I., Rosenthal-Sabroux, C., Gargouri, F., Arduin, P.-E. (eds.) ICIKS 2021. LNBIP, vol. 425, pp. 71–84. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-85977-0_6
Devlin, J., Chang, M.W., Lee, K.: Bert: pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805 (2018)
Adhikari, A., Ram, A., Tang, R., et al.: Docbert: bert for document classification. arXiv:1904.08398 (2019)
Bambroo, P., Awasthi, A.: LegalDB: long distilbert for legal document classification. In: 2021 International Conference on Advances in Electrical, Computing, Communication and Sustainable Technologies (ICAECT), vol. 2021, pp. 1–4 (2021)
Qing N., Cong, Y., Dong, Z.: Research on small sample text classification based on attribute extraction and data augmentation. In: 2021 IEEE 6th International Conference on Cloud Computing and Big Data Analytics (ICCCBDA), pp. 53–57. IEEE (2021)
Fang, Y., Tian, X., Wu, H., et al.: Few-shot learning for Chinese legal controversial issues classification. IEEE Access 8, 75022–75034 (2020)
Shaheen, Z., Wohlgenannt, G., Filtz, E.: Large scale legal text classification using transformer models. arXiv:2010.12871 (2020)
JRC-Acquis (2020). https://ec.europa.eu/jrc/en/language-technologies/jrc-acquis
EURLEX57K dataset (2020). http://nlp.cs.aueb.gr/softwareanddatasets/EURLEX57K/
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9, 1735–1780 (1997)
Alschner, W., Skougarevskiy, D.: Towards an automated production of legal texts using recurrent neural networks. In: Proceedings of the 16th Edition of the International Conference on Articial Intelligence and Law, pp. 229–232 (2017)
Tran, V.K., Nguyen, L.M.: Natural language generation for spoken dialogue system using RNN encoder-decoder networks. arXiv:1706.00139 (2017)
Park, S.H., Kim, B.D., Kang, C.M.: Sequence-to-sequence prediction of vehicle trajectory via LSTM encoder-decoder architecture. In: 2018 IEEE Intelligent Vehicles Symposium (IV), pp. 1672–1678. IEEE (2018)
Ye, H., Jiang, X., Luo, Z., et al.: Interpretable charge predictions for criminal cases: learning to generate court views from fact descriptions. arXiv:1802.08504 (2018)
Peric, L., Mijic, S., Stammbach, D.: Legal language modeling with transformers. CEUR Workshop Proc. CEUR-WS 2020, 2764 (2020)
Dai, Z., Yang, Z., Yang, Y.: Transformer-xl: Attentive language models beyond a fixed-length context. arXiv:1901.02860 (2019)
Huang, W., Liao, X., Xie, Z.: Generating reasonable legal text through the combination of language modeling and question answering. In: Proceedings of the Twenty-Ninth International Conference on International Joint Conferences on Artificial Intelligence, vol. 2021, pp. 3687–3693 (2021)
Radford, A., Narasimhan, K., Salimans, T., et al.: Improving language understanding by generative pre-training (2018)
Dong, L., Yang, N., Wang, W., et al.: Unified language model pre-training for natural language understanding and generation. arXiv:1905.03197 (2019)
Goodfellow, I., Pouget-Abadie, J., Mirza, M., et al.: Generative adversarial nets. In: Advances in Neural Information Processing Systems (2014)
Wang, C.L., Liu, Y.L., Tong, Y.J., Wang, J.W.: GAN-GLS: generative lyric steganography based on generative adversarial network. Comput. Mater. Continua 69(1), 1375–1390 (2021)
Cheng, J., Yang, Y., Tang, X., Xiong, N., Zhang, Y., Lei, F.: Generative adversarial networks: a literature review. KSII Trans. Internet Inf. Syst. 14(12), 4625–4647 (2020)
Zhang, Y., Gan, Z., Fan, K.: Adversarial feature matching for text generation. In: International Conference on Machine Learning, pp. 4006–4015. PMLR (2017)
Guo, J., Lu, S., Cai, H.: Long text generation via adversarial training with leaked information. In: Proceedings of the AAAI Conference on Artificial Intelligence (2018)
Katamesh, O., Abu-Elnasr, S.: Elmougy: deep learning multimodal for unstructured and semi-structured textual documents classification. Comput. Mater. Continua 68, 589–606 (2021)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zhao, G., Liu, Y., Erdun, E. (2022). Review on Intelligent Processing Technologies of Legal Documents. In: Sun, X., Zhang, X., Xia, Z., Bertino, E. (eds) Artificial Intelligence and Security. ICAIS 2022. Lecture Notes in Computer Science, vol 13338. Springer, Cham. https://doi.org/10.1007/978-3-031-06794-5_55
Download citation
DOI: https://doi.org/10.1007/978-3-031-06794-5_55
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-06793-8
Online ISBN: 978-3-031-06794-5
eBook Packages: Computer ScienceComputer Science (R0)