Deep Learning for French Legal Data Categorization

Hammami, Eya; Akermi, Imen; Faiz, Rim; Boughanem, Mohand

doi:10.1007/978-3-030-32065-2_7

Eya Hammami¹⁰,
Imen Akermi¹¹,
Rim Faiz¹⁰ &
…
Mohand Boughanem¹¹

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 11815))

Included in the following conference series:

International Conference on Model and Data Engineering

825 Accesses
3 Citations
1 Altmetric

Abstract

In current years, deep learning has showed promising results when used in the field of natural language processing (NLP). Neural Networks (NNs) such as convolutional neural network (CNN) and recurrent neural network (RNN) have been utilized for different NLP tasks like information retrieval, sentiment analysis and document classification. In this paper, we explore the use of NNs-based method for legal text classification. In our case, the results show that NN models with a fixed input length outperforms baseline methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998). https://doi.org/10.1007/BFb0026683
Chapter Google Scholar
McCallum, A., Nigam, K., et al.: A comparison of event models for naive bayes text classification. In: AAAI-1998 Workshop on Learning for Text Categorization, vol. 752, pp. 41–48. Citeseer (1998)
Google Scholar
Joulin, A., Grave, E., Bojanowski, P., Mikolov, T.: Bag of Tricks for Efficient Text Classification. CoRR, abs/1607.01759 (2016). http://arxiv.org/abs/1607.01759, arXiv:1607.01759. https://dblp.org/rec/bib/journals/corr/JoulinGBM16, dblp computer science bibliography, https://dblp.org. Accessed 13 Aug 2018
Collobert, R., Weston, J.: A unified architecture for natural language processing: deep neural networks with multitask learning. In: Proceedings of the 25th International Conference on Machine Learning, pp. 160–167. ACM (2008)
Google Scholar
Yogatama, D., Dyer, C., Ling, W., Blunsom, P.: Generative and discriminative text classification with recurrent neural networks. arXiv preprint arXiv:1703.01898 (2017)
Xiao, Y., Cho, K.: Efficient Character-Level Document Classification by Combining Convolution and Recurrent Layers, CoRR, abs/1602.00367 (2016). http://arxiv.org/abs/1602.00367, arXiv:1602.00367, https://dblp.org/rec/bib/journals/corr/XiaoC16, dblp computer science bibliography, https://dblp.org. Accessed 13 Aug 2018
Kim, Y.: Convolutional Neural Networks for Sentence Classification, CoRR, abs/1408.5882 (2014). http://arxiv.org/abs/1408.5882, arXiv:1408.5882, https://dblp.org/rec/bib/journals/corr/Kim14f, dblp computer science bibliography, https://dblp.org. Accessed 13 Aug 2018
Zhang, X., Zhao, J.J., LeCun, Y.: Character-Level Convolutional Networks for Text Classification, CoRR, abs/1509.01626 (2015). http://arxiv.org/abs/1509.01626, arXiv:1509.01626, https://dblp.org/rec/bib/journals/corr/ZhangZL15, dblp computer science bibliography, https://dblp.org. Accessed 13 Aug 2018
Conneau, A., Schwenk, H., Barrault, L., LeCun, Y.: Very Deep Convolutional Networks for Natural Language Processing, CoRR, abs/1606.01781 (2016). http://arxiv.org/abs/1606.01781, arXiv:1606.01781, https://dblp.org/rec/bib/journals/corr/ConneauSBL16, dblp computer science bibliography, https://dblp.org. Accessed 13 Aug 2018
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P., et al.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Collobert, R., Weston, J.: A unified architecture for natural language processing: deep neural networks with multitask learning. In: Proceedings of the 25th International Conference on Machine Learning, ICML 2008, Helsinki, Finland, vol. 8, pp. 160–167. ACM, New York (2008). http://doi.acm.org/10.1145/1390156.1390177, https://doi.org/10.1145/1390156.1390177.1390177. ISBN: 978–1-60558-205-4
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
MKalchbrenner, N., Grefenstette, E., Blunsom, P.: A convolutional neural network for modelling sentences. arXiv preprint arXiv:1404.2188 (2014)
Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. In: Advances in Neural Information Processing Systems, pp. 649–657 (2015)
Google Scholar
Koomsubha, T., Vateekul, P.: A character-level convolutional neural network with dynamic input length for Thai text categorization. In: 2017 9th International Conference on Knowledge and Smart Technology (KST), pp. 101–105. IEEE (2017)
Google Scholar
Kim, Y., Jernite, Y., Sontag, D., Rush, A.M.: Character-aware neural language models. In: Thirtieth AAAI Conference on Artificial Intelligence (2016)
Google Scholar
Radford, A., Jozefowicz, R., Sutskever, I.: Learning to generate reviews and discovering sentiment. arXiv preprint arXiv:1704.01444 (2017)
Socher, R., et al.: Recursive deep models for semantic compositionality over a sentiment treebank. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 1631–1642 (2013)
Google Scholar
Nallapati, R., Manning, C.D.: Legal docket-entry classification: Where machine learning stumbles. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 438–446. Association for Computational Linguistics (2008)
Google Scholar
Sulea, O.-M., Zampieri, M., Malmasi, S., Vela, M., Dinu, L.P., van Genabith, J.: Exploring the use of text classification in the legal domain. arXiv preprint arXiv:1710.09306 (2017)
Undavia, S., Meyers, A., Ortega, J.E.: A Comparative study of classifying legal documents with neural networks. In: 2018 Federated Conference on Computer Science and Information Systems (FedCSIS), pp. 515–522. IEEE (2018)
Google Scholar
Da Silva, N.C., et al.: Document type classification for Brazil’s supreme court using a convolutional neural network. In: The Tenth International Conference on Forensic Computer Science and Cyber Law-ICoFCS, pp. 7–11 (2018)
Google Scholar
Wei, F., Qin, H., Ye, S., Zhao, H.: Empirical study of deep learning for text classification in legal document review. In: 2018 IEEE International Conference on Big Data (Big Data), pp. 3317–3320. IEEE (2018)
Google Scholar
Kim, Y.: Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014)
Wang, X., Liu, Y., Chengjie, S.U.N., Wang, B., Wang, X.: Predicting polarities of tweets by composing word embeddings with long short-term memory. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), vol. 1, pp. 1343–1353 (2015)
Google Scholar
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014). JMLR.org
MathSciNet MATH Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)

Download references

Author information

Authors and Affiliations

LARODEC Laboratory, University of Manouba, Manouba, Tunisia
Eya Hammami & Rim Faiz
IRIT Laboratory, University of Toulouse 3, Toulouse, France
Imen Akermi & Mohand Boughanem

Authors

Eya Hammami
View author publications
You can also search for this author in PubMed Google Scholar
Imen Akermi
View author publications
You can also search for this author in PubMed Google Scholar
Rim Faiz
View author publications
You can also search for this author in PubMed Google Scholar
Mohand Boughanem
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Eya Hammami .

Editor information

Editors and Affiliations

UIUC Institute, Zhejiang University, Zhejiang, China
Klaus-Dieter Schewe
INPT-ENSEEIHT/IRIT, Toulouse, France
Neeraj Kumar Singh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hammami, E., Akermi, I., Faiz, R., Boughanem, M. (2019). Deep Learning for French Legal Data Categorization. In: Schewe, KD., Singh, N. (eds) Model and Data Engineering. MEDI 2019. Lecture Notes in Computer Science(), vol 11815. Springer, Cham. https://doi.org/10.1007/978-3-030-32065-2_7

Download citation

DOI: https://doi.org/10.1007/978-3-030-32065-2_7
Published: 21 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32064-5
Online ISBN: 978-3-030-32065-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics