skip to main content
10.1145/3322640.3326726acmconferencesArticle/Chapter ViewAbstractPublication PagesicailConference Proceedingsconference-collections
research-article

Automated Bundle Pagination Using Machine Learning

Published: 17 June 2019 Publication History

Abstract

Coherent division of legal document bundles, whether this is done in the context of court bundles, briefs or some other application, is a time consuming and challenging task. We propose an approach whereby this process can be automated. Two variations are considered. The first addresses the scenario where the topic labelling is pre-defined and adopts a supervised learning approach. The second addresses the scenario where the topic labelling, for whatever reason, is not specified in advance and adopts an unsupervised learning approach. This paper reports on an investigation of both mechanisms using accident claims bundles. The evaluation results indicate that the proposed approaches can be successfully applied to divide legal document bundles.

References

[1]
Mehdi Allahyari, Seyed Amin Pouriyeh, Mehdi Assefi, Saied Safaei, Elizabeth D. Trippe, Juan B. Gutierrez, and Krys Kochut. 2017. A Brief Survey of Text Mining: Classification, Clustering and Extraction Techniques. CoRR abs/1707.02919 (2017). http://arxiv.org/abs/1707.02919
[2]
Purnima Bholowalia and Arvind Kumar. 2014. Article: EBK-Means: A Clustering Technique based on Elbow Method and K-Means in WSN. International Journal of Computer Applications 105, 9 (November 2014), 17--24.
[3]
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet Allocation. Journal of Machine Learning Research 3, 4-5 (2003), 993--1022.
[4]
L. Karl Branting. 2017. Automating Judicial Document Analysis. In Proceedings of the Second Workshop on Automated Semantic Analysis of Information in Legal Texts.
[5]
Leo Breiman. 2001. Random Forests. Machine Learning 45, 1 (01 Oct 2001), 5--32.
[6]
Andreas Buja, Deborah F. Swayne, Michael L. Littman, Nathaniel Dean, Heike Hofmann, and Lisha Chen. 2008. Data visualization with multidimensional scaling. Journal of Computational and Graphical Statistics (2008).
[7]
Alexander Clark, Chris Fox, and Shalom Lappin. 2010. The Handbook of Computational Linguistics and Natural Language Processing. Wiley-Blackwell.
[8]
Jack G. Conrad, Khalid Al-Kofahi, Ying Zhao, and George Karypis. 2005. Effective Document Clustering for Large Heterogeneous Law Firm Collections. In Proceedings of the 10th International Conference on Artificial Intelligence and Law (ICAIL '05). ACM, 177--187.
[9]
Corinna Cortes and Vladimir Vapnik. 1995. Support-Vector Networks. Mach. Learn. 20, 3 (Sept. 1995), 273--297.
[10]
William M. Darling. 2011. A Theoretical and Practical Implementation Tutorial on Topic Modeling and Gibbs Sampling.
[11]
M Ikonomakis, S Kotsiantis, and V Tampakas. 2005. Text classification using machine learning techniques. WSEAS Transactions on Computers 4, 8 (2005), 966--974.
[12]
Mita K Dalal and Mukesh Zaveri. 2011. Automatic Text Classification: A Technical Review. International Journal of Computer Applications 28 (08 2011).
[13]
David G. Kleinbaum and Mitchel Klein. 2010. Introduction to Logistic Regression. Logistic Regression: A Self-Learning Text (2010), 1--39.
[14]
David MacKay. 2003. Information Theory, Inference and Learning Algorithms. Cambridge University Press.
[15]
R. Mihalcea and P. Tarau. 2004. TextRank: Bringing Order into Texts. In Proceedings of EMNLP-04 and the 2004 Conference on Empirical Methods in Natural Language Processing.
[16]
L. Page, S. Brin, R. Motwani, and T. Winograd. 1998. The PageRank citation ranking: Bringing order to the Web. In Proceedings of the 7th International World Wide Web Conference. 161--172.
[17]
Juan Ramos. 2003. Using TF-IDF to Determine Word Relevance in Document Queries. Technical Report.
[18]
Peter Rousseeuw. 1987. Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis. J. Comput. Appl. Math. 20, 1 (Nov. 1987), 53--65.
[19]
Octavia-Maria Sulea, Marcos Zampieri, Shervin Malmasi, Mihaela Vela, Liviu P. Dinu, and Josef van Genabith. 2017. Exploring the Use of Text Classification in the Legal Domain. CoRR abs/1710.09306 (2017).
[20]
Ravi Kumar V and K. Raghuveer. 2012. Article: Legal Documents Clustering using Latent Dirichlet Allocation. International Journal of Applied Information Systems 2, 6 (2012), 27--33.
[21]
H. Zhang and D. Li. 2007. NaÃŕve Bayes Text Classifier. In 2007 IEEE International Conference on Granular Computing (GRC 2007). 708--708.

Cited By

View all
  • (2021)Administrative prejudice in cases of petty theft (the Article 7.27 of the Code of the Russian Federation on Administrative Offenses and the Article 158.1 of the Criminal Code of the Russian Federation): how the big data of judicial acts reflect humanization and quality of justiceЮридические исследования10.25136/2409-7136.2021.9.36521(81-124)Online publication date: Sep-2021
  • (2021)Application of Machine Learning Metrics for Dynamic E-justice Processes2021 28th Conference of Open Innovations Association (FRUCT)10.23919/FRUCT50888.2021.9347598(293-300)Online publication date: 27-Jan-2021

Index Terms

  1. Automated Bundle Pagination Using Machine Learning

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICAIL '19: Proceedings of the Seventeenth International Conference on Artificial Intelligence and Law
    June 2019
    312 pages
    ISBN:9781450367547
    DOI:10.1145/3322640
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    • Univ. of Montreal: University of Montreal
    • AAAI
    • IAAIL: Intl Asso for Artifical Intel & Law

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 June 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICAIL '19
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 69 of 169 submissions, 41%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 08 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Administrative prejudice in cases of petty theft (the Article 7.27 of the Code of the Russian Federation on Administrative Offenses and the Article 158.1 of the Criminal Code of the Russian Federation): how the big data of judicial acts reflect humanization and quality of justiceЮридические исследования10.25136/2409-7136.2021.9.36521(81-124)Online publication date: Sep-2021
    • (2021)Application of Machine Learning Metrics for Dynamic E-justice Processes2021 28th Conference of Open Innovations Association (FRUCT)10.23919/FRUCT50888.2021.9347598(293-300)Online publication date: 27-Jan-2021

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media