skip to main content
10.1145/3322640.3326715acmconferencesArticle/Chapter ViewAbstractPublication PagesicailConference Proceedingsconference-collections
research-article

Extracting the Gist of Chinese Judgments of the Supreme Court

Published: 17 June 2019 Publication History

Abstract

The gist of judgement documents encodes important experience and viewpoints of the Supreme Court, and provides instrumental and educational information for judges, lawyers, practitioners, and students. The Supreme Court in Taiwan appoints senior members to produce the gist for selected judgments of the Supreme Court, but is unable to offer the gist for all judgment documents. Based on our observation of the existing gist statements, we can treat the generation of the gist as a sentence classification problem. We apply machine-learning methods, including gradient boosting, multilayer perceptrons, and deep learning methods with long short-term memory units; and consider legal, linguistic, statistical information, and different word embedding methods to build several classifiers. By using more sophisticated classifiers and more relevant features, we gradually achieved better results, and the best result was 0.9372 in F1 measure.

References

[1]
E. Alfaro, N. Garcia, M. Gamez, and D. Elizondo (2008). Bankruptcy forecasting: An empirical comparison of AdaBoost and neural networks. Decision Support Systems, 45, 110--122.
[2]
E. Alpaydin (2010). Introduction to Machine Learning, the MIT Press.
[3]
G. Brown (2003). CHINATAX: exploring isomorphism with Chinese law. Proceedings of the Fourth International Conference on Artificial Intelligence and Law, 175--179.
[4]
P. Bojanowski, E. Grave, A. Joulin, and T. Mikolov (2017). Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics, 5, 135--146.
[5]
K.-C. Chen (2018). Automatic Extraction of Gist of Chinese Judgments of the Supreme Court, Master's Thesis, National Chengchi University, Taiwan. (advisor: Chao-Lin Liu)
[6]
T. Chen and C. Guestrin (2016). XGBoost: A scalable tree boosting system. Proceedings of the Twenty-Second SIGKDD Conference on Knowledge Discovery and Data Mining, 785--794.
[7]
T. Gonçalves and P. Quaresma (2005). Is linguistic information relevant for the classification of legal texts? Proceedings of the Tenth International Conference on Artificial Intelligence and Law, 168--176.
[8]
I. Goodfellow, Y. Bengio, and A. Courville (2016). Deep Learning, the MIT Press.
[9]
C. Grover, B. Hachey, I. Hughson, and C. Korycinski (2003). Automatic summarisation of legal documents. Proceedings of the Ninth International Conference on Artificial Intelligence and Law, 243--251.
[10]
A. Gupta, A. Z. Wang, K. Lin, H. Hong, H. Sun, B. L. Liebman, R. E. Stern, S. Dasgupta, and M. E. Roberts (2017). Toward building a legal knowledge-base of Chinese judicial documents for large-scale analytics. Proceedings of the Thirtieth Conference on Legal Knowledge and Information Systems, 135--144.
[11]
B. Hachey and C. Grover (2006). Extractive summarization of legal texts. Artificial Intelligence and Law, 14(4), 305--345.
[12]
T. K. Ho (1995). Random decision forests. Proceedings of the Third International Conference on Document Analysis and Recognition, volume 1, 278--282.
[13]
S. Hochreiter and Jürgen Schmidhuber (1997). Long short-term memory. Neural Computation, 9(8), 1735--1780.
[14]
A. Kanapala, S. Pal, and R. Pamula (2017). Text summarization from legal documents: a survey. Artificial Intelligence Review, 1--32, Springer.
[15]
G. Ke, Q. Meng, T. Finley, T. Wang, W. Chen, W. Ma, Q. Ye, and T.-Y. Liu (2017). LightGBM: A highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems 30, 3149--3157.
[16]
D. P. Kingma and J. Ba (2015). Adam: A method for stochastic optimization. Proceedings of the Third International Conference for Learning Representations, 2015.
[17]
H.-L. Lin (2010). A Study on the Content Analysis and Metadata Design of Court Decisions, Section 4.1, Master's Thesis, National Chiao-Tung University, Taiwan. (advisor: Hao-Ren Ke)
[18]
C.-L. Liu, C.-T. Chang, and J.-H. Ho (2003). Classification and clustering for case-based criminal summary judgments. Proceedings of the Ninth International Conference on Artificial Intelligence and Law, 252--261.
[19]
W.-Y. Ma, and K.-J. Chen (2003). Introduction to CKIP Chinese word segmentation system for the first international Chinese word segmentation bakeoff. Proceedings of the Second SIGHAN Workshop on Chinese Language Processing, 168--171.
[20]
C. D. Manning and H. Schütze (1999). Foundations of Statistical Natural Language Processing, the MIT Press.
[21]
T. Mikolov, I. Sutskever, K. Chen, G. Corrado, and J. Dean (2013). Distributed Representations of Words and Phrases and their Compositionality. Proceedings of the Twenty-Sixth International Conference on Neural Information Processing Systems, volume 2, 3111--3119.
[22]
S. Polsley, P. Jhunjhunwala, and R Huang (2016). CaseSummarizer: A system for automated summarization of legal texts. Proceedings of the Twenty-Sixth International Conference on Computational Linguistics: System Demonstrations, 258--262.
[23]
M. Saravanan, B. Ravindran, and S. Raman (2006). Improving Legal Document Summarization Using Graphical Models. Proceedings of the Nineteenth Conference on Legal Knowledge and Information Systems, 51--60.
[24]
M. Saravanan, B. Ravindran, and S. Raman (2007). Using legal ontology for query enhancement in generating a document summary. Proceedings of the Twentieth Conference on Legal Knowledge and Information Systems, 171--172.
[25]
M. Schuster and K. K. Paliwal (1997). Bidirectional recurrent neural networks. IEEE Transactions on Signal Processing, 45(11), 2673--2681.
[26]
B. Wei and J. Huang (2015). Modelling dialogues in court using a gradual argumentation model: A case study. Proceedings of the Fifteenth International Conference on Artificial Intelligence and Law, 138--147.
[27]
M. Yousfi-Monod, A. Farzindar, and G. Lapalme (2010). Supervised machine learning for summarizing legal documents. Proceedings of the Twenty-Third Canadian conference on Advances in Artificial Intelligence, 51--62.

Cited By

View all
  • (2024)Studying Judicial Behaviour with Text AnalysisThe Oxford Handbook of Comparative Judicial Behaviour10.1093/oxfordhb/9780192898579.013.15(305-324)Online publication date: 22-Feb-2024
  • (2024)LAWSUIT: a LArge expert-Written SUmmarization dataset of ITalian constitutional court verdictsArtificial Intelligence and Law10.1007/s10506-024-09414-wOnline publication date: 9-Sep-2024
  • (2024)A support system for the detection of abusive clauses in B2C contractsArtificial Intelligence and Law10.1007/s10506-024-09408-8Online publication date: 26-Jun-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICAIL '19: Proceedings of the Seventeenth International Conference on Artificial Intelligence and Law
June 2019
312 pages
ISBN:9781450367547
DOI:10.1145/3322640
© 2019 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

Sponsors

In-Cooperation

  • Univ. of Montreal: University of Montreal
  • AAAI
  • IAAIL: Intl Asso for Artifical Intel & Law

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 June 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Deep Learning
  2. Extractive Summarization
  3. Gradient Boosting
  4. Information Extraction
  5. Legal Informatics
  6. Long Short-Term Memory (LSTM)
  7. Machine Learning
  8. Multilayer Perceptrons
  9. Random Forest

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICAIL '19
Sponsor:

Acceptance Rates

Overall Acceptance Rate 69 of 169 submissions, 41%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)37
  • Downloads (Last 6 weeks)2
Reflects downloads up to 08 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Studying Judicial Behaviour with Text AnalysisThe Oxford Handbook of Comparative Judicial Behaviour10.1093/oxfordhb/9780192898579.013.15(305-324)Online publication date: 22-Feb-2024
  • (2024)LAWSUIT: a LArge expert-Written SUmmarization dataset of ITalian constitutional court verdictsArtificial Intelligence and Law10.1007/s10506-024-09414-wOnline publication date: 9-Sep-2024
  • (2024)A support system for the detection of abusive clauses in B2C contractsArtificial Intelligence and Law10.1007/s10506-024-09408-8Online publication date: 26-Jun-2024
  • (2024)Incorporating Domain Knowledge in Multi-objective Optimization Framework for Automating Indian Legal Case SummarizationPattern Recognition10.1007/978-3-031-78495-8_17(265-280)Online publication date: 4-Dec-2024
  • (2023)Legal Holding Extraction from Italian Case Documents using Italian-LEGAL-BERT Text SummarizationProceedings of the Nineteenth International Conference on Artificial Intelligence and Law10.1145/3594536.3595177(148-156)Online publication date: 19-Jun-2023
  • (2023)CASRank: A ranking algorithm for legal statute retrievalMultimedia Tools and Applications10.1007/s11042-023-15464-083:2(5369-5386)Online publication date: 2-Jun-2023
  • (2023)Bringing order into the realm of Transformer-based language models for artificial intelligence and lawArtificial Intelligence and Law10.1007/s10506-023-09374-732:4(863-1010)Online publication date: 20-Nov-2023
  • (2023)Ensemble methods for improving extractive summarization of legal case judgementsArtificial Intelligence and Law10.1007/s10506-023-09349-832:1(231-289)Online publication date: 4-Mar-2023
  • (2022)A full-process intelligent trial system for smart court一种智慧法院的全流程智能化审判系统Frontiers of Information Technology & Electronic Engineering10.1631/FITEE.210004123:2(186-206)Online publication date: 18-Mar-2022
  • (2021)Incorporating domain knowledge for extractive summarization of legal case documentsProceedings of the Eighteenth International Conference on Artificial Intelligence and Law10.1145/3462757.3466092(22-31)Online publication date: 21-Jun-2021
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media