skip to main content
10.1145/3331184.3331223acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Hierarchical Matching Network for Crime Classification

Published: 18 July 2019 Publication History

Abstract

Automatic crime classification is a fundamental task in the legal field. Given the fact descriptions, judges first determine the relevant violated laws, and then the articles. As laws and articles are grouped into a tree-shaped hierarchy (i.e., laws as parent labels, articles as children labels), this task can be naturally formalized as a two layers' hierarchical multi-label classification problem. Generally, the label semantics (i.e., definition of articles) and the hierarchical structure are two informative properties for judges to make a correct decision. However, most previous methods usually ignore the label structure and feed all labels into a flat classification framework, or neglect the label semantics and only utilize fact descriptions for crime classification, thus the performance may be limited. In this paper, we formalize crime classification problem into a matching task to address these issues. We name our model as Hierarchical Matching Network (HMN for short). Based on the tree hierarchy, HMN explicitly decomposes the semantics of children labels into the residual and alignment components. The residual components keep the unique characteristics of each individual children label, while the alignment components capture the common semantics among sibling children labels, which are further aggregated as the representation of their parent label. Finally, given a fact description, a co-attention metric is applied to effectively match the relevant laws and articles. Experiments on two real-world judicial datasets demonstrate that our model can significantly outperform the state-of-the-art methods.

Supplementary Material

MP4 File (cite3-17h20-d1.mp4)

References

[1]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural Machine Translation by Jointly Learning to Align and Translate. CoRR abs/1409.0473 (2014). arXiv:1409.0473 http://arxiv.org/abs/1409.0473.
[2]
Simon Baker and Anna Korhonen. 2017. Initializing neural networks for hierarchical multi-label text classification. In BioNLP 2017, Vancouver, Canada, August 4, 2017. 307--315.
[3]
Zafer Barutcuoglu, Robert E. Schapire, and Olga G. Troyanskaya. 2006. Hierarchical multi-label prediction of gene function. Bioinformatics 22, 7 (2006), 830--836.
[4]
Emily M. Bender, Leon Derczynski, and Pierre Isabelle (Eds.). 2018. Proceedings of the 27th International Conference on Computational Linguistics, COLING 2018, Santa Fe, New Mexico, USA, August 20-26, 2018. Association for Computational Linguistics.
[5]
Matthew R. Boutell, Jiebo Luo, Xipeng Shen, and Christopher M. Brown. 2004. Learning multi-label scene classification. Pattern Recognition 37, 9 (2004), 1757--1771.
[6]
Samuel R. Bowman, Gabor Angeli, Christopher Potts, and Christopher D. Manning. 2015. A large annotated corpus for learning natural language inference. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 632--642.
[7]
Ricardo Cerri, Rodrigo C. Barros, and André Carlos Ponce Leon Ferreira de Carvalho. 2014. Hierarchical multi-label classification using local neural networks. J. Comput. Syst. Sci. 80, 1 (2014), 39--56.
[8]
Ricardo Cerri, Rodrigo C. Barros, and André C. P. L. F. de Carvalho. 2015. Hierarchical classification of Gene Ontology-based protein functions with neural networks. In 2015 International Joint Conference on Neural Networks, IJCNN 2015, Killarney, Ireland, July 12-17, 2015. 1--8.
[9]
Nicolò Cesa-Bianchi, Claudio Gentile, and Luca Zaniboni. 2006. Incremental Algorithms for Hierarchical Classification. J. Mach. Learn. Res. 7 (Dec. 2006), 31--54. http://dl.acm.org/citation.cfm?id=1248547.1248549.
[10]
Kyunghyun Cho, Bart Van Merrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. empirical methods in natural language processing (2014), 1724--1734.
[11]
Eduardo Costa, Ana Lorena, Andre Carvalho, and Alex Freitas. 2007. A review of performance evaluation measures for hierarchical classifiers. AAAI Workshop - Technical Report (01 2007).
[12]
Eduardo P. Costa, Ana C. Lorena, André C. P. L. F. Carvalho, Alex A. Freitas, and Nicholas Holden. 2007. Comparing Several Approaches for Hierarchical Classification of Proteins with Decision Trees. In Proceedings of the 2nd Brazilian Conference on Advances in Bioinformatics and Computational Biology (BSB'07). Springer-Verlag, Berlin, Heidelberg, 126--137. http://dl.acm.org/citation.cfm?id=1776474.1776487.
[13]
Andre Elisseeff and Jason Weston. 2001. A kernel method for multi-labelled classification. (2001), 681--687.
[14]
Eva Gibaja and Sebastian Ventura. 2014. Multilabel Learning: A Review of the State of The Art and Ongoing Research. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery (11 2014).
[15]
Jiafeng Guo, Yixing Fan, Qingyao Ai, and W. Bruce Croft. 2016. Semantic Matching by Non-Linear Word Transportation for Information Retrieval. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management (CIKM '16). ACM, New York, NY, USA, 701--710.
[16]
Hua He and Jimmy J. Lin. 2016. Pairwise Word Interaction Modeling with Deep Neural Networks for Semantic Similarity Measurement. In NAACL HLT 2016, The 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego California, USA, June 12-17, 2016. 937--948.
[17]
Posen Huang, Xiaodong He, Jianfeng Gao, Li Deng, Alex Acero, and Larry P. Heck. 2013. Learning deep structured semantic models for web search using clickthrough data. (2013), 2333--2338.
[18]
Rong Jin, Alex G. Hauptmann, and Cheng Xiang Zhai. 2002. Title Language Model for Information Retrieval. In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '02). ACM, New York, NY, USA, 42--48.
[19]
Yoon Kim. 2014. Convolutional Neural Networks for Sentence Classification. empirical methods in natural language processing (2014), 1746--1751.
[20]
Xin Li and Yuhong Guo. 2015. Multi-label classification with feature-aware non-linear label space transformation. (2015), 3635--3642.
[21]
Jimmy J. Lin. 2007. An exploration of the principles underlying redundancy-based factoid question answering. ACM Transactions on Information Systems 25, 2 (2007), 6.
[22]
Shangbang Long, Cunchao Tu, Zhiyuan Liu, and Maosong Sun. 2018. Automatic Judgment Prediction via Legal Reading Comprehension. CoRR abs/1809.06537 (2018). arXiv:1809.06537 http://arxiv.org/abs/1809.06537.
[23]
Shangbang Long, Cunchao Tu, Zhiyuan Liu, and Maosong Sun. 2018. Automatic Judgment Prediction via Legal Reading Comprehension. CoRR abs/1809.06537 (2018). arXiv:1809.06537 http://arxiv.org/abs/1809.06537.
[24]
Bingfeng Luo, Yansong Feng, Jianbo Xu, Xiang Zhang, and Dongyan Zhao. 2017. Learning to Predict Charges for Criminal Cases with Legal Basis. CoRR abs/1707.09168 (2017). http://arxiv.org/abs/1707.09168.
[25]
Mallinali Ramírez-Corona, L. Enrique Sucar, and Eduardo F. Morales. 2016. Hierarchical Multilabel Classification Based on Path Evaluation. Int. J. Approx. Reasoning 68, C (Jan. 2016), 179--193.
[26]
Jesse Read, Bernhard Pfahringer, Geoff Holmes, and Eibe Frank. 2011. Classifier chains for multi-label classification. Machine Learning 85, 3 (2011), 333--359.
[27]
Ellen Riloff, David Chiang, Julia Hockenmaier, and Jun'ichi Tsujii (Eds.). 2018. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4, 2018. Association for Computational Linguistics.
[28]
Sawinee Sangsuriyun, Sanparith Marukatat, and Kitsana Waiyamai. 2010. Hierarchical Multi-label Associative Classification (HMAC) using negative rules. In Proceedings of the 9th IEEE International Conference on Cognitive Informatics, ICCI 2010, July 7-9, 2010, Beijing, China. 919--924.
[29]
Leander Schietgat, Celine Vens, Jan Struyf, Hendrik Blockeel, Dragi Kocev, and Saso Dzeroski. 2010. Predicting gene function using hierarchical multi-label decision tree ensembles. BMC Bioinformatics 11 (2010), 2.
[30]
Gehui Shen, Yunlun Yang, and Zhi-Hong Deng. 2017. Inter-Weighted Alignment Network for Sentence Pair Modeling. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017, Copenhagen, Denmark, September 9-11, 2017. 1179--1189.
[31]
Aixin Sun and Ee-Peng Lim. 2001. Hierarchical Text Classification and Evaluation. In Proceedings of the 2001 IEEE International Conference on Data Mining (ICDM '01). IEEE Computer Society, Washington, DC, USA, 521--528. http://dl.acm.org/citation.cfm?id=645496.657884.
[32]
Grigorios Tsoumakas and Ioannis Katakis. 2007. Multi-Label Classification: An Overview. International Journal of Data Warehousing and Mining 3, 3 (2007), 1--13.
[33]
Celine Vens, Jan Struyf, Leander Schietgat, Saso Dzeroski, and Hendrik Blockeel. 2008. Decision trees for hierarchical multi-label classification. Machine Learning 73, 2 (2008), 185--214.
[34]
Pengfei Wang, Ze Yang, Shuzi Niu, Yongfeng Zhang, Lei Zhang, and ShaoZhang Niu. 2018. Modeling Dynamic Pairwise Attention for Crime Classification over Legal Articles. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR '18). ACM, New York, NY, USA, 485--494.
[35]
Jonatas Wehrmann, Ricardo Cerri, and Rodrigo Barros. 2018. Hierarchical MultiLabel Classification Networks. In Proceedings of the 35th International Conference on Machine Learning (Proceedings of Machine Learning Research), Jennifer Dy and Andreas Krause (Eds.), Vol. 80. PMLR, Stockholmsmässan, Stockholm Sweden, 5075--5084.
[36]
Jonatas Wehrmann, Ricardo Cerri, and Rodrigo C. Barros. 2018. Hierarchical Multi-Label Classification Networks. In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018. 5225--5234.
[37]
Caiming Xiong, Victor Zhong, and Richard Socher. 2017. Dynamic Coattention Networks For Question Answering. international conference on learning representations (2017).
[38]
Zichao Yang, Diyi Yang, Chris Dyer, Xiaodong He, Alexander J. Smola, and Eduard H. Hovy. 2016. Hierarchical Attention Networks for Document Classification. In NAACL HLT 2016, The 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego California, USA, June 12-17, 2016. 1480--1489.
[39]
Minling Zhang and Zhihua Zhou. 2006. Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization. IEEE Transactions on Knowledge and Data Engineering 18, 10 (2006), 1338--1351.
[40]
Minling Zhang and Zhihua Zhou. 2014. A Review on Multi-Label Learning Algorithms. IEEE Transactions on Knowledge and Data Engineering 26, 8 (2014), 1819--1837.
[41]
Haoxi Zhong, Guo Zhipeng, Cunchao Tu, Chaojun Xiao, Zhiyuan Liu, and Maosong Sun. 2018. Legal Judgment Prediction via Topological Learning. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 3540--3549. http://aclweb.org/anthology/D18-1390.

Cited By

View all
  • (2025)Enhancing Rhetorical Role Identification in Legal Documents Using Large Language Models and IN_place Data AugmentationComputing, Communication and Learning10.1007/978-3-031-79041-6_8(90-100)Online publication date: 1-Feb-2025
  • (2024)Legal Statute Identification: A Case Study using State-of-the-Art Datasets and MethodsProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657879(2231-2240)Online publication date: 10-Jul-2024
  • (2024)A Circumstance-Aware Neural Framework for Explainable Legal Judgment PredictionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.338758036:11(5453-5467)Online publication date: Nov-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval
July 2019
1512 pages
ISBN:9781450361729
DOI:10.1145/3331184
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 July 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. crime classification
  2. hierarchical matching network
  3. hierarchical multi-label classification

Qualifiers

  • Research-article

Funding Sources

  • the fundamental Research for the Central Universities
  • National Natural Science Foundation of China

Conference

SIGIR '19
Sponsor:

Acceptance Rates

SIGIR'19 Paper Acceptance Rate 84 of 426 submissions, 20%;
Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)39
  • Downloads (Last 6 weeks)8
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Enhancing Rhetorical Role Identification in Legal Documents Using Large Language Models and IN_place Data AugmentationComputing, Communication and Learning10.1007/978-3-031-79041-6_8(90-100)Online publication date: 1-Feb-2025
  • (2024)Legal Statute Identification: A Case Study using State-of-the-Art Datasets and MethodsProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657879(2231-2240)Online publication date: 10-Jul-2024
  • (2024)A Circumstance-Aware Neural Framework for Explainable Legal Judgment PredictionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.338758036:11(5453-5467)Online publication date: Nov-2024
  • (2024)$\boldsymbol{R}^{2}$: A Novel Recall & Ranking Framework for Legal Judgment PredictionIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2024.336538932(1609-1622)Online publication date: 19-Feb-2024
  • (2024)Legal Judgement Prediction via Contrastive Learning Based-Retrieval and Semantic Embedding2024 IEEE International Symposium on Product Compliance Engineering - Asia (ISPCE-ASIA)10.1109/ISPCE-ASIA64773.2024.10756244(1-6)Online publication date: 25-Oct-2024
  • (2024)HD-LJPKnowledge-Based Systems10.1016/j.knosys.2024.112033299:COnline publication date: 18-Oct-2024
  • (2024)CADLRA: A multi-charge prediction method based on the Criminal Act-Driven Law Retrieval AugmentationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.108619134(108619)Online publication date: Aug-2024
  • (2024)Rhetorical Role Detection in Legal Judgements Using Zero-Shot LearningProceedings of Data Analytics and Management10.1007/978-981-99-6544-1_42(559-574)Online publication date: 14-Jan-2024
  • (2024)A Decision Tree Approach for Identifying Indian Penal Code Sections Across Different Crime AspectsContributions Presented at The International Conference on Computing, Communication, Cybersecurity and AI, July 3–4, 2024, London, UK10.1007/978-3-031-74443-3_45(773-782)Online publication date: 20-Dec-2024
  • (2023)Coupled point process-based sequence modeling for privacy-preserving network alignmentProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/678(6112-6120)Online publication date: 19-Aug-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media