skip to main content
research-article

Syntactic-Informed Graph Networks for Sentence Matching

Published:27 September 2023Publication History
Skip Abstract Section

Abstract

Matching two natural language sentences is a fundamental problem in both natural language processing and information retrieval. Preliminary studies have shown that the syntactic structures help improve the matching accuracy, and different syntactic structures in natural language are complementary to sentence semantic understanding. Ideally, a matching model would leverage all syntactic information. Existing models, however, are only able to combine limited (usually one) types of syntactic information due to the complex and heterogeneous nature of the syntactic information. To deal with the problem, we propose a novel matching model, which formulates sentence matching as a representation learning task on a syntactic-informed heterogeneous graph. The model, referred to as SIGN (Syntactic-Informed Graph Network), first constructs a heterogeneous matching graph based on the multiple syntactic structures of two input sentences. Then the graph attention network algorithm is applied to the matching graph to learn the high-level representations of the nodes. With the help of the graph learning framework, the multiple syntactic structures, as well as the word semantics, can be represented and interacted in the matching graph and therefore collectively enhance the matching accuracy. We conducted comprehensive experiments on three public datasets. The results demonstrate that SIGN outperforms the state of the art and also can discriminate the sentences in an interpretable way.

REFERENCES

  1. [1] Abney Steven. 1996. Partial parsing via finite-state cascades. Natural Language Engineering 2, 4 (1996), 337344.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] Bai Jiangang, Wang Yujing, Chen Yiren, Yang Yaming, Bai Jing, Yu Jing, and Tong Yunhai. 2021. Syntax-BERT: Improving pre-trained transformers with syntax trees. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. 30113020. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  3. [3] Bastings Jasmijn, Titov Ivan, Aziz Wilker, Marcheggiani Diego, and Sima’an Khalil. 2017. Graph convolutional encoders for syntax-aware neural machine translation. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 19571967. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  4. [4] Bowman Samuel R., Angeli Gabor, Potts Christopher, and Manning Christopher D.. 2015. A large annotated corpus for learning natural language inference. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 632642. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Bu Fan, Li Hang, and Zhu Xiaoyan. 2013. An introduction to string re-writing kernel. In Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI’13). 2982–2986. http://www.aaai.org/ocs/index.php/IJCAI/IJCAI13/paper/view/6544Google ScholarGoogle Scholar
  6. [6] Chen Haolan, Han Fred X., Niu Di, Liu Dong, Lai Kunfeng, Wu Chenglin, and Xu Yu. 2018. MIX: Multi-channel information crossing for text matching. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’18). ACM, New York, NY, 110119. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. [7] Chen Lu, Zhao Yanbin, Lyu Boer, Jin Lesheng, Chen Zhi, Zhu Su, and Yu Kai. 2020. Neural graph matching networks for Chinese short text matching. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 61526158. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  8. [8] Chen Qian, Zhu Xiaodan, Ling Zhenhua, Wei Si, and Jiang Hui. 2016. Enhancing and combining sequential and tree lstm for natural language inference. ArXiv preprint abs/1609.06038 (2016). https://arxiv.org/abs/1609.06038Google ScholarGoogle Scholar
  9. [9] Chen Qian, Zhu Xiaodan, Ling Zhen-Hua, Wei Si, Jiang Hui, and Inkpen Diana. 2017. Enhanced LSTM for natural language inference. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 16571668. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  10. [10] Das Dipanjan and Smith Noah A.. 2009. Paraphrase identification as probabilistic quasi-synchronous recognition. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP. 468476. https://aclanthology.org/P09-1053Google ScholarGoogle ScholarCross RefCross Ref
  11. [11] Devlin Jacob, Chang Ming-Wei, Lee Kenton, and Toutanova Kristina. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long and Short Papers). 41714186. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  12. [12] Fitria Tira Nur. 2021. QuillBot as an online tool: Students’ alternative in paraphrasing and rewriting of English writing. Englisia: Journal of Language, Education, and Humanities 9, 1 (2021), 183196.Google ScholarGoogle ScholarCross RefCross Ref
  13. [13] Friederici Angela D. and Weissenborn Jürgen. 2007. Mapping sentence form onto meaning: The syntax–semantic interface. Brain Research 1146 (2007), 5058.Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Gao Jianfeng, Pantel Patrick, Gamon Michael, He Xiaodong, and Deng Li. 2014. Modeling interestingness with deep neural networks. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP’14). 213. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  15. [15] Gong Yichen, Luo Heng, and Zhang Jian. 2018. Natural language inference over interaction space. In Proceedings of the 6th International Conference on Learning Representations: Conference Track Proceedings (ICLR’18). https://openreview.net/forum?id=r1dHXnH6-Google ScholarGoogle Scholar
  16. [16] Gouvea Ana C., Phillips Colin, Kazanina Nina, and Poeppel David. 2010. The linguistic processes underlying the P600. Language and Cognitive Processes 25, 2 (2010), 149188.Google ScholarGoogle ScholarCross RefCross Ref
  17. [17] Guo Jiafeng, Fan Yixing, Ai Qingyao, and Croft W. Bruce. 2016. A deep relevance matching model for ad-hoc retrieval. In Proceedings of the 25th ACM International Conference on Information and Knowledge Management (CIKM’16). 55–64. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. [18] Hu Baotian, Lu Zhengdong, Li Hang, and Chen Qingcai. 2014. Convolutional neural network architectures for matching natural language sentences. In Proceedings of the 27th International Conference on Neural Information Processing Systems (NIPS’14), Vol. 2. 2042–2050. https://proceedings.neurips.cc/paper/2014/hash/b9d487a30398d42ecff55c228ed5652b-Abstract.htmlGoogle ScholarGoogle Scholar
  19. [19] Huang Po-Sen, He Xiaodong, Gao Jianfeng, Deng Li, Acero Alex, and Heck Larry P.. 2013. Learning deep structured semantic models for web search using clickthrough data. In Proceedings of the 22nd ACM International Conference on Information and Knowledge Management (CIKM’13). ACM, New York, NY, 23332338. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. [20] Khot Tushar, Sabharwal Ashish, and Clark Peter. 2018. SciTaiL: A textual entailment dataset from science question answering. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence(AAAI’18), the 30th Innovative Applications of Artificial Intelligence (IAAI’18), and the 8thAAAI Symposium on Education Advances in Artificial Intelligence (EAAI’18). 5189–5198. https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/17368Google ScholarGoogle Scholar
  21. [21] Kim Seonhoon, Kang Inho, and Kwak Nojun. 2019. Semantic sentence matching with densely-connected recurrent and co-attentive information. In Proceedings of the 33rd AAAI Conference on ArtificialIntelligence (AAAI’19), the 31st Innovative Applications of Artificial Intelligence Conference (IAAI’19), and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI’19). 65866593. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. [22] Kingma Diederik P. and Ba Jimmy. 2015. Adam: A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations (ICLR’15). http://arxiv.org/abs/1412.6980Google ScholarGoogle Scholar
  23. [23] Kipf Thomas N. and Welling Max. 2017. Semi-supervised classification with graph convolutional networks. In Proceedings of the 5th International Conference on Learning Representations: Conference Track Proceedings (ICLR’17). https://openreview.net/forum?id=SJU4ayYglGoogle ScholarGoogle Scholar
  24. [24] Li Hang and Xu Jun. 2014. Semantic matching in search. Foundations and Trends in Information Retrieval 7, 5 (2014), 343469.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. [25] Liu Tao, Wang Xin, Lv Chengguo, Zhen Ranran, and Fu Guohong. 2020. Sentence matching with syntax- and semantics-aware BERT. In Proceedings of the 28th International Conference on Computational Linguistics. 33023312. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  26. [26] Liu Xiaodong, Duh Kevin, and Gao Jianfeng. 2018. Stochastic answer networks for natural language inference. arXiv preprint abs/1804.07888 (2018). https://arxiv.org/abs/1804.07888Google ScholarGoogle Scholar
  27. [27] Liu Yang, Gardner Matt, and Lapata Mirella. 2018. Structured alignment networks for matching sentences. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 15541564. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  28. [28] Liu Yinhan, Ott Myle, Goyal Naman, Du Jingfei, Joshi Mandar, Chen Danqi, Levy Omer, Lewis Mike, Zettlemoyer Luke, and Stoyanov Veselin. 2019. RoBERTa: A robustly optimized BERT pretraining approach. arXiv preprint abs/1907.11692 (2019). https://arxiv.org/abs/1907.11692Google ScholarGoogle Scholar
  29. [29] Lu Zhengdong and Li Hang. 2013. A deep architecture for matching short texts. In Proceedings of the 26th International Conference on Neural Information Processing Systems, Vol. 1 (NIPS’13). 1367–1375. https://proceedings.neurips.cc/paper/2013/hash/8a0e1141fd37fa5b98d5bb769ba1a7cc-Abstract.htmlGoogle ScholarGoogle Scholar
  30. [30] Ma Nianzu, Mazumder Sahisnu, Wang Hao, and Liu Bing. 2020. Entity-aware dependency-based deep graph attention network for comparative preference classification. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 57825788. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Manning Christopher, Surdeanu Mihai, Bauer John, Finkel Jenny, Bethard Steven, and McClosky David. 2014. The Stanford CoreNLP natural language processing toolkit. In Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations. 5560. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  32. [32] Mitra Bhaskar, Diaz Fernando, and Craswell Nick. 2017. Learning to match using local and distributed representations of text for web search. In Proceedings of the 26th International Conference on World Wide Web (WWW’17). ACM, New York, NY, 12911299. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. [33] Mohammad Al-Smadi, Jaradat Zain, Mahmoud Al-Ayyoub, and Jararweh Yaser. 2017. Paraphrase identification and semantic text similarity analysis in Arabic news tweets using lexical, syntactic, and semantic features. Information Processing & Management 53, 3 (2017), 640652.Google ScholarGoogle Scholar
  34. [34] Mou Lili, Men Rui, Li Ge, Xu Yan, Zhang Lu, Yan Rui, and Jin Zhi. 2016. Natural language inference by tree-based convolution and heuristic matching. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 130136. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  35. [35] Pang Liang, Lan Yanyan, Guo Jiafeng, Xu Jun, Wan Shengxian, and Cheng Xueqi. 2016. Text matching as image recognition. In Proceedings of the 30th AAAI Conference on Artificial Intelligence. 2793–2799.. http://www.aaai.org/ocs/index.php/AAAI/AAAI16/paper/view/11895Google ScholarGoogle Scholar
  36. [36] Parikh Ankur, Täckström Oscar, Das Dipanjan, and Uszkoreit Jakob. 2016. A decomposable attention model for natural language inference. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. 22492255. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Potthast Martin, Barrón-Cedeño Alberto, Stein Benno, and Rosso Paolo. 2011. Cross-language plagiarism detection. Language Resources and Evaluation 45 (2011), 4562.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. [38] Sachan Devendra, Zhang Yuhao, Qi Peng, and Hamilton William L.. 2021. Do syntax trees help pre-trained transformers extract information? In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. 26472661. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  39. [39] Shen Yelong, He Xiaodong, Gao Jianfeng, Deng Li, and Mesnil Grégoire. 2014. Learning semantic representations using convolutional neural networks for web search. In Proceedings of the 23rd International Conference on World Wide Web. 373374.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. [40] Tan Chuanqi, Wei Furu, Wang Wenhui, Lv Weifeng, and Zhou Ming. 2018. Multiway attention networks for modeling sentence pairs. In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI’18). 4411–4417. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  41. [41] Tay Yi, Luu Anh Tuan, and Hui Siu Cheung. 2018. Co-stack residual affinity networks with multi-level attention refinement for matching text sequences. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 44924502. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] Tay Yi, Luu Anh Tuan, and Hui Siu Cheung. 2018. Compare, compress and propagate: Enhancing neural architectures with alignment factorization for natural language inference. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 15651575. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  43. [43] Tay Yi, Luu Anh Tuan, and Hui Siu Cheung. 2018. Hermitian co-attention networks for text matching in asymmetrical domains. In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI’18). 4425–4431. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  44. [44] Tomar Gaurav Singh, Duque Thyago, Täckström Oscar, Uszkoreit Jakob, and Das Dipanjan. 2017. Neural paraphrase identification of questions with noisy pretraining. In Proceedings of the 1st Workshop on Subword and Character Level Models in NLP. 142147. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  45. [45] Velickovic Petar, Cucurull Guillem, Casanova Arantxa, Romero Adriana, Liò Pietro, and Bengio Yoshua. 2018. Graph attention networks. In Proceedings of the 6th International Conference on Learning Representations: Conference Track Proceedings (ICLR’18). https://openreview.net/forum?id=rJXMpikCZGoogle ScholarGoogle Scholar
  46. [46] Wang Mingxuan, Lu Zhengdong, Li Hang, and Liu Qun. 2015. Syntax-based deep matching of short texts. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI’15). 13541361. http://ijcai.org/Abstract/15/195Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. [47] Wang Xiao, Ji Houye, Shi Chuan, Wang Bai, Ye Yanfang, Cui Peng, and Yu Philip S.. 2019. Heterogeneous graph attention network. In Proceedings of the World Wide Web Conference (WWW’19). ACM, New York, NY, 20222032. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. [48] Wang Xiaoyan, Kapanipathi Pavan, Musa Ryan, Yu Mo, Talamadupula Kartik, Abdelaziz Ibrahim, Chang Maria, Fokoue Achille, Makni Bassem, Mattei Nicholas, and Witbrock Michael. 2019. Improving natural language inference using external knowledge in the science questions domain. In Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI’19), the 31st Innovative Applications of Artificial Intelligence Conference (IAAI’19), and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI’19).72087215. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. [49] Wang Zhiguo, Hamza Wael, and Florian Radu. 2017. Bilateral multi-perspective matching for natural language sentences. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI’17). 41444150. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  50. [50] Wang Zhiguo, Hamza Wael, and Florian Radu. 2017. Bilateral multi-perspective matching for natural language sentences. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI’17). 41444150. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  51. [51] Wu Zonghan, Pan Shirui, Chen Fengwen, Long Guodong, Zhang Chengqi, and Philip S. Yu. 2021. A comprehensive survey on graph neural networks. IEEE Transactions on Neural Networks and Learning Systems 32, 1 (2021), 4–24.Google ScholarGoogle ScholarCross RefCross Ref
  52. [52] Xiong Chenyan, Dai Zhuyun, Callan Jamie, Liu Zhiyuan, and Power Russell. 2017. End-to-end neural ad-hoc ranking with kernel pooling. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, 5564. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. [53] Xu Chen, Xu Jun, Dong Zhenhua, and Wen Ji-Rong. 2022. Semantic sentence matching via interacting syntax graphs. In Proceedings of the 29th International Conference on Computational Linguistics. 938949. https://aclanthology.org/2022.coling-1.78Google ScholarGoogle Scholar
  54. [54] Xu Jun, He Xiangnan, and Li Hang. 2019. Deep learning for matching in search and recommendation. In Proceedings of the 12th ACM International Conference on Web Search and Data Mining (WSDM’19). ACM, New York, NY, 832833. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. [55] Yang Runqi, Zhang Jianhai, Gao Xing, Ji Feng, and Chen Haiqing. 2019. Simple and effective text matching with richer alignment features. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 46994709. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  56. [56] Yao Liang, Mao Chengsheng, and Luo Yuan. 2019. Graph convolutional networks for text classification. In Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI’19), the 31st Innovative Applications of Artificial Intelligence Conference (IAAI’19), and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI’19). 73707377. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. [57] Yu Xueli, Xu Weizhi, Cui Zeyu, Wu Shu, and Wang Liang. 2021. Graph-based hierarchical relevance matching signals for ad-hoc retrieval. In Proceedings of the Web Conference 2021. 778787.Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. [58] Zhang Bo, Zhang Yue, Wang Rui, Li Zhenghua, and Zhang Min. 2020. Syntax-aware opinion role labeling with dependency graph convolutional networks. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 32493258. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  59. [59] Zhang Zhuosheng, Wu Yuwei, Zhao Hai, Li Zuchao, Zhang Shuailiang, Zhou Xi, and Zhou Xiang. 2020. Semantics-aware BERT for language understanding. In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI’20), the 32nd Innovative Applications of Artificial Intelligence Conference (IAAI’20), and the 10th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI’20). 96289635. https://aaai.org/ojs/index.php/AAAI/article/view/6510Google ScholarGoogle Scholar

Index Terms

  1. Syntactic-Informed Graph Networks for Sentence Matching

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Information Systems
        ACM Transactions on Information Systems  Volume 42, Issue 2
        March 2024
        897 pages
        ISSN:1046-8188
        EISSN:1558-2868
        DOI:10.1145/3618075
        Issue’s Table of Contents

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 27 September 2023
        • Online AM: 19 July 2023
        • Accepted: 5 July 2023
        • Revised: 29 August 2022
        • Received: 16 December 2021
        Published in tois Volume 42, Issue 2

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
      • Article Metrics

        • Downloads (Last 12 months)278
        • Downloads (Last 6 weeks)35

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Full Text

      View this article in Full Text.

      View Full Text