Skip to main content

A hybrid method for entity hyponymy acquisition in Chinese complex sentences

  • Published:
Automatic Control and Computer Sciences Aims and scope Submit manuscript

Abstract

Extracting entity hyponymy in Chinese complex sentences can be a highly difficult process. This paper proposes a novel hybrid approach that combines parsing with supervised learning and semi-supervised learning. First, conditional random fields (CRF) model is employed to obtain the candidate domain named entity. Pattern matching is then used to acquire candidate hyponymy. Next, predicate and symbol features, syntactic analysis, and semantic roles are introduced into the CRF features template to identify the hyponymy entity pairs. Finally, analysis of both the parallel relationship of entities among sentences and entity pairs in simple sentences is conducted to obtain the hyponymy entity pairs in Chinese complex sentences. The experimental results show that the proposed method reduces the manual work required for CRF markers and has an improved overall performance in comparison with the baseline methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Nakaya, N., Kurematsu, M., and Yamaguchi, T., A domain ontology development environment using a MRD and text corpus, Proc. of the Joint Conf. on Knowledge Based Software Engineering, 2002, pp. 242–253.

    Google Scholar 

  2. WordNet: A Lexical Database for English, Princeton University. http://wordnet.princeton.edu/wordnet/.

  3. Li, H., Li, W., Liang, R., et al., Toponym ontology concept semantic relation research based on place name dictionary and thesaurus, China Place Name, 2010, vol. 10, pp. 71–74.

    Google Scholar 

  4. Dong, Z. and Dong, Q., HowNet. http://www.keenage.com/html/c_index.html.

  5. Hearst, M.A., Automatic acquisition of hyponyms from large text corpora, Proceedings of the 14th Conference on Computational Linguistics, 1992, vol. 2, pp. 539–545.

    Article  Google Scholar 

  6. Tuan, L.A., Kim, J., and Kiong, N.S., Taxonomy construction using syntactic contextual evidence, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014, pp. 810–819.

    Google Scholar 

  7. Bansal, M., Burkett, D., Melo, D.G., et al., Structured learning for taxonomy induction with belief propagation, ACL, 2014, no. 1, pp. 1041–1051.

    Google Scholar 

  8. Wu, J., Luo, B., and Cao, C., Acquisition and verification of mereological knowledge from Web page texts, J. East China Univ. Sci. Technol., 2006, vol. 32, no. 11, p. 1310.

    Google Scholar 

  9. Tang, Q., Lv, X.Q., and Li, Z., Research on domain ontology concept hyponymy relation extraction, Microelectron. Comput., 2014, vol. 6, pp. 68–71.

    Google Scholar 

  10. Tian, F., Yuan, C., and Ren, F., Hyponym extraction from the web by bootstrapping, IEEJ Trans. Electr. Electron. Eng., 2012, vol. 7, no. 1, pp. 62–68.

    Article  Google Scholar 

  11. Fan, M., Zhao, D., Zhou, Q., et al., Distant supervision for relation extraction with matrix completion, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014, vol. 1, pp. 839–849.

    Google Scholar 

  12. Xia, F., Cao, X., Fu, J., et al., Extracting part-whole relations based on coordinate structure, J. Chin. Inf. Process., 2015, vol. 29, no. 1, pp. 88–96.

    Google Scholar 

  13. Fu, R., Qin, B., and Liu, T., Exploiting multiple sources for open-domain hypernym discovery, EMNLP, 2013, pp. 1224–1234.

    Google Scholar 

  14. Sang, E.T.K. and Hofmann, K., Lexical patterns or dependency patterns: Which is better for hypernym extraction?, Proceedings of the Thirteenth Conference on Computational Natural Language Learning, 2009, pp. 174–182.

    Chapter  Google Scholar 

  15. Liu, H., Che, W., and Liu, T., Feature engineering for Chinese semantic role labeling, J. Chin. Inf. Process., 2007, vol. 21, no. 1, pp. 79–84.

    Google Scholar 

  16. Chen, Y., Zheng, Q., and Chen, P., Feature assembly method for extracting relations in Chinese, Artif. Intell., 2015, vol. 228, pp. 179–184.

    Article  MathSciNet  MATH  Google Scholar 

  17. Zhang, H., NLPIR: Chinese word segmentation system. http://ictclas.nlpir.org/.

  18. Pennacchiotti, M. and Pantel, P., A bootstrapping algorithm for automatically harvesting semantic relations, Proceedings of Inference in Computational Semantics (ICoS-06), 2006, pp. 87–96.

    Google Scholar 

  19. The Research Center for Social Computing and Information Retrieval at Harbin Institute of Technology (HITSCIR): Language Technology Platform. http://www.ltp-cloud.com/.

  20. Mo, Y., Guo, J., Yu, Z., et al., Hyponymy extraction of domain ontology concept based on CCRF, Comput. Eng., 2014, vol. 40, no. 6, pp. 138–141.

    Google Scholar 

  21. Wang, C. and Yang, Z., An acquisition method of domain-specific terminological hyponym based on structure features of sentence, J. Chongqing Univ. Posts Telecommun. (Nat. Sci. Ed.), 2014, vol. 3, p. 19.

    Google Scholar 

  22. Chang, C. and Lin, C., LIBSVM–A library for support vector machines. http://www.csie.ntu.edu.tw/~cjlin/libsvm/.

  23. Kudo, T., CRF++: Yet another CRF toolkit. https://taku910.github.io/crfpp/.

  24. Che, W., Li, Z., and Liu, T., LTP: A Chinese language technology platform, Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations, 2010, pp. 13–16.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yunru Cheng.

Additional information

The article is published in the original.

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cheng, Y., Guo, J., Xian, Y. et al. A hybrid method for entity hyponymy acquisition in Chinese complex sentences. Aut. Control Comp. Sci. 50, 369–377 (2016). https://doi.org/10.3103/S0146411616050035

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.3103/S0146411616050035

Keywords