A hybrid method for entity hyponymy acquisition in Chinese complex sentences

Cheng, Yunru; Guo, Jianyi; Xian, Yantuan; Yu, Zhengtao; Chen, Wei; Yang, Qiyue

doi:10.3103/S0146411616050035

A hybrid method for entity hyponymy acquisition in Chinese complex sentences

Published: 12 November 2016

Volume 50, pages 369–377, (2016)
Cite this article

Automatic Control and Computer Sciences Aims and scope Submit manuscript

Yunru Cheng¹,
Jianyi Guo^1,2,
Yantuan Xian^1,2,
Zhengtao Yu^1,2,
Wei Chen^1,2 &
…
Qiyue Yang¹

52 Accesses
1 Citation
Explore all metrics

Abstract

Extracting entity hyponymy in Chinese complex sentences can be a highly difficult process. This paper proposes a novel hybrid approach that combines parsing with supervised learning and semi-supervised learning. First, conditional random fields (CRF) model is employed to obtain the candidate domain named entity. Pattern matching is then used to acquire candidate hyponymy. Next, predicate and symbol features, syntactic analysis, and semantic roles are introduced into the CRF features template to identify the hyponymy entity pairs. Finally, analysis of both the parallel relationship of entities among sentences and entity pairs in simple sentences is conducted to obtain the hyponymy entity pairs in Chinese complex sentences. The experimental results show that the proposed method reduces the manual work required for CRF markers and has an improved overall performance in comparison with the baseline methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Extracting hyponymy of domain entity using Cascaded Conditional Random Fields

Article 01 July 2017

Research on Pattern Representation and Reliability in Semi-Supervised Entity Relation Extraction

OntoILPER: an ontology- and inductive logic programming-based system to extract entities and relations from text

Article 09 October 2017

References

Nakaya, N., Kurematsu, M., and Yamaguchi, T., A domain ontology development environment using a MRD and text corpus, Proc. of the Joint Conf. on Knowledge Based Software Engineering, 2002, pp. 242–253.
Google Scholar
WordNet: A Lexical Database for English, Princeton University. http://wordnet.princeton.edu/wordnet/.
Li, H., Li, W., Liang, R., et al., Toponym ontology concept semantic relation research based on place name dictionary and thesaurus, China Place Name, 2010, vol. 10, pp. 71–74.
Google Scholar
Dong, Z. and Dong, Q., HowNet. http://www.keenage.com/html/c_index.html.
Hearst, M.A., Automatic acquisition of hyponyms from large text corpora, Proceedings of the 14th Conference on Computational Linguistics, 1992, vol. 2, pp. 539–545.
Article Google Scholar
Tuan, L.A., Kim, J., and Kiong, N.S., Taxonomy construction using syntactic contextual evidence, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014, pp. 810–819.
Google Scholar
Bansal, M., Burkett, D., Melo, D.G., et al., Structured learning for taxonomy induction with belief propagation, ACL, 2014, no. 1, pp. 1041–1051.
Google Scholar
Wu, J., Luo, B., and Cao, C., Acquisition and verification of mereological knowledge from Web page texts, J. East China Univ. Sci. Technol., 2006, vol. 32, no. 11, p. 1310.
Google Scholar
Tang, Q., Lv, X.Q., and Li, Z., Research on domain ontology concept hyponymy relation extraction, Microelectron. Comput., 2014, vol. 6, pp. 68–71.
Google Scholar
Tian, F., Yuan, C., and Ren, F., Hyponym extraction from the web by bootstrapping, IEEJ Trans. Electr. Electron. Eng., 2012, vol. 7, no. 1, pp. 62–68.
Article Google Scholar
Fan, M., Zhao, D., Zhou, Q., et al., Distant supervision for relation extraction with matrix completion, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014, vol. 1, pp. 839–849.
Google Scholar
Xia, F., Cao, X., Fu, J., et al., Extracting part-whole relations based on coordinate structure, J. Chin. Inf. Process., 2015, vol. 29, no. 1, pp. 88–96.
Google Scholar
Fu, R., Qin, B., and Liu, T., Exploiting multiple sources for open-domain hypernym discovery, EMNLP, 2013, pp. 1224–1234.
Google Scholar
Sang, E.T.K. and Hofmann, K., Lexical patterns or dependency patterns: Which is better for hypernym extraction?, Proceedings of the Thirteenth Conference on Computational Natural Language Learning, 2009, pp. 174–182.
Chapter Google Scholar
Liu, H., Che, W., and Liu, T., Feature engineering for Chinese semantic role labeling, J. Chin. Inf. Process., 2007, vol. 21, no. 1, pp. 79–84.
Google Scholar
Chen, Y., Zheng, Q., and Chen, P., Feature assembly method for extracting relations in Chinese, Artif. Intell., 2015, vol. 228, pp. 179–184.
Article MathSciNet MATH Google Scholar
Zhang, H., NLPIR: Chinese word segmentation system. http://ictclas.nlpir.org/.
Pennacchiotti, M. and Pantel, P., A bootstrapping algorithm for automatically harvesting semantic relations, Proceedings of Inference in Computational Semantics (ICoS-06), 2006, pp. 87–96.
Google Scholar
The Research Center for Social Computing and Information Retrieval at Harbin Institute of Technology (HITSCIR): Language Technology Platform. http://www.ltp-cloud.com/.
Mo, Y., Guo, J., Yu, Z., et al., Hyponymy extraction of domain ontology concept based on CCRF, Comput. Eng., 2014, vol. 40, no. 6, pp. 138–141.
Google Scholar
Wang, C. and Yang, Z., An acquisition method of domain-specific terminological hyponym based on structure features of sentence, J. Chongqing Univ. Posts Telecommun. (Nat. Sci. Ed.), 2014, vol. 3, p. 19.
Google Scholar
Chang, C. and Lin, C., LIBSVM–A library for support vector machines. http://www.csie.ntu.edu.tw/~cjlin/libsvm/.
Kudo, T., CRF++: Yet another CRF toolkit. https://taku910.github.io/crfpp/.
Che, W., Li, Z., and Liu, T., LTP: A Chinese language technology platform, Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations, 2010, pp. 13–16.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, Yunnan, China
Yunru Cheng, Jianyi Guo, Yantuan Xian, Zhengtao Yu, Wei Chen & Qiyue Yang
Key Laboratory of Pattern recognition And Intelligent computing of Yunnan College, Kunming, Yunnan, China
Jianyi Guo, Yantuan Xian, Zhengtao Yu & Wei Chen

Authors

Yunru Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Jianyi Guo
View author publications
You can also search for this author in PubMed Google Scholar
Yantuan Xian
View author publications
You can also search for this author in PubMed Google Scholar
Zhengtao Yu
View author publications
You can also search for this author in PubMed Google Scholar
Wei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Qiyue Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yunru Cheng.

Additional information

The article is published in the original.

About this article

Cite this article

Cheng, Y., Guo, J., Xian, Y. et al. A hybrid method for entity hyponymy acquisition in Chinese complex sentences. Aut. Control Comp. Sci. 50, 369–377 (2016). https://doi.org/10.3103/S0146411616050035

Download citation

Received: 19 April 2016
Accepted: 28 July 2016
Published: 12 November 2016
Issue Date: September 2016
DOI: https://doi.org/10.3103/S0146411616050035

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A hybrid method for entity hyponymy acquisition in Chinese complex sentences

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Extracting hyponymy of domain entity using Cascaded Conditional Random Fields

Research on Pattern Representation and Reliability in Semi-Supervised Entity Relation Extraction

OntoILPER: an ontology- and inductive logic programming-based system to extract entities and relations from text

References

Author information

Authors and Affiliations

Corresponding author

Additional information

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

A hybrid method for entity hyponymy acquisition in Chinese complex sentences

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Extracting hyponymy of domain entity using Cascaded Conditional Random Fields

Research on Pattern Representation and Reliability in Semi-Supervised Entity Relation Extraction

OntoILPER: an ontology- and inductive logic programming-based system to extract entities and relations from text

References

Author information

Authors and Affiliations

Corresponding author

Additional information

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation