A Construction Method for the Semantic Relation Corpus of Traditional Chinese Medicine

Chen, Jing; Wang, Haitao; Liu, Liangliang; Zhang, Xiaoru; Zhao, Jing; Zhang, Fan; Cao, Xinyu

doi:10.1007/978-3-319-71084-6_65

A Construction Method for the Semantic Relation Corpus of Traditional Chinese Medicine

Jing Chen¹⁸,
Haitao Wang¹⁹,
Liangliang Liu¹⁹,
Xiaoru Zhang¹⁸,
Jing Zhao¹⁹,
Fan Zhang¹⁹ &
…
Xinyu Cao²⁰

Conference paper
First Online: 18 November 2017

2309 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10676))

Abstract

It is difficult to use a search engine to acquire knowledge directly due to the complexity of Web resources. In this paper we proposed a method for semantic relation corpus construction of traditional Chinese medicine based on combination of multiple encyclopedias. For a known conceptual pair, we got some search results by automatically constructing search requests based on URLs’ characteristics of the encyclopedia search engine, and used regular expressions to extract meaningful texts from the search results to form semantic relation corpus. The experiment result shows that the precision and recall are 92.1% and 65.3%, respectively.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Liu, Y., Duan, H., Wang, H., Zhou, Y., Wang, Z., Li, H.: Research on corpus creation and development of Chinese traditional medicine. J. Chin. Inf. Process. 22(4), 24–30 (2008). http://doi.org/10.3969/j.issn.1003-0077.2008.04.004. (in Chinese)
Google Scholar
Roberts, A., Gaizauskas, R., Hepple, M., et al.: Building a semantically annotated corpus of clinical texts. J. Biomed. Inform. 42(5), 950–966 (2009). http://doi.org/10.1016/j.jbi.2008.12.013
Article Google Scholar
Chapman, W.W., Savova, G.K., Zheng, J., et al.: Anaphoric reference in clinical reports: characteristics of an annotated corpus. J. Biomed. Inform. 45(3), 507–521 (2012). http://doi.org/10.1016/j.jbi.2012.01.010
Article Google Scholar
Yu, Q., Cui, M., Liu, L., Liu, J., Liu, H.: Current status of research on database of TCM medical records. China Digit. Med. 8(3), 71–74 (2013). http://doi.org/10.3969/j.issn.1673-7571.2013.03.023. (in Chinese)
Google Scholar
Qu, C., Guan, Y., Yang, J., Zhao, Y., Liu, X.: The construction of annotated corpora of named entities for Chinese electronic medical records. Chin. High Technol. Lett. 02, 143–150 (2015). http://doi.org/10.3772/j.issn.1002-0470.2015.02.005. (in Chinese)
Google Scholar
Feng, L.: Automatic Approaches to Develop Large-scale TCM Electronic Medical Record Corpus for Named Entity Recognition Tasks. BeiJing JiaoTong University (2015). (in Chinese)
Google Scholar
Li, Z., Liu, F., Antieau, L., Cao, Y., Yu, H.: Lancet: a high precision medication event extraction system for clinical text. J. Am. Med. Inform. Assoc. 17(5), 563 (2010). http://doi.org/10.1136/jamia.2010.004077
Article Google Scholar
Collier, N., Mima, H., Ohta, T., Tateisi, Y., Yakushiji, A.: The GENIA project: knowledge acquisition from biology texts. Genome Inform. 11, 448–449 (2001). http://doi.org/10.11234/gi1990.11.448
Google Scholar
Friedman, C., Kra, P., Rzhetsky, A.: Two biomedical sublanguages: a description based on the theories of Zellig Harris. J. Biomed. Inform. 35(4), 222 (2002). http://doi.org/10.1016/S1532-0464(03)00012-1
Article Google Scholar
Yang, J.F., Guan, Y., He, B., Qu, C.Y., Yu, Q.B., Liu, Y.X., Zhao, Y.J.: Corpus construction for named entities and entity relations on Chinese electronic medical records. J. Softw. 27(11), 2725–2746 (2016). (in Chinese)
Google Scholar
Yang, Y.: Demonstrative study of semantic relation in comprehensive clinical terminologies of traditional Chinese Medicine. Chinese Academy of traditional Chinese Medicine (2007). (in Chinese)
Google Scholar
Bai, L., Zhou, Y., Yue, X.: Thoughts and methods of digital informationization of ancient chinese medicine. J. Tradit. Chin. Med. 05, 12 (2009). (in Chinese)
Google Scholar
Zhu, L., Yu, T., Yang, F.: Study on semantic relations discovery based on key verbs in chinese classical medical books. China Digit. Med. 05, 73–75 (2016). http://doi.org/10.3969/j.issn.1673-7571.2016.05.023. (in Chinese)
Google Scholar
Wang, S.: Study on Pathogenesis of Traditional Chinese Medicine Symptoms and Its Relationship Mining. Xiamen University (2009). (in Chinese)
Google Scholar
Yao, Y., Wang, S., Xu, R., Liu, G., Gui, L., Lu, Q., Wang, X.: The construction of an emotion annotated corpus on microblog text. J. Chin. Inf. Process. 05, 83–91 (2014). http://doi.org/10.3969/j.issn.1003-0077.2014.05.011. (in Chinese)
Google Scholar
Han, Z.: Construction of dynamic corpus based on web - a case study of Chinese political news corpus. China Educ. Technol. Equip. 23, 66–68 (2013). http://doi.org/10.3969/j.issn.1671-489X.2013.23.066. (in Chinese)
Google Scholar
Cao, X., Cao, C.: A method for acquiring corpus rich in part-whole relation from the web. J. Chin. Inf. Process. 05, 17–23 (2011). http://doi.org/10.3969/j.issn.1003-0077.2011.05.003. (in Chinese)
Google Scholar
Hu, H., Yao, T.: Sentence alignment of bilingual verbs based on Wikipedia. J. Chin. Inf. Process. 01, 198–203 (2016). (in Chinese)
Google Scholar

Download references

Acknowledgments

This work was supported by National Key R&D Program of China (2016YFF0202806), National Natural Science Foundation of China (81403281), project of China National Institute of Standardization (712016Y-4941, 522016Y-4681).

Author information

Authors and Affiliations

School of Computer Science and Engineering, Jiangsu University of Science and Technology, Zhenjiang, China
Jing Chen & Xiaoru Zhang
China National Institute of Standardization, Beijing, China
Haitao Wang, Liangliang Liu, Jing Zhao & Fan Zhang
Institute of Basic Research in Clinical Medicine, Chinese Academy of Chinese Medical Sciences, Beijing, China
Xinyu Cao

Authors

Jing Chen
View author publications
You can also search for this author in PubMed Google Scholar
Haitao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Liangliang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoru Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jing Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Fan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xinyu Cao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xinyu Cao .

Editor information

Editors and Affiliations

National Taichung University of Science and Technology, Taichung City, Taiwan
Tien-Chi Huang
City University of Hong Kong, Kowloon, Hong Kong
Rynson Lau
Department of Engineering Science, National Cheng Kung University, Tainan, Taiwan
Yueh-Min Huang
University of Caen Normandie, Caen, France
Marc Spaniol
City University of Hong Kong, Kowloon, Hong Kong
Chun-Hung Yuen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, J. et al. (2017). A Construction Method for the Semantic Relation Corpus of Traditional Chinese Medicine. In: Huang, TC., Lau, R., Huang, YM., Spaniol, M., Yuen, CH. (eds) Emerging Technologies for Education. SETE 2017. Lecture Notes in Computer Science(), vol 10676. Springer, Cham. https://doi.org/10.1007/978-3-319-71084-6_65

Download citation

DOI: https://doi.org/10.1007/978-3-319-71084-6_65
Published: 18 November 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-71083-9
Online ISBN: 978-3-319-71084-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics