Skip to main content

Corpus Construction for Generating Knowledge Graph of Sichuan Cuisine

  • Conference paper
  • First Online:
Chinese Lexical Semantics (CLSW 2022)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13496))

Included in the following conference series:

Abstract

Sichuan cuisine and its culture play an important role in Chinese food culture. Unfortunately, people can only learn Sichuan cuisine through traditional ways such as the Web and books. To address the problem, we developed the annotation system of Sichuan cuisine and designed the annotation schemes of named entities and entity relations under the guidance of Sichuan cuisine masters. There are 220,000 words in the corpus that was annotated and proofread manually by an annotation system. We constructed a corpus of named entities and entity relations of Sichuan cuisine. It contains 691 kinds of Sichuan cuisine recipes, including 8,683 entities and 8,700 entity relations. The consistency of multiple-around annotation is 0.94 and 0.93, respectively. Based on the corpus, a knowledge graph of Sichuan cuisine was constructed and will be used to develop the tourism QA system in the future.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Zhao, J., Liu, K., He, S., et al.: Knowledge Graph, pp. 7–10. Higher Education Press, Beijing (2018)

    Google Scholar 

  2. Amit, S.: Introducing the Knowledge Graph: Things, Not Strings. Official Blog of Google, America (2012)

    Google Scholar 

  3. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., et al. (eds.) ASWC/ISWC -2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-76298-0_52

    Chapter  Google Scholar 

  4. Etzioni, O., et al.: Web-scale information extraction in knowitall: (preliminary results). In: Proceedings of the 13th International Conference on World Wide Web. ACM, pp. 100–110 (2004)

    Google Scholar 

  5. Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka, E., Mitchell, T.: Toward an architecture for never-ending language learning. Proc. AAAI Conf. Artif. Intell. 24(1), 1306–1313 (2010)

    Google Scholar 

  6. Suchanek, F.M., Kasneci, G., Weikum, G. YAGO: a core of semantic knowledge. In: Proceedings of the 16th International Conference on World Wide Web, pp. 697–706 (2007)

    Google Scholar 

  7. Dong, X.: Knowledge vault: a web-scale approach to probabilistic knowledge fusion. In: Proceedings of the 20th ACM Special Interest Group on Knowledge Discovery and Data Mining. New York, USA, pp. 601–610 (2014)

    Google Scholar 

  8. Byambasuren, O., et al.: Preliminary study on the construction of Chinese medical knowledge graph. J. Chin. Inform. Process. 33(10), 1–9 (2019)

    Google Scholar 

  9. Kemp, C., Tenenbaum, J.B., Griffiths, T.L., Yamada, T., Ueda, N.: Learning systems of concepts with an infinite relational model. In: Proceedings of the 21st National Conference on Artificial Intelligence. Boston, USA, pp. 381–388 (2006)

    Google Scholar 

  10. Belleau, F., Nolin, M.-A., Tourigny, N., Rigault, P., Morissette, J.: Bio2RDF: towards a mashup to build bioinformatics knowledge systems. J. Biomed. Inform. 41(5), 706–716 (2008)

    Article  Google Scholar 

  11. Liu, Y., Li, H.: Survey on domain knowledge graph research. Comput. Syst. Appl. 29(6), 1–12 (2020)

    Google Scholar 

  12. Humphreys, K., et al.: University of sheffield: description of the LaSIE-II system as used for MUC-7. Association for Computational Linguistics (1998)

    Google Scholar 

  13. Guodong, Z., Jie, Z., Jian, S., Dan, S., Chew, L.T.: Recognizing names in biomedical texts: a machine learning approach. Bioinformatics 20(7), 1178–1190 (2004)

    Article  Google Scholar 

  14. Ronan, C., Jason, W., Leon, B., Michael, K., Koray, K., Pavel, K.: Natural Language Processing (Almost) from Scratch. J. Mach. Learn. Res. 12(Aug), 2493–2537 (2011)

    MATH  Google Scholar 

  15. Hripcsak, G., Rothschild, A.S.: Agreement, the f-measure, and reliability in information retrieval. J. Am. Med. Inform. Assoc. 12(3), 296–298 (2005)

    Article  Google Scholar 

Download references

Acknowledgments

This work is supported partly by Science And Technology Bureau Of LeShan Town (No. 21GZD008).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xia Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Yang, X., Jing, S., Chen, X., Jin, P. (2023). Corpus Construction for Generating Knowledge Graph of Sichuan Cuisine. In: Su, Q., Xu, G., Yang, X. (eds) Chinese Lexical Semantics. CLSW 2022. Lecture Notes in Computer Science(), vol 13496. Springer, Cham. https://doi.org/10.1007/978-3-031-28956-9_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-28956-9_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-28955-2

  • Online ISBN: 978-3-031-28956-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics