Skip to main content

An Investigation of an Interontologia: Comparison of the Thousand-Character Text and Roget’s Thesaurus

  • Conference paper
Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy (ICCPOL 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5459))

Included in the following conference series:

Abstract

The present study presents the lexical category analysis of the Thousand-Character Text and Roget’s Thesaurus. Through preprocessing, the Thousand-Character Text and Roget’s Thesaurus have been built into databases. In addition, for easier analysis and more efficient research, we have developed a system to search Roget’s Thesaurus for the categories corresponding to Chinese characters in the Thousand-Character Text. According to the results of this study, most of the 39 sections of Roget’s Thesaurus except the ’Creative Thought’ section were relevant to Chinese characters in the Thousand-Character Text. Three sections ’Space in General’, ’Dimensions’ and ’Matter in General’ have higher mapping rate. The correlation coefficient is also around 0.94, showing high category relevancy on the section level between the Thousand-Character Text and Roget’s Thesaurus.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Roget’s Thesauri, http://www.bartleby.com/thesauri/

  2. WordNet, http://wordnet.princeton.edu/

  3. Lexical FreeNet, http://www.cinfn.com/doc/

  4. Ohno, S., Hamanishi, M.: New Synonyms Dictionary, Kadogawa Shoten, Tokyo (1981) (written in Japanese)

    Google Scholar 

  5. The EDR Electronic Dictionary, http://www2.nict.go.jp/r/r312/EDR/index.html

  6. KR Ontology, http://www.jfsowa.com/ontology/

  7. CYC Ontology, http://www.cyc.com/

  8. Mikrokosmos Ontology, http://crl.nmsu.edu/Research/Projects/mikro/htmls/ontology-htmls/onto.index.html

  9. SENSUS Ontology, http://www.isi.edu/natural-language/projects/ONTOLOGIES.html

  10. HowNet, http://www.keenage.com/html/e_index.html

  11. Enterprise Ontology, http://www.aiai.ed.ac.uk/project/enterprise/enterprise/ontology.html

  12. UMLS, http://www.nlm.nih.gov/research/umls/

  13. UNSPSC, http://www.unspsc.org/

  14. RosettaNet, http://www.rosettanet.org

  15. ISO 2788, http://www.collectionscanada.gc.ca/iso/tc46sc9/standard/2788e.htm

  16. ANSI Z39.19, http://www.niso.org/standards/resources/Z39-19-2005.pdf

  17. Richard, E.N.: The Geography of Thought: How Asians and Westerners Think Differently... and Why. Simon & Schuster, New York (2004)

    Google Scholar 

  18. Kalfoglou, Y., Schorlemmer, M.: Ontology mapping: the state of the art. The Knowledge Engineering Review 18(1), 1–31 (2003)

    Article  Google Scholar 

  19. Noy, N.F.: Semantic Integration: A Survey of Ontology-Based Approaches. SIGMOD Record 33(4), 65–70 (2004)

    Article  Google Scholar 

  20. Kim, J.-T., Song, C.-S.: Comparison of Vocabulary Classification Systems among Thousand-Character Text, Yuhap, and Hunmongjahoi, Korean Literature Society, Linguistics and Literature, vol. 52, pp. 159–192 (1991) (written in Korean)

    Google Scholar 

  21. Jin, T.-H.: Problems in the Translations and Sounds of Thousand-Character Text, Hangeul-Chinese Character Culture, vol. 104, pp. 80–82 (2008) (written in Korean)

    Google Scholar 

  22. Kingsoft 2008 (谷歌金山词霸), http://g.iciba.com/

  23. Classical Chinese Character FrequencyList, http://lingua.mtsu.edu/chinese-computing/statistics/char/list.php?Which=CL

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kim, SR., Yang, JG., Bae, JH.J. (2009). An Investigation of an Interontologia: Comparison of the Thousand-Character Text and Roget’s Thesaurus. In: Li, W., Mollá-Aliod, D. (eds) Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy. ICCPOL 2009. Lecture Notes in Computer Science(), vol 5459. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00831-3_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-00831-3_40

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-00830-6

  • Online ISBN: 978-3-642-00831-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics