Skip to main content

An Approach to Acquire Semantic Relationships Between Words from Web Document

  • Conference paper
Advances in Web-Based Learning – ICWL 2005 (ICWL 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3583))

Included in the following conference series:

  • 898 Accesses

Abstract

In this paper, we focus on the semantic relationships acquisition from Chinese web documents motivated by the large requirement of web question answering system in e-Learning. With our scheme, we dwindle in numbers of text to be analyzed and obtain initial sentence-level text in pre-process phase. Then linguistic rules, which are broken down into unambiguous and ambiguous, designed for Chinese phrases are applied to these sentence-level text to extract the synonymy relationship, hyponymy relationship, hypernymy relationship and parataxis relationship. Lastly, candidates are refined using two heuristics. Compared to other previous works, we apply not only strict unambiguous linguistic rules but also loose ambiguous linguistic rules to extract relationships and proposed efficient approach to refine the outputs of these rules. Experiments show that this method can acquire semantic relationships efficiently and effectively.

Funding for this work was provided by NSF grant 60373105 and 60473136.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Sun, X., Zheng, Q.: Semantics-based Answers Selection in Question Answering System. In: Liu, W., Shi, Y., Li, Q. (eds.) ICWL 2004. LNCS, vol. 3143, pp. 354–362. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  2. Girju, R., Badulescu, A., Moldovan, D.: Learning Semantic Constraints for the Automatic Discovery of Part-Whole Relations. In: Proceedings of HLT-NAACL (2003)

    Google Scholar 

  3. Gildea, D., Jurafsky, D.: Automatically Labeling Semantic classes. In: Proceedings of Annual Conference of the Association for Computational Linguistics, ACL (2004)

    Google Scholar 

  4. Pantel, P., Lin, D.: Discovering Word Senses from Text. In: Proceedings of ACM Conference on Knowledge Discovery and Data Mining, SIGKDD (2002)

    Google Scholar 

  5. Matthew, B., Eugene, C.: Finding Parts in Very Large Corpora. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, Maryland (1999)

    Google Scholar 

  6. Li, Y., He, Q., Shi, Z.: Association Retrieve Based On Concept Semantic Space. Journal of University of Science and Technology, Beijing (2001)

    Google Scholar 

  7. WordNet, http://www.cogsci.princeton.edu/~wn/index.shtml

  8. Google, http://www.google.com/

  9. Wu, P., Chen, Q., Ma, L.: The Study on Large Scale Duplicated Web Pages of Chinese Fast Deletion Algorithm Based on String of Feature Code. Journal of Chinese Information Processing, Beijing (2003)

    Google Scholar 

  10. Zheng, Q., Zhang, S.: A Novel Algorithm of Eliminating the Chinese Word Segmentation Ambiguities for Web Answer. Computer Engineering and Applications (2004)

    Google Scholar 

  11. Wang, Z., Zheng, Q.: An Approach of POS Tagging for Web Answer. Computer Engineering and Applications (2004)

    Google Scholar 

  12. Sun, X., Zheng, Q.: A Method of Special Domain Lexicon Construction Based on Raw Materials. Mini-Micro Systems (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Sun, X., Zheng, Q., Dang, H., Hu, Y., Bai, H. (2005). An Approach to Acquire Semantic Relationships Between Words from Web Document. In: Lau, R.W.H., Li, Q., Cheung, R., Liu, W. (eds) Advances in Web-Based Learning – ICWL 2005. ICWL 2005. Lecture Notes in Computer Science, vol 3583. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11528043_23

Download citation

  • DOI: https://doi.org/10.1007/11528043_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27895-5

  • Online ISBN: 978-3-540-31716-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics