Abstract
In this paper, we focus on the semantic relationships acquisition from Chinese web documents motivated by the large requirement of web question answering system in e-Learning. With our scheme, we dwindle in numbers of text to be analyzed and obtain initial sentence-level text in pre-process phase. Then linguistic rules, which are broken down into unambiguous and ambiguous, designed for Chinese phrases are applied to these sentence-level text to extract the synonymy relationship, hyponymy relationship, hypernymy relationship and parataxis relationship. Lastly, candidates are refined using two heuristics. Compared to other previous works, we apply not only strict unambiguous linguistic rules but also loose ambiguous linguistic rules to extract relationships and proposed efficient approach to refine the outputs of these rules. Experiments show that this method can acquire semantic relationships efficiently and effectively.
Funding for this work was provided by NSF grant 60373105 and 60473136.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Sun, X., Zheng, Q.: Semantics-based Answers Selection in Question Answering System. In: Liu, W., Shi, Y., Li, Q. (eds.) ICWL 2004. LNCS, vol. 3143, pp. 354–362. Springer, Heidelberg (2004)
Girju, R., Badulescu, A., Moldovan, D.: Learning Semantic Constraints for the Automatic Discovery of Part-Whole Relations. In: Proceedings of HLT-NAACL (2003)
Gildea, D., Jurafsky, D.: Automatically Labeling Semantic classes. In: Proceedings of Annual Conference of the Association for Computational Linguistics, ACL (2004)
Pantel, P., Lin, D.: Discovering Word Senses from Text. In: Proceedings of ACM Conference on Knowledge Discovery and Data Mining, SIGKDD (2002)
Matthew, B., Eugene, C.: Finding Parts in Very Large Corpora. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, Maryland (1999)
Li, Y., He, Q., Shi, Z.: Association Retrieve Based On Concept Semantic Space. Journal of University of Science and Technology, Beijing (2001)
Google, http://www.google.com/
Wu, P., Chen, Q., Ma, L.: The Study on Large Scale Duplicated Web Pages of Chinese Fast Deletion Algorithm Based on String of Feature Code. Journal of Chinese Information Processing, Beijing (2003)
Zheng, Q., Zhang, S.: A Novel Algorithm of Eliminating the Chinese Word Segmentation Ambiguities for Web Answer. Computer Engineering and Applications (2004)
Wang, Z., Zheng, Q.: An Approach of POS Tagging for Web Answer. Computer Engineering and Applications (2004)
Sun, X., Zheng, Q.: A Method of Special Domain Lexicon Construction Based on Raw Materials. Mini-Micro Systems (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sun, X., Zheng, Q., Dang, H., Hu, Y., Bai, H. (2005). An Approach to Acquire Semantic Relationships Between Words from Web Document. In: Lau, R.W.H., Li, Q., Cheung, R., Liu, W. (eds) Advances in Web-Based Learning – ICWL 2005. ICWL 2005. Lecture Notes in Computer Science, vol 3583. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11528043_23
Download citation
DOI: https://doi.org/10.1007/11528043_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27895-5
Online ISBN: 978-3-540-31716-6
eBook Packages: Computer ScienceComputer Science (R0)