Abstract
This article investigates the effectiveness of an information inference mechanism on Chinese text. The information inference derives implicit associations via computation of information flow on a high dimensional conceptual space, which is approximated by a cognitively motivated lexical semantic space model, namely Hyperspace Analogue to Language (HAL). A dictionary-based Chinese word segmentation system was used to segment words. To evaluate the Chinese-based information flow model, it is applied to query expansion, in which a set of test queries are expanded automatically via information flow computations and documents are retrieved. Standard recall-precision measures are used to measure performance. Experimental results for TREC-5 Chinese queries and People Daily’s corpus suggest that the Chinese information flow model significantly increases average precision, though the increase is not as high as those achieved using English corpus. Nevertheless, there is justification to believe that the HAL-based information flow model, and in turn our psychologistic stance on the next generation of information processing systems, have a promising degree of language independence.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Barwise, J., Seligman, J.: Information Flow: The Logic of Distributed Systems. Cambridge Tracts in Theoretical Computer Science 44 (1997)
Bruza, P.D., Song, D.: Inferring Query Models by Information Flow Analysis. In: Proceedings of the 11th International ACM Conference on Information and Knowledge Management (CIKM 2002), pp. 260–269 (2002)
Burgess, C., Lund, K.: Parsing Constraints and High-Dimensional Semantic Space. Language and Cognitive Processes 12, 177–210 (1997)
Burgess, C., Livesay, L., Lund, K.: Explorations in Context Space: Words, Sentences, Discourse. In: Foltz, P.W. (ed.) Quantitative Approaches to Semantic Knowledge Representation, vol. 25(2&3), pp. 179–210 (1998)
Gärdenfors, P.: Conceptual Spaces: The Geometry of Thought. MIT Press, Cambridge (2000)
Landauer, T., Dumais, S.: A Solution to Plato.s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychological Review 104(2), 211–240 (1997)
Lund, K., Burgess, C.: Producing High-dimensional Semantic Spaces from Lexical Co-occurrence. Behavior Research Methods. Instruments, & Computers 28(2), 203–208 (1996)
Robertson, S.E., Walker, S., Spark-Jones, K., Hancock-Beaulieu, M.M., Gatford, M.: OKAPI at TREC-3. In: Proceedings of the 3rd Text Retrieval Conference (TREC-3), pp. 109–126 (1994)
Song, D., Bruza, P.D.: Towards context-sensitive information inference. Journal of the American Society for Information Science and Technology 54(4), 321–334 (2003)
Text Retrieval Conference (TREC), National Institution of Standards and Technology( NIST), http://trec.nist.gov/data/
Cheong, P., Song, D., Bruza, P.D., Wong, K.F.: Information Flow Analysis with Chinese Text. In: Su, K.-Y., Tsujii, J., Lee, J.-H., Kwong, O.Y. (eds.) IJCNLP 2004. LNCS (LNAI), vol. 3248, pp. 91–98. Springer, Heidelberg (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cheong, P., Song, D., Bruza, P., Wong, KF. (2005). Information Flow Analysis with Chinese Text. In: Su, KY., Tsujii, J., Lee, JH., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2004. IJCNLP 2004. Lecture Notes in Computer Science(), vol 3248. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30211-7_11
Download citation
DOI: https://doi.org/10.1007/978-3-540-30211-7_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24475-2
Online ISBN: 978-3-540-30211-7
eBook Packages: Computer ScienceComputer Science (R0)