Skip to main content

Sociomapping in Text Retrieval Systems

  • Conference paper
Flexible Query Answering Systems (FQAS 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5822))

Included in the following conference series:

Abstract

This contribution describes clustering of most informative keywords within full-text query results and its visualization in 2D or 3D space using so-called sociomapping. The main goal of the clustering is to help user with orientation in the term space and with the reformulating – more detail specification – of ambiguous queries. Test data were obtained from web search engines like Yahoo!, Google etc. To be able to evaluate quality of the used clustering method we have used 2 metrics to compare it with manually classified collection (Reuters Corpus Volume 1), moreover, the quality of preserving mutual distances of clusters from original multi-dimensional space was measured by Spearman’s rank correlation coefficient.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Liu, Y., Zhang, B., Chen, Z., Lyu, M.R., Ma, W.-Y.: Affinity Rank: A New Scheme for Efficient Web Search. In: WWW 2004, May 17–22. ACM, New York (2004), 1-58113-912-8/04/0005

    Google Scholar 

  2. Jenček, P., Höschl, C., Hudeček, J., Šebesta, J.: Mail Archiver Project, Software project documentation. Charles University, Prague (2009)

    Google Scholar 

  3. Höschl, C.: Sociomaps Visualisation, bachelor thesis, Charles University, Prague (in czech language) (2006), http://www.hoschl.cz/cyril/bakalarka/errata.pdf

  4. Salton, G., Buckley, C.: Term weighting approaches in automatic text retrieval. Information Processing and Management 24(5), 513–523 (1988)

    Article  Google Scholar 

  5. MacQueen, J.B.: Some Methods for classification and Analysis of Multivariate Observations. In: Proceedings of 5-th Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. University of California Press, Berkeley (1967)

    Google Scholar 

  6. Wang, D., Yu, G., Bao, Y., Zhang, M.: An Optimized K-Means Algorithm of Reducing Cluster Intra-dissimilarity for Document of Clustering Advances. In: Fan, W., Wu, Z., Yang, J. (eds.) WAIM 2005. LNCS, vol. 3739, pp. 785–790. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  7. Dunn, J.C.: A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters. Journal of Cybernetics 3, 32–57 (1973)

    Article  MATH  MathSciNet  Google Scholar 

  8. Bahbouh, R., Warrenfeltz, R.: The Application of Sociomapping to Executive Team Development, http://www.hoganassessments.com/_hoganweb/documents/ApplicationOfSociomapping.pdf

  9. Spearman, C.: The proof and measurement of association between two things. Amer. J. Psychol. 15, 72–101 (1904)

    Article  Google Scholar 

  10. Borg, I., Groenen, P.: Modern Multidimensional Scaling: theory and applications, 2nd edn. Springer, New York (2005)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Jenček, P., Vojtáš, P., Kopecký, M., Hösch, C. (2009). Sociomapping in Text Retrieval Systems. In: Andreasen, T., Yager, R.R., Bulskov, H., Christiansen, H., Larsen, H.L. (eds) Flexible Query Answering Systems. FQAS 2009. Lecture Notes in Computer Science(), vol 5822. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04957-6_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-04957-6_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-04956-9

  • Online ISBN: 978-3-642-04957-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics