Abstract
The termvisual text analytics describes a class of information analysis techniques and processes that enable knowledge discovery via the use of interactive graphical representations of textual data. These techniques enable discovery and understanding via the recruitment of human visual pattern recognition and spatial reasoning capabilities. Visual text analytics is a subclass of visual data mining / visual analytics, which more generally encompasses analytical techniques that employ visualization of non-physically-based (or “abstract”) data of all types. Text visualization is a key component in visual text analytics. While the term “text visualization” has been used to describe a variety of methods for visualizing both structured and unstructured characteristics of text-based data, it is most closely associated with techniques for depicting the semantic characteristics of the free-text components of documents in large document collections. In contrast with text clustering techniques which serve only to partition text corpora into sets of related items, these so-called semantic mapping methods also typically strive to depict detailed inter- and intra-set similarity structure. Text analytics software typically couples semantic mapping techniques with additional visualization techniques to enable interactive comparison of semantic structure with other characteristics of the information, such as publication date or citation information. In this way, value can be derived from the material in the form of multidimensional relationship patterns existing among the discrete items in the collection. The ultimate goal of these techniques is to enable human understanding and reasoning about the contents of large and complexly related text collections.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web. Scientific American 284(5), 34–43 (2001)
Feldman, R., Dagan, I.: KDT - Knowledge discovery in text. In: Proceedings of the First International Conference on Knowledge Discovery (KDD 1995), Montreal (1995)
Thomas, J.J., Cook, K.A. (eds.): Illuminating the Path: The Research and Development Agenda for Visual Analytics. IEEE Computer Society Press, Los Alamitos (2005)
Eick, S., et al.: WEB-Based Text Visualization. 1997, National Institute of Statistical Sciences: Technical Report 64. p. http://www.niss.org/technicalreports/tr64.pdf
Rohrer, R.M., Ebert, D.S., Sibert, J.L.: The shape of Shakespeare: Visualizing text using implicit surfaces. In: Proceedings of the IEEE Symposium on Information Visualization 1998, North Carolina (1998)
Miller, N.E., et al.: Topic Islands: A wavelet-based text visualization system. In: Proceedings of the 9th IEEE Conference on Visualization VIS 1998, Research Triangle Park, North Carolina (1998)
Lin, X., Soergel, D., Marchionini, D.: A self-organizing semantic map for information retrieval. In: Proceedings of the Fourteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, Chicago (1991)
Chalmers, M., Chitson, P.: Bead: Explorations in information visualization. In: Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, Copenhagen (1992)
Wise, J.A., et al.: Visualizing the non-visual: Spatial analysis and interaction with information from text documents. In: Proceedings of the IEEE Information Visualization Symposium InfoVis 1995, Atlanta, Georgia (1995)
Kohonen, T., et al.: Self organization of a massive document collection. IEEE Transactions on Neural Networks 11(3), 574–585 (2000)
Doyle, L.: Semantic roadmaps for literature searchers. Journal of the Association for Computing Machinery 8(4), 367–391 (1961)
Sammon, J.W.: A nonlinear mapping for data structure analysis. IEEE Transactions on Computing 18(5), 401–409 (1969)
Crouch, D.: The visual display of information in an information retrieval environment. In: Proceedings of the 9th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, Pisa (1986)
York, J., et al.: Clustering and dimensionality reduction in SPIRE. In: Proceedings of the Symposium on Advanced Information Processing and Analysis, AIPA 1995, Tysons Corner, VA (1995)
Salton, G.: Automatic Text Processing. Addison-Wesley, Reading (1989)
Wall, M.E., Rechtsteiner, A., Rocha, L.M.: Singular value decomposition and principal component analysis. In: Berrar, D.P., Dubitzky, W., Granzow, M. (eds.) A Practical Approach to Microarray Data Analysis, pp. 91–109. Kluwer, Norwell (2003)
Deerwester, S.C., et al.: Indexing by latent semantic analysis. Journal of American Society for Information Science 41(6), 391–407 (1990)
Booker, A., et al.: Visualizing text data sets. Computing in Science & Engineering 1(4), 26–35 (1999)
Huang, S., Ward, M., Rundensteiner, E.: Exploration of dimensionality reduction for text visualization. Worcester Polytechnic Institute, Computer Science Department: Technical Report TR-03-14, Worcester (2003)
Chatfield, C., Collins, A.: Introduction to Multivariate Analysis. Chapman & Hall, London (1980)
Kruskal, J.: Multidimensional scaling by optimizing goodness-of-fit to a nonmetric hypothesis. Psychometrika 29, 1–27 (1964); Reprinted in Key Texts in Multidimensional Scaling, In: Davies, P.M., Coxon, A.P.M.(eds.). Heinemann Educational Books, Exeter, N.H., pp. 59–83 (1982)
Risch, J.S., et al.: The STARLIGHT information visualization system. In: Card, S., Mackinlay, J., Shneiderman, B. (eds.) Readings in Information Visualization: Using Vision To Think, pp. 551–560. Morgan Kaufmann, San Francisco (1999)
Hartigan, J.A.W., A., M.: Algorithm AS136: A k-Means Clustering Algorithm. Applied Statistics 28, 100–108 (1979)
Bertin, J.: Semiology of Graphics: Diagrams, Networks, Maps. University of Wisconsin Press, Madison (1983)
Lagus, K., Kaski, S., Kohonen, T.: Mining massive document collections by the WEBSOM method. Information Sciences 163(1-3), 135–156 (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Risch, J., Kao, A., Poteet, S.R., Wu, Y.J.J. (2008). Text Visualization for Visual Text Analytics. In: Simoff, S.J., Böhlen, M.H., Mazeika, A. (eds) Visual Data Mining. Lecture Notes in Computer Science, vol 4404. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71080-6_11
Download citation
DOI: https://doi.org/10.1007/978-3-540-71080-6_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71079-0
Online ISBN: 978-3-540-71080-6
eBook Packages: Computer ScienceComputer Science (R0)