Skip to main content

AVEDA: Statistical Tests for Finding Interesting Visualisations

  • Conference paper
Book cover Knowledge-Based and Intelligent Information and Engineering Systems (KES 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5711))

  • 859 Accesses

Abstract

Visualisation is usually one of the first steps in handling any data analysis problem. Visualisations are an intuitive way to discover inconsistencies, outliers, dependencies, interesting patterns and peculiarities in the data. However, due to modern computer technology, a vast number of visualisation techniques is available nowadays. Even if only simple scatterplots, plotting pairs of variables against each other, are considered, the number of scatterplots is too large for high-dimensional data to visually inspect each scatterplot. In this paper, we propose a system architecture called AVEDA (Automatic Visual Exploratory Data Analysis) which computes a large number of visualisations, filters out those ones that might contain special patterns and shows only these interesting visualisations to the user. The filtering process for the visualisations is based on statistical tests and statistical measures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Tukey, J.W.: Exploratory Data Analysis. Addison-Wesley, Reading (1977)

    MATH  Google Scholar 

  2. Borg, I., Groenen, P.: Modern Multidimensional Scaling: Theory and Applications. Springer, Berlin (1997)

    Book  MATH  Google Scholar 

  3. Jolliffe, I.: Principal Component Analysis. Springer, New York (2002)

    MATH  Google Scholar 

  4. Soukup, T., Davidson, I.: Visual Data Mining: Techniques and Tools for Data Visualization and Mining. Wiley, New York (2002)

    Google Scholar 

  5. Morrison, A., Ross, G., Chalmers, M.: Fast multidimensional scaling through sampling, springs and interpolation. Information Visualization 2 (2003)

    Google Scholar 

  6. Rehm, F., Klawonn, F., Kruse, R.: MDS polar – a new approach for dimension reduction to visualize high dimensional data. In: Famili, A.F., Kook, J.N., Peña, J.M., Siebes, A., Feelders, A. (eds.) IDA 2005. LNCS, vol. 3646, pp. 316–327. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  7. Lowe, D., Tipping, M.: Feed-forward neural networks topographic mapping for exploratory data analysis. Neural Computing and Applications 4, 83–95 (1996)

    Article  Google Scholar 

  8. Scholz, M., Kaplan, F., Guy, C., Kopka, J., Selbig, J.: Non-linear pca: A missing data approach. Bioinformatics 21, 3887–3895 (2005)

    Article  Google Scholar 

  9. Kolodyazhniy, V., Klawonn, F., Tschumitschew, K.: Neuro-fuzzy model for dimensionality reduction and its application. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 15, 571–593 (2007)

    Article  MATH  Google Scholar 

  10. Friedman, J., Tukey, J.: A projection pursuit algorithm for exploratory data analysis. IEEE Transactions on Computers C-23, 881–890 (1974)

    Article  MATH  Google Scholar 

  11. Diaconis, P., Freedman, D.: Asymptotics of graphical projection pursuit. The Annals of Statistics 17, 793–815 (1989)

    Article  MathSciNet  MATH  Google Scholar 

  12. Huber, P.: Projection pursuit. The Annals of Statistics 13, 435–475 (1985)

    Article  MathSciNet  MATH  Google Scholar 

  13. Friedman, J.: Exploratory projection pursuit. Journal of the American Statistical Assoc. 82, 249–266 (1987)

    Article  MathSciNet  MATH  Google Scholar 

  14. Hall, P.: On polynomial-based projection indices for exploratory projection pursuit. The Annals of Statistics 17, 589–605 (1989)

    Article  MathSciNet  MATH  Google Scholar 

  15. Cook, D., Buja, A., Cabrera, J.: Projection pursuit indices based on orthonormal function expansion. Journal of Computational and Graphical Statistics 2, 225–250 (1993)

    Article  MathSciNet  Google Scholar 

  16. Posse, C.: Projection pursuit exploratory data analysis. Computational Statistics and Data Analysis 20, 669–687 (1995)

    Article  MathSciNet  MATH  Google Scholar 

  17. Shaffer, J.P.: Multiple hypothesis testing. Ann. Rev. Psych 46, 561–584 (1995)

    Article  Google Scholar 

  18. Holm, S.: A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics 6, 65–70 (1979)

    MathSciNet  MATH  Google Scholar 

  19. Hopkins, B.: A new method of determining the type of distribution of plant individuals. Annals of Botany 18, 213–226 (1954)

    Google Scholar 

  20. Leban, G., Bratko, I., Petrovic, U., Curk, T., Zupan, B.: VizRank: Finding informative data projections in functional genomics by machine learning. Bioinformatics 21, 413–414 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tschumitschew, K., Klawonn, F. (2009). AVEDA: Statistical Tests for Finding Interesting Visualisations. In: Velásquez, J.D., Ríos, S.A., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based and Intelligent Information and Engineering Systems. KES 2009. Lecture Notes in Computer Science(), vol 5711. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04595-0_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-04595-0_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-04594-3

  • Online ISBN: 978-3-642-04595-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics