Skip to main content

Visualisation and Exploration of Scientific Data Using Graphs

  • Chapter
Data Mining

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3755))

  • 3367 Accesses

Abstract

We present a prototype application for graph-based exploration and mining of online databases, with particular emphasis on scientific data. The application builds structured graphs that allow the user to explore patterns in a data set, including clusters, trends, outliers, and relationships. A number of different graphs can be rapidly generated, giving complementary insights into a given data set. The application has a Flash-based graphical interface and uses semantic information from the data sources to keep this interface as intuitive as possible. Data can be accessed from local and remote databases and files. Graphs can be explored using an interactive visual browser, or graph-analytic algorithms. We demonstrate the approach using marine sediment data, and show that differences in benthic species compositions in two Antarctic bays are related to heavy metal contamination.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Washio, T., Motoda, H.: State of the art graph-based data mining. SIGKDD Explorations: Newsletter of the ACM Special Interest Group on Knowledge Discovery & Data Mining 5(1), 59–68 (2003)

    Google Scholar 

  2. Kuramochi, M., Desphande, M., Karypis, G.: Mining Scientific Datasets Using Graphs. In: Kargupta, H., Joshi, A., Sivakumar, K., Yesha, Y. (eds.) Next Generation Data Mining, pp. 315–334. MIT/AAAI Press (2003)

    Google Scholar 

  3. Brieger, R.L.: The analysis of social networks. In: Hardy, M., Bryman, A. (eds.) Handbook of Data Analysis, pp. 505–526. SAGE Publications, London (2004)

    Google Scholar 

  4. Lusseau, D., Newman, M.E.J.: Identifying the role that individual animals play in their social networks. Proceedings of the Royal Society of London B 271, S477–S481 (2004)

    Article  Google Scholar 

  5. Luczkovich, J.J., Borgatti, S.P., Johnson, J.C., Everett, M.G.: Defining and measuring trophic role similarity in food webs using regular equivalence. Journal of Theoretical Biology 220(3), 303–321 (2003)

    Article  MathSciNet  Google Scholar 

  6. Yook, S.-H., Oltavai, Z.N., Barabási, A.-L.: Functional and topological characterization of protein interaction networks. Proteomics 4, 928–942 (2004)

    Article  Google Scholar 

  7. De Raedt, L., Kramer, S.: The level wise version space algorithm and its application to molecular fragment finding. In: Proceedings of the Seventeenth International Joint Conference on Articial Intelligence, pp. 853–862. Morgan Kaufmann, San Francisco (2001)

    Google Scholar 

  8. Comiso, J.: Bootstrap sea ice concentrations for NIMBUS-7, SMMR and DMSP SSM/ National Snow and Ice Data Center, I. Boulder, CO, USA (1999, updated 2002)

    Google Scholar 

  9. Global Biodiversity Information Facility, http://www.gbif.net

  10. Swayne, D.F., Buja, A., Temple Lang, D.: Exploratory visual analysis of graphs in GGobi. In: Proceedings of the 3rd International Workshop on Distributed Statistical Computing, Vienna (2003)

    Google Scholar 

  11. Adar, E., Tyler, J.R.: Zoomgraph, http://www.hpl.hp.com/research/idl/projects/graphs/

  12. Winter, A., Kullbach, B., Riediger, V.: An overview of the GXL graph exchange language. In: Diehl, S. (ed.) Dagstuhl Seminar 2001. LNCS, vol. 2269, pp. 324–336. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  13. Shapiro, A.: Touchgraph, http://www.touchgraph.com

  14. Cook, D.J., Holder, L.B.: Graph-based data mining. IEEE Intelligent Systems 15(2), 32–41 (2000)

    Article  Google Scholar 

  15. Kuramochi, M., Karypis, G.: Finding frequent patterns in a large sparse graph. In: Berry, M.W., Dayal, U., Kamath, C., Skillicorn, D.B. (eds.) Proceedings of the Fourth SIAM International Conference on Data Mining, Florida, USA. SIAM, Philadelphia (2004)

    Google Scholar 

  16. Cortes, C., Pregibon, D., Volinsky, C.: Computational methods for dynamic graphs. J. Computational and Graphical Statistics 12, 950–970 (2003)

    Article  MathSciNet  Google Scholar 

  17. Inokuchi, A., Washio, T., Motoda, H.: Complete mining of frequent patterns from graphs: mining graph data. Machine Learning 50, 321–354 (2003)

    Article  MATH  Google Scholar 

  18. Yan, X., Han, J.: CloseGraph: Mining closed frequent graph patterns. In: Getoor, L., Senator, T.E., Domingos, P., Faloutsos, C. (eds.) Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 286–295. ACM, Washington (2003)

    Chapter  Google Scholar 

  19. Quigley, A., Eades, P.: FADE: graph drawing, clustering, and visual abstraction. In: Marks, J. (ed.) GD 2000. LNCS, vol. 1984, pp. 197–210. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  20. Shekhar, S., Lu, C.T., Zhang, P.: Detecting graph-based spatial outliers: algorithms and applications (a summary of results). In: Provost, F., Srikant, R. (eds.) Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 371–376 (2001)

    Google Scholar 

  21. Noble, C.C., Cook, D.J.: Graph-based anomaly detection. In: Getoor, L., Senator, T.E., Domingos, P., Faloutsos, C. (eds.) Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 631–636. ACM, Washington (2003)

    Chapter  Google Scholar 

  22. Girvan, M., Newman, M.E.J.: Community structure in social and biological networks. Proc. Natl. Acad. Sci. USA 99, 7821–7826 (2002)

    Article  MATH  MathSciNet  Google Scholar 

  23. Drossel, B., McKane, A.J.: Modelling food webs. In: Bornholdt, S., Schuster, H.G. (eds.) Handbook of Graphs and Networks: From the Genome to the Internet, pp. 218–247. Wiley-VCH, Berlin (2003)

    Google Scholar 

  24. Moody, J.: Peer influence groups: identifying dense clusters in large networks. Social Networks 23, 216–283 (2001)

    Article  Google Scholar 

  25. Stark, J.S., Riddle, M.J., Snape, I., Scouller, R.C.: Human impacts in Antarctic marine soft-sediment assemblages: correlations between multivariate biological patterns and environmental variables at Casey Station. Estuarine, Coastal and Shelf Science 56, 717–734 (2003)

    Article  Google Scholar 

  26. Abello, J., Korn, J.: MGV: a system for visualizing massive multi-digraphs. IEEE Transactions on Visualization and Computer Graphics 8, 21–38 (2002)

    Article  Google Scholar 

  27. Wills, G.J.: NicheWorks — interactive visualization of very large graphs. J. Computational and Graphical Statistics 8(2), 190–212 (1999)

    Article  MathSciNet  Google Scholar 

  28. Batagelj, V., Mrvar, A.: Pajek - Program for Large Network Analysis, http://vlado.fmf.uni-lj.si/pub/networks/pajek/

  29. Borgatti, S., Chase, R.: UCINET: social network analysis software, http://www.analytictech.com/ucinet.htm

  30. Bongiovanni, B., Choplin, S., Lalande, J.F., Syska, M., Verhoeven, Y.: Mascotte Optimization project, http://www-sop.inria.fr/mascotte/mascopt/index.html

  31. White, S., O’Madadhain, J., Fisher, D., Boey, Y.-B.: Java Universal Network/Graph Framework, http://jung.sourceforge.net

  32. Auber, D.: Tulip — A Huge Graph Visualization Framework, http://www.tulip-software.org/

  33. Adai, A.T., Date, S.V., Wieland, S., Marcotte, E.M.: LGL: creating a map of protein function with an algorithm for visualizing very large biological networks. Journal of Molecular Biology 340(1), 179–190 (2004)

    Article  Google Scholar 

  34. Ellson, J., North, S.: Graphviz - Graph Visualization Software, http://www.graphviz.org/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Raymond, B., Belbin, L. (2006). Visualisation and Exploration of Scientific Data Using Graphs. In: Williams, G.J., Simoff, S.J. (eds) Data Mining. Lecture Notes in Computer Science(), vol 3755. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11677437_2

Download citation

  • DOI: https://doi.org/10.1007/11677437_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-32547-5

  • Online ISBN: 978-3-540-32548-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics