Skip to main content

A Visual Interface for on-the-fly Biological Database Integration and Workflow Design Using VizBuilder

  • Conference paper
Data Integration in the Life Sciences (DILS 2009)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 5647))

Included in the following conference series:

Abstract

Data integration plays a major role in modern Life Sciences research primarily because required resources are geographically distributed across continents and experts depend upon leveraging these digitally archived resources. The ever changing and exponentially growing digital archives pose a significant challenge for traditional data integration efforts and efficient development of data processing pipelines for scientific investigations. Thus a forceful argument can be made that in Life Sciences fast and ad hoc application design requiring data integration and workflow processing is urgently warranted. In this paper, we present a new visual application design toolkit, called VizBuilder, that can be used to design ad hoc applications at throw away costs by naive users in the Life Sciences to integrate remote databases and to design application pipelines. We leverage the recent development of BioFlow query language and LifeDB database management system that allows for such application design. We show that VizBuilder can be used as a front end query interface for LifeDB, with which robust and a wide range of applications can be developed at a very abstract and conceptual level without having to learn BioFlow. Users are able to express their application by drawing it using VizBuilder icons and connecting them in a meaningful way. Once completed, VizBuilder can compile and transform the visual application into an error-free BioFlow program to be executed in LifeDB. We present the functionality of VizBuilder using a real life application that our Life Sciences researchers have implemented and executed in LifeDB as part of their investigation.

This research was partially supported by National Science Foundation grants CNS 0521454 and IIS 0612203.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. JUNG - Java Universal Network/Graph Framework, http://jung.sourceforge.net

  2. Microsoft Robotics, http://msdn.microsoft.com/en-us/library/bb483088.aspx

  3. Web Services Description Language (WSDL) Version 2.0, http://www.w3.org/TR/wsdl20/

  4. XAML - Extensible Application Markup Language, http://msdn.microsoft.com/en-us/library/ms752059.aspx

  5. Altintas, I., Berkley, C., Jaeger, E., Jones, M., Ludäscher, B., Mock, S.: Kepler: An extensible system for design and execution of scientific workflows. In: SSDBM, pp. 21–23 (2004)

    Google Scholar 

  6. Amin, M.S., Jamil, H.: FastWrap: An Efficient wrapper for Tabular Data Extraction from the Web. In: Tenth IEEE International Conference on Information Reuse and Integration, Las Vegas, Nevada (August 2009)

    Google Scholar 

  7. Bhattacharjee, A., Jamil, H.: OntoMatch: A Monotonically Improving Schema Matching System for Autonomous Data Integration. In: Tenth IEEE International Conference on Information Reuse and Integration, Las vegas, Nevada (August 2009)

    Google Scholar 

  8. Boulakia, S.C., Biton, O., Davidson, S.B., Froidevaux, C.: Bioguidesrs: querying multiple sources with a user-centric perspective. Bioinformatics 23(10), 1301–1303 (2007)

    Article  Google Scholar 

  9. Bowers, S., Ludäscher, B., Ngu, A.H.H., Critchlow, T.: Enabling scientificworkflow reuse through structured composition of dataflow and control-flow. In: Proceedings of the 22nd International Conference on Data Engineering Workshops, Washington, DC, USA, p. 70. IEEE Computer Society Press, Los Alamitos (2006)

    Google Scholar 

  10. Catarci, T., Costabile, M.F., Levialdi, S., Batini, C.: Visual query systems for databases: A survey. Journal of Visual Languages & Computing 8(2), 215–260 (1997)

    Article  Google Scholar 

  11. Dogru, S., Rajan, V., Rieck, K., Slagle, J.R., Tjan, B.S., Wang, Y.: A graphical data flow language for retrieval, analysis, and visualisation of a scientific database. Journal of Visual Languages and Computing, 247–265 (1996)

    Google Scholar 

  12. Dotan, D., Pinter, R.Y.: HyperFlow: An integrated visual query and dataflow language for end-user information analysis. In: Proceedings of the IEEE Symposium on Visual Languages and Human-Centric Computing, Washington, DC, USA, pp. 27–34 (2005)

    Google Scholar 

  13. Ehrig, K., Ermel, C., Hänsgen, S., Taentzer, G.: Generation of visual editors as eclipse plug-ins. In: Proceedings of the 20th IEEE/ACM international Conference on Automated software engineering, pp. 134–143 (2005)

    Google Scholar 

  14. Gil, Y., Deelman, E., Ellisman, M., Fahringer, T., Fox, G., Gannon, D., Goble, C., Livny, M., Moreau, L., Myers, J.: Examining the challenges of scientific workflows. Computer 40(12), 24–32 (2007)

    Article  Google Scholar 

  15. Gusfield, D., Stoye, J.: Relationships between p63 binding, dna sequence, transcription activity, and biological function in human cells. Mol. Cell 24(4), 593–602 (2006)

    Article  Google Scholar 

  16. Hossain, S., Jamil, H.: VizBuilder – a generic editor for visual languages. Technical report, Computer Science, Wayne State University (December 2008)

    Google Scholar 

  17. Jamil, H., El-Hajj-Diab, B.: Bioflow: A web-based declarative workflow language for life sciences. In: IEEE International Workshop on Scientific Workflow, July 2008, pp. 453–460 (2008)

    Google Scholar 

  18. Jamil, H., Islam, A.: The power of declarative languages:a comparative exposition of scientific workflow design using BioFlow and Taverna. In: IEEE International Workshop on Scientific Workflow, Los Angeles, CA (2009)

    Google Scholar 

  19. Kesselman, C., Foster, I.: The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann Publishers, San Francisco (1998)

    Google Scholar 

  20. Ludäscher, B., Altintas, I., Berkley, C., Higgins, D., Jaeger, E., Jones, M., Lee, E.A., Tao, J., Zhao, Y.: Scientific workflow management and the kepler system. Concurrency and Computation: Practice & Experience 18(10), 1039–1065 (2006)

    Article  Google Scholar 

  21. McPhillips, T., Bowers, S., Zinn, D., Ludäscher, B.: Scientific workflow design for mere mortals. In: Future Generation Computer Systems (2008)

    Google Scholar 

  22. Oinn, T., Addis, M., Ferris, J., Marvin, D., Senger, M., Greenwood, M., Carver, T., Glover, K., Pocock, M., Wipat, A., Li, P.: Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20(17), 3045–3054 (2004)

    Article  CAS  PubMed  Google Scholar 

  23. Paton, N.M.N.: Kaleidoquery: A visual query language for object databases. In: Advanced Visual Interfaces, pp. 247–257. ACM Press, New York (1998)

    Google Scholar 

  24. Plake, C., Schiemann, T., Pankalla, M., Hakenberg, J., Leser, U.: Alibaba: Pubmed as a graph. Bioinformatics 22(19), 2444–2445 (2006)

    Article  CAS  PubMed  Google Scholar 

  25. Santucci, G., Sottile, P.A.: Query by diagram: a visual environment for querying databases. Software - Practice and Experience 23(3), 317–340 (1993)

    Article  Google Scholar 

  26. Siau, K., Chan, H., Tan, K.: Visual knowledge query language as a front-end to relational systems. In: Proceedings of the Fifteenth Annual International on Computer Software and Applications, September 1991, pp. 373–378 (1991)

    Google Scholar 

  27. Sismanis, Y., Brown, P., Haas, P.J., Reinwald, B.: GORDIAN: efficient and scalable discovery of composite keys. In: VLDB, pp. 691–702 (2006)

    Google Scholar 

  28. Taylor, I., Shields, M., Wang, I., Harrison, A.: The Triana Workflow Environment: Architecture and Applications. In: Taylor, I., Deelman, E., Gannon, D., Shields, M. (eds.) Workflows for e-Science, pp. 320–339. Springer, New York (2007)

    Chapter  Google Scholar 

  29. Tolvanen, J.-P., Rossi, M.: Metaedit+: defining and using domain-specific modeling languages and code generators. In: Companion of the 18th annual ACM SIGPLAN Conference on Object-oriented programming, systems, languages, and applications, pp. 92–93 (2003)

    Google Scholar 

  30. Vangheluwe, H., de Lara, J.: Computer automated multi-paradigm modelling for analysis and design of traffic networks. In: Proceedings of the 36th conference on Winter simulation, pp. 249–258 (2004)

    Google Scholar 

  31. Zhang, D.-Q., Zhang, K.: VisPro: A visual language generation toolset. In: Proceedings of the IEEE Symposium on Visual Languages, Washington, DC, p. 195 (1998)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hossain, S., Jamil, H. (2009). A Visual Interface for on-the-fly Biological Database Integration and Workflow Design Using VizBuilder. In: Paton, N.W., Missier, P., Hedeler, C. (eds) Data Integration in the Life Sciences. DILS 2009. Lecture Notes in Computer Science(), vol 5647. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02879-3_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-02879-3_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-02878-6

  • Online ISBN: 978-3-642-02879-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics