Abstract
Data integration plays a major role in modern Life Sciences research primarily because required resources are geographically distributed across continents and experts depend upon leveraging these digitally archived resources. The ever changing and exponentially growing digital archives pose a significant challenge for traditional data integration efforts and efficient development of data processing pipelines for scientific investigations. Thus a forceful argument can be made that in Life Sciences fast and ad hoc application design requiring data integration and workflow processing is urgently warranted. In this paper, we present a new visual application design toolkit, called VizBuilder, that can be used to design ad hoc applications at throw away costs by naive users in the Life Sciences to integrate remote databases and to design application pipelines. We leverage the recent development of BioFlow query language and LifeDB database management system that allows for such application design. We show that VizBuilder can be used as a front end query interface for LifeDB, with which robust and a wide range of applications can be developed at a very abstract and conceptual level without having to learn BioFlow. Users are able to express their application by drawing it using VizBuilder icons and connecting them in a meaningful way. Once completed, VizBuilder can compile and transform the visual application into an error-free BioFlow program to be executed in LifeDB. We present the functionality of VizBuilder using a real life application that our Life Sciences researchers have implemented and executed in LifeDB as part of their investigation.
This research was partially supported by National Science Foundation grants CNS 0521454 and IIS 0612203.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
JUNG - Java Universal Network/Graph Framework, http://jung.sourceforge.net
Microsoft Robotics, http://msdn.microsoft.com/en-us/library/bb483088.aspx
Web Services Description Language (WSDL) Version 2.0, http://www.w3.org/TR/wsdl20/
XAML - Extensible Application Markup Language, http://msdn.microsoft.com/en-us/library/ms752059.aspx
Altintas, I., Berkley, C., Jaeger, E., Jones, M., Ludäscher, B., Mock, S.: Kepler: An extensible system for design and execution of scientific workflows. In: SSDBM, pp. 21–23 (2004)
Amin, M.S., Jamil, H.: FastWrap: An Efficient wrapper for Tabular Data Extraction from the Web. In: Tenth IEEE International Conference on Information Reuse and Integration, Las Vegas, Nevada (August 2009)
Bhattacharjee, A., Jamil, H.: OntoMatch: A Monotonically Improving Schema Matching System for Autonomous Data Integration. In: Tenth IEEE International Conference on Information Reuse and Integration, Las vegas, Nevada (August 2009)
Boulakia, S.C., Biton, O., Davidson, S.B., Froidevaux, C.: Bioguidesrs: querying multiple sources with a user-centric perspective. Bioinformatics 23(10), 1301–1303 (2007)
Bowers, S., Ludäscher, B., Ngu, A.H.H., Critchlow, T.: Enabling scientificworkflow reuse through structured composition of dataflow and control-flow. In: Proceedings of the 22nd International Conference on Data Engineering Workshops, Washington, DC, USA, p. 70. IEEE Computer Society Press, Los Alamitos (2006)
Catarci, T., Costabile, M.F., Levialdi, S., Batini, C.: Visual query systems for databases: A survey. Journal of Visual Languages & Computing 8(2), 215–260 (1997)
Dogru, S., Rajan, V., Rieck, K., Slagle, J.R., Tjan, B.S., Wang, Y.: A graphical data flow language for retrieval, analysis, and visualisation of a scientific database. Journal of Visual Languages and Computing, 247–265 (1996)
Dotan, D., Pinter, R.Y.: HyperFlow: An integrated visual query and dataflow language for end-user information analysis. In: Proceedings of the IEEE Symposium on Visual Languages and Human-Centric Computing, Washington, DC, USA, pp. 27–34 (2005)
Ehrig, K., Ermel, C., Hänsgen, S., Taentzer, G.: Generation of visual editors as eclipse plug-ins. In: Proceedings of the 20th IEEE/ACM international Conference on Automated software engineering, pp. 134–143 (2005)
Gil, Y., Deelman, E., Ellisman, M., Fahringer, T., Fox, G., Gannon, D., Goble, C., Livny, M., Moreau, L., Myers, J.: Examining the challenges of scientific workflows. Computer 40(12), 24–32 (2007)
Gusfield, D., Stoye, J.: Relationships between p63 binding, dna sequence, transcription activity, and biological function in human cells. Mol. Cell 24(4), 593–602 (2006)
Hossain, S., Jamil, H.: VizBuilder – a generic editor for visual languages. Technical report, Computer Science, Wayne State University (December 2008)
Jamil, H., El-Hajj-Diab, B.: Bioflow: A web-based declarative workflow language for life sciences. In: IEEE International Workshop on Scientific Workflow, July 2008, pp. 453–460 (2008)
Jamil, H., Islam, A.: The power of declarative languages:a comparative exposition of scientific workflow design using BioFlow and Taverna. In: IEEE International Workshop on Scientific Workflow, Los Angeles, CA (2009)
Kesselman, C., Foster, I.: The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann Publishers, San Francisco (1998)
Ludäscher, B., Altintas, I., Berkley, C., Higgins, D., Jaeger, E., Jones, M., Lee, E.A., Tao, J., Zhao, Y.: Scientific workflow management and the kepler system. Concurrency and Computation: Practice & Experience 18(10), 1039–1065 (2006)
McPhillips, T., Bowers, S., Zinn, D., Ludäscher, B.: Scientific workflow design for mere mortals. In: Future Generation Computer Systems (2008)
Oinn, T., Addis, M., Ferris, J., Marvin, D., Senger, M., Greenwood, M., Carver, T., Glover, K., Pocock, M., Wipat, A., Li, P.: Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20(17), 3045–3054 (2004)
Paton, N.M.N.: Kaleidoquery: A visual query language for object databases. In: Advanced Visual Interfaces, pp. 247–257. ACM Press, New York (1998)
Plake, C., Schiemann, T., Pankalla, M., Hakenberg, J., Leser, U.: Alibaba: Pubmed as a graph. Bioinformatics 22(19), 2444–2445 (2006)
Santucci, G., Sottile, P.A.: Query by diagram: a visual environment for querying databases. Software - Practice and Experience 23(3), 317–340 (1993)
Siau, K., Chan, H., Tan, K.: Visual knowledge query language as a front-end to relational systems. In: Proceedings of the Fifteenth Annual International on Computer Software and Applications, September 1991, pp. 373–378 (1991)
Sismanis, Y., Brown, P., Haas, P.J., Reinwald, B.: GORDIAN: efficient and scalable discovery of composite keys. In: VLDB, pp. 691–702 (2006)
Taylor, I., Shields, M., Wang, I., Harrison, A.: The Triana Workflow Environment: Architecture and Applications. In: Taylor, I., Deelman, E., Gannon, D., Shields, M. (eds.) Workflows for e-Science, pp. 320–339. Springer, New York (2007)
Tolvanen, J.-P., Rossi, M.: Metaedit+: defining and using domain-specific modeling languages and code generators. In: Companion of the 18th annual ACM SIGPLAN Conference on Object-oriented programming, systems, languages, and applications, pp. 92–93 (2003)
Vangheluwe, H., de Lara, J.: Computer automated multi-paradigm modelling for analysis and design of traffic networks. In: Proceedings of the 36th conference on Winter simulation, pp. 249–258 (2004)
Zhang, D.-Q., Zhang, K.: VisPro: A visual language generation toolset. In: Proceedings of the IEEE Symposium on Visual Languages, Washington, DC, p. 195 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hossain, S., Jamil, H. (2009). A Visual Interface for on-the-fly Biological Database Integration and Workflow Design Using VizBuilder. In: Paton, N.W., Missier, P., Hedeler, C. (eds) Data Integration in the Life Sciences. DILS 2009. Lecture Notes in Computer Science(), vol 5647. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02879-3_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-02879-3_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02878-6
Online ISBN: 978-3-642-02879-3
eBook Packages: Computer ScienceComputer Science (R0)