ABSTRACT
UltraScan Solution Modeler (US-SOMO) processes atomic and lower-resolution bead model representations of biological and other macromolecules to compute various hydrodynamic parameters, such as the sedimentation and diffusion coefficients, relaxation times and intrinsic viscosity, and small angle scattering curves, that contribute to our understanding of molecular structure in solution. Knowledge of biological macromolecules' structure aids researchers in understanding their function as a path to disease prevention and therapeutics for conditions such as cancer, thrombosis, Alzheimer's disease and others. US-SOMO provides a convergence of experimental, computational, and modeling techniques, in which detailed molecular structure and properties are determined from data obtained in a range of experimental techniques that, by themselves, give incomplete information. Our goal in this work is to develop the infrastructure and user interfaces that will enable a wide range of scientists to carry out complicated experimental data analysis techniques on XSEDE. Our user community predominantly consists of biophysics and structural biology researchers. A recent search on PubMed reports 9,205 papers in the decade referencing the techniques we support. We believe our software will provide these researchers a convenient and unique framework to refine structures, thus advancing their research.
The computed hydrodynamic parameters and scattering curves are screened against experimental data, effectively pruning potential structures into equivalence classes. Experimental methods may include analytical ultracentrifugation, dynamic light scattering, small angle X-ray and neutron scattering, NMR, fluorescence spectroscopy, and others. One source of macromolecular models is X-ray crystallography. However, the conformation in solution may not match that observed in the crystal form. Using computational techniques, an initial fixed model can be expanded into a search space utilizing high temperature molecular dynamic approaches or stochastic methods such as Brownian dynamics. The number of structures produced can vary greatly, ranging from hundreds to tens of thousands or more. This introduces a number of cyberinfrastructure challenges. Computing hydrodynamic parameters and small angle scattering curves can be computationally intensive for each structure, and therefore cluster compute resources are essential for timely results. Input and output data sizes can vary greatly from less than 1 MB to 2 GB or more. Although the parallelization is trivial, along with data size variability there is a large range of compute sizes, ranging from one to potentially thousands of cores with compute time of minutes to hours.
In addition to the distributed computing infrastructure challenges, an important concern was how to allow a user to conveniently submit, monitor and retrieve results from within the C++/Qt GUI application while maintaining a method for authentication, approval and registered publication usage throttling. Middleware supporting these design goals has been integrated into the application with assistance from the Open Gateway Computing Environments (OGCE) collaboration team. The approach was tested on various XSEDE clusters and local compute resources. This paper reviews current US-SOMO functionality and implementation with a focus on the newly deployed cluster integration.
- Miller, B. A. 2009. Imatinib and its successors: how modern chemistry has changed drug development. Curr Pharm Des 15:120--133.Google ScholarCross Ref
- Goldman, J. M. 2010. Chronic myeloid leukemia: an historical perspective Semin Hematol 47:302--311.Google Scholar
- Demeler, B. 2005. UltraScan: a comprehensive data analysis software package for analytical ultracentrifugation experiments. Modern AUC: Techniques and Methods. Scott, D. J. et al., Eds. Royal Society of Chemistry 210--9Google Scholar
- UltraScan. http:/www.ultrascan.uthscsa.eduGoogle Scholar
- Qt. http://qt.nokia.comGoogle Scholar
- Brookes, E., Boppana, R. V., and Demeler, B. 2006. Computing large sparse multivariate optimization problems with an application in biophysics. Proceed. SC2006. ACM. Google ScholarDigital Library
- Brookes, E., Cao, W., Demeler, B. 2009. A two-dimensional spectrum analysis for sedimentation velocity experiments of mixtures with heterogeneity in molecular weight and shape. Eur Biophys JGoogle Scholar
- Brookes, E. and Demeler B. 2010. Performance optimization of large non-negatively constrained least squares problems with an application in biophysics. ACM TG10. N. Y. Google ScholarDigital Library
- Brookes, E. and Demeler, B. 2006. Genetic algorithm optimization for obtaining accurate molecular weight distribution from sedimentation velocity experiments. AUC VIII, Progr Colloid Polym Sci 131:78--82. Springer.Google Scholar
- Brookes, E. and Demeler, B. 2007. Parsimonious regularization using genetic algorithms applied to the analysis of analytical ultra-centrifugation experiments. Proceedings GECCO 07. ACM. Google ScholarDigital Library
- Message passing interface standard. http://www.mcs.anl.gov/research/projects/mpi/Google Scholar
- High perf. comp. across texas. http://www.hipcat.netGoogle Scholar
- Texas advanced comp. center. http://www.tacc.utexas.eduGoogle Scholar
- Globus ws-gram 4. http://www.globus.org/toolkit/docs/4.0/execution/wsgram/Google Scholar
- The generic service toolkit. http://www.extreme.indiana.edu/gfac/Google Scholar
- Pierce, M., S. Marru, et al. 2010. Open grid computing environments: advanced gateway support activities. Proceedings of the TG10 Conference, ACM: 16:11--16:19. Google ScholarDigital Library
- Rai, N, et al., M. SOMO (SOlution MOdeler): Difference between x-ray and nmr-derived bead models suggest a role for side chain flexibility in protein hydrodynamics. Structure 13, 723--734, 2005Google Scholar
- Brookes, E., Demeler., B, and Rocco, M. 2010. The implementation of somo in the ultrascan analytical data analysis suite: enhanced capabilities allow the reliable hydrodynamic modeling of virtually any kind of biomacromolecule. Eur Biophys JGoogle Scholar
- Brookes, E., Demeler, B., Rosano, C., and Rocco, M. 2010. Developments in the us-somo bead modeling software: new features in the direct residue-to-bead method, improved grid routines, and influence of accessible surface area screening, Macromol Biosci 10:746--753Google ScholarCross Ref
- Brookes, E. US-SOMO. http://somo.uthscsa.eduGoogle Scholar
- Glatter, O. Kratky, O. 1982. Small angle x-ray scattering. 1982. Academic Press., London, ISBN-0-12-286280-5Google Scholar
- Roe, R. J., 2000. Methods of x-ray and neutron scattering in science. Oxford University Press, New York.Google Scholar
- Dokholyan, N. V., Buldyrev, S. V., Stanley, H. E., and Shaknovich, E. I. 1998. Discrete molecular dynamic studies of the folding of a protein-like model. Folding & Design 3:577--587Google Scholar
- Ding F, Dokholyan NV. 2006. Emergence of protein fold families through rational design. Public Library of Science Comput Biol 2(7):e85Google Scholar
- Mansfield et al., 2001. Intrinsic viscosity and the electric polarizability of arbitrarily shaped objects, Phys Rev E 64:61401--16Google ScholarCross Ref
- Jmol. http://www.jmol.org/Google Scholar
- Qt port of webkit. http://trac.webkit.org/wiki/QtWebKitGoogle Scholar
- Qt5. http://qt-project.org/wiki/Qt_5.0Google Scholar
- The protein data bank http://www.rcsb.orgGoogle Scholar
- van Holde, K. E. 1985 Phys. Biochem., 2nd Ed. Prentice Hall.Google Scholar
- Svergun, D. I. 1999. Restoring low resolution structure of biological macromolecules from solution scattering using simulated annealing. Biophys J 2879--86.Google Scholar
- Svergun, D. I., Petoukhov, M. V., and Koch, M. H. J. 2001. Determination of domain structure of proteins from X-ray solution scattering. Biophys J, 80, 2946--2953Google ScholarCross Ref
- Garcia de la Torre, J., Bloomfield, V. A. 1981. Hydrodynamic properties of complex rigid, biological macromolecules: theory and application. Q Rev Biophys 14:81--139.Google ScholarCross Ref
- Garcia de la Torre, J, Bloomfield, V. A. 1977. Hydrodynamic properties of macromolecular complexes. Biopol 16:1765--78Google ScholarCross Ref
- Ortega, A., Amoros, D., Garcia de la Torre, J. 2011. Prediction of hydrodynamic and other solution properties of rigid proteins from atomic- and residue-level models. Biophys J 101, 892--898Google ScholarCross Ref
- Byron, O. 1997. Construction of hydrodynamic bead models from high-resolution X-ray crystallographic or nuclear magnetic resonance data. Biophys J 72, 408--415.Google ScholarCross Ref
- Garcia de la Torre, J. et al. 2009. Simuflex: algorithms and tools for simulation of the conformation and dynamics of flexible molecules and nanoparticles in dilute solution. J Chem Theor Comput 5, 2606--2618.Google ScholarCross Ref
- Moeller, A, et. al 2012. Nucleotide-dependent conformational changes in the n-ethylmaleimide sensitive factor (nsf) and their potential role in snare complex disassembly. J Struct Bio 177:335--43Google ScholarCross Ref
- Nishio, M., et al. 2010. Structural basis for the cooperative interplay between the two causative gene products of combined factor v and factor viii deficiency. Proc Natl Acad Sci USA 107 (9) 4034--4039Google ScholarCross Ref
- Rosano, C, and Rocco, M. 2010. Solution properties of full-length integrin αIIbβ3 refined models suggest environment-dependent induction of alternative bent/extended resting states. FEBS J 277:3190--3202Google ScholarCross Ref
- Douglas et al. 1994. Hydrodynamic friction and the capacitance of arb. shaped objects. Phys. Rev. E 49:5319--31Google ScholarCross Ref
- Zeno. http://www.stevens.edu/zenoGoogle Scholar
- Schneidman-Duhovny, D., Hammel, M., and Sali, A. 2010. Foxs: a web server for rapid comp. and fitting of saxs profiles. Nucleic Acids Res 38 Suppl:W540--4.Google ScholarCross Ref
- FoXS webserver. http://modbase.compbio.ucsf.edu/foxs/about.htmlGoogle Scholar
- Svergun, D. I., Barberato, C. and Koch, M. H. J. 1995. Crysol - a program to evaluate x-ray solution scattering of biological macromolecules from atomic coordinates. J Appl Cryst 28, 768--73.Google ScholarCross Ref
- Jowitt, Tom, Scott, David. Separate pers. comm.Google Scholar
- Pierce, M. et al. 2009. Open grid computing environments.Google Scholar
- US-SOMO OGCE Bridge Clients Information. http://wiki.bcfuthscsa.edu/ultrascan/wiki/OGCEIntegrationGoogle Scholar
- Rave Identity service. https://ogce.svn.sourceforge.net/svnroot/ogce/rave-extensions/rave-id-extension.Google Scholar
- Spring Security. http://static.springsource.org/spring-security/site/index.htmlGoogle Scholar
- Spring Framework http://www.springsource.org/Google Scholar
- Pierce, M. E., Singh, R., et al. 2011. Open community development for science gateways with apache rave. Proceedings of the 2011 ACM workshop on Gateway computing environments. 29--36. Google ScholarDigital Library
- Marru, S., Gunathilake, L., et al. 2011. Apache airavata: a framework for distributed applications and computational workflows. Proceedings of the 2011 ACM workshop on Gateway computing environments. 21--28. Google ScholarDigital Library
- Globus Online. http://www.globusonline.orgGoogle Scholar
- Dropbox. http://www.dropbox.comGoogle Scholar
Index Terms
- Ultrascan solution modeler: integrated hydrodynamic parameter and small angle scattering computation and fitting tools
Recommendations
US-SOMO cluster methods: year one perspective
XSEDE '13: Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to DiscoveryUltraScan Solution Modeler (US-SOMO) computes hydrodynamic parameters and small-angle scattering data from biological macromolecular structural representations and compares them with experimental data for structural determination and validation. At ...
Advancements of the UltraScan scientific gateway for open standards-based cyberinfrastructures
The UltraScan data analysis application is a software package that is able to take advantage of computational resources in order to support the interpretation of analytical ultracentrifugation experiments. Since 2006, the UltraScan scientific gateway ...
Improvements of the UltraScan scientific gateway to enable computational jobs on large-scale and open-standards based cyberinfrastructures
XSEDE '13: Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to DiscoveryThe UltraScan data analysis application is a software package that is able to take advantage of computational resources in order to support the interpretation of analytical ultracentrifugation (AUC) experiments. Since 2006, the UltraScan scientific ...
Comments