Abstract
The Genetic Linkage Analysis of SNP (Single Nucleotide Polymorphism) markers permits the discovery of genetic correlations in complex diseases following their transmission through family generations. However, all major algorithms proposed in the literature require high computational power and memory availability, making large data sets very hard to analyze on a single CPU. A facility for achieving a Whole-Genome Linkage Analysis has been set up as a web application upon a highly distributed infrastructure: the EGEE Grid. Test cases have been run with 10,000 to one million SNPs per Chip and, after validation, the application has been effectively used for a study on cardiac conduction disorders.
Similar content being viewed by others
References
Abecasis, G.R., Cherny, S.S., Cookson, W.O., Cardon, L.R.: Merlin-rapid analysis of dense genetic maps using sparse gene flow trees. Nat. Genet. 30, 97–101 (2002)
Lander, E.S., Kruglyak, L.: Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results. Nat. Genet. 11, 241–247 (1995)
Elston, R.C., Stewart, J.: A general model for the genetic analysis of pedigree data. Hum. Hered. 21(6), 523–542 (1971)
Lander, E.S., Green, P.: Construction of multilocus genetic linkage maps in humans. Proc. Natl. Acad. Sci. USA 84(8), 2363–2367 (1987)
Fishelson, M., Dovgolevsky, N., Geiger, D.: Maximum likelihood haplotyping for general pedigrees. Hum. Hered. 59, 41–60 (2005)
Kruglyak, L., Daly, M.J., Reeve-Daly, M.P., Lander, E.S.: Parametric and nonparametric linkage analysis: a unified multipoint approach. Am. J. Hum. Genet. 58, 1347–1363 (1996)
Kurbasic, A., Hössjer, O.: A general method for linkage disequilibrium correction for multipoint linkage and association. Genet. Epidemiol. 32, 647–657 (2008)
Markianos, K., Daly, M.J., Kruglyak, L.: Efficient multipoint linkage analysis through reduction of inheritance space. Am. J. Hum. Genet. 68(4), 963–977 (2001)
Piccolboni, A., Gusfield, D.: On the complexity of fundamental computational problems in pedigree analysis. J. Comput. Biol. 10(5), 763–773 (2003). doi:10.1089/106652703322539088
Trombetti, G.A., Bonnal, R.J.P., Rizzi, E., De Bellis, G., Milanesi, L.: Data handling strategies for high throughput pyrosequencers. BMC Bioinformatics 8(1), S22 (2007). doi:10.1186/1471-2105-8-S1-S22
Waldo, A.L., Feld, G.K.: Inter-relationships of atrial fibrillation and atrial flutter mechanisms and clinical implications. J. Am. Coll. Cardiol. 51(8), 779–786 (2008)
Andrade, J., et al.: The use of Grid computing to drive data-intensive genetic research. Eur. J. Hum. Genet. 15, 694–702 (2007)
Hernández-Sánchez, J., Grunchec, J.A., Knott, S.: A web application to perform linkage disequilibrium and linkage analyses on a computational Grid. Bioinformatics 25, 1377–1383 (2009)
EGEE project homepage: http://www.eu-egee.org/
gLite-3 user’s guide: https://edms.cern.ch/file/722398//gLite-3-UserGuide.html
Foster, I., Kesselman, C., Tuecke, S.: The anatomy of the Grid: enabling scalable virtual organizations. Int. J. Supercomput. Appl. 15, 3 (2001)
Foster, I., Kesselman, C.: The Grid: Blueprint for a New Computing Infrastructure, 2nd ed. Kaufman, San Francisco (2003)
Alonso, J.M., Ferrero, J.M. Jr., Hernández, V., Moltó, G., Saiz, J., Trénor, B.: A Grid computing-based approach for the acceleration of simulations in cardiology. IEEE Trans. Inf. Technol. Biomed. 12(2), 138–144 (2008)
Jacq, N., Salzemann, J., Jacq, F., Legré, Y., Medernach, E., Montagnat, J., Maaß, A., Reichstadt, M., Schwichtenberg, H., Sridhar, M., Kasam, V., Zimmermann, M., Hofmann, M., Breton, V.: Grid-enabled virtual screening against malaria. J. Grid Computing 6, 29–43 (2008)
Korkhov, V.V., Moscicki, J.T., Krzhizhanovskaya, V.V.: Dynamic workload balancing of parallel applications with user-level scheduling on the Grid. Future Gener. Comput. Syst. 25(1), 28–34 (2009)
Germain-Renaud, C., Loomis, C., Moscicki, J.T., Texier, R.: Scheduling for responsive Grids. J. Grid Computing 6, 15–27 (2008)
Maeno, T., Pan, D.A.: Distributed production and distributed analysis system for ATLAS. J. Phys. Conf. Ser. 119, 062036 (2008)
Lee, H.C., Salzemann, J., Jacq, N., Chen, H.Y., Ho, L.Y., Morelli, I., Milanesi, L., Breton, V., Lin, S.C., Wu, Y.T.: Grid-enabled high-throughput in silico screening against influenza A neuraminidase. IEEE Trans. Nanobioscience 5(4), 288–295 (2006)
Pugliese, A., Talia, D., Yahyapour, R.: Modeling and supporting Grid scheduling. J. Grid Computing 6(2), 195–213 (2007)
McClatchey, R., Anjum, A., Stockinger, H., Ali, A., Willers, I., Thomas, M.: Data intensive and network aware (DIANA) Grid scheduling. J. Grid Computing 5(1), 43–64 (2007)
Rood, B., Lewis, M.J.: Grid resource availability prediction-base scheduling and task replication. J. Grid Computing 7, 479–500 (2009)
Christodoulopoulos, K., Gkamas, V., Varvarigos, E.: Statistical analysis and modeling of jobs in a Grid environment. J. Grid Computing 6, 77–101 (2008)
Ibarra, O.H., Kim, C.E.: Heuristic algorithms for scheduling independent tasks on nonidentical processors. J. ACM 24(2), 280–289 (1977)
Fernandez-Baca, D.: Allocating modules to processors in a distributed system. IEEE Trans. Softw. Eng. 15(11), 1427–1436 (1989)
Braun, T.D., Siegel, H.J., Beck, N., Boloni, L.L., Maheswaran, M., Reuther, A.I., Robertson, J.P., Theys, M.D., Yao, B., Hensgen, D., Freund, R.F.: A comparison of eleven static heuristics for mapping a class of independent tasks onto heterogeneous distributed computing systems. J. Parallel Distrib. Comput. 61(6), 810–837 (2001)
Sfiligoi, I.: glideInWMS: a generic pilot-based workload management system. J. Phys.: Conf. Ser. 119, 9 (2008)
Cecchi, M., Capannini, F., Dorigo, A., Ghiselli, A., Giacomini, F., Maraschini, A., Marzolla, M., Monforte, S., Pacini, F., Petronzio, L., Prelz, F.: The gLite Workload Management System, in Advances in Grid and Pervasive Computing, pp 256–268. Springer, New York (2009)
Rahman, R., Barker, K., Alhajj, R.: Replica placement strategies in data Grid. J. Grid Computing 6, 103–123 (2008)
Dix, A.J., Finlay, J.E., Abowd, G.D., Beale, R.: Human-Computer Interaction, 3rd ed. Prentice-Hall, London (2003)
Trombetti, G.A.: Enabling computationally intensive bioinformatics applications on the Grid platform. Ph.D. Thesis, DEIS Department, Università degli Studi di Bologna. Available at http://amsdottorato.cib.unibo.it/922/ (2008)
EDGeS project homepage: www.edges-grid.eu
Sfiligoi, I., Koeroo, O., Venekamp, G., Yocum, D., Groep, D., Petravick, D.: Addressing the pilot security problem with gLExec. J. Phys.: Conf. Ser. 119, 052029 (2008). doi:10.1088/1742-6596/119/5/052029
Floros, E., Loomis, C.: Interactive and real-time applications on the EGEE Grid infrastructure. In: Remote Instrumentation and Virtual Laboratories, vol. 3, pp. 263–273. Springer (2010). doi:10.1007/978-1-4419-5597-5_22
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Calabria, A., Di Pasquale, D., Gnocchi, M. et al. Grid Based Genome Wide Studies on Atrial Flutter. J Grid Computing 8, 511–527 (2010). https://doi.org/10.1007/s10723-010-9163-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10723-010-9163-y