Abstract
In this paper we introduce a novel push and pull technique to analyze pedigree data. We present the Push and Pull Pedigree Analyzer (PPPA) to organize large and complex pedigrees and investigate the development of genetic diseases. PPPA receives as input a pedigree (ancestry information) of different families. For each person the pedigree contains information about the occurrence of a specific genetic disease. We propose a new solution to arrange and visualize the individuals of the pedigree based on the relationships between individuals and information about the disease. PPPA starts with random positions of the individuals, and iteratively pushes apart non-relatives with opposite diseases patterns and pulls together relatives with identical disease patterns. The goal is a visualization that groups families with homogeneous disease patterns.
We investigate our solution experimentally with genetic data from peoples from South Tyrol, Italy. We show that the algorithm converges independent of the number of individuals n and the complexity of the relationships. The runtime of the algorithm is super-linear wrt n. The space complexity of the algorithm is linear wrt n. The visual analysis of the method confirms that our push and pull technique successfully deals with large and complex pedigrees.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Marroni, F., Pichler, I., De Grandi, A., Volpato, C.B., Vogl, F.D., Pinggera, G.K., Bailey-Wilson, J.E., Pramstaller, P.P.: Population isolates in south tyrol and their value for genetic dissection of complex diseases. Ann. Hum. Gen. (2006)
David, A.: Tulip. In: Mutzel, P., Jünger, M., Leipert, S. (eds.) GD 2001. LNCS, vol. 2265, pp. 335–337. Springer, Heidelberg (2002)
Auber, D.: Tulip: A huge graph visualisation framework. In: Mutzel, P., Jünger, M. (eds.) Graph Drawing Softwares. Mathematics and Visualization, pp. 105–126. Springer, Heidelberg (2003)
Reingold, E.M., Tilford, J.S.: Tidier drawing of trees. IEEE Transactions on Software Engineering SE-7(2), 223–228 (1981)
Robertson, G.G., Mackinlay, J.D., Card, S.K.: Cone trees: animated 3d visualizations of hierarchical information. In: CHI 1991: Proceedings of the SIGCHI conference on Human factors in computing systems, pp. 189–194. ACM Press, New York (1991)
Sugiyama, K., Tagawa, S., Toda, M.: Methods for visual understanding of hierarchical system structures. IEEE Transactions on Systems, Man, and Cybernetics SMC-11(2), 109–125 (1981)
Brandes, U.: 4. Drawing on physical analogies. In: Kaufmann, M., Wagner, D. (eds.) Drawing Graphs. LNCS, vol. 2025, pp. 71–86. Springer, Heidelberg (2001)
Archambault, D., Munzner, T., Auber, D.: Topolayout: Graph layout by topological features. In: INFOVIS 2005: Poster Track of the IEEE Symposium on Information Visualization (INFOVIS 2005), pp. 3–4. IEEE Computer Society Press, Washington (2005)
Visual Technologies [Online accessed June 09, 2006] (2006), http://www.visualizeinc.com/
Chapman, C.: Cyrillic 2.1. Cherwell, Oxford (1997)
Cyrillic Software. [Online accessed June 09, 2006] (2006), http://www.cyrillicsoftware.com/
Progeny Software [Online accessed June 09, 2006] (2006), http://www.progeny.com/
aiSee Graph Layout Software [Online accessed June 09, 2006] (2006), http://www.aisee.com
Wong, L.: Visualization and manipulation of pedigree diagrams. Genome Informatics 11, 63–72 (2000)
Tores, F., Barillot, E.: Optimizing pedigree drawing using interval graph theory. Currents in Computational Molecular Biology, 194–195 (2000)
Lineage [Online accessed June 09, 2006] (2006), http://www.ansci.cornell.edu/lineage/index.html
Wigginton, J.E., Abecasis, G.R.: PEDSTATS: descriptive statistics, graphics and quality assessment for gene mapping data. Bioinformatics 21(16), 3445–3447 (2005)
Abecasis, G.R., Cherny, S.S., Cookson, W.O.C., Cardon, L.R.: GRR: graphical representation of relationship errors. Bioinformatics 17(8), 742–742 (2001)
Tools of Linkage Analysis (2006), http://linkage.rockefeller.edu/soft/
Pixton, B., G.-Carrier, C.: MAL4:6 - Using Data Mining for Record Linkage. In: The 5th Annual Workshop on Technology for Family History and Genealogical Research (2005)
Gansner, E.R., North, S.C.: An open graph visualization system and its applications to software engineering. Software — Practice and Experience 30(11), 1203–1233 (2000)
Gansner, E., Koren, Y., North, S.: Topological fisheye views for visualizing large graphs. Infovis 00, 175–182 (2004)
Dynagraph [Online accessed June 09, 2006] (2006), http://www.dynagraph.net/
Böhlen, M.H., Bukauskas, L., Eriksen, P.S., Lauritzen, S.L., Mazeika, A., Musaeus, P., Mylov, P.: 3d visual data mining: Goals and experiences. Computational Statistics & Data Analysis 43(4) (August 2004)
Mazeika, A., Boehlen, M.H., Taliun, A.: Adaptive density estimation, demo. In: VLDB, 32nd International Conference on Very Large Data Bases (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mazeika, A., Petersons, J., Böhlen, M.H. (2006). PPPA: Push and Pull Pedigree Analyzer for Large and Complex Pedigree Databases. In: Manolopoulos, Y., Pokorný, J., Sellis, T.K. (eds) Advances in Databases and Information Systems. ADBIS 2006. Lecture Notes in Computer Science, vol 4152. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11827252_26
Download citation
DOI: https://doi.org/10.1007/11827252_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37899-0
Online ISBN: 978-3-540-37900-3
eBook Packages: Computer ScienceComputer Science (R0)