Abstract
The analysis of genetic diseases has classically been directed towards establishing direct links between cause, a genetic variation, and effect, the observable deviation of phenotype. For complex diseases which are caused by multiple factors and which show a wide spread of variations in the phenotypes this is unlikely to succeed. One example is the Attention Deficit Hyperactivity Disorder (ADHD), where it is expected that phenotypic variations will be caused by the overlapping effects of several distinct genetic mechanisms. The classical statistical models to cope with overlapping subgroups are mixture models, essentially convex combinations of density functions, which allow inference of descriptive models from data as well as the deduction of groups. An extension of conventional mixtures with attractive properties for clustering is the context-specific independence (CSI) framework. CSI allows for an automatic adaption of model complexity to avoid overfitting and yields a highly descriptive model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Y. BARASH and N. FRIEDMAN (2002): Context-specific Bayesian clustering for gene ex-pression data. J Comput Biol, 9,169-91.
C. BIERNACKI, G. CELEUX and G. GOVAERT (1999): An improvement of the NEC crite-rion for assessing the number of clusters in a mixture model. Non-Linear Anal., 20,267-272.
E. H. Jr. COOK, M. A. STEIN, M. D. KRASOWSKI, N. J. COX, D. M. OLKON, J. E. KIEFFER and B. L. LEVENTHAL (1995): Association of attention-deficit disorder and the dopamine transporter gene. Am. J. Hum. Genet., 56,993-998.
A. DEMPSTER and N. LAIRD and D. RUBIN (1977): Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 1-38.
N. FRIEDMAN (1998): The Bayesian Structural EM Algorithm. Proceedings of the Four-teenth Conference on Uncertainty in Artificial Intelligence,129-138.
B. GEORGI and A. SCHLIEP (2006): Context-specific Independence Mixture Modeling for Positional Weight Matrices. Bioinformatics, 22, 166-73.
M. GILL, G. DALY, S. HERON, Z. HAWI and M. FITGERALD (1997): Confirmation of association between attention deficit hyperactivity disorder and a dopamine transporter polymorphism. Molec. Psychiat, 2, 311-313.
F. C. LUFT (2000): Can complex genetic diseases be solved ? J Mol Med, 78, 469-71.
G.J. MCLACHLAN and D. PEEL (2000): Finite Mixture Models. John Wiley & Sons J. SWANSON, J. OOSTERLAAN, M. MURIAS, S. SCHUCK, P. FLODMAN, M. A. SPENCE, M. WASDELL,Y. DING, H. C. CHI, M. SMITH, M. MANN, C. CARLSON, J. L. KENNEDY, J. A. SERGEANT, P. LEUNG, Y. P. ZHANG,A. SADEH, C. CHEN, C. K. WHALEN, K. A. BABB, R. MOYZIS and M. I. POSNER (2000b): Attention deficit/hyperactivity disorder children with a 7-repeat allele of the dopamine receptor D4 gene have extreme behavior but normal performance on critical neuropsychological tests of attention. Proc Natl Acad Sci U S A, 97,4754-4759.
J. SWANSON, P. FLODMAN, J. L. KENNEDY, M. A. SPENCE, R. MOYZIS, S. SCHUCK, M. MURIAS, J. MORIARITY, C. BARR, M. SMITH and M. POSNER (2000a): Dopamine genes and ADHD. Neurosci Biobehav Rev, 24, 21-25.
T. J. WOODRUFF, D. A. AXELRAD, A. D. KYLE, O. NWEKE, G. G. MILLER and B. J. HURLEY (2004): Trends in environmentally related childhood illnesses. Pediatrics, 113, 1133-40.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Georgi, B., Spence, M.A., Flodman, P., Schliep, A. (2008). Mixture Model Based Group Inference in Fused Genotype and Phenotype Data. In: Preisach, C., Burkhardt, H., Schmidt-Thieme, L., Decker, R. (eds) Data Analysis, Machine Learning and Applications. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78246-9_15
Download citation
DOI: https://doi.org/10.1007/978-3-540-78246-9_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78239-1
Online ISBN: 978-3-540-78246-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)