Skip to main content

Mixture Model Based Group Inference in Fused Genotype and Phenotype Data

  • Conference paper
Data Analysis, Machine Learning and Applications
  • 6014 Accesses

Abstract

The analysis of genetic diseases has classically been directed towards establishing direct links between cause, a genetic variation, and effect, the observable deviation of phenotype. For complex diseases which are caused by multiple factors and which show a wide spread of variations in the phenotypes this is unlikely to succeed. One example is the Attention Deficit Hyperactivity Disorder (ADHD), where it is expected that phenotypic variations will be caused by the overlapping effects of several distinct genetic mechanisms. The classical statistical models to cope with overlapping subgroups are mixture models, essentially convex combinations of density functions, which allow inference of descriptive models from data as well as the deduction of groups. An extension of conventional mixtures with attractive properties for clustering is the context-specific independence (CSI) framework. CSI allows for an automatic adaption of model complexity to avoid overfitting and yields a highly descriptive model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Y. BARASH and N. FRIEDMAN (2002): Context-specific Bayesian clustering for gene ex-pression data. J Comput Biol, 9,169-91.

    Article  Google Scholar 

  • C. BIERNACKI, G. CELEUX and G. GOVAERT (1999): An improvement of the NEC crite-rion for assessing the number of clusters in a mixture model. Non-Linear Anal., 20,267-272.

    MATH  Google Scholar 

  • E. H. Jr. COOK, M. A. STEIN, M. D. KRASOWSKI, N. J. COX, D. M. OLKON, J. E. KIEFFER and B. L. LEVENTHAL (1995): Association of attention-deficit disorder and the dopamine transporter gene. Am. J. Hum. Genet., 56,993-998.

    Google Scholar 

  • A. DEMPSTER and N. LAIRD and D. RUBIN (1977): Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 1-38.

    Google Scholar 

  • N. FRIEDMAN (1998): The Bayesian Structural EM Algorithm. Proceedings of the Four-teenth Conference on Uncertainty in Artificial Intelligence,129-138.

    Google Scholar 

  • B. GEORGI and A. SCHLIEP (2006): Context-specific Independence Mixture Modeling for Positional Weight Matrices. Bioinformatics, 22, 166-73.

    Article  Google Scholar 

  • M. GILL, G. DALY, S. HERON, Z. HAWI and M. FITGERALD (1997): Confirmation of association between attention deficit hyperactivity disorder and a dopamine transporter polymorphism. Molec. Psychiat, 2, 311-313.

    Article  Google Scholar 

  • F. C. LUFT (2000): Can complex genetic diseases be solved ? J Mol Med, 78, 469-71.

    Article  Google Scholar 

  • G.J. MCLACHLAN and D. PEEL (2000): Finite Mixture Models. John Wiley & Sons J. SWANSON, J. OOSTERLAAN, M. MURIAS, S. SCHUCK, P. FLODMAN, M. A. SPENCE, M. WASDELL,Y. DING, H. C. CHI, M. SMITH, M. MANN, C. CARLSON, J. L. KENNEDY, J. A. SERGEANT, P. LEUNG, Y. P. ZHANG,A. SADEH, C. CHEN, C. K. WHALEN, K. A. BABB, R. MOYZIS and M. I. POSNER (2000b): Attention deficit/hyperactivity disorder children with a 7-repeat allele of the dopamine receptor D4 gene have extreme behavior but normal performance on critical neuropsychological tests of attention. Proc Natl Acad Sci U S A, 97,4754-4759.

    Google Scholar 

  • J. SWANSON, P. FLODMAN, J. L. KENNEDY, M. A. SPENCE, R. MOYZIS, S. SCHUCK, M. MURIAS, J. MORIARITY, C. BARR, M. SMITH and M. POSNER (2000a): Dopamine genes and ADHD. Neurosci Biobehav Rev, 24, 21-25.

    Article  Google Scholar 

  • T. J. WOODRUFF, D. A. AXELRAD, A. D. KYLE, O. NWEKE, G. G. MILLER and B. J. HURLEY (2004): Trends in environmentally related childhood illnesses. Pediatrics, 113, 1133-40.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Georgi, B., Spence, M.A., Flodman, P., Schliep, A. (2008). Mixture Model Based Group Inference in Fused Genotype and Phenotype Data. In: Preisach, C., Burkhardt, H., Schmidt-Thieme, L., Decker, R. (eds) Data Analysis, Machine Learning and Applications. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78246-9_15

Download citation

Publish with us

Policies and ethics