Skip to main content
Log in

Improved chromosome classification using monotonic functions of mahalanobis distance and the transportation method

  • Articles
  • Published:
Zeitschrift für Operations Research Aims and scope Submit manuscript

Abstract

It is shown that the accuracy of chromosome classification constrained by class size can be improved over previously reported results by a combination of straightforward modifications to previously used methods. These are (i) the use of the logarithm of the Mahalanobis distance of an unknown chromosome's feature vector to estimated class mean vectors as the basis of the transportation method objective function, rather than the estimated likelihood; (ii) the use of all available features and full estimated covariance to compute the Mahalanobis distance, rather than a subset of features and the diagonal (variance) terms only; (iii) a modification to the way the transportation model deals with the constraint on the number of sex chromosomes in a metaphase cell; and (iv) the use of a newly discovered heuristic to weight off-diagonal elements of the covariance; this proved to be particularly valuable in cases where relatively few training examples were available to estimate covariance. The methods have been verified using 5 different sets of chromosome data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Anderson TW (1984) An introduction to multivariate statistical analysis, Second Edition. Wiley Series in Probability and Mathematical Statistics. Wiley

  2. Duda RO, Hart PE (1973) Pattern classification and scene analysis. Wiley New York

    Google Scholar 

  3. Efron B, Tibshirani RJ (1993) An introduction to the bootstrap. Chapman and Hall New York

    Google Scholar 

  4. Errington P, Graham J (1993) Neural network chromosome classification. Cytometry 14:627–639

    Google Scholar 

  5. Fahrmeir L, Hamerle A (eds) (1984) Multivariate statistische Verfahren. De Gruyter Berlin New York

    Google Scholar 

  6. Goodman MM (1972) Distance analysis in biology. Systematic Zoology 21(2): 174–186, reprinted in: Atchley WR, Bryant EH (eds) (1975) Multivariate statistical methods. Among-Group Covariation, Benchmark Papers in Systematic and Evolutionary Biology 1:377–389, Dowden Hutchinson & Ross

    Google Scholar 

  7. Granum E (1982) Application of statistical and syntactical methods of analysis and classification to chromosome data. In: Kittler J, Ku KS, Pau LF (eds) Pattern recognition theory and applications. NATO ASI (Oxford 1981) Reidel Dordrecht 373–398

    Google Scholar 

  8. Granum E, Thomason M (1990) Automatically inferred Markov network models for classification of chromosomal band pattern structures. Cytometry 11:26–39

    Google Scholar 

  9. Greon FCA, ten Kate TK, Smeulders AWM, Young IT (1989) Human chromosome classification based on local band descriptors. Pattern Recognition Letters 9:211–222

    Google Scholar 

  10. Habbema JDF (1979) Statistical methods for classification of human chromosomes. Biometrics 35:103–118

    Google Scholar 

  11. ISCN (1985) An international system for human cytogenetic nomenclature. Karger S Basel

    Google Scholar 

  12. Kirby SPJ, Theobald CM, Piper J, Carothers A (1991) Some methods of combining class information in multivariate normal discrimination for the classification of human chromosomes. Statistics in Medicine 10:141–149

    Google Scholar 

  13. Kleinschmidt P, Mitterreiter I, Rank C (1994) A hybrid method for automatic chromosome karyotyping. Pattern Recognition Letters 15:87–96

    Google Scholar 

  14. Lundsteen C, Gerdes T, Maahr J (1986) Automatic classification of chromosomes as part of a routine system for clinical analysis. Cytometry 7:1–7

    Google Scholar 

  15. Lundsteen C, Lind A-M, Granum E (1976) Visual classification of banded human chromosomes. I. Karyotyping compared with classification of isolated chromosomes. Annals of Human Genetics 40:87–97

    Google Scholar 

  16. Paton K (1969) Automatic chromosome identification by the maximum-likelihood method. Annals of Human Genetics 33:177–184

    Google Scholar 

  17. Piper J (1986) Classification of chromosomes constrained by expected class size pattern recognition letters 4:391–395

    Google Scholar 

  18. Piper J (1987) The effect of zero feature correlation assumption on maximum likelihood based classification of chromosomes. Signal Processing 12:49–57

    Google Scholar 

  19. Piper J (1990) A new approach to constrained maximum likelihood chromosome classification. Research Note, MRC Human Genetics Unit Edinburgh

    Google Scholar 

  20. Piper J (1982) Variability and bias in experimentally measured classifier error rates. Pattern Recognition Letters 13:685–692

    Google Scholar 

  21. Piper J, Breckon G (1989) An automated system for karyotyping mouse chromosomes. Cytogenetics and Cell Genetics 50:111–115

    Google Scholar 

  22. Piper J, Granum E (1989) On fully automatic feature measurement for banded chromosome classification. Cytometry 10:242–255

    Google Scholar 

  23. Piper J, Nickolls P, McLaren W, Rutovitz D, Chisholm A, Johnstone I (1981) The effect of digital image filtering on the performance of an automatic chromosome classifier. Signal Processing 4:361–373

    Google Scholar 

  24. Rohlf FJ (1971) Perspectives on the application of multivariate statistics to taxonomy. Taxon 20(1): 85–90. Reprinted in: Atchley WR, Bryant EH (eds) (1975) Multivariate statistical methods, among-group covariation. Benchmark Papers in Systematic and Evolutionary Biology 1:6–11, Dowden Hutchinson & Ross

    Google Scholar 

  25. Slot RE (1979) On the profit of taking into account the known number of objects per class in classification methods. IEEE Transactions on Information Theory IT-25:484–488

    Google Scholar 

  26. Sweeney W Classification of chromosomes using probabilistic neural network. Cytometry, submitted

  27. Theobald CM, Kirby SPJ Discrimination using covariance selection models for the automated allocation of human chromosomes. Submitted

  28. Tso M, Graham J (1983) The transportation algorithm as an aid to chromosome classification. Pattern Recognition Letters 1:489–496

    Google Scholar 

  29. Tso M, Kleinschmidt P, Mitterreiter I, Graham J (1991) An efficient transportation algorithm for automatic chromosome karyotyping. Pattern Recognition Letters 12:117–126

    Google Scholar 

  30. Zimmermann SO, Johnston DA, Arrighi FE, Rupp ME (1986) Automated homologue matching of human G-banded chromosomes. Computers in Biology and Medicine 16:223–233

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kleinschmidt, P., Mitterreiter, I. & Piper, J. Improved chromosome classification using monotonic functions of mahalanobis distance and the transportation method. ZOR - Methods and Models of Operations Research 40, 305–323 (1994). https://doi.org/10.1007/BF01432971

Download citation

  • Received:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01432971

Keywords

Navigation