Abstract
Background. DNA amplifications and deletions characterize cancer genome and are often related to disease evolution. Microarray based techniques for measuring these DNA copy-number changes use fluorescence ratios at arrayed DNA elements (BACs, cDNA or oligonucleotides) to provide signals at high resolution, in terms of genomic locations. These data are then further analyzed to map aberrations and boundaries and identify biologically significant structures.
Methods. We develop a statistical framework that enables the casting of several DNA copy number data analysis questions as optimization problems over real valued vectors of signals. The simplest form of the optimization problem seeks to maximize \(\varphi (I) = \sum v_i/\sqrt{|I|}\) over all subintervals I in the input vector. We present and prove a linear time approximation scheme for this problem. Namely, a process with time complexity O(nε − 2) that outputs an interval for which ϕ(I) is at least Opt/α(ε), where Opt is the actual optimum and α(ε) → 1 as ε → 0. We further develop practical implementations that improve the performance of the naive quadratic approach by orders of magnitude. We discuss properties of optimal intervals and how they apply to the algorithm performance.
Examples. We benchmark our algorithms on synthetic as well as publicly available DNA copy number data. We demonstrate the use of these methods for identifying aberrations in single samples as well as common alterations in fixed sets and subsets of breast cancer samples.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Balsara, B.R., Testa, J.R.: Chromosomal imbalances in human lung cancer. Oncogene 21(45), 6877–6883 (2002)
Barrett, M.T., Scheffer, A., Ben-Dor, A., Sampas, N., Lipson, D., Kincaid, R., Tsang, P., Curry, B., Baird, K., Meltzer, P.S., Yakhini, Z., Bruhn, L., Laderman, S.: Comparative genomic hybridization using oligonucleotide microarrays and total genomic DNA. PNAS 101(51), 17765–17770 (2004)
Bignell, G., Huang, J., Greshock, J., Watt, S., Butler, A., West, S., Grigorova, M., Jones, K., Wei, W., Stratton, M., Futreal, P., Weber, B., Shapero, M., Wooster, R.: High-resolution analysis of DNA copy number using oligonucleotide microarrays. Genome Research 14(2), 287–295 (2004)
Brennan, C., Zhang, Y., Leo, C., Feng, B., Cauwels, C., Aguirre, A.J., Kim, M., Protopopov, A., Chin, L.: High-resolution global profiling of genomic alterations with long oligonucleotide microarray. Cancer Research 64(14), 4744–4748 (2004)
DeGroot, M.H.: Probability and Statistics, ch. 5.7, p. 275. Addison-Wesley, Reading (1989)
Sebat, J., et al.: Large-scale copy number polymorphism in the human genome. Science 305(5683), 525–528 (2004)
Feller, W.: An Introduction to Probability Theory and Its Applications, ch. VII.6, vol. I, p. 193. John Wiley & Sons, Chichester (1970)
Fu, K.S., Mui, J.K.: A survey of image segmentation. Pattern Recognition 13(1), 3–16 (1981)
Himberg, J., Korpiaho, K., Mannila, H., Tikanmki, J., Toivonen, H.T.T.: Time series segmentation for context recognition in mobile devices. In: Proceedings of the 2001 IEEE International Conference on Data Mining (ICDM 2001), pp. 203–210 (2001)
Hyman, E., Kauraniemi, P., Hautaniemi, S., Wolf, M., Mousses, S., Rozenblum, E., Ringner, M., Sauter, G., Monni, O., Elkahloun, A., Kallioniemi, O.P., Kallioniemi, A.: Impact of DNA amplification on gene expression patterns in breast cancer. Cancer Research 62, 6240–6245 (2002)
Kallioniemi, O.P., Kallioniemi, A., Sudar, D., Rutovitz, D., Gray, J.W., Waldman, F., Pinkel, D.: Comparative genomic hybridization: a rapid new method for detecting and mapping DNA amplification in tumors. Semin Cancer Biol. 4(1), 41–46 (1993)
Lipson, D., Ben-Dor, A., Dehan, E., Yakhini, Z.: Joint analysis of DNA copy numbers and expression levels. In: Jonassen, I., Kim, J. (eds.) WABI 2004. LNCS (LNBI), vol. 3240, pp. 135–146. Springer, Heidelberg (2004)
Mertens, F., Johansson, B., Hoglund, M., Mitelman, F.: Chromosomal imbalance maps of malignant solid tumors: a cytogenetic survey of 3185 neoplasms. Can. Res. 57, 2765–2780 (1997)
Morley, M., Molony, C., Weber, T., Devlin, J., Ewens, K., Spielman, R., Cheung, V.: Genetic analysis of genome-wide variation in human gene expression. Nature 430(7001), 743–747 (2004)
Hupe, P., Stransky, N., Thiery, J.P., Radvanyi, F., Barillot, E.: Analysis of array CGH data: from signal ratio to gain and loss of DNA regions. Bioinformatics (2004) (Epub ahead of print)
Pinkel, D., Segraves, R., Sudar, D., Clark, S., Poole, I., Kowbel, D., Collins, C., Kuo, W.L., Chen, C., Zhai, Y., Dairkee, S.H., Ljung, B.M., Gray, J.W., Albertson, D.G.: High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays. Nature Genetics 20(2), 207–211 (1998)
Platzer, P., Upender, M.B., Wilson, K., Willis, J., Lutterbaugh, J., Nosrati, A., Willson, J.K., Mack, D., Ried, T., Markowitz, S.: Silence of chromosomal amplifications in colon cancer. Cancer Research 62(4), 1134–1138 (2002)
Pollack, J.R., Perou, C.M., Alizadeh, A.A., Eisen, M.B., Pergamenschikov, A., Williams, C.F., Jeffrey, S.S., Botstein, D., Brown, P.O.: Genome-wide analysis of DNA copy-number changes using cDNA microarrays. Nature Genetics 23(1), 41–46 (1999)
Pollack, J.R., Sorlie, T., Perou, C.M., Rees, C.A., Jeffrey, S.S., Lonning, P.E., Tibshirani, R., Botstein, D., Borresen-Dale, A., Brown, P.O.: Microarray analysis reveals a major direct role of DNA copy number alteration in the transcriptional program of human breast tumors. PNAS 99(20), 12963–12968 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lipson, D., Aumann, Y., Ben-Dor, A., Linial, N., Yakhini, Z. (2005). Efficient Calculation of Interval Scores for DNA Copy Number Data Analysis. In: Miyano, S., Mesirov, J., Kasif, S., Istrail, S., Pevzner, P.A., Waterman, M. (eds) Research in Computational Molecular Biology. RECOMB 2005. Lecture Notes in Computer Science(), vol 3500. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11415770_6
Download citation
DOI: https://doi.org/10.1007/11415770_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25866-7
Online ISBN: 978-3-540-31950-4
eBook Packages: Computer ScienceComputer Science (R0)