ABSTRACT
In situ staining of a target mRNA at several time points during the development of a D. melanogaster embryo gives one a detailed spatio-temporal view of the expression pattern of a given gene. We have developed algorithms and software for analyzing a database of such images with the goal of being able to identify coordinately expressed genes and further our understanding of cis-regulatory control during embryogenesis. Our approach combines measures of similarity at both the global and local levels, based on Gaussian Mixture Model (GMM) decompositions. At the global level, the observed distribution of pixel values is quantized using an adaptive GMM decomposition and then quantized images are compared using mutual information. At the local level, we decompose quantized images into 2-dimensional Gaussian kernels or "blobs" and then develop a blob-set matching method to search for the best matching traits in different pattern-images. A hybrid scoring method is proposed to combine both global and local matching results. We further develop a voting scheme to search for genes with similar spatial staining patterns over the time course of embryo development. To evaluate the effectiveness of our approach, we compare it with several global image matching schemes and a controlled vocabulary method. We then apply our method to 4400 images of 136 genes to detect potentially co-regulated genes that have similar spatio-temporal patterns, using expert-annotation to evaluate our results.
- BDGP: Berkeley Drosophila Genome Project, http://www.fruitfly.org.Google Scholar
- Berman, B.P., Nibu, Y., Pfeiffer, B.D., Tomancak, P., Celniker, S.E., Levine, M., Rubin, G.M., and Eisen, M.B., Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome, Proc Natl Acad Sci USA, 99 (2002), 757--762.Google ScholarCross Ref
- Carson, C., Belongie, S., Greenspan, H., and, Malik, J., Blobworld: image segmentation using expectation-maximization and its application to image querying, IEEE Trans Pattern Analysis and Machine Intelligence, 24, 8 (2002), 1026--1038. Google ScholarDigital Library
- Cover, T., and Thomas, J., Elements of Information Theory, New York: Wiley, 1991. Google ScholarDigital Library
- De Gregorio, E., Spellman, P.T., Rubin, G.M., and Lemaitre, B., Genome-wide analysis of the Drosophila immune response by using oligonucleotide microarrays, Proc Natl Acad Sci USA, 98, 22 (2001), 12590--12595.Google ScholarCross Ref
- Gieseler, K., Wilder, E., Mariol, M.C., Buratovitch, M., Berenger, H., et al., DWnt4 and wingless elicit similar cellular responses during imaginal development, Dev. Biol., 232 (2001), 339--350.Google ScholarCross Ref
- Gonzalez, R.C., and Woods, R.E., Digital Image Processing, (2nd Ed), Addison-Wesley, 2002. Google ScholarDigital Library
- Jain, A., Fundamentals of Digital Image Processing, Prentice-Hall, 1986. Google ScholarDigital Library
- Kumar, S., Jayaraman, K., Panchanathan, S., Gurunathan, R., Marti-Subirana, A., and Newfeld, S., BEST: a novel computational approach for comparing gene expression patterns from early stages of Drosophlia melanogaster develeopment, Genetics, 169 (2002), 2037--2047.Google ScholarCross Ref
- Long, F.H., Zhang, H.J., and Feng, D.G., Fundamental of content-based image retrieval, In Multimedia Information Retrieval and Management, Feng, D., Siu, W.C., Zhang, H. J. (Eds.), Springer-Verlag, (2003), 1--12.Google ScholarCross Ref
- Mitchell, T., Machine Learning, McGraw-Hill. 1997. Google ScholarDigital Library
- Peng, H.C., Long F.H., Chi Z., and Siu W., Template document image matching based on component block list, Pattern Recognition Letters, 22, 9 (2001), 1033--1042.Google ScholarCross Ref
- Peng, H.C., Long, F.H., and Chi, Z., Document image recognition based on template matching of component block projections, IEEE Trans. Pattern Analysis and Machine Intelligence, 25, 9 (2003), 1188--1192. Google ScholarDigital Library
- Roberts, S.J., Husmeier, D., Rezek, I., and Penny, W., Bayesian approaches to Gaussian mixture modeling, IEEE Trans. Pattern Analysis and Machine Intelligence, 20, 11 (1998), 1133--1142. Google ScholarDigital Library
- Takaesu, N.T., Johnson, A.N., Sultani, O.H., and Newfeld, S.J., Combinatorial signaling by an unconventional Wg path- way and the Dpp pathway requires Nejire (CBP/p300) to regulate dpp expression in posterior tracheal branches, Dev. Biol., 247 (2002), 225--236.Google ScholarCross Ref
- Tomancak, P., Beaton, A., Weiszmann, R., Kwan, E., Shu, S., Lewis, S.E., Richards, S., Ashburner, M., Hartenstein, V., Celniker, S.E., and Rubin, G.M., Systematic determination of patterns of gene expression during Drosophila embryogenesis, Genome Biology, 3, 12 (2002).Google ScholarCross Ref
- Tsai, C.C., Kramer, S.G., and Gergen, J.P., Pair-rule gene runt restricts orthodenticle expression to the presumptive head of the Drosophila embryo, Dev. Genet., 23 (1998), 35--44.Google ScholarCross Ref
- Webb, A., Statistical Pattern Recognition, Arnold, 1999.Google Scholar
- Zhang, H., and Levine, M., Groucho and dCtBP mediate separate pathways of transcriptional repression in the Drosophila embryo, Proc. Natl. Acad. Sci. USA, 96 (1999), 535--540.Google ScholarCross Ref
Index Terms
- Comparing in situ mRNA expression patterns of drosophila embryos
Recommendations
Quantitative characterization of the transcriptional regulatory network in the yeast cell cycle
Motivation: Genome-wide gene expression programs have been monitored and analyzed in the yeast Saccharomyces cerevisiae , but how cells regulate global gene expression programs in response to environmental changes is still far from being understood. ...
Inferring dynamic architecture of cellular networks using time series of gene expression, protein and metabolite data
Motivation: High-throughput technologies have facilitated the acquisition of large genomics and proteomics datasets. However, these data provide snapshots of cellular behavior, rather than help us reveal causal relations. Here, we propose how these ...
Hypothesis-driven approach to predict transcriptional units from gene expression data
Motivation: A major issue in computational biology is the reconstruction of functional relationships among genes, for example the definition of regulatory or biochemical pathways. One step towards this aim is the elucidation of transcriptional units, ...
Comments