Skip to main content
Log in

Markov Random Fields with Asymmetric Interactions for Modelling Spatial Context in Structured Scene Labelling

  • Published:
Journal of Signal Processing Systems Aims and scope Submit manuscript

Abstract

In this paper we propose a Markov random field with asymmetric Markov parameters to model the spatial and topological relationships between objects in structured scenes. The field is formulated in terms of conditional probabilities learnt from a set of training images. A locally consistent labelling of new scenes is achieved by relaxing the Markov random field directly using these conditional probabilities. We evaluate our model on a varied collection of several hundred hand-segmented images of buildings. The incorporation of spatial information is shown to improve greatly the performance of some trivial classifiers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4

Similar content being viewed by others

Notes

  1. Note that although a directed graph for the computer vision community cannot be called an MRF, and as a consequence, according to the same community, asymmetric MRFs do not exist, from the mathematical point of view the concept is not only possible, but, in our opinion, also prevalent in nature: if the value of a node is assumed to be the outcome of a random experiment, we are clearly dealing with a random field; if we model this outcome by considering only the neighbours (defined in some pre-specified sense) of the node, we are clearly dealing with a Markovian random field. Nowhere in these two definitions enters the concept of directionality. We consider the assertion that a directed graph cannot be a Markov random field rather idiosyncratic of the computer vision community and not valid if strict mathematical definitions are applied. That is why in this paper we use the mathematically correct terms “asymmetric MRFs” or “non-Gibbsian MRFs”.

  2. The images alongside their annotation and segmentation information are available at http://www.commsp.ee.ic.ac.uk/~dheesch/ngmrf/data/.

References

  1. Bar, M., & Aminoff, E. (2003). Cortical analysis of visual context. Neuron, 38, 347–358.

    Article  Google Scholar 

  2. Bar, M., Kassam, K., Ghuman, A., Boshyan, J., Schmidt, A., Dale, A., et al. (2006). Top-down facilitation of visual recognition. Proceedings of the National Academy of Sciences, 103(2), 449–454.

    Article  Google Scholar 

  3. Besag, J. (1974). Spatial interaction and the statistical analysis of lattice systems (with discussion). Journal of the Royal Statistical Society, B, 36, 192–236.

    MATH  MathSciNet  Google Scholar 

  4. Carbonetto, P., de Freitas, N., & Barnard, K. (2004). A statistical model for general contextual object recognition. In Proc European conf computer vision (pp. 350–362).

  5. Christmas, W. J., Kittler, J., & Petrou, M. (1995). Structural matching in computer vision using probabilistic relaxation. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI, 17, 749–764.

    Article  Google Scholar 

  6. Csurka, G., Bray, C., Dance, C., & Fan, L. (2004). Visual categorisation with bags of keypoints. In Proc European conf computer vision, Prague, Czech Republic, May 11–14.

    Google Scholar 

  7. He, X., Zemel, R., & Ray, D. (2006). Learning and incorporating top-down cues in image segmentation. In Proc European conf computer vision, Graz, Austria, May 7–13.

  8. Kumar, S., & Hebert, H. (2003). Discriminative random fields: A discriminative framework for contextual interaction in classification. In Proc int’l conf computer vision, Nice, France, October 14–17.

  9. Li, Z. (1999). Visual segmentation by contextual influences via intra-cortical interactions in the primary visual cortex. Networks: Computation in Neural Systems, 10, 187–212.

    Article  MATH  Google Scholar 

  10. Li, Z. (2001). Computational design and nonlinear dynamics of a recurrent network model of the primary visual cortex. Neural Computation, 13(8), 1749–1780.

    Article  MATH  Google Scholar 

  11. Li, S. (1995). Markov random field modeling in computer vision. New York: Springer.

    Google Scholar 

  12. Modestino, J., & Zhang, J. (1992). A Markov random field model-based approach to image interpretation. IEEE Trans Pattern Analysis and Machine Intelligence, 14(6), 606–615.

    Article  Google Scholar 

  13. Oliva, A., & Torralba, A. (2001). Modelling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision, 42(3), 145–175.

    Article  MATH  Google Scholar 

  14. Petrou, M., & Garcia Sevilla, P. (2006). Image processing: Dealing with texture. New York: Wiley, ISBN-13: 978-0-470-02628-1.

  15. Petrou, M., Gautam, S., & Giannoutakis, K. N. (2006). Simulating a digital business ecosystem. In M. Costantino & C. A. Brebbia (Eds.), Computational finance and its applications II (pp. 277–287). Southampton: WIT, ISBN 1-84564-1744.

  16. Petrou, M. (2007). Learning in computer vision: Some thoughts. In L. Rueda, D. Mery and J. Kittler (Eds.), Progress in pattern recognition, image analysis and applications. The 12th Iberoamerican congress on pattern recognition, CIARP 2007, Vina del Mar-Valparaiso, November, LNCS 4756, pp 1–12. New York: Springer.

    Google Scholar 

  17. Rosenfeld, A., Hummel, A., & Zucker, S. (1976). Scene labelling by relaxation operations. Trans SMC, 6(6), 420–433.

    MATH  MathSciNet  Google Scholar 

  18. Serre, T., Oliva, A., & Poggio, T. (2007). A feedforward architecture accounts for rapid categorisation. Proceedings of the National Academy of Sciences of the United States of America, 104(15), 6424–6429.

    Article  Google Scholar 

  19. Shotton, J., Winn, J., Rother, C., & Criminisi, A. (2006). Textonboost: Joint appearance, shape and context modelling for multi-class object recognition and segmentation. In Proc European conf computer vision, Graz, Austria, May 7–13.

  20. Sivic, J., & Zisserman, A. (2003). Video google: A text retrieval approach to object matching in videos. In Proc int’l conf computer vision, Nice, France, October 14–17, pp. 1–8.

  21. Torralba, A., Murphy, K., & Freeman, W. (2004). Sharing features: Efficient boosting procedures for multiclass object detection. In Proc int’l conf computer vision and pattern recognition (pp. 762–769).

  22. Viola, P., & Jones, M. (2001). Rapid object detection using a boosted cascade of simple features. In Proc int’l conf computer vision and pattern recognition, Kauai, Hawaii, December 9–14.

  23. Winn, J., Criminisi, A., & Minka, T. (2005). Object categorisation by learnt universal visual dictionary. In Proc int’l conf computer vision (pp. 1800–1807).

Download references

Acknowledgements

This work was supported by the FP6 European project eTRIMS.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Maria Petrou.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Heesch, D., Petrou, M. Markov Random Fields with Asymmetric Interactions for Modelling Spatial Context in Structured Scene Labelling. J Sign Process Syst 61, 95–103 (2010). https://doi.org/10.1007/s11265-009-0349-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11265-009-0349-0

Keywords

Navigation