A unified framework for scene illuminant estimation

doi:10.1016/j.imavis.2006.12.003

Image and Vision Computing

Volume 26, Issue 3, 3 March 2008, Pages 415-429

https://doi.org/10.1016/j.imavis.2006.12.003 Get rights and content

Abstract

Most existing illuminant estimation algorithms work with assumptions of specific source types (e.g. directional light source or point light source). The assumptions bring up two main limitations which significantly restrict their applicabilities: first, the prior knowledge about source types is needed; second, it can not handle complex scenes where multiple source types co-exist. In this paper, we develop a general light source model which is designed to model arbitrary light sources. With the general source model, we are able to estimate multiple illuminants of different types within a single unified framework. We use a plastic sphere to probe both diffuse and specular reflections of a scene. With specular reflections, we estimate geometric parameters of the illuminants by a novel ray tracing and matching algorithm. The diffuse reflections are used to estimate photometric parameters of the illuminants. We also notice that common source types are degenerate cases of the general source model under certain conditions. Therefore, we can approximate light sources with common source types by checking the conditions. The approximations serve as type determinations of light sources, which makes our framework extremely useful for lighting related algorithms which need prior knowledge about light sources. Experiment results on a variety of real images demonstrate the efficiency and accuracy of our algorithm.

Introduction

Extensive light source models have been used in computer vision and computer graphics, among which the most common ones are point light source, directional light source and area light source. Each of these models provide an appropriate approximation to a specific category of the real light sources. In real applications, the situation could be very complex where multiple different types of light sources co-exist in the scene. Unfortunately, most light source estimation algorithms work with the assumption of one specific type of light source and there presently exists no method in computer vision for inferring the types of light sources from images. This is problematic because choosing which model to apply in a lighting related problem requires the prior knowledge about the types of the light sources presented in the scene. It is desirable to have a general light source model which allows us to estimate the light sources of different types within a single framework.

Most previous approaches for illuminant estimation make the assumption of directional light source. For example, many schemes have been proposed in the context of shape from shading [23], [1], [10], [6], [21] to estimate a single directional light source. Recently, many attempts have been made to detect multiple directional light sources by exploring various cues. Sato and Ikeuchi [15] estimated illumination distribution by analyzing the shadows cast by an object of known shape onto another object in the scene. Zhou and Kambhamettu [24] utilized the specular properties of a shining surface to estimate the light source directions; Yang and Yuille [20] introduced the idea of occluding boundary to constrain the light source directions; Zhang and Yang [22] extended the idea of occluding boundary and estimate lighting direction from critical points that have surface normals perpendicular to an illuminant direction. Since the detection of critical points is sensitive to noise, Wang and Samaras [18], [19] further segmented the surface into “virtual light patches” by the critical boundaries and illuminants are estimated by minimizing a global error. Li et al. [9] integrated the shading, shadow and specular cues into a single framework which results in more robust estimation.

All the above methods assume directional light sources, which are reasonable approximations to real scene illuminants provided that the distances between the light sources and the illuminated objects are large enough compared against the sizes of both the light sources and the illuminated objects. However, this approximation may not be valid in some cases, especially when light sources are within a limited range, e.g. a small room. The proximal point source model could be used in this situation such as in [12], [8], which estimates the 3-D positions instead of the directions of the illuminants. Nevertheless, all those algorithms are designed for one specific type of the light source which require prior knowledge about the types of the light sources presented in the scene.

To estimate the complex scene illuminations without type assumption, Debevec [2] introduced a light-based model. The model is constructed by mapping reflections on a spherical mirror onto a geometric model of the scene. Although the algorithm works well on superimposing virtual objects onto the real image with convincing shadings, the geometries of all objects in the local scene have to be given a priori, while shape modeling of real objects itself is a difficult research topic. Sato el al. [14] addressed the problem with a radiance-map model. They first build a rough geometric model of the scene as a triangular mesh from a pair of omni-directional images taken from different locations, then the radiance map is constructed by mapping the radiance of the scene onto the reconstructed geometrical model. Although these models have been successfully utilized to render virtual object onto the real images, they were designed specifically for the Augmented Reality and consequently can not be used as a general light source model for all lighting related applications in both computer vision and computer graphics. Specifically, these models suffer from three limitations: first, the direct and indirect illuminations are not explicitly distinguished, which is problematic when the applications only need the information about the direct illuminations (as is the case for most computer vision algorithms); second, the illuminations are estimated as a radiance map while the parameters of the light sources such as the distance and the size of the light source are not explicitly estimated; finally, the geometric model of the scene should be built a priori or be roughly estimated as a triangular mesh, therefore, the estimations of the light sources are expensive. These limitations may not be problems for the applications of Augmented Reality, while for most computer vision algorithms, such as shape from shading and photometric stereo, accurate estimation of the parameters of the light sources such as the directions and the distances of the light sources (direct illuminations) is crucial for the success of the algorithms. To our knowledge, the most closely related work is presented recently by Takai et al. [17] which estimated the point light sources and the directional light sources in a single framework by analyzing the intensity difference of two diffuse spheres although the case of area light source is not considered.

In [5], Langer et al. demonstrated that the set of rays within a free space is a 4-D manifold. Modeling the light sources in terms of the emitting source rays, various light sources can be classified as light sources with different dimensions ranging from 0 to 4. Therefore, the analysis of all different types of light sources could be performed within a 4-D light source hypercube. In this paper, we extend the idea of the 4-D hypercube and propose a general light source model which is more suitable for the purpose of the light source estimation. With this general light source model, different types of light sources can be treated in the same manner. In Section 2, we discuss the light source modeling in detail. The purpose of this work is to design a unified light source estimation framework. Employing the general light source model, we remove the assumption about the source types, therefore, it works on all different types of light sources and consequently can deal with complex scenes where different types of light sources co-exist.

Checking the relationship between the general light source model and the common source types, we notice that the common source types are actually the degenerate cases of the general source models which satisfy certain conditions. Checking these conditions, we can approximate the light sources with the most appropriate common source types. The approximation serves as the type determination of the light sources presented in the scene, which makes our algorithm extremely useful for most lighting related algorithms which need the prior knowledge about the source types.

The rest of this paper is organized as follows: in Section 2, we propose a general light source model for all different types of light sources. Section 3 describes the unified estimation framework in detail. In Section 4, we discuss the conditions upon which the appropriate type approximation can be made to the light sources. Section 5 shows the experiment results on real images and Section 6 gives conclusions and future works.

Section snippets

Light source modeling

In [5], Langer et al. proposed a theoretical framework to compare the different types of the light sources. They demonstrated that the set of rays (including non-source rays and source rays) within a free space is a 4-D manifold, and the set of source rays can be modeled as a 4-D light source hypercube. Fig. 1a illustrates the idea: place a plane $P$ between the light source and the scene, a ray r piercing the plane through a point (x₀, y₀) in direction (p₀, q₀, 1) is parameterized as (x₀, y₀, p₀, q₀).

Framework outline

The reflections from inhomogeneous objects such as plastic, ceramics etc. are mainly composed of two different reflection components: diffuse reflection and specular reflection. The specular reflection has the useful geometric property that the incident angle and the reflectance angle are equal. The diffuse reflection preserves the well-studied photometric property which is well described by Lambert’s law. In our framework, We use the similar experimental setup as proposed in our previous works

Common source model approximation

As pointed out in [5], most real light sources lie on the corners of the 4-D hypercube (The corners are defined by taking the parameters h_x, h_y, h_p, h_q to theirs limits, i.e. 0 or ∞). For example, fluorescent tube is a 3-D source (h_x = ∞, h_y = 0, h_p = ∞, h_q = ∞); sunlight is a 2-D source (h_x = ∞, h_y = ∞, h_p = 0, h_q = 0). In other words, the common source types are just degenerate cases of the general light source model under certain conditions. Checking these conditions, we can approximate the light sources to the

Experiment results

In order to test and evaluate our approach in practice, we have performed experiments on real images. The images were acquired using Bumblebee camera: a 2-eye stereo camera designed by PointGrey Inc. This device provides us rectified image and camera calibration parameters.

Ninety light sources in total with different sizes and locations were used to create different kinds of images. The sizes and the locations of the light sources were manually measured as ground truth. First, the light sources

Conclusion and future work

We have presented a unified framework for scene illuminant estimation. Unlike previous works in the area, we estimate the light sources of different types using a general light source model, which makes it possible to estimate different types of the light sources within a single framework. Primary contributions of our work toward illuminant estimation can be summarized as follows:

1.
A general light source model (GLM) is proposed to model different types of light sources. Under the GLM, different

References (26)

C.H. Lee et al.
Improved methods of estimating shape from shading using the light source coordinate system
Artificial Intelligence
(1985)
Y. Wang et al.
Estimation of multiple directional light sources for synthesis of Augmented Reality images
Graphical Models
(2003)
M.J. Brooks, B.K.P. Horn, Shape and source from shading, Proceedings of the 9th International Joint Conference on...
P. Debevec
Rendering synthetic objects into real scenes: bridging traditional and image-based graphics with global illumination and high dynamic range photography
ACM SIGGraph, Computer Graphics
(1998)
P. Debevec et al.
Recovering high dynamic range radiance maps from photographs
ACM SIGGraph, Computer Graphics
(1997)
M.A. Fischler et al.
Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography
Communications of the ACM
(1981)
M.S. Langer et al.
What is a light source?
IEEE Proc. Computer Vision and Pattern Recognition
(1997)
S.W. Lee et al.
Detection of specularity using color and multiple views
European Conference on Computer Vision
(1992)
H.P.A. Lensch et al.
Image-based reconstruction of spatial appearance and geometric detail
ACM Transactions on Graphics
(2003)
Y. Li, S. Lin, H. Lu, H. Shum, Multiple-cue illumination estimation in textured scenes, IEEE Proc. 9th International...

A. Pentland

Linear shape from shading

International Journal of Computer Vision

(1990)

B.T. Phong

Ilumination for computer generated pictures

Communications of the ACM

(1975)

M.W. Powell et al.

Calibration of light sources

IEEE Proc. Computer Vision and Pattern Recognition

(2000)

Cited by (29)

FISH: Face intensity-shape histogram representation for automatic face splicing detection
2019, Journal of Visual Communication and Image Representation
Citation Excerpt :
An experimental evaluation of our approach is reported in Section 5, and conclusions are finally drawn in Section 6. Estimating the light source parameters of a real scene is quite a challenging task [15] which can prove extremely useful for detecting tampered images. In the recent literature on image forensics, some methods aim to detect image inconsistencies by estimating the color of the light source (i.e. the illuminant), while others focus on fitting a parametric model describing the interaction of the light source with the environment, for which the light source location/direction is usually the most relevant parameter.
Tampered images spread nowadays over any visual media influencing our judgement in many aspects of our life. This is particularly critical for face splicing manipulations, where recognizable identities are put out of context. To contrast these activities on a large scale, automatic detectors are required.
In this paper, we present a novel method for automatic face splicing detection, based on computer vision, that exploits inconsistencies in the lighting environment estimated from different faces in the scene. Differently from previous approaches, we do not rely on an ideal mathematical model of the lighting environment. Instead, our solution, built upon the concept of histogram-based features, is able to statistically represent the current interaction of faces with light, untied from the actual and unknown reflectance model. Results show the effectiveness of our solution, that outperforms existing approaches on real-world images, being more robust to face shape inaccuracies.
Online detection of character and 3D surface defect in steel rail production
2022, ACM International Conference Proceeding Series
MagicChem: a MR system based on needs theory for chemical experiments
2022, Virtual Reality
A Comparative Survey of Geometric Light Source Calibration Methods
2021, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Illumination consistency based on single low dynamic range images
2020, Multimedia Tools and Applications
Estimation of position and intensity of multi-light sources based on specular sphere
2018, Journal of Advanced Computational Intelligence and Intelligent Informatics

View all citing articles on Scopus

View full text

A unified framework for scene illuminant estimation

Abstract

Introduction

Section snippets

Light source modeling

Framework outline

Common source model approximation

Experiment results

Conclusion and future work

Artificial Intelligence

Graphical Models

Rendering synthetic objects into real scenes: bridging traditional and image-based graphics with global illumination and high dynamic range photography

ACM SIGGraph, Computer Graphics

Recovering high dynamic range radiance maps from photographs

ACM SIGGraph, Computer Graphics

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Communications of the ACM

What is a light source?

IEEE Proc. Computer Vision and Pattern Recognition

Detection of specularity using color and multiple views

European Conference on Computer Vision

Image-based reconstruction of spatial appearance and geometric detail

ACM Transactions on Graphics

Linear shape from shading

International Journal of Computer Vision

Ilumination for computer generated pictures

Communications of the ACM

Calibration of light sources

IEEE Proc. Computer Vision and Pattern Recognition