Introduction

Koch, Reinhard; Pears, Nick; Liu, Yonghuai

doi:10.1007/978-1-4471-4063-4_1

Reinhard Koch⁴,
Nick Pears⁵ &
Yonghuai Liu⁶

3186 Accesses
1 Citations

Abstract

3D Imaging, Analysis and Applications is a comprehensive textbook on 3D shape capture, 3D shape processing and how such capture and processing can be used. Eleven chapters cover a broad range of concepts, algorithms and applications and they are split into three parts, as follows: Part I, 3D Imaging and Shape Representation, presents techniques for capture, representation and visualization of 3D data; Part II, 3D Shape Analysis and Processing presents feature-based methods of analysis, registration and shape matching and, finally, Part III, 3D Imaging Applications presents application areas in 3D face recognition, remote sensing and medical imaging. This introduction provides the reader with historical and background information, such as that relating to the development of computer vision; in particular, the development of automated 3D imaging. It briefly discusses general depth estimation principles for 3D imaging, details a selection of seminal papers, sketches applications of 3D imaging and concludes with an outline of the book’s remaining chapters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.95; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Typically, this term is used when the 3D data is acquired from multiple viewpoint 2D images.
2.
Typically, this term is used when a scanner acquired the 3D data, such as a laser stripe scanner.
3.
Typically, this term is used when the data is ordered in a regular grid, such as the 2D array of depth values in a range image, or a 3D array of data in volumetric medical imaging.
4.
Euclid of Alexandria, Greek mathematician, also referred to as the Father of Geometry, lived in Alexandria during the reign of Ptolemy I (323–283 BC).
5.
Alhazen (Ibn al-Haytham), born 965 CE in Basra, Iraq, died in 1040. Introduced the concept of physical optics and experimented with lenses, mirrors, camera obscura, refraction and reflection.
6.
Sir Austen Henry Layard (1817–1894), British archaeologist, found a polished rock crystal during the excavation of ancient Nimrud, Iraq. The lens has a diameter of 38 mm, presumed creation date 750–710 BC and now on display at the British Museum, London.
7.
Lucius Annaeus Seneca, around 4 BC–65 CE, was a Roman philosopher, statesman, dramatist, tutor and adviser of Nero.
8.
Small and thin bi-convex lenses look like lentils, hence the name lens, which is Latin for lentil.
9.
Nicéphore Niépce, 1765–1833, is credited as one of the inventors of photography by solar light etching (Heliograph) in 1826. He later worked with Louis-Jacques-Mandé Daguerre, 1787–1851, who acquired a patent for his Daguerreotype, the first practical photography process based on silver iodide, in 1839. In parallel, William Henry Fox Talbot, 1800–1877, developed the calotype process, which uses paper coated with silver iodide. The calotype produced a negative image from which a positive could be printed using silver chloride coated paper [19].
10.
The Greek word stereos for solid is used to indicate a spatial 3D extension of vision, hence stereoscopic stands for a 3D form of visual information.
11.
Gabriel Lippmann, 1845–1921, French scientist, received the 1908 Nobel price in Physics for his method to reproduce color pictures by interferometry.
12.
Sir Charles Wheatstone, 1802–1875, English physicist and inventor.
13.
The terms disparity and parallax are sometimes used interchangeably in the literature and this misuse of terminology is a source of confusion. One way to think about parallax is that it is induced by the difference in disparity between foreground and background objects over a pair of views displaced by a translation. The end result is that the foreground is in alignment with different parts of the background. Disparity of foreground objects and parallax then only become equivalent when the distance of background objects can be treated as infinity (e.g. distant stars), in this case the background objects are stationary in the image.
14.
Sir David Brewster, 1781–1868, Scottish physicist and inventor.
15.
Szeliski, Computer Vision: Algorithms and Applications, p. 10 [49].
16.
Intrinsic Image Dimension (IID) describes the local change in the image. Constant image: 0D, linear structures: 1D, point structures: 2D.
17.
A pdf version is also available for personal use on the website http://szeliski.org/Book/.
18.
This triangle defines an epipolar plane, which is discussed in Chap. 2.
19.
Kinect is a trademark of Microsoft.
20.
Figures are a preprint from the forthcoming Encyclopedia of Computer Vision [29].
21.
Twelve milestones is a small number, with the selection somewhat subjective and open to debate. We are merely attempting to give a glimpse of the subject’s development and diversity, not a definitive and comprehensive history.
22.
Zhang’s seminal work is pre-dated by a large body of pioneering work on calibration, such as D.C. Brown’s work in the context of photogrammetry, which dates back to the 1950s and many other works in computer vision, such as the seminal two-stage method of Tsai [53].
23.
A geodesic distance between two points on a surface is the minimal across-surface distance.
24.
Kinect and XBox are trademarks of Microsoft Corporation.

References

Adelson, E.H., Bergen, J.R.: The plenoptic function and the elements of early vision. In: Landy, M., Movshon, J.A. (eds.) Computational Models of Visual Processing (1991)
Google Scholar
Arun, K.S., Huang, T.S., Blostein, S.D.: Least-squares fitting of two 3d point sets. IEEE Trans. Pattern Anal. Mach. Intell. 9(5), 698–700 (1987)
Article Google Scholar
Bartczak, B., Vandewalle, P., Grau, O., Briand, G., Fournier, J., Kerbiriou, P., Murdoch, M., Mller, M., Goris, R., Koch, R., van der Vleuten, R.: Display-independent 3d-TV production and delivery using the layered depth video format. IEEE Trans. Broadcast. 57(2), 477–490 (2011)
Article Google Scholar
Bennet, R.: Representation and Analysis of Signals. Part xxi: The Intrinsic Dimensionality of Signal Collections, Rep. 163. The Johns Hopkins University, Baltimore (1965)
Google Scholar
Besl, P., McKay, N.D.: A method for registration of 3D shapes. IEEE Trans. Pattern Anal. Mach. Intell. 14(2), 239–256 (1992)
Article Google Scholar
Bigun, J., Granlund, G.: Optimal orientation detection of linear symmetry. In: First International Conference on Computer Vision, pp. 433–438. IEEE Computer Society, New York (1987)
Google Scholar
Blundell, B., Schwarz, A.: The classification of volumetric display systems: characteristics and predictability of the image space. IEEE Trans. Vis. Comput. Graph. 8, 66–75 (2002)
Article Google Scholar
Boyer, K., Kak, A.: Color-encoded structured light for rapid active ranging. IEEE Trans. Pattern Anal. Mach. Intell. 9(1) (1987)
Google Scholar
Brewster, S.D.: The Stereoscope: Its History, Theory, and Construction with Applications to the fine and useful Arts and to Education. John Murray, Albemarle Street, London (1856)
Google Scholar
Brownson, C.D.: Euclid’s optics and its compatibility with linear perspective. Arch. Hist. Exact Sci. 24, 165–194 (1981). doi:10.1007/BF00357417
Article MathSciNet MATH Google Scholar
Creusot, C.: Automatic landmarking for non-cooperative 3d face recognition. Ph.D. thesis, Department of Computer Science, University of York, UK (2011)
Google Scholar
Curtis, G.: The Cave Painters. Knopf, New York (2006)
Google Scholar
Faugeras, O.: What can be seen in three dimensions with an uncalibrated stereorig? In: Sandini, G. (ed.) Computer Vision: ECCV’92. Lecture Notes in Computer Science, vol. 588, pp. 563–578. Springer, Berlin (1992)
Google Scholar
Faugeras, O., Luong, Q., Maybank, S.: Camera self-calibration: theory and experiments. In: Sandini, G. (ed.) Computer Vision: ECCV’92. Lecture Notes in Computer Science, vol. 588, pp. 321–334. Springer, Berlin (1992)
Google Scholar
Faugeras, O.D., Hebert, M.: The representation, recognition and locating of 3-d objects. Int. J. Robot. Res. 5(3), 27–52 (1986)
Article Google Scholar
Forsyth, D., Ponce, J.: Computer Vision: A Modern Approach. Prentice Hall, Upper Saddle River (2003)
Google Scholar
Fusiello, A.: Visione computazionale. Appunti delle lezioni. Pubblicato a cura dell’autore (2008)
Google Scholar
Gennery, D.B.: A stereo vision system for an autonomous vehicle. In: Proc. 5th Int. Joint Conf. Artificial Intell (IJCAI), pp. 576–582 (1977)
Google Scholar
Gernsheim, H., Gernsheim, A.: The History of Photography. Mc Graw-Hill, New York (1969)
Google Scholar
Harris, C., Stephens, M.J.: A combined corner and edge detector. In: Alvey Vision Conference (1988)
Google Scholar
Hartley, R., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University, Cambridge (2003). ISBN 0-521-54051-8
Google Scholar
Hartley, R.I.: In defence of the 8-point algorithm. In: Proceedings of the Fifth International Conference on Computer Vision, ICCV’95, p. 1064. IEEE Computer Society, Washington (1995)
Chapter Google Scholar
Hartley, R.I.: In defence of the 8-point algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 19(6), 580–593 (1997)
Article Google Scholar
Horn, B.K.P.: Shape from shading: a method for obtaining the shape of a smooth opaque object from one view. Ph.D. thesis, MIT, Cambridge, MA, USA (1970)
Google Scholar
Horn, B.K.P.: Closed-form solution of absolute orientation using unit quaternions. J. Opt. Soc. Am. A 4(4), 629–642 (1987)
Article MathSciNet Google Scholar
Johnson, A.E., Hebert, M.: Using spin images for efficient object recognition in cluttered 3d scenes. IEEE Trans. Pattern Anal. Mach. Intell. 21(5), 433–449 (1997)
Article Google Scholar
Jordt, A., Koch, R.: Fast tracking of deformable objects in depth and colour video. In: Proceedings of the British Machine Vision Conference (BMVC) (2011)
Google Scholar
King, H.: The History of Telescope. Griffin, London (1955)
Google Scholar
Koch, R.: Depth estimation. In: Ikeuchi, K. (ed.) Encyclopedia of Computer Vision. Springer, New York (2013)
Google Scholar
Koch, R., Schiller, I., Bartczak, B., Kellner, F., Koeser, K.: Mixin3d: 3d mixed reality with ToF-camera. In: Dynamic 3D Imaging DAGM 2009 Workshop, Dyn3D, Jena, Germany. Lecture Notes in Computer Science, vol. 5742, pp. 126–141 (2009)
Chapter Google Scholar
Kolb, A., Barth, E., Koch, R., Larsen, R.: Time-of-flight cameras in computer graphics. Comput. Graph. Forum 29(1), 141–159 (2010)
Article Google Scholar
Kolb, A., Koch, R.: Dynamic 3D Imaging. Lecture Notes in Computer Science, vol. 5742. Springer, Berlin (2009)
Book Google Scholar
Kriss, T.C., Kriss, V.M.: History of the operating microscope: from magnifying glass to microneurosurgery. Neurosurgery 42(4), 899–907 (1998)
Article Google Scholar
Lippmann, G.: La photographie integrale (English translation Fredo Durant, MIT-csail). In: Academy Francaise: Photography-Reversible Prints. Integral Photographs (1908)
Google Scholar
Longuet-Higgins, H.C.: A computer algorithm for re-constructing a scene from two projections. Nature 293, 133–135 (1981)
Article Google Scholar
Ma, Y., Soatto, S., Kosecka, J., Sastry, S.: An Invitation to 3D Vision: From Images to Geometric Models. Springer, Berlin (2003)
Google Scholar
Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 381–395 (1981)
Article MathSciNet Google Scholar
Marr, D.: Vision. A Computational Investigation Into the Human Representation and Processing of Visual Information. Freeman, New York (1982)
Google Scholar
Nayar, S.K., Watanabe, M., Noguchi, M.: Real-time focus range sensor. IEEE Trans. Pattern Anal. Mach. Intell. 18(12), 1186–1198 (1996)
Article Google Scholar
Rioux, M.: Laser range finder based on synchronized scanners. Appl. Opt. 23(21), 3837–3844 (1984)
Article Google Scholar
Savran, A., Alyuz, N., Dibeklioglu, H., Celiktutan, O., Gokberk, B., Sankur, B., Akarun, L.: Bosphorus database for 3d face analysis. In: Biometrics and Identity Management. Lecture Notes in Computer Science, vol. 5372, pp. 47–56 (2008)
Chapter Google Scholar
Scharstein, D., Szeliski, R.: A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int. J. Comput. Vis. 47, 7–42 (2002)
Article MATH Google Scholar
Schölkopf, B., Smola, A.: Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond. MIT Press, Cambridge (2002)
Google Scholar
Schwarte, R., Xu, Z., Heinol, H.G., Olk, J., Klein, R., Buxbaum, B., Fischer, H., Schulte, J.: New electro-optical mixing and correlating sensor: facilities and applications of the photonic mixer device (PMD). In: Proc. SPIE, vol. 3100 (1997)
Google Scholar
Shirai, Y.: Recognition of polyhedrons with a range finder. Pattern Recognit. 4, 243–250 (1972)
Article Google Scholar
Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from single depth images. In: CVPR (2011)
Google Scholar
Smith, A.M.: Alhacen’s theory of visual perception: a critical edition, with English translation and commentary, of the first three books of Alhacen’s de aspectibus, the medieval Latin version of Ibn al-Haytham’s Kitab al-Manazir. Trans. Am. Philos. Soc. 91 (2001)
Google Scholar
Sun, J., Ovsjanikov, M., Guibas, L.: A concise and provably informative multi-scale signature based on heat diffusion. Comput. Graph. Forum 28(5), 1383–1392 (2009)
Article Google Scholar
Szeliski, R.: Computer Vision, Algorithms and Applications. Springer, Berlin (2010)
Google Scholar
Tanimoto, S., Pavlidis, T.: A hierarchal data structure for picture processing. Comput. Graph. Image Process. 4, 104–113 (1975)
Article Google Scholar
Triggs, B., McLauchlan, P.F., Hartley, R.I., Fitzgibbon, A.W.: Bundle adjustment—a modern synthesis. In: Proceedings of the International Workshop on Vision Algorithms: Theory and Practice, ICCV’99, pp. 298–372. Springer, London (2000). http://portal.acm.org/citation.cfm?id=646271.685629
Chapter Google Scholar
Trucco, E., Verri, A.: Introductory Techniques for 3-D Computer Vision. Prentice Hall, New York (1998)
Google Scholar
Tsai, R.Y.: A versatile camera calibration technique for high accuracy 3d machine vision metrology using off-the-shelf TV cameras and lenses. IEEE J. Robot. Autom. 3(4), 323–344 (1987)
Article Google Scholar
Wheatstone, C.: Contributions to the physiology of vision. Part the first. On some remarkable, and hitherto unobserved, phenomena of binocular vision. In: Philosophical Transactions of the Royal Society of London, pp. 371–394 (1838)
Google Scholar
Witkin, A.P.: Scale-space filtering. In: Proceedings of the Eighth International Joint Conference on Artificial Intelligence, vol. 2, pp. 1019–1022. Morgan Kaufmann, San Francisco (1983). http://portal.acm.org/citation.cfm?id=1623516.1623607
Google Scholar
Yang, R., Pollefeys, M.: A versatile stereo implementation on commodity graphics hardware. Real-Time Imaging 11, 7–18 (2005)
Article Google Scholar
Zhang, Z.: A flexible new technique for camera calibration. IEEE Trans. Pattern Anal. Mach. Intell. 22(11), 1330–1334 (2000)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science, Christian-Albrechts-University of Kiel, Kiel, Germany
Reinhard Koch
Department of Computer Science, University of York, Deramore Lane, York, YO10 5GH, UK
Nick Pears
Department of Computer Science, Aberystwyth University, Aberystwyth, Ceredigion, SY23 3DB, UK
Yonghuai Liu

Authors

Reinhard Koch
View author publications
You can also search for this author in PubMed Google Scholar
Nick Pears
View author publications
You can also search for this author in PubMed Google Scholar
Yonghuai Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Reinhard Koch .

Editor information

Editors and Affiliations

Department of Computer Science, University of York, Deramore Lane, Heslington, York, YO10 5GH, United Kingdom
Nick Pears
Department of Computer Science, Aberystwyth University, Llandinam Building, Ceredigion, Aberystwyth, SY23 3DB, United Kingdom
Yonghuai Liu
Institute of Geography and Earth Science, Aberystwyth University, Penglais Campus, Ceredigion, Aberystwyth, SY23 3DB, United Kingdom
Peter Bunting

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Koch, R., Pears, N., Liu, Y. (2012). Introduction. In: Pears, N., Liu, Y., Bunting, P. (eds) 3D Imaging, Analysis and Applications. Springer, London. https://doi.org/10.1007/978-1-4471-4063-4_1

Download citation

DOI: https://doi.org/10.1007/978-1-4471-4063-4_1
Publisher Name: Springer, London
Print ISBN: 978-1-4471-4062-7
Online ISBN: 978-1-4471-4063-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics