Skip to main content

Abstract

3D Imaging, Analysis and Applications is a comprehensive textbook on 3D shape capture, 3D shape processing and how such capture and processing can be used. Eleven chapters cover a broad range of concepts, algorithms and applications and they are split into three parts, as follows: Part I, 3D Imaging and Shape Representation, presents techniques for capture, representation and visualization of 3D data; Part II, 3D Shape Analysis and Processing presents feature-based methods of analysis, registration and shape matching and, finally, Part III, 3D Imaging Applications presents application areas in 3D face recognition, remote sensing and medical imaging. This introduction provides the reader with historical and background information, such as that relating to the development of computer vision; in particular, the development of automated 3D imaging. It briefly discusses general depth estimation principles for 3D imaging, details a selection of seminal papers, sketches applications of 3D imaging and concludes with an outline of the book’s remaining chapters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.95
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Typically, this term is used when the 3D data is acquired from multiple viewpoint 2D images.

  2. 2.

    Typically, this term is used when a scanner acquired the 3D data, such as a laser stripe scanner.

  3. 3.

    Typically, this term is used when the data is ordered in a regular grid, such as the 2D array of depth values in a range image, or a 3D array of data in volumetric medical imaging.

  4. 4.

    Euclid of Alexandria, Greek mathematician, also referred to as the Father of Geometry, lived in Alexandria during the reign of Ptolemy I (323–283 BC).

  5. 5.

    Alhazen (Ibn al-Haytham), born 965 CE in Basra, Iraq, died in 1040. Introduced the concept of physical optics and experimented with lenses, mirrors, camera obscura, refraction and reflection.

  6. 6.

    Sir Austen Henry Layard (1817–1894), British archaeologist, found a polished rock crystal during the excavation of ancient Nimrud, Iraq. The lens has a diameter of 38 mm, presumed creation date 750–710 BC and now on display at the British Museum, London.

  7. 7.

    Lucius Annaeus Seneca, around 4 BC–65 CE, was a Roman philosopher, statesman, dramatist, tutor and adviser of Nero.

  8. 8.

    Small and thin bi-convex lenses look like lentils, hence the name lens, which is Latin for lentil.

  9. 9.

    Nicéphore Niépce, 1765–1833, is credited as one of the inventors of photography by solar light etching (Heliograph) in 1826. He later worked with Louis-Jacques-Mandé Daguerre, 1787–1851, who acquired a patent for his Daguerreotype, the first practical photography process based on silver iodide, in 1839. In parallel, William Henry Fox Talbot, 1800–1877, developed the calotype process, which uses paper coated with silver iodide. The calotype produced a negative image from which a positive could be printed using silver chloride coated paper [19].

  10. 10.

    The Greek word stereos for solid is used to indicate a spatial 3D extension of vision, hence stereoscopic stands for a 3D form of visual information.

  11. 11.

    Gabriel Lippmann, 1845–1921, French scientist, received the 1908 Nobel price in Physics for his method to reproduce color pictures by interferometry.

  12. 12.

    Sir Charles Wheatstone, 1802–1875, English physicist and inventor.

  13. 13.

    The terms disparity and parallax are sometimes used interchangeably in the literature and this misuse of terminology is a source of confusion. One way to think about parallax is that it is induced by the difference in disparity between foreground and background objects over a pair of views displaced by a translation. The end result is that the foreground is in alignment with different parts of the background. Disparity of foreground objects and parallax then only become equivalent when the distance of background objects can be treated as infinity (e.g. distant stars), in this case the background objects are stationary in the image.

  14. 14.

    Sir David Brewster, 1781–1868, Scottish physicist and inventor.

  15. 15.

    Szeliski, Computer Vision: Algorithms and Applications, p. 10 [49].

  16. 16.

    Intrinsic Image Dimension (IID) describes the local change in the image. Constant image: 0D, linear structures: 1D, point structures: 2D.

  17. 17.

    A pdf version is also available for personal use on the website http://szeliski.org/Book/.

  18. 18.

    This triangle defines an epipolar plane, which is discussed in Chap. 2.

  19. 19.

    Kinect is a trademark of Microsoft.

  20. 20.

    Figures are a preprint from the forthcoming Encyclopedia of Computer Vision [29].

  21. 21.

    Twelve milestones is a small number, with the selection somewhat subjective and open to debate. We are merely attempting to give a glimpse of the subject’s development and diversity, not a definitive and comprehensive history.

  22. 22.

    Zhang’s seminal work is pre-dated by a large body of pioneering work on calibration, such as D.C. Brown’s work in the context of photogrammetry, which dates back to the 1950s and many other works in computer vision, such as the seminal two-stage method of Tsai [53].

  23. 23.

    A geodesic distance between two points on a surface is the minimal across-surface distance.

  24. 24.

    Kinect and XBox are trademarks of Microsoft Corporation.

References

  1. Adelson, E.H., Bergen, J.R.: The plenoptic function and the elements of early vision. In: Landy, M., Movshon, J.A. (eds.) Computational Models of Visual Processing (1991)

    Google Scholar 

  2. Arun, K.S., Huang, T.S., Blostein, S.D.: Least-squares fitting of two 3d point sets. IEEE Trans. Pattern Anal. Mach. Intell. 9(5), 698–700 (1987)

    Article  Google Scholar 

  3. Bartczak, B., Vandewalle, P., Grau, O., Briand, G., Fournier, J., Kerbiriou, P., Murdoch, M., Mller, M., Goris, R., Koch, R., van der Vleuten, R.: Display-independent 3d-TV production and delivery using the layered depth video format. IEEE Trans. Broadcast. 57(2), 477–490 (2011)

    Article  Google Scholar 

  4. Bennet, R.: Representation and Analysis of Signals. Part xxi: The Intrinsic Dimensionality of Signal Collections, Rep. 163. The Johns Hopkins University, Baltimore (1965)

    Google Scholar 

  5. Besl, P., McKay, N.D.: A method for registration of 3D shapes. IEEE Trans. Pattern Anal. Mach. Intell. 14(2), 239–256 (1992)

    Article  Google Scholar 

  6. Bigun, J., Granlund, G.: Optimal orientation detection of linear symmetry. In: First International Conference on Computer Vision, pp. 433–438. IEEE Computer Society, New York (1987)

    Google Scholar 

  7. Blundell, B., Schwarz, A.: The classification of volumetric display systems: characteristics and predictability of the image space. IEEE Trans. Vis. Comput. Graph. 8, 66–75 (2002)

    Article  Google Scholar 

  8. Boyer, K., Kak, A.: Color-encoded structured light for rapid active ranging. IEEE Trans. Pattern Anal. Mach. Intell. 9(1) (1987)

    Google Scholar 

  9. Brewster, S.D.: The Stereoscope: Its History, Theory, and Construction with Applications to the fine and useful Arts and to Education. John Murray, Albemarle Street, London (1856)

    Google Scholar 

  10. Brownson, C.D.: Euclid’s optics and its compatibility with linear perspective. Arch. Hist. Exact Sci. 24, 165–194 (1981). doi:10.1007/BF00357417

    Article  MathSciNet  MATH  Google Scholar 

  11. Creusot, C.: Automatic landmarking for non-cooperative 3d face recognition. Ph.D. thesis, Department of Computer Science, University of York, UK (2011)

    Google Scholar 

  12. Curtis, G.: The Cave Painters. Knopf, New York (2006)

    Google Scholar 

  13. Faugeras, O.: What can be seen in three dimensions with an uncalibrated stereorig? In: Sandini, G. (ed.) Computer Vision: ECCV’92. Lecture Notes in Computer Science, vol. 588, pp. 563–578. Springer, Berlin (1992)

    Google Scholar 

  14. Faugeras, O., Luong, Q., Maybank, S.: Camera self-calibration: theory and experiments. In: Sandini, G. (ed.) Computer Vision: ECCV’92. Lecture Notes in Computer Science, vol. 588, pp. 321–334. Springer, Berlin (1992)

    Google Scholar 

  15. Faugeras, O.D., Hebert, M.: The representation, recognition and locating of 3-d objects. Int. J. Robot. Res. 5(3), 27–52 (1986)

    Article  Google Scholar 

  16. Forsyth, D., Ponce, J.: Computer Vision: A Modern Approach. Prentice Hall, Upper Saddle River (2003)

    Google Scholar 

  17. Fusiello, A.: Visione computazionale. Appunti delle lezioni. Pubblicato a cura dell’autore (2008)

    Google Scholar 

  18. Gennery, D.B.: A stereo vision system for an autonomous vehicle. In: Proc. 5th Int. Joint Conf. Artificial Intell (IJCAI), pp. 576–582 (1977)

    Google Scholar 

  19. Gernsheim, H., Gernsheim, A.: The History of Photography. Mc Graw-Hill, New York (1969)

    Google Scholar 

  20. Harris, C., Stephens, M.J.: A combined corner and edge detector. In: Alvey Vision Conference (1988)

    Google Scholar 

  21. Hartley, R., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University, Cambridge (2003). ISBN 0-521-54051-8

    Google Scholar 

  22. Hartley, R.I.: In defence of the 8-point algorithm. In: Proceedings of the Fifth International Conference on Computer Vision, ICCV’95, p. 1064. IEEE Computer Society, Washington (1995)

    Chapter  Google Scholar 

  23. Hartley, R.I.: In defence of the 8-point algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 19(6), 580–593 (1997)

    Article  Google Scholar 

  24. Horn, B.K.P.: Shape from shading: a method for obtaining the shape of a smooth opaque object from one view. Ph.D. thesis, MIT, Cambridge, MA, USA (1970)

    Google Scholar 

  25. Horn, B.K.P.: Closed-form solution of absolute orientation using unit quaternions. J. Opt. Soc. Am. A 4(4), 629–642 (1987)

    Article  MathSciNet  Google Scholar 

  26. Johnson, A.E., Hebert, M.: Using spin images for efficient object recognition in cluttered 3d scenes. IEEE Trans. Pattern Anal. Mach. Intell. 21(5), 433–449 (1997)

    Article  Google Scholar 

  27. Jordt, A., Koch, R.: Fast tracking of deformable objects in depth and colour video. In: Proceedings of the British Machine Vision Conference (BMVC) (2011)

    Google Scholar 

  28. King, H.: The History of Telescope. Griffin, London (1955)

    Google Scholar 

  29. Koch, R.: Depth estimation. In: Ikeuchi, K. (ed.) Encyclopedia of Computer Vision. Springer, New York (2013)

    Google Scholar 

  30. Koch, R., Schiller, I., Bartczak, B., Kellner, F., Koeser, K.: Mixin3d: 3d mixed reality with ToF-camera. In: Dynamic 3D Imaging DAGM 2009 Workshop, Dyn3D, Jena, Germany. Lecture Notes in Computer Science, vol. 5742, pp. 126–141 (2009)

    Chapter  Google Scholar 

  31. Kolb, A., Barth, E., Koch, R., Larsen, R.: Time-of-flight cameras in computer graphics. Comput. Graph. Forum 29(1), 141–159 (2010)

    Article  Google Scholar 

  32. Kolb, A., Koch, R.: Dynamic 3D Imaging. Lecture Notes in Computer Science, vol. 5742. Springer, Berlin (2009)

    Book  Google Scholar 

  33. Kriss, T.C., Kriss, V.M.: History of the operating microscope: from magnifying glass to microneurosurgery. Neurosurgery 42(4), 899–907 (1998)

    Article  Google Scholar 

  34. Lippmann, G.: La photographie integrale (English translation Fredo Durant, MIT-csail). In: Academy Francaise: Photography-Reversible Prints. Integral Photographs (1908)

    Google Scholar 

  35. Longuet-Higgins, H.C.: A computer algorithm for re-constructing a scene from two projections. Nature 293, 133–135 (1981)

    Article  Google Scholar 

  36. Ma, Y., Soatto, S., Kosecka, J., Sastry, S.: An Invitation to 3D Vision: From Images to Geometric Models. Springer, Berlin (2003)

    Google Scholar 

  37. Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 381–395 (1981)

    Article  MathSciNet  Google Scholar 

  38. Marr, D.: Vision. A Computational Investigation Into the Human Representation and Processing of Visual Information. Freeman, New York (1982)

    Google Scholar 

  39. Nayar, S.K., Watanabe, M., Noguchi, M.: Real-time focus range sensor. IEEE Trans. Pattern Anal. Mach. Intell. 18(12), 1186–1198 (1996)

    Article  Google Scholar 

  40. Rioux, M.: Laser range finder based on synchronized scanners. Appl. Opt. 23(21), 3837–3844 (1984)

    Article  Google Scholar 

  41. Savran, A., Alyuz, N., Dibeklioglu, H., Celiktutan, O., Gokberk, B., Sankur, B., Akarun, L.: Bosphorus database for 3d face analysis. In: Biometrics and Identity Management. Lecture Notes in Computer Science, vol. 5372, pp. 47–56 (2008)

    Chapter  Google Scholar 

  42. Scharstein, D., Szeliski, R.: A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int. J. Comput. Vis. 47, 7–42 (2002)

    Article  MATH  Google Scholar 

  43. Schölkopf, B., Smola, A.: Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond. MIT Press, Cambridge (2002)

    Google Scholar 

  44. Schwarte, R., Xu, Z., Heinol, H.G., Olk, J., Klein, R., Buxbaum, B., Fischer, H., Schulte, J.: New electro-optical mixing and correlating sensor: facilities and applications of the photonic mixer device (PMD). In: Proc. SPIE, vol. 3100 (1997)

    Google Scholar 

  45. Shirai, Y.: Recognition of polyhedrons with a range finder. Pattern Recognit. 4, 243–250 (1972)

    Article  Google Scholar 

  46. Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from single depth images. In: CVPR (2011)

    Google Scholar 

  47. Smith, A.M.: Alhacen’s theory of visual perception: a critical edition, with English translation and commentary, of the first three books of Alhacen’s de aspectibus, the medieval Latin version of Ibn al-Haytham’s Kitab al-Manazir. Trans. Am. Philos. Soc. 91 (2001)

    Google Scholar 

  48. Sun, J., Ovsjanikov, M., Guibas, L.: A concise and provably informative multi-scale signature based on heat diffusion. Comput. Graph. Forum 28(5), 1383–1392 (2009)

    Article  Google Scholar 

  49. Szeliski, R.: Computer Vision, Algorithms and Applications. Springer, Berlin (2010)

    Google Scholar 

  50. Tanimoto, S., Pavlidis, T.: A hierarchal data structure for picture processing. Comput. Graph. Image Process. 4, 104–113 (1975)

    Article  Google Scholar 

  51. Triggs, B., McLauchlan, P.F., Hartley, R.I., Fitzgibbon, A.W.: Bundle adjustment—a modern synthesis. In: Proceedings of the International Workshop on Vision Algorithms: Theory and Practice, ICCV’99, pp. 298–372. Springer, London (2000). http://portal.acm.org/citation.cfm?id=646271.685629

    Chapter  Google Scholar 

  52. Trucco, E., Verri, A.: Introductory Techniques for 3-D Computer Vision. Prentice Hall, New York (1998)

    Google Scholar 

  53. Tsai, R.Y.: A versatile camera calibration technique for high accuracy 3d machine vision metrology using off-the-shelf TV cameras and lenses. IEEE J. Robot. Autom. 3(4), 323–344 (1987)

    Article  Google Scholar 

  54. Wheatstone, C.: Contributions to the physiology of vision. Part the first. On some remarkable, and hitherto unobserved, phenomena of binocular vision. In: Philosophical Transactions of the Royal Society of London, pp. 371–394 (1838)

    Google Scholar 

  55. Witkin, A.P.: Scale-space filtering. In: Proceedings of the Eighth International Joint Conference on Artificial Intelligence, vol. 2, pp. 1019–1022. Morgan Kaufmann, San Francisco (1983). http://portal.acm.org/citation.cfm?id=1623516.1623607

    Google Scholar 

  56. Yang, R., Pollefeys, M.: A versatile stereo implementation on commodity graphics hardware. Real-Time Imaging 11, 7–18 (2005)

    Article  Google Scholar 

  57. Zhang, Z.: A flexible new technique for camera calibration. IEEE Trans. Pattern Anal. Mach. Intell. 22(11), 1330–1334 (2000)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Reinhard Koch .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag London

About this chapter

Cite this chapter

Koch, R., Pears, N., Liu, Y. (2012). Introduction. In: Pears, N., Liu, Y., Bunting, P. (eds) 3D Imaging, Analysis and Applications. Springer, London. https://doi.org/10.1007/978-1-4471-4063-4_1

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-4063-4_1

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-4062-7

  • Online ISBN: 978-1-4471-4063-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics