Abstract
In this paper, we discuss the design and implementation of a Field-Programmable Gate Array (FPGA) based stereo depth measurement system that is capable of handling a very large disparity range. The system performs rectification of the input video stream and a left-right consistency check to improve the accuracy of the results and generates subpixel disparities at 30 frames/second on 480 × 640 images. The system is based on the Local Weighted Phase-Correlation algorithm [9] which estimates disparity using a multi-scale and multi-orientation approach. Though FPGAs are ideal devices to exploit the inherent parallelism in many computer vision algorithms, they have a finite resource capacity which poses a challenge when adapting a system to deal with large image sizes or disparity ranges. In this work, we take advantage of the temporal information available in a video sequence to design a novel architecture for the correlation unit to achieve correlation over a large range while keeping the resource utilisation very low as compared to a naive approach of designing a correlation unit in hardware.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Alonso, J.D.: Real-time optical flow computation using FPGAs. In: Proceedings of the Early Cognitive Vision Workshop, Isle of Skye, Scotland (June 2004)
Burt, P.J.: A pyramid-based front-end processor for dynamic vision applications. Proceedings of the IEEE 90(7), 1188–1200 (2002)
Altera Corporation. Stratix devices (2003), http://www.altera.com/products/devices/stratix/stx-index.jsp
Crossley, S., Thacker, N.A., Seed, N.L.: Robust stereo via temporal consistency. In: Proceedings of the British Machine Vision Conference, pp. 659–668 (1997)
Darabiha, A., Rose, J., MacLean, W.J.: Video-rate stereo depth measurement on programmable hardware. In: Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision & Pattern Recognition, Madison, WI, June 2003, vol. 1, pp. 203–210 (2003)
Faugeras, O., Hotz, B., Mathieu, H., Viéville, T., Zhang, Z., Fua, P., Théron, E., Moll, L., Berry, G., Vuillemin, J., Bertin, P., Proy, C.: Real time correlation-based stereo: Algorithm, implementations and applications. Technical Report Research Report 2013, INRIA Sophia Antipolis (August 1993)
Fender, J.: Transmogrifier 4 preliminary information (August 2003), http://www.eecg.toronto.edu/~fender/tm4/sointroduction.shtml
Fleet, D.J.: Measurement of Image Velocity. Kluwer Academic Publishers, Boston (1992)
Fleet, D.J.: Disparity from local weighted phase correlation. In: International Conference on Systems, Man and Cybernetics, vol. 1, pp. 48–54 (1994)
Hirschmüller, H., Innocent, P.R., Garibaldi, J.: Real-time correlation-based stereo vision with reduced border errors. International Journal of Computer Vision 47(1/2/3), 229–246 (2002) stereo,intensity correlation,MMX, fast
Hou, K.M., Belloum, A.: A reconfigurable and flexible parallel 3d vision system for a mobile robot. In: IEEE Workshop on Computer Architecture for Machine Perception, New Orleans, Louisiana (December 1993)
Kanade, T., Yoshida, A., Oda, K., Kano, H., Tanaka, M.: A stereo machine for video-rate dense depth mapping and its new applications. In: Proceedings of the 15th IEEE Computer Vision & Pattern Recognition Conference, San Francisco, June 1996, pp. 196–202 (1996)
Konolige, K.: Small vision systems: Hardware and implmentation. In: Proceedings of the Eighth International Symposium on Robotics Research (Robotics Research 8), Hayama, Japan, October 1997, pp. 203–212 (1997)
Mühlmann, K., Maier, D., Hesser, J., Anner, R.M.: Calculating dense disparity maps from color stereo images, an efficient implementation. International Journal of Computer Vision 47(1/2/3), 79–88 (2002) stereo,intensity correlation,MMX,fast
Okutomi, M., Katayama, Y.: A simple stereo algorithm to recover precise object boundaries and smooth surfaces. In: Proceedings of the IEEE Workshop on Stereo and Multi-Baseline Vision—SMBV 2001(2001)
van der Wal, G., Burt, P.: A VLSI pyramid chip for multiresolution image analysis. Int. Journal of Computer Vision 8, 177–190 (1992)
Wolberg, G.: Digital Image Warping. IEEE Computer Society Press, Los Alamitos (1994)
Woodfill, J., Von Herzen, B.: Real time stereo vision on the parts reconfigurable computer. In: 5th Annual IEEE Symposium on Field-Programmable Custom Computing Machines, pp. 201–210 (1997)
Yang, R., Pollefeys, M.: Multi-resolution real-time stereo on commodity graphics hardware. In: Proceedings of the 2003 IEEE Conference on Computer Vision and Pattern Recognition, Madison, Wisconsin, June 2003, pp. 211–218 (2003)
Zabih, R., Woodfill, J.: Non-parametric local transforms for computing visual correspondence. In: Proceedings of the 3rd European Conference on Computer Vision, May 1994, pp. 150–158 (1994), http://www.cs.cornell.edu/rdz/Papers/Archive/neccv.ps ; http://www.cs.cornell.edu/rdz/Papers/Archive/nplt-journal.ps.gz
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Masrani, D.K., MacLean, W.J. (2006). A Real-Time Large Disparity Range Stereo-System Using FPGAs. In: Narayanan, P.J., Nayar, S.K., Shum, HY. (eds) Computer Vision – ACCV 2006. ACCV 2006. Lecture Notes in Computer Science, vol 3852. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11612704_5
Download citation
DOI: https://doi.org/10.1007/11612704_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31244-4
Online ISBN: 978-3-540-32432-4
eBook Packages: Computer ScienceComputer Science (R0)