ABSTRACT
The ability to access textual information is crucial for visually impaired people in terms of achieving greater independence in their everyday life. Thus, there is a need for a mobile easy-to-use reading device, capable of dealing with the complexity of the outdoor environment. In this paper a wearable camera-based solution is presented, aiming at improving the performance of existing systems through the use of stereo vision. Specific aspects of the stereo matching problem in document images are discussed and an approach for its integration into the document processing procedure is introduced. We conclude with the presentation of experimental results from a prototype system, which demonstrate the practical benefits of the presented approach.
- M. Tanaka and H. Goto, Text-Tracking Wearable Camera System for Visually-Impaired People, Proceedings 19th International Conference on Pattern Recognition (ICPR2008), 2008.Google ScholarCross Ref
- Nobuo Ezaki, Marius Bulacu, Lambert Schomaker, Text Detection from Natural Scene Images: Towards a System for Visually Impaired Persons, ICPR, vol. 2, pp.683--686, 17th International Conference on Pattern Recognition (ICPR'04), 2004 Google ScholarDigital Library
- Keefer R., Kakumanu P., Bourbakis N., A Wearable Document Reader for the Visually Impaired: Dewarping and Segmentation. International Journal on Artificial Intelligence Tools, 18, 3, 467--486, 2009Google Scholar
- Nikolaos G. Bourbakis, Tyflos: A Wearable System-Prototype for Assisting Visually Impaired, 12th WSEAS International Conference on SYSTEMS, Heraklion, Greece, pp. 21, 2008. Google ScholarDigital Library
- G. Sainarayanan, On Intelligent Image Processing Methodologies Applied to Navigation Assistance for Visually Impaired, Ph. D. Thesis, University Malaysia Sabah, 2002.Google Scholar
- J.M. Saez, F. Escolano, A. Penalver, First steps towards stereo-based 6DOF SLAM for the visually impaired. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, San Diego, CA p. 23, 2005. Google ScholarDigital Library
- G.C.Medioni and R. Nevatia, Segment-based Stereo Matching. In Proceedings of Image Understanding Workshop, pp.128--136, DARPA, June, 1983.Google Scholar
- Richard Hartley and Andrew Zisserman, Multiple View Geometry in computer vision. Cambridge University Press. pp. 32--33. ISBN 0-521-54051-8, 2003 Google ScholarDigital Library
- A. Ulges, C. H. Lampert, and T. M. Breuel, Document capture using stereo vision, in Proc. ACM Symposium on Document Engineering, pp.198--200, 2004 Google ScholarDigital Library
- A. Yamashita. A. Kawarago, T. Kaneko, K. T. Miura, Shape Recognition and Image Restoration for Non-Flat Surfaces of Documents with a Stereo Vision System, International Conference on Pattern Recognition, vol. 1, Cambridge, UK, pp. 482--485, 2004. Google ScholarDigital Library
- Brown MS, Seales WB Document restoration using 3D shape: a general deskewing algorithm for arbitrarily warped documents. In: Proc. ICCV, pp 367 374, 2001Google ScholarCross Ref
- J. Liang, D. F. DeMenthon, and D. Doermann. Flattening curved documents in images. In Proc. Computer Vision and Pattern Recognition, pages 338--345, June 2005. Google ScholarDigital Library
- Z. Zhang, C. L. Tan, and L. Fan, Restoration of curved document images through 3D shape modeling. In International Conference on Computer Vision and Pattern Recognition (CVPR2004), pages 10--15, June 2004.Google ScholarCross Ref
- J.C. Wu, J. W. Hsieh and Y. S. Chen, Morphology-based Text Line Extraction, Machine Vision and Applications 19(3), Springer, pp. 1432--1769, 2008. Google ScholarDigital Library
- Fei Hao, Zhenjiang Miao, Ping Guo, and Zhan Xu, Real Time Multiple Object Tracking Using Tracking Matrix. In Proceedings of the 2009 International Conference on Computational Science and Engineering - Volume 02 (CSE '09), Vol. 2. IEEE Computer Society, Washington, DC, USA, 37--41, 2009 Google ScholarDigital Library
- P. Gupta, N. Vohra, S. Chaudhury, S. D. Joshi, Wavelet Based Page Segmentation, Proc. Indian Conf. on Computer Vision, Graphics and Image Processing, 20--22 Dec. 2000, Banglore, India, 2000.Google Scholar
- H. Choi and R. G. Baraniuk, Multiscale Document Segmentation Using Wavelet-Domain Hidden Markov Models, Proc. Int. Soc. Optical Eng./Soc. for Imaging, Science and Technology, 12th Ann. Int. Symp.-Electronic Imaging, 2000.Google Scholar
- Block M., Rojas R.: Local Contrast Segmentation to Binarize Images, The Third International Conference on Digital Society (ICDS 2009), IEEE Computer Society, ISBN:978-0-7695-3526-5, Vol.1, No.1, pp.294--299, Cancun/Mexiko, 2009 Google ScholarDigital Library
- C. Wu and G. Agam. Document image de-warping for text/graphics recognition. In Proc. of Joint IAPR 2002 and SPR 2002 Windsor, pages 348--357, 2002. Google ScholarDigital Library
- Lindner M., Block M., Rojas R.: Object Recognition Using Summed Features Classifier, In: 11th International Conference on Artificial Intelligence and Soft Computing (ICAISC 2012), published by Springer in the Lecture Notes in Artificial Intelligence series, Zakopane/Poland, 2012 Google ScholarDigital Library
- B. Ristic, S. Arulampalam, and N. Gordon, Beyond the Kalman Filter: Particle Filters for Tracking Applications. Norwell, MA: Artech House, 2004Google Scholar
- T.F.Smith and M. S. Waterman, Identification of Common Molecular Subsequences. In: Journal of Molecular Biology. 147, s. 195, 1981Google ScholarCross Ref
- C. H. Teh and R. T. Chin, On the detection of dominant points on digital curves, IEEE Trans. Pattern Anal. Machine Intel., vol. 11, no. 8, pp. 859--872, 1989 Google ScholarDigital Library
- C. Mei, S. Benhimane, E. Malis, and P. Rives. Efficient homographybased tracking and 3-d reconstruction for single-viewpoint sensors. IEEE Transactions on Robotics, 2008. Google ScholarDigital Library
- Amidror I. Scattered data interpolation methods for electronic imaging systems: a survey. Journal of Electronic Imaging. 2002,s. 176, 2002Google Scholar
Recommendations
Single Lens Stereo with a Plenoptic Camera
Special issue on interpretation of 3-D scenes—part IIOrdinary cameras gather light across the area of their lens aperture, and the light striking a given subregion of the aperture is structured somewhat differently than the light striking an adjacent subregion. By analyzing this optical structure, one can ...
Single-Camera multi-baseline stereo using fish-eye lens and mirrors
ACCV'09: Proceedings of the 9th Asian conference on Computer Vision - Volume Part IIThis report proposes a monocular range measurement system with a fish-eye lens and mirrors placed around the lens. The fish-eye lens has a wide view-angle; the captured image includes a centered region of direct observation and surrounding regions of ...
Stereo camera handoff
IEA/AIE'2004: Proceedings of the 17th international conference on Innovations in applied artificial intelligenceTo track multiple moving objects in multiple surveillance cameras is a non-trivial problem, especially when occlusion is present. This problem is also referred as "Camera Handoff" as it hands over object's identification from one camerea to another ...
Comments