Abstract
A camera-assisted digital writing tablet was invented recently. It preserves the familiar experience of filling out a paper form while allowing automatic conversion of relevant handwritten field entries into electronic form, without explicit form scanning. In this paper, we focus on two key computer vision problems associated with the invention of this device, namely, form indexing and field projection. These are needed for accurate association of tablet writing with corresponding entries in the electronic form. Form indexing is modeled as the problem of shape-based content retrieval using the perspectively-distorted form appearances seen from the tablet camera. Fast form indexing is achieved using geometric hashing based on projective invariants. The invariants derived from curve and line features reduce the basis search space considerably while still providing for robust localization. We derive field projection as a sequence of projective transformations between the tablet, the camera and the original electronic form coordinates. Results of extensive testing on a medical form database are reported.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Jacobs, D.: The space requirements of indexing under perspective projections. IEEE Trans. PAMI, 330–333 (1996)
Lamdan, Y., Schwartz, J., Wolfson, H.J.: Object recognition by affine-invariant matching. In: Proceedings IEEE Conf. on Computer Vision and Pattern Recognition, pp. 335–344 (1988)
Tsai, F.D.: Geometric hashing with line features. Pattern Recognition 27, 377–389 (1994)
Syeda-Mahmood, T.F.: Locating Indexing Structures in Engineering Drawing Databases Using Location Hashing. In: CVPR 1999, pp. 1049–1055 (1999)
Grimson, W.E.L.: On the sensitivity of geometric hashing. In: ICCV 1997 (1997)
Mao, J., Abayan, M., Mohiuddin, K.: A model-based form processing subsystem. In: ICPR 1996, pp. 691–695 (1996)
Watanabe, T., Luo, Q., Sugie, N.: Layout recognition of multi-kinds of table-form document. IEEE Trans PAMI 17(4), 432–445 (1995)
Wacom Graphire II Digitier Tablet and Inking Pen, http://www.wacom.com/graphire/4x5.cfm
Aiptek VGA PenCam, Irvine, CA 92618, http://www.aiptek.com
Pizano, A.: Extracting Line Features from Images of Business Forms and Tables. In: ICPR 1992, pp. 399–403 (1992)
Doermann, D.S., Rosenfeld, A.: The Processing of Form Documents. In: ICDAR 1993, pp. 497–501 (1993)
Chhabra, A.K.: Anatomy of a Hand-Filled Form Reader. In: Proc. IEEE Trans. On Application of Computer Vision, pp. 195–204 (1994)
Cesarini, F., Gori, M., Marinai, S.: A System for Data Extraction from Forms of Known Class. In: ICDAR 1995, Montreal, Canada, pp. 1136–1140 (1995)
Yuan, J.X., Tang, Y.Y., Suen, C.Y.: Four Directional Adjacency Graphs (FDAG) and Their Application in Locating Fields in Forms. In: ICDAR 1995, Montreal, Canada, pp. 752–755 (1995)
Safari, R., Narasimhamurthi, N., Shridhar, M.: Document Registration Using Projective Geometry. In: ICDAR 1995, Montreal, Canada, pp. 1161–1164 (1995)
Xerox mobile camera document imaging (2004), http://www.ipvalue.com/technology/docs/Xerox_Mobile_Camera_Imaging_Document_Capture.pdf
Watanabe, T., Luo, Q., Sugie, N.: Structure recognition methods for various types of documents. MVA 6(2-3), 163–176 (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Syeda-Mahmood, T., Zimmerman, T. (2006). FormPad: A Camera-Assisted Digital Notepad. In: Narayanan, P.J., Nayar, S.K., Shum, HY. (eds) Computer Vision – ACCV 2006. ACCV 2006. Lecture Notes in Computer Science, vol 3852. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11612704_20
Download citation
DOI: https://doi.org/10.1007/11612704_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31244-4
Online ISBN: 978-3-540-32432-4
eBook Packages: Computer ScienceComputer Science (R0)