skip to main content
10.1145/1734583.1734604acmconferencesArticle/Chapter ViewAbstractPublication PageshotmobileConference Proceedingsconference-collections
research-article

Mobile image recognition: architectures and tradeoffs

Published: 22 February 2010 Publication History

Abstract

We argue that the most desirable architecture for mobile image recognition runs the complete algorithm on the mobile device. Alternative solutions that run the recognizer on a remote server will not be as desirable because of the delay between image capture and receipt of a result that can cause users to abandon the technique. We present a method for mobile recognition of paper documents and an application to newspapers that lets readers retrieve electronic data linked to articles, photos, and advertisements. We show that the index for a reasonable collection of daily newspapers can be downloaded in less than a minute and will fit in the memory of today's mid-range smart phones. Experimental results show that the recognition system has an overall error rate of less than 1%. We achieved a run time of 1.2 secs. per image with a collection of 140 newspaper pages on an HTC-8282 Windows Mobile phone.

References

[1]
Commercial offerings that recognize individual images submitted from camera phones include www.doog.mobi and www.snaptell.com.
[2]
J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, "Object retrieval with large vocabularies and fast spatial matching," Proc. of the IEEE CVPR, 2007.
[3]
J. J. Hull, B. Erol, J. Graham, Q. Ke, H. Kishi, J. Moraleda, and D. Van Olst, "Paper-Based Augmented Reality," 17th Int. Conf. on Augmented Reality and Telexistence, Esbjerg, Denmark, Nov. 28--30 2007, 205--209.
[4]
B. Erol, E. Antunez, and J. J. Hull, "HOTPAPER: multimedia interaction with paper using mobile phones," Proc. of the 16th ACM Intl. Conf. on Multimedia, Vancouver, Canada, 2008, pp. 399--408.
[5]
S. Dekleva, J. P. Shim, U. Varshney, and G. Knoerzer, "Evolution and emerging issues in mobile wireless networks," Comm. ACM, v. 50, no. 6, June 2007, pp. 38--43.
[6]
T. Nakai, K. Kise, and M. Iwamura, "Use of affine invariants in locally likely arrangement hashing for camera-based document image retrieval," Lecture Notes in Computer Science (7th International Workshop DAS2006, vol. 3872, 2006.
[7]
X. Liu and D. Doermann, "Mobile Retriever: access to digital documents from their physical source," International Journal on Document Analysis and Recognition, vol. 11, 2008, pp. 19--27.
[8]
D. Wagner, G. Reitmayr, A. Mulloni, T. Drummond, and D. Schmalstieg, "Pose tracking from natural features on mobile phones," Proc. of the 7th IEEE/ACM Int. Symp. on Mixed and Augmented Reality (Sept. 15--18, 2008), pp. 125--134.
[9]
G. Takacs, V. Chandrasekhar, N. Gelfand, Y. Xiong, W. C. Chen, T. Bismpigiannis, R. Grzeszczuk, K. Pulli, and B. Girod, "Outdoors augmented reality on mobile phone using loxel-based visual feature organization," ACM International Conference on Multimedia Information Retrieval (MIR'08), Vancouver, Canada, Oct. 2008.

Cited By

View all
  • (2022)The Effect of the Characteristics of an Image Search Service on Continued Usage Intention in the Fashion Product Shopping SituationJournal of the Korean Society of Costume10.7233/jksc.2022.72.2.09672:2(96-110)Online publication date: 30-Apr-2022
  • (2015)Human Assisted Positioning Using Textual SignsProceedings of the 16th International Workshop on Mobile Computing Systems and Applications10.1145/2699343.2699347(87-92)Online publication date: 12-Feb-2015
  • (2012)Evaluating and understanding the usability of a pen-based command system for interactive paperACM Transactions on Computer-Human Interaction10.1145/2147783.214778619:1(1-24)Online publication date: 4-May-2012
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
HotMobile '10: Proceedings of the Eleventh Workshop on Mobile Computing Systems & Applications
February 2010
99 pages
ISBN:9781450300056
DOI:10.1145/1734583
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 February 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. camera phone
  2. document retrieval
  3. visual search

Qualifiers

  • Research-article

Conference

HotMobile '10
Sponsor:

Acceptance Rates

Overall Acceptance Rate 96 of 345 submissions, 28%

Upcoming Conference

HOTMOBILE '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2022)The Effect of the Characteristics of an Image Search Service on Continued Usage Intention in the Fashion Product Shopping SituationJournal of the Korean Society of Costume10.7233/jksc.2022.72.2.09672:2(96-110)Online publication date: 30-Apr-2022
  • (2015)Human Assisted Positioning Using Textual SignsProceedings of the 16th International Workshop on Mobile Computing Systems and Applications10.1145/2699343.2699347(87-92)Online publication date: 12-Feb-2015
  • (2012)Evaluating and understanding the usability of a pen-based command system for interactive paperACM Transactions on Computer-Human Interaction10.1145/2147783.214778619:1(1-24)Online publication date: 4-May-2012
  • (2011)An investigation into the use of partial face in the mobile environmentProceedings of the 7th international conference on Advances in visual computing - Volume Part II10.5555/2045195.2045254(526-535)Online publication date: 26-Sep-2011
  • (2011)Dynamic deployment and quality adaptation for mobile augmented reality applicationsJournal of Systems and Software10.1016/j.jss.2011.06.06384:11(1871-1882)Online publication date: 1-Nov-2011
  • (2010)1. A Survey on Human Interface Studies Using Image Recognition TechniquesThe Journal of The Institute of Image Information and Television Engineers10.3169/itej.64.179264:12(1792-1796)Online publication date: 2010
  • (2010)Embedded media markerProceedings of the 18th ACM international conference on Multimedia10.1145/1873951.1874261(1503-1504)Online publication date: 25-Oct-2010

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media