Conferences >2017 IEEE International Confe...

Representing word image using visual word embeddings and RNN for keyword spotting on historical document images

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Visual words of Bag-of-Visual-Words (BoVW) framework are independent each other, which results in not only discarding spatial orders between visual words but also lacking...Show More

Metadata

Abstract:

Visual words of Bag-of-Visual-Words (BoVW) framework are independent each other, which results in not only discarding spatial orders between visual words but also lacking semantic information. This study is inspired by word embeddings that a similar embedding procedure is applied to a large number of visual words. By this way, the corresponding embedding vectors of the visual words can be formulated. For a word image, the average of embedding vectors of all visual words within the word image is taken as its embedding vector. Moreover, Recurrent Neural Network (RNN) is utilized to encode each word image into embeddings like an auto-encoder. The RNN embeddings and the visual word embeddings are complementary. In this study, all word images are represented by combining visual word embeddings and RNN embeddings. Experimental results show that the proposed representation approach is superior to the traditional BoVW, spatial pyramid matching and latent Dirichlet allocation.

Published in: 2017 IEEE International Conference on Multimedia and Expo (ICME)

Date of Conference: 10-14 July 2017

Date Added to IEEE Xplore: 31 August 2017

ISBN Information:

Electronic ISSN: 1945-788X

DOI: 10.1109/ICME.2017.8019403

Conference Location: Hong Kong, China

Contents

References is not available for this document.

Representing word image using visual word embeddings and RNN for keyword spotting on historical document images

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Representing word image using visual word embeddings and RNN for keyword spotting on historical document images

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?