Skip to main content

Multimodal Query Based Approach for Document Image Retrieval

  • Conference paper
  • First Online:
Computer Vision and Image Processing (CVIP 2019)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1147))

Included in the following conference series:

Abstract

Scanning and storage of documents are regular practices. Retrieval of such documents is necessary to support office work. In this paper, a novel multimodal query based approach for retrieving documents using text, non-text contents is presented. This work focuses on logos, stamps, signatures for non-text query; and dates and keywords for text query to do retrieval. The proposed algorithm is called as multimodal document image retrieval algorithm (MDIRA), uses separation of text and non-text to simplify document indexing using both textual and non-textual contents. A single feature space for non-text contents is proposed for indexing. Various date formats can be recognized using regular expressions and mapped to uniform representation useful for indexing and retrieval. The proposed algorithm supports formation of multimodal queries using multiple attributes of documents. Results on a publicly available color document dataset, show the effectiveness of the proposed technique for document retrieval.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ahmed, S., Malik, M.I., Liwicki, M., Dengel, A.: Signature segmentation from document images. In: International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 425–429. IEEE (2012)

    Google Scholar 

  2. Ahmed, S., Shafait, F., Liwicki, M., Dengel, A.: A generic method for stamp segmentation using part-based features. In: Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 708–712. IEEE (2013)

    Google Scholar 

  3. Alaei, A., Roy, P.P., Pal, U.: Logo and seal based administrative document image retrieval: a survey. Comput. Sci. Rev. 22, 47–63 (2016)

    Article  MathSciNet  Google Scholar 

  4. Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (surf). Comput. Vis. Image Underst. 110(3), 346–359 (2008)

    Article  Google Scholar 

  5. Duan, L.Y., Ji, R., Chen, Z., Huang, T., Gao, W.: Towards mobile document image retrieval for digital library. IEEE Trans. Multimedia 16(2), 346–359 (2014)

    Article  Google Scholar 

  6. Jain, R., Doermann, D.: Logo retrieval in document images. In: Proceedings of the 10th IAPR International Workshop on Document Analysis Systems, pp. 135–139. IEEE (2012)

    Google Scholar 

  7. Le, V.P., Nayef, N., Visani, M., Ogier, J.M., De Tran, C.: Document retrieval based on logo spotting using key-point matching. In: Proceedings of the 22nd International Conference on Pattern Recognition (ICPR), pp. 3056–3061. IEEE (2014)

    Google Scholar 

  8. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  Google Scholar 

  9. Muja, M., Lowe, D.G.: Scalable nearest neighbor algorithms for high dimensional data. IEEE Trans. Pattern Anal. Mach. Intell. 36(11), 2227–2240 (2014)

    Article  Google Scholar 

  10. Nandedkar, A.V., Mukherjee, J., Sural, S.: Text and non-text separation in scanned color-official documents. In: Mukherjee, S., et al. (eds.) ICVGIP 2016. LNCS, vol. 10481, pp. 231–242. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68124-5_20

    Chapter  Google Scholar 

  11. Ning, Q., Zhu, J., Zhong, Z., Hoi, S.C., Chen, C.: Scalable image retrieval by sparse product quantization. IEEE Trans. Multimedia 19(3), 586–597 (2016)

    Article  Google Scholar 

  12. Roy, P.P., Pal, U., Lladós, J.: Document seal detection using GHT and character proximity graphs. Pattern Recogn. 44(6), 1282–1295 (2011)

    Article  Google Scholar 

  13. Rusiñol, M., Lladós, J.: Efficient logo retrieval through hashing shape context descriptors. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, pp. 215–222. ACM (2010)

    Google Scholar 

  14. Srihari, S.N., et al.: Document image retrieval using signatures as queries. In: Proceedings of the 2nd International Conference on Document Image Analysis for Libraries (DIAL), pp. 198–203. IEEE (2006)

    Google Scholar 

  15. Tencer, L., Renáková, M., Cheriet, M.: Sketch-based retrieval of document illustrations and regions of interest. In: Proceedings of the 12th International Conference on Document Analysis and Recognition, pp. 728–732. IEEE (2013)

    Google Scholar 

  16. Zhu, G., Zheng, Y., Doermann, D., Jaeger, S.: Signature detection and matching for document image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 31(11), 2015–2031 (2009)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Amit V. Nandedkar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nandedkar, A.V., Nandedkar, A.V. (2020). Multimodal Query Based Approach for Document Image Retrieval. In: Nain, N., Vipparthi, S., Raman, B. (eds) Computer Vision and Image Processing. CVIP 2019. Communications in Computer and Information Science, vol 1147. Springer, Singapore. https://doi.org/10.1007/978-981-15-4015-8_32

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-4015-8_32

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-4014-1

  • Online ISBN: 978-981-15-4015-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics