skip to main content
10.1145/3371238.3371248acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccseConference Proceedingsconference-collections
research-article

Tag Information Recognition Approaches and Algorithms for Cross-Border Products Checking

Authors Info & Claims
Published:18 October 2019Publication History

ABSTRACT

The images with fixed layouts, such as images from ID cards, driving licenses, and invoices can be recognized from prior knowledge[1]-[7]. However, The non-immobilized images, such as product labels at ports, is very difficult to be extracted structured data information from tag images because the formats and contents of tags in different countries and different product vary widely[8]. The process is complex and the error rate is high.

This paper combines the characteristics of the Cross-Border Products label, overall format complex and simple local structure (top-to-down and left-to-right), and proposes a method for identifying and structuring port commodity label information. The method mainly establishes a template library of keyword and data unit information of commodity labels according to the port commodity classification and then separates the keyword and the data information from the multi-line text with accurate location information recognized by the OCR engine. Finally, the keyword and data are structured according to the local layout pattern between the keyword and the data, and the structured Cross-Border product information is obtained.

References

  1. Z. Zhang, C. Zhang, W. Shen, C. Yao, W. Liu, and X. Bai, (2016)"Multi-oriented text detection with fully convolutional networks," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4159--4167.Google ScholarGoogle Scholar
  2. S. Tian, Y. Pan, C. Huang, S. Lu, K. Yu, and C. Lim Tan, (,2015) "Text flow: A unifed text detection system in natural scene images," in Proceedings of the IEEE international conference on computer vision, pp. 4651--4659.Google ScholarGoogle Scholar
  3. M. Jaderberg, K. Simonyan, A. Vedaldi, and A. Zisserman, (2016)"Reading text in the wild with convolutional neural networks," International Journal of Computer Vision, vol. 116, no. 1, pp. 1--20.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. P. He, W. Huang, Y. Qiao, C. C. Loy, and X. Tang, (2016) "Reading scene text in deep convolutional sequences.," in AAAI, vol. 16, pp. 3501--3508.Google ScholarGoogle Scholar
  5. T. He, W. Huang, Y. Qiao, and J. Yao, (2016)"Accurate text localization innatural image with cascaded convolutional text network," arXiv preprint arXiv:1603.09423.Google ScholarGoogle Scholar
  6. A. Gupta, A. Vedaldi, and A. Zisserman, (2016) "Synthetic data for textlocalisation in natural images," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2315--2324.Google ScholarGoogle Scholar
  7. W. Huang, Y. Qiao, and X. Tang, (2014)"Robust scene text detection with convolution neural network induced mser trees," in European Conference on Computer Vision, pp. 497--511, Springer.Google ScholarGoogle Scholar
  8. J.-Y. Ramel, M. Crucianu, N. Vincent, C. Faure (2006). Detection, Extraction and Representation of Tables. Proceedings of the Seventh International Conference on Document Analysis and Recognition (ICDAR'03).Google ScholarGoogle Scholar
  9. Y. Shinyama and S. Sekine, (2006)"Preemptive information extraction using unrestricted relation discovery," in Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, pp. 304--311, Association for Computational Linguistics.Google ScholarGoogle Scholar
  10. H. Dejean, (2015). "Extracting structured data from unstructured document with incomplete resources". in Document Analysis and Recognition (ICDAR), 2015 13th International Conference on, pp. 271--275, IEEE, 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. A. Ferrucci, (2012)"Introduction to "this is watson"," IBM Journalof Research and Development, vol. 56, no. 3.4, pp. 1--1.Google ScholarGoogle Scholar
  12. A. Arasu and H. Garcia-Molina, (2003) "Extracting structured data from web pages," in Proceedings of the 2003 ACM SIGMOD international conference on Management of data, pp. 337--348, ACM.Google ScholarGoogle Scholar
  13. B. Liu, S. Zhang, Z. Hong and X. Ye, (2018) A Horizontal Tilt Correction Method for Ship License Numbers Recognition, Journal of Physics: Conference Series, IOP Publishing.Google ScholarGoogle Scholar
  14. M. Busta, L. Neumann, and J. Matas, (2015) "Fastext: Efcient unconstrained scene text detector," in Proceedings of the IEEE International Conference on Computer Vision, pp. 1206--1214.Google ScholarGoogle Scholar
  15. Y. Ye, S. Zhu, J. Wang, Q. Du, Y. Yang, D. Tu, L. Wang and J. Luo (2018). A unifed scheme of text localization and structured data extraction for joint OCR and data mining. 2018 IEEE International Conference on Big Data (Big Data).Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Tag Information Recognition Approaches and Algorithms for Cross-Border Products Checking

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          ICCSE'19: Proceedings of the 4th International Conference on Crowd Science and Engineering
          October 2019
          246 pages
          ISBN:9781450376402
          DOI:10.1145/3371238

          Copyright © 2019 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 18 October 2019

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

          Acceptance Rates

          ICCSE'19 Paper Acceptance Rate35of92submissions,38%Overall Acceptance Rate92of247submissions,37%
        • Article Metrics

          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader