Skip to main content

Knowledge Inference Model of OCR Conversion Error Rules Based on Chinese Character Construction Attributes Knowledge Graph

  • Conference paper
  • First Online:
Natural Language Processing and Chinese Computing (NLPCC 2020)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12431))

Abstract

OCR is a character conversion method based on image recognition. The complexity of the character and the image quality plays a key role in the conversion accuracy. The OCR conversion process has the characteristics of irregular conversion errors and the combination between incorrect conversion words and context of original location in certain text scenarios is established in semantic. In this paper, we propose an OCR conversion error rules inference model based on Chinese character construction attribute knowledge graph to analyze and inference the structure and complexity of Chinese characters. The model integrates a variety of coding methods, extracts features of entities and relationships of different data types with different encoder in the knowledge graph, uses convolutional neural networks to learn and inference the unknown error rules in the OCR conversion. In addition, in order to enable the triple feature matrix to fully contain the construction attribute information of the Chinese characters, a feature crossover algorithm for feature diffusion of the triple feature matrix is introduced. In this algorithm, the relation matrix and the entities matrix are crossed to generate the new feature matrix which can better represent the triple of knowledge graph. The experimental results show that, compared with the current mainstream knowledge inference model, the OCR conversion error rules inference model incorporating the feature cross algorithm has achieved important improvements in MRR, Hits@1, Hits@2 and other evaluation indicators on public data sets and task-related data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Fink, F., Schulz, K.U., Springmann, U.: Profiling of OCR’ed historical texts revisited. In: Proceedings of the 2nd International Conference on Digital Access to Textual Cultural Heritage, Gottingen, pp. 61–66. Association for Computing Machinery (2017)

    Google Scholar 

  2. Bast, H., Claudius, K.: A benchmark and evaluation for text extraction from pdf. In: Proceedings of the 17th ACM/IEEE Joint Conference on Digital Libraries, Toronto, pp. 99–108. IEEE Press (2017)

    Google Scholar 

  3. Bordes, A., Usunier, N., Garcia-Duran, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. In: Proceedings of the 26th International Conference on Neural Information Processing Systems, Lake Tahoe, vol. 2, pp. 2787–2795. Curran Associates Inc. (2013)

    Google Scholar 

  4. Wang, Z., Zhang, J., Feng, J., Chen, Z.: Knowledge graph embedding by translating on hyperplanes. In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, Quebec, pp. 1112–1119. AAAI Press (2014)

    Google Scholar 

  5. Lin, Y., Liu, Z., Sun, M., Liu, Y., Zhu, X.: Learning entity and relation embeddings for knowledge graph completion. In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, pp. 2181–2187. AAAI Press (2015)

    Google Scholar 

  6. Ji, G., Liu, K., He, S., Zhao, J.: Knowledge graph completion with adaptive sparse transfer matrix. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, Phoenix, pp. 985–991. AAAI Press (2016)

    Google Scholar 

  7. Wen, Z.: Knowledge graph embedding with diversity of structures. In: Proceedings of the 26th International Conference on World Wide Web Companion, Perth, pp. 747–753. International World Wide Web Conferences Steering Committee (2017)

    Google Scholar 

  8. Pouya, P., Liyan, C., Sameer, S.: Embedding multimodal relational data for knowledge base completion. ArXiv, abs/1809.01341 (2018)

    Google Scholar 

  9. Kanojia, V., Maeda, H., Togashi, R., Fujita, S.: Enhancing knowledge graph embedding with probabilistic negative sampling. In: Proceedings of the 26th International Conference on World Wide Web Companion, Perth, pp. 801–802. International World Wide Web Conferences Steering Committee (2017)

    Google Scholar 

  10. Pennington, J., Socher, R., Manning, C.: Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, Doha, vol. 14, pp. 1532–1543. Association for Computational Linguistics (2014)

    Google Scholar 

  11. Yan, S., Shuming, S., Jing, L., Haisong, Z.: Directional skip-gram: explicitly distinguishing left and right context for word embeddings. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, New Orleans, vol. 2, pp. 175–180. Association for Computational Linguistics (2018)

    Google Scholar 

Download references

Acknowledgment

This work has been support by the North Minzu university key research project (No. 2019KJ26), the Ningxia first-class discipline and scientific research projects (electronic science and technology) (NO. NXYLXK2017A07) and the Natural science foundation of Ningxia Province (NO. 2020AAC03218).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hairong Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, X., Wang, H., Gu, W. (2020). Knowledge Inference Model of OCR Conversion Error Rules Based on Chinese Character Construction Attributes Knowledge Graph. In: Zhu, X., Zhang, M., Hong, Y., He, R. (eds) Natural Language Processing and Chinese Computing. NLPCC 2020. Lecture Notes in Computer Science(), vol 12431. Springer, Cham. https://doi.org/10.1007/978-3-030-60457-8_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-60457-8_34

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-60456-1

  • Online ISBN: 978-3-030-60457-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics