Knowledge Inference Model of OCR Conversion Error Rules Based on Chinese Character Construction Attributes Knowledge Graph

Zhang, Xiaowen; Wang, Hairong; Gu, Wenjie

doi:10.1007/978-3-030-60457-8_34

Xiaowen Zhang¹²,
Hairong Wang¹² &
Wenjie Gu¹²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12431))

Included in the following conference series:

CCF International Conference on Natural Language Processing and Chinese Computing

1961 Accesses
1 Citations

Abstract

OCR is a character conversion method based on image recognition. The complexity of the character and the image quality plays a key role in the conversion accuracy. The OCR conversion process has the characteristics of irregular conversion errors and the combination between incorrect conversion words and context of original location in certain text scenarios is established in semantic. In this paper, we propose an OCR conversion error rules inference model based on Chinese character construction attribute knowledge graph to analyze and inference the structure and complexity of Chinese characters. The model integrates a variety of coding methods, extracts features of entities and relationships of different data types with different encoder in the knowledge graph, uses convolutional neural networks to learn and inference the unknown error rules in the OCR conversion. In addition, in order to enable the triple feature matrix to fully contain the construction attribute information of the Chinese characters, a feature crossover algorithm for feature diffusion of the triple feature matrix is introduced. In this algorithm, the relation matrix and the entities matrix are crossed to generate the new feature matrix which can better represent the triple of knowledge graph. The experimental results show that, compared with the current mainstream knowledge inference model, the OCR conversion error rules inference model incorporating the feature cross algorithm has achieved important improvements in MRR, Hits@1, Hits@2 and other evaluation indicators on public data sets and task-related data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Fink, F., Schulz, K.U., Springmann, U.: Profiling of OCR’ed historical texts revisited. In: Proceedings of the 2nd International Conference on Digital Access to Textual Cultural Heritage, Gottingen, pp. 61–66. Association for Computing Machinery (2017)
Google Scholar
Bast, H., Claudius, K.: A benchmark and evaluation for text extraction from pdf. In: Proceedings of the 17th ACM/IEEE Joint Conference on Digital Libraries, Toronto, pp. 99–108. IEEE Press (2017)
Google Scholar
Bordes, A., Usunier, N., Garcia-Duran, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. In: Proceedings of the 26th International Conference on Neural Information Processing Systems, Lake Tahoe, vol. 2, pp. 2787–2795. Curran Associates Inc. (2013)
Google Scholar
Wang, Z., Zhang, J., Feng, J., Chen, Z.: Knowledge graph embedding by translating on hyperplanes. In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, Quebec, pp. 1112–1119. AAAI Press (2014)
Google Scholar
Lin, Y., Liu, Z., Sun, M., Liu, Y., Zhu, X.: Learning entity and relation embeddings for knowledge graph completion. In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, pp. 2181–2187. AAAI Press (2015)
Google Scholar
Ji, G., Liu, K., He, S., Zhao, J.: Knowledge graph completion with adaptive sparse transfer matrix. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, Phoenix, pp. 985–991. AAAI Press (2016)
Google Scholar
Wen, Z.: Knowledge graph embedding with diversity of structures. In: Proceedings of the 26th International Conference on World Wide Web Companion, Perth, pp. 747–753. International World Wide Web Conferences Steering Committee (2017)
Google Scholar
Pouya, P., Liyan, C., Sameer, S.: Embedding multimodal relational data for knowledge base completion. ArXiv, abs/1809.01341 (2018)
Google Scholar
Kanojia, V., Maeda, H., Togashi, R., Fujita, S.: Enhancing knowledge graph embedding with probabilistic negative sampling. In: Proceedings of the 26th International Conference on World Wide Web Companion, Perth, pp. 801–802. International World Wide Web Conferences Steering Committee (2017)
Google Scholar
Pennington, J., Socher, R., Manning, C.: Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, Doha, vol. 14, pp. 1532–1543. Association for Computational Linguistics (2014)
Google Scholar
Yan, S., Shuming, S., Jing, L., Haisong, Z.: Directional skip-gram: explicitly distinguishing left and right context for word embeddings. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, New Orleans, vol. 2, pp. 175–180. Association for Computational Linguistics (2018)
Google Scholar

Download references

Acknowledgment

This work has been support by the North Minzu university key research project (No. 2019KJ26), the Ningxia first-class discipline and scientific research projects (electronic science and technology) (NO. NXYLXK2017A07) and the Natural science foundation of Ningxia Province (NO. 2020AAC03218).

Author information

Authors and Affiliations

School of Computer Science and Engineering, North Minzu University, Yinchuan, 750021, China
Xiaowen Zhang, Hairong Wang & Wenjie Gu

Authors

Xiaowen Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hairong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wenjie Gu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hairong Wang .

Editor information

Editors and Affiliations

ECE & Ingenuity Labs Research Institute, Queen’s University, Kingston, ON, Canada
Xiaodan Zhu
Department of Computer Science and Technology, Tsinghua University, Beijing, China
Min Zhang
School of Computer Science and Technology, Soochow University, Suzhou, China
Yu Hong
College of Intelligence and Computing, Tianjin University, Tianjin, China
Ruifang He

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, X., Wang, H., Gu, W. (2020). Knowledge Inference Model of OCR Conversion Error Rules Based on Chinese Character Construction Attributes Knowledge Graph. In: Zhu, X., Zhang, M., Hong, Y., He, R. (eds) Natural Language Processing and Chinese Computing. NLPCC 2020. Lecture Notes in Computer Science(), vol 12431. Springer, Cham. https://doi.org/10.1007/978-3-030-60457-8_34

Download citation

DOI: https://doi.org/10.1007/978-3-030-60457-8_34
Published: 02 October 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60456-1
Online ISBN: 978-3-030-60457-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)