Abstract
OCR is a character conversion method based on image recognition. The complexity of the character and the image quality plays a key role in the conversion accuracy. The OCR conversion process has the characteristics of irregular conversion errors and the combination between incorrect conversion words and context of original location in certain text scenarios is established in semantic. In this paper, we propose an OCR conversion error rules inference model based on Chinese character construction attribute knowledge graph to analyze and inference the structure and complexity of Chinese characters. The model integrates a variety of coding methods, extracts features of entities and relationships of different data types with different encoder in the knowledge graph, uses convolutional neural networks to learn and inference the unknown error rules in the OCR conversion. In addition, in order to enable the triple feature matrix to fully contain the construction attribute information of the Chinese characters, a feature crossover algorithm for feature diffusion of the triple feature matrix is introduced. In this algorithm, the relation matrix and the entities matrix are crossed to generate the new feature matrix which can better represent the triple of knowledge graph. The experimental results show that, compared with the current mainstream knowledge inference model, the OCR conversion error rules inference model incorporating the feature cross algorithm has achieved important improvements in MRR, Hits@1, Hits@2 and other evaluation indicators on public data sets and task-related data sets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Fink, F., Schulz, K.U., Springmann, U.: Profiling of OCR’ed historical texts revisited. In: Proceedings of the 2nd International Conference on Digital Access to Textual Cultural Heritage, Gottingen, pp. 61–66. Association for Computing Machinery (2017)
Bast, H., Claudius, K.: A benchmark and evaluation for text extraction from pdf. In: Proceedings of the 17th ACM/IEEE Joint Conference on Digital Libraries, Toronto, pp. 99–108. IEEE Press (2017)
Bordes, A., Usunier, N., Garcia-Duran, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. In: Proceedings of the 26th International Conference on Neural Information Processing Systems, Lake Tahoe, vol. 2, pp. 2787–2795. Curran Associates Inc. (2013)
Wang, Z., Zhang, J., Feng, J., Chen, Z.: Knowledge graph embedding by translating on hyperplanes. In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, Quebec, pp. 1112–1119. AAAI Press (2014)
Lin, Y., Liu, Z., Sun, M., Liu, Y., Zhu, X.: Learning entity and relation embeddings for knowledge graph completion. In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, pp. 2181–2187. AAAI Press (2015)
Ji, G., Liu, K., He, S., Zhao, J.: Knowledge graph completion with adaptive sparse transfer matrix. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, Phoenix, pp. 985–991. AAAI Press (2016)
Wen, Z.: Knowledge graph embedding with diversity of structures. In: Proceedings of the 26th International Conference on World Wide Web Companion, Perth, pp. 747–753. International World Wide Web Conferences Steering Committee (2017)
Pouya, P., Liyan, C., Sameer, S.: Embedding multimodal relational data for knowledge base completion. ArXiv, abs/1809.01341 (2018)
Kanojia, V., Maeda, H., Togashi, R., Fujita, S.: Enhancing knowledge graph embedding with probabilistic negative sampling. In: Proceedings of the 26th International Conference on World Wide Web Companion, Perth, pp. 801–802. International World Wide Web Conferences Steering Committee (2017)
Pennington, J., Socher, R., Manning, C.: Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, Doha, vol. 14, pp. 1532–1543. Association for Computational Linguistics (2014)
Yan, S., Shuming, S., Jing, L., Haisong, Z.: Directional skip-gram: explicitly distinguishing left and right context for word embeddings. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, New Orleans, vol. 2, pp. 175–180. Association for Computational Linguistics (2018)
Acknowledgment
This work has been support by the North Minzu university key research project (No. 2019KJ26), the Ningxia first-class discipline and scientific research projects (electronic science and technology) (NO. NXYLXK2017A07) and the Natural science foundation of Ningxia Province (NO. 2020AAC03218).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhang, X., Wang, H., Gu, W. (2020). Knowledge Inference Model of OCR Conversion Error Rules Based on Chinese Character Construction Attributes Knowledge Graph. In: Zhu, X., Zhang, M., Hong, Y., He, R. (eds) Natural Language Processing and Chinese Computing. NLPCC 2020. Lecture Notes in Computer Science(), vol 12431. Springer, Cham. https://doi.org/10.1007/978-3-030-60457-8_34
Download citation
DOI: https://doi.org/10.1007/978-3-030-60457-8_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60456-1
Online ISBN: 978-3-030-60457-8
eBook Packages: Computer ScienceComputer Science (R0)