Abstract
In the traditional teaching procedure, the repetitive labor of correcting arithmetic exercise brings huge human costs. To reduce these costs and improve the given teaching efficiency, we propose a novel intelligent arithmetic evaluation system, which can automatically identify the meaning of each arithmetic question and make a reasonable judgment or decision. The designed evaluation system can be divided into two modules with detection and identification. In the detection module, due to the intensive distribution and various formats of arithmetic questions in the test papers, we adopt the MixNet-YOLOv3 network with scale balance and lightweight to achieve speed-accuracy trade-off with the mAP being up to 0.989; In the recognition module, considering the formats of each arithmetic problem are mostly fixed, we employ the CRNN network based on the CTC decoding mechanism to achieve an accuracy being up to 0.971. By the incorporation of two networks, the proposed system is capable of intelligently evaluating arithmetic exercise in mobile devices.
T. Liu, C. Liang, X. Dai are also with Jiangsu Provincial Key Laboratory of Image Processing and Image Communication, Key Laboratory of Broadband Wireless Communication and Sensor Network Technology, Ministry of Education, and also with Jiangsu Provincial Engineering Research Center for High Performance Computing and Intelligent Processing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2963–2970 (2010)
Huang, W., Qiao, Y., Tang, X.: Robust scene text detection with convolutional neural networks induced mser trees. Eur. Conf. Comput. Vis. 1(2), 3 (2014)
Tian, S., Pan, Y., Huang, C., Lu, S., Yu, K., Tan, C.L.: Text flow: a unified text detection system in natural scene images. In: IEEE International Conference on Computer Vision, pp. 4651–4659 (2015)
Redmon, J., Farhadi, A.: Yolov3: An incremental improvement, arXiv preprint arXiv:1804.02767 (2018)
Tan, M., Le, Q.V.: Mixconv: Mixed depthwise convolutional kernels, arXiv preprint arXiv:1907.09595 (2019)
Shi, B., Bai, X., Yao, C.: An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans. Pattern Anal. Mach. Intell. 39(11), 2298–2304 (2016)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Watter, M., Springenberg, J., Boedecker, J., Riedmiller, M.: Embed to control: a locally linear latent dynamics model for control from raw images. In: Advances in Neural Information Processing Systems, pp. 2746–2754 (2015)
Liu, M., Yang, J., Song, T., Hu, J., Gui, G.: Deep learning-inspired message passing algorithm for efficient resource allocation in cognitive radio networks. IEEE Trans. Veh. Technol. 69(1), 641–653 (2019)
Wang, Y., Liu, M., Yang, J., Gui, G.: Data-driven deep learning for automatic modulation recognition in cognitive radios. IEEE Trans. Veh. Technol. 68(4), 4074–4077 (2019)
Zhao, Y., Chen, Q., Cao, W., Yang, J., Xiong, J., Gui, G.: Deep learning for risk detection and trajectory tracking at construction sites. IEEE Access 7, 30905–30912 (2019)
Shao, L., Li, M., Yuan, L., Gui, G.: InMAS: deep learning for designing intelligent making system. IEEE Access 7, 51104–51111 (2019)
Tian, Z., Huang, W., He, T., He, P., Qiao, Yu.: Detecting text in natural image with connectionist text proposal network. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9912, pp. 56–72. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46484-8_4
Liao, M., Shi, B., Bai, X., Wang, X., Liu, W.: Textboxes: a fast text detector with a single deep neural network. In: Thirty-First AAAI Conference on Artificial Intelligence (2017)
Lyu, P., Liao, M., Yao, C., Wu, W., Bai, X.: Mask textspotter: an end-to-end trainable neural network for spotting text with arbitrary shapes. In: European Conference on Computer Vision (ECCV), pp. 67–83 (2018)
Wang, W., et al.: Shape robust text detection with progressive scale expansion network. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 9336–9345 (2019)
Zhou, X., et al.: East: an efficient and accurate scene text detector. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 5551–5560 (2017)
Pusateri, E., Ambati, B.R., Brooks, E., Platek, O., McAllaster, D., Nagesha, V.: A mostly data-driven approach to inverse text normalization. In: INTERSPEECH, pp. 2784–2788 (2017)
Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)
Liu, W., et al.: SSD: Single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask r-cnn. In: IEEE International Conference on Computer Vision, pp. 2961–2969 (2017)
Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)
Shi, B., Yang, M., Wang, X., Lyu, P., Yao, C., Bai, X.: Aster: an attentional scene text recognizer with flexible rectification. IEEE Trans. Pattern Anal. Mach. Intell. 41(9), 2035–2048 (2018)
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., Chen, L.-C.: Mobilenetv2: inverted residuals and linear bottlenecks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 4510–4520 (2018)
Tan, M., Le, Q.V.: Efficientnet: Rethinking model scaling for convolutional neural networks, arXiv preprint arXiv:1905.11946 (2019)
Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., Xu, C.: Ghostnet: more features from cheap operations. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1580–1589 (2020)
Acknowledgments
This work was supported in part by the National Natural Science Foundation of China (61001152, 61071091, 31671006, 61572503, 61772286, 61872199, 61872424 and 6193000388), China Scholarship Council.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Liu, T., Liang, C., Dai, X., Luo, J. (2021). Arithmetic Evaluation System Based on MixNet-YOLOv3 and CRNN Neural Networks. In: Del Bimbo, A., et al. Pattern Recognition. ICPR International Workshops and Challenges. ICPR 2021. Lecture Notes in Computer Science(), vol 12665. Springer, Cham. https://doi.org/10.1007/978-3-030-68821-9_31
Download citation
DOI: https://doi.org/10.1007/978-3-030-68821-9_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-68820-2
Online ISBN: 978-3-030-68821-9
eBook Packages: Computer ScienceComputer Science (R0)