Abstract
While paleographers face various challenges in the dating process of historical manuscripts, computer scientists encounter multiple obstacles in automating them. To address this problem, machine learning and deep learning techniques which have proven effective in other domains have been used. This study introduces a system that integrates two primary methods of feature extraction - deep learning and hand-crafted features. The Harris-detector was utilized to extract key-points from the manuscripts, and the K-means algorithm was applied to cluster them. From these clusters, patches of size nxn were extracted. Then, a densnet model was trained on these patches using transfer learning, finally we used majority voting on document patches to determine the date based on document level. The effectiveness of this approach was evaluated on two datasets, MPS and CLaMM, and achieved a CS of 96% and 65% and an MAE of 4.01 years and 20.1 years respectively for both datasets.
Similar content being viewed by others
References
Omayio, E.O., Indu, S., Panda, J.: Historical manuscript dating: traditional and current trends. Multimed. Tools Appl. (2022). https://doi.org/10.1007/s11042-022-12927-8
He, S., Samara, P., Burgers, J., Schomaker, L.: Towards style-based dating of historical documents. In: International Conference of Frontiers in Handwriting Recognition (ICFHR), pp. 265–270 (2014)
He, S., Samara, P., Burgers, J., Schomaker, L.: A Multiple-Label guided clustering algorithm for historical document dating and localization. IEEE Trans. Image Process. 25(11), 5252–5265 (2016). https://doi.org/10.1109/TIP.2016.2602078
He, S., Samara, P., Burgers, J., Schomaker, L.: Historical manuscript dating based on temporal pattern codebook. Comput. Vis. Image Underst. 152, 167–175 (2016). https://doi.org/10.1016/j.cviu.2016.08.008
Hamid, A., Bibi, M., Siddiqi, I., Moetesum, M.: Historical manuscript dating using textural measures. In: International Conference on Frontiers of Information Technology (FIT), pp. 235–241 (2018). https://doi.org/10.1109/FIT.2018.00048
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Szegedy, C., Ioffe, S., Vanhoucke, V., Alemi, A.A.: Inception-v4, inception-resnet and the impact of residual connections on learning. In: Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, pp. 4278–4284 (2017)
Buades, A., Coll, B., Morel, J.M.: Non-local means denoising. Image Process. On Line 1, 208–212 (2011)
Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 6, 679–698 (1986)
Stutzmann, D.: Clustering of medieval scripts through computer image analysis: towards an evaluation protocol. Digit Medievalist J. 10 (2016). https://doi.org/10.16995/dm.61
Cloppet, F., Eglin, V., Helias-Baron, M., Kieu, V.C., Stutzmann, D., Vincent, N.: ICDAR 2017 competition on the classification of medieval handwritings in latin script. In: 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) 2017, Kyoto, pp. 1371–1376 (2017). https://doi.org/10.1109/ICDAR.2017.224
He, K., Zhang, X., Ren, S., Sun, J.: Deep Residual Learning for Image Recognition, arXiv151203385 Cs (2015)
Ioffe, S., Szegedy, C.: Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift, arXiv150203167 Cs (2015)
Christlein, V., Bernecker, D., Angelopoulou, E.: Writer identification using VLAD encoded contour-Zernike moments. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), Nancy, pp. 906–910 (2015)
Cloppet, F., Eglin, V., Kieu, V.C., Stutzmann, D., Vincent, N.: ICFHR2016 Competition on the classification of medieval handwritings in latin script. In: Proceedings of the International Conference on Frontiers in Handwriting Recognition (ICFHR) (2016)
Jin, X., Han, J.: K-Means Clustering. In: Sammut, C., Webb, G.I. (eds.) Encyclopedia of Machine Learning. Springer, Boston (2011). https://doi.org/10.1007/978-0-387-30164-8_425
Harris, C., Stephens, M.: A combined corner and edge detector. In: Proceedings of the 4th Alvey Vision Conference, pp. 147–151 (1988)
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4708 (2017)
Wu, K., Otoo, E., Suzuki, K.: Two strategies to speed up connected component labeling algorithms. IEEE Trans. Image Process. 9(11)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Boudraa, M., Bennour, A. (2024). Combination of Local Features and Deep Learning to Historical Manuscripts Dating. In: Bennour, A., Bouridane, A., Chaari, L. (eds) Intelligent Systems and Pattern Recognition. ISPR 2023. Communications in Computer and Information Science, vol 1940. Springer, Cham. https://doi.org/10.1007/978-3-031-46335-8_11
Download citation
DOI: https://doi.org/10.1007/978-3-031-46335-8_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-46334-1
Online ISBN: 978-3-031-46335-8
eBook Packages: Computer ScienceComputer Science (R0)