Abstract
For the content-based management and access to domain-specific data in digital libraries, special domain-knowledge and knowledge processing functionality are required. However, the integration of knowledge components has not yet become an integral part of existing digital library systems. The current paper represents the realization of a digital archive of historical music scores, integrating special domain-specific data and functionality for writer identification in historical music scores. We introduce the basic formalisms and heuristics for the representation of handwriting characteristics. To compare two handwritings we propose the usage of a normalized, weighted Hamming distance function to calculate the degree of similarity between their handwriting characteristics. For the identification of writers we employ the k-nearest neighbor method to build clusters of similar writers, based on the calculated distance. And finally, we represent and evaluate the test results from the prototype implementation of the system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Crane, G., Wulfman, C., Cerrato, L., Mahoney, A., Milbank, T., Mimno, D., Rydberg-Cox, A., Smith, D., York, C.: Towards a cultural heritage digital library. In: 2003 Joint Conference on Digital Libraries (2003)
Bruder, I., Finger, A., Heuer, A., Ignatova, T.: Towards a digital document archive for historical handwritten music scores. In: Sembok, T.M.T., Zaman, H.B., Chen, H., Urs, S.R., Myaeng, S.-H. (eds.) ICADL 2003. LNCS, vol. 2911, pp. 411–414. Springer, Heidelberg (2003)
Hand, D., Mannila, H., Smyth, P.: Principles of Data Mining. MIT Press, Cambridge (2001)
Witten, I.H., Frank, E.: Data Mining - Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (2000)
Berkhin, P.: Survey of clustering data mining techniques. Technical report, Accrue Software, San Jose, CA (2002)
Aha, D., Kibler, D.: Instance-based learning algorithms. Machine Learning 6 (1991)
Zhang, B., Srihari, S.N., Lee, S.: Individuality of handwritten characters. In: 7th International Conference on Document Analysis and Recognition (2003)
Bensefia, A., Paquet, T., Heutte, L.: Information retrieval based writer identification. In: 5th International Conference on Enterprise Information Systems (2003)
Oracle: Darwin Installation and Administration. Release 3.7 (2000)
Kohavi, R., Sommerfield, D.: MLC++ Machine Lerning Library in C++ (1996)
Aha, D.: Tolerating noisy, irrelevant and novel attributes in instance-based learning algorithms. International Journal of Man-Machine Studies 36(1) (1992)
van Rijsbergen, C.J.: Information Retrieval (1979)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bruder, I., Ignatova, T., Milewski, L. (2004). Knowledge-Based Scribe Recognition in Historical Music Archives. In: Heery, R., Lyon, L. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2004. Lecture Notes in Computer Science, vol 3232. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30230-8_28
Download citation
DOI: https://doi.org/10.1007/978-3-540-30230-8_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23013-7
Online ISBN: 978-3-540-30230-8
eBook Packages: Springer Book Archive