Abstract
For a large company a prototype for automatic detection of similar objects in database systems has been developed. This task has been accomplished by transferring the database object classification problem into a text classification problem and applying standard classification algorithms. Although the data provided for the task did not look promising due to the small number of positive examples, the results turned out to be very good.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Beuster, G.: MIC — A System for Classification of Structured and Unstructured Texts. Master’s thesis, University Koblenz (2001), http://www.gb/papers/thesis_mic/mic.pdf
Bouguettaya, A., Benatallah, B., Elmagarmid, A.K.: Interconnecting Heterogeneous Information Systems. Kluwer Academic Publishers, Dordrecht (1998)
Chinchor, N.: Muc-4 evaluation metrics. In: Fourth Message Understanding Conference, pp. 22–29. Morgan Kaufmann, San Francisco (1992)
Marco, D.: Building and Managing the Meta Data Repository: A Full Lifecycle Guide. John Wiley & Sons, Chichester (2000)
Maron, M.: Automatic indexing: An experimental inquiry. Journal of the ACM (JACM) 8, 404–417 (1961)
Mitchell, T.M.: Machine Learning. McGraw-Hill International Editions (1997)
Quinlan, J.: Discovering rules by induction from a large collection of examples. In: Michie, D. (ed.) Expert systems in the Micro-Electronic Age, pp. 168–201. Edinburgh University Press, Edinburgh (1979)
Rumelhart, D.D., Hinton, G.E., Williams, R.J.: Learning representations by backpropagating errors. Nature, 533–536 (1986)
Shannon, C.: A mathematical theory of communication. Bell System Technical Journal 27, 379–423 (1948)
Sheth, A., Larson, J.: Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Computing Surveys 22, 183–236 (1990)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Beuster, G., Furbach, U., Gross-Hardt, M., Thomas, B. (2003). Automatic Classification for the Identification of Relationships in a Meta-Data Repository. In: Grieser, G., Tanaka, Y., Yamamoto, A. (eds) Discovery Science. DS 2003. Lecture Notes in Computer Science(), vol 2843. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39644-4_24
Download citation
DOI: https://doi.org/10.1007/978-3-540-39644-4_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20293-6
Online ISBN: 978-3-540-39644-4
eBook Packages: Springer Book Archive