Abstract
A file containing the last three characters of words from Roget's Thesaurus was created. Every entry was classified as belonging to one of the five parts of speech: nouns, verbs, adjectives, adverbs, and prepositions. The machine learning system LERS induced rules from this file. The paper describes this experiment. Two interesting regularities of the English language were discovered. Moreover, using a set of rules induced by LERS it is feasible to recognize part of speech of a word on the basis of its last three characters with the expected error rate of 26.71 %.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Booker, L. B., Goldberg, D. E., and Holland, J. F.: Classifier systems and genetic algorithms. In Machine Learning. Paradigms and Methods. Carbonell, J. G. (ed.), The MIT Press, 1990, 235–282.
Chan, C. C. and Grzymala-Busse, J. W.: On the attribute redundancy and the learning programs ID3, PRISM, and LEM2. Department of Computer Science, University of Kansas, TR-91-14, 1991, 20 pp.
Grzymala-Busse, J. W.: Managing Uncertainty in Expert Systems. Kluwer Academic Publishers, 1991.
Grzymala-Busse, J. W.: LERS—A system for learning from examples based on rough sets. In Intelligent Decision Support. Handbook of Applications and Advances of the Rough Sets Theory. Slowinski, R. (ed.), Kluwer Academic Publishers, 1992, 3–18.
Grzymala-Busse, J. W.: Managing uncertainty in machine learning from examples. Proc. of the Third Intelligent Information Systems Workshop, Wigry, Poland, June 6–11, 1994, 70–84.
Grzymala-Busse, J. W. and Wang, C. P. B.: Classification and rule induction based on rough sets. Proc. of the 5th IEEE International Conference on Fuzzy Systems FUZZ-IEEE'96, New Orleans, Louisiana, September 8–11, 1996, 744–747.
Holland, J. H., Holyoak K. J., and Nisbett, R. E.: Induction. Processes of Inference, Learning, and Discovery. The MIT Press, 1986.
Michalski, R. S., Mozetic, I., Hong, J. and Lavrac, N. The AQ15 inductive learning system: An overview and experiments. Department of Computer Science, University of Illinois, Rep. UIUCDCD-R-86-1260, 1986.
Pawlak, Z.: Rough sets. International Journal Computer and Information Sciences 11, 1982, 341–356.
Pawlak, Z.: Rough Sets. Theoretical Aspects of Reasoning about Data. Kluwer Academic Publishers, 1991.
Ras, Z. W.: Cooperative query answering. Proc. of the Workshop Intelligent Inf. Syst. IV, Augustow, Poland, June 5–9, 1995, 32–41.
Roget's International Thesaurus, Thomas Y. Crowell Company, 1962.
Slowinski, R. and Stefanowski, J. Handling various types of uncertainty in the rough set approach. Proc of the RKSD-93, International Workshop on Rough Sets and Knowledge Discovery, 1993, 395–397.
Ziarko, W. Analysis of uncertain information in the framework of variable precision rough sets. Found. Computing Decision Sci. 18, 1993, 381–396.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Grzymala-Busse, J.W., Old, L.J. (1997). A machine learning experiment to determine part of speech from word-endings. In: RaÅ›, Z.W., Skowron, A. (eds) Foundations of Intelligent Systems. ISMIS 1997. Lecture Notes in Computer Science, vol 1325. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63614-5_48
Download citation
DOI: https://doi.org/10.1007/3-540-63614-5_48
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63614-4
Online ISBN: 978-3-540-69612-4
eBook Packages: Springer Book Archive