Retrieval Of Morphological Variants In Searches Of Latin Text Databases

Schinke, Robyn; Greengrass, Mark; Robertson, Alexander M.; Willett, Peter

doi:10.1023/A:1000996413558

Retrieval Of Morphological Variants In Searches Of Latin Text Databases

Published: September 1997

Volume 31, pages 409–432, (1997)
Cite this article

Computers and the Humanities Aims and scope Submit manuscript

Robyn Schinke¹,
Mark Greengrass¹,
Alexander M. Robertson² &
…
Peter Willett²

77 Accesses
2 Citations
Explore all metrics

Abstract

This paper reports a detailed evaluation of the effectiveness of a system that has been developed for the identification and retrieval of morphological variants in searches of Latin text databases. A user of the retrieval system enters the principal parts of the search term (two parts for a noun or adjective, three parts for a deponent verb, and four parts for other verbs), this enabling the identification of the type of word that is to be processed and of the rules that are to be followed in determining the morphological variants that should be retrieved. Two different search algorithms are described. The algorithms are applied to the Latin portion of the Hartlib Papers Collection and to a range of classical, vulgar and medieval Latin texts drawn from the Patrologia Latina and from the PHI Disk 5.3 datasets. The effectiveness of these searches demonstrates the effectiveness of our procedures in providing access to the full range of classical and post-classical Latin text databases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Computer morphology for investigations of a variable text

Article 01 June 2017

Morphosyntactic Annotation of Historical Texts. The Making of the Baroque Corpus of Polish

Lexical Analysis Using Regular Expressions for Information Retrieval from a Legal Corpus

References

Ahmad F., M. Yusoff and T.M.T. Sembok "Experiments with a Malay Stemming Algorithm". Journal of the American Society for Information Science, 47 (1996), 909–918.
Article Google Scholar
Frakes, W.B. "Stemming Algorithms". In Information Retrieval: Data Structures and Algorithms. Eds. W.B. Frakes and R. Baeza-Yates, Englewood Cliffs: Prentice-Hall, 1992.
Google Scholar
Greengrass, M. "The Hartlib Papers Project. An Electronic Edition of the Past for the Future". In Changing Patterns of Online Information. UKOLUG State-Of-The-Art Conference1994. Eds. C.J. Armstrong and R.J. Hartley. Oxford: Learned Information Limited, 1994.
Google Scholar
Kalamboukis, T.Z. "Suffix Stripping with Modern Greek". Program, 29 (1995), 313–321.
Google Scholar
Kraaij,W. and R. Pohlmann "Evaluation of a Dutch Stemming Algorithm". New Review of Document and Text Management, 1 (1995), 25–43.
Google Scholar
Lennon, M., D.S. Peirce, B.D. Tarry and P. Willett "An Evaluation of some Conflation Algorithms for Information Retrieval". Journal of Information Science, 3 (1981), 177–183.
Google Scholar
Leslie, M. "The Hartlib Papers Project: Text Retrieval in Large Datasets". Literary and Linguistic Computing, 5 (1990), 58–69.
Article Google Scholar
Lovins, J.B. "Error Evaluation for Stemming Algorithms as Clustering Algorithms". Journal of the American Society for Information Science, 22 (1971), 28–40.
Google Scholar
Paice, C.D. "An Evaluation Method for Stemming Algorithms". In Proceedings of the Seventeenth Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval. Eds. W.B. Croft and C.J. van Rijsbergen, London: Springer-Verlag, 1994.
Google Scholar
Popovic, M. and P. Willett "The Effectiveness of Stemming for Natural-Language Access to Slovene Textual Data". Journal of the American Society for Information Science, 43 (1992), 384–390.
Article Google Scholar
Savoy, J. "Stemming of French Words Based on Grammatical Categories". Journal of the American Society for Information Science, 44 (1993), 1–9.
Article Google Scholar
Schinke, R., M. Greengrass, A.M. Robertson and P. Willett "A Stemming Algorithm for Latin Text Databases". Journal of Documentation, 52 (1996), 172–187.
Google Scholar
Solak, A. and K. Oflazer "Design and Implementation of a Spelling Checker for Turkish". Literary and Linguistic Computing, 8 (1993), 113–130.
Article Google Scholar
Sparck Jones, K. and P. Willett, Eds. Readings in Information Retrieval. San Francisco: Morgan Kaufman, 1997.
Google Scholar
Sproat, R. Morphology and Computation. Cambridge MA: MIT Press, 1992.
Google Scholar

Download references

Author information

Authors and Affiliations

Humanities Research Institute and Department of History, University of Sheffield, Sheffield, S10 2TN, UK
Robyn Schinke & Mark Greengrass
Humanities Research Institute and Department of Information Studies, University of Sheffield, Sheffield, S10 2TN, UK
Alexander M. Robertson & Peter Willett

Authors

Robyn Schinke
View author publications
You can also search for this author in PubMed Google Scholar
Mark Greengrass
View author publications
You can also search for this author in PubMed Google Scholar
Alexander M. Robertson
View author publications
You can also search for this author in PubMed Google Scholar
Peter Willett
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alexander M. Robertson.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Schinke, R., Greengrass, M., Robertson, A.M. et al. Retrieval Of Morphological Variants In Searches Of Latin Text Databases. Computers and the Humanities 31, 409–432 (1997). https://doi.org/10.1023/A:1000996413558

Download citation

Issue Date: September 1997
DOI: https://doi.org/10.1023/A:1000996413558

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Retrieval Of Morphological Variants In Searches Of Latin Text Databases

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Computer morphology for investigations of a variable text

Morphosyntactic Annotation of Historical Texts. The Making of the Baroque Corpus of Polish

Lexical Analysis Using Regular Expressions for Information Retrieval from a Legal Corpus

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Subscribe and save

Buy Now

Navigation

Retrieval Of Morphological Variants In Searches Of Latin Text Databases

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Computer morphology for investigations of a variable text

Morphosyntactic Annotation of Historical Texts. The Making of the Baroque Corpus of Polish

Lexical Analysis Using Regular Expressions for Information Retrieval from a Legal Corpus

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Subscribe and save

Buy Now

Search

Navigation