research-article

The Impact of Unassimilated Loanwords on the Latin Lexicon. A Qualitative and Quantitative Analysis

Authors:
Marco Budassi

Università degli Studi di Pavia, Pavia, Italy

Università degli Studi di Pavia, Pavia, Italy
View Profile

,
Marco Passarotti

Università Cattolica del Sacro Cuore, Milan, Italy

Università Cattolica del Sacro Cuore, Milan, Italy
View Profile

DATeCH2017: Proceedings of the 2nd International Conference on Digital Access to Textual Cultural HeritageJune 2017Pages 85–90https://doi.org/10.1145/3078081.3078083

Published:01 June 2017Publication History

DATeCH2017: Proceedings of the 2nd International Conference on Digital Access to Textual Cultural Heritage

Pages 85–90

ABSTRACT

The recent enhancement of the morphological analyser for Latin Lemlat with a large Onomasticon enables us to analyse both the morphology and the distribution of loanwords in the Latin lexicon. In this paper, first we describe the categories of proper names that were not possible to insert into Lemlat automatically, showing that a large part of them are loanwords. Then, we present the results of a qualitative analysis of loanwords to detect those 'exceptional' endings that identify loanwords featuring inflectional properties not assimilated to those regular in the morphological system of Latin. In the end, we report a quantitative analysis of data to study the frequency of such loanwords in Latin texts.

References

Marco Budassi and Marco Passarotti. 2016. Nomen Omen. Enhancing the Latin Morphological Analyser Lemlat with an Onomasticon. In Proceedings of the 10th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH), Berlin, Germany. 90--94.Google ScholarCross Ref
Roberto Busa. 1988. Totius Latinitatis lemmata quae ex Aeg. Forcellini Patavina editione 1940 a fronte, a tergo atque morphologice opera IBM automati ordinaverat Robertus Busa SJ. Istituto Lombardo, Accademia di scienze e lettere, Milano.Google Scholar
Gregory Crane. 1991. Generating and parsing classical Greek. Literary and Linguistic Computing 6, 4 (1991), 243--245. Google ScholarCross Ref
Egidio Forcellini. 1940. Lexicon Totius Latinitatis / ad Aeg. Forcellini lucubratum, dein a Jos. Furlanetto emendatum et auctum; nunc demum Fr. Corradini et Jos. Perin curantibus emendatius et auctius meloremque informam redactum adjecto altera quasi parte Onomastico totius latinitatis opera et studio ejusdem Jos. Perin. Typis Seminarii, Padova.Google Scholar
Karl E Georges and Heinrich Georges. 1913--1918. Ausführliches Lateinisch-Deutsches Handwörterbuch. Hahn, Hannover.Google Scholar
Peter GW Glare. 1982. Oxford latin dictionary. Clarendon Press. Oxford University Press, Oxford.Google Scholar
Otto Gradenwitz. 1904. Laterculi Vocum Latinarum. Hirzel, Leipzig.Google Scholar
Roberto Gusmani. 1973. Aspetti delprestito linguistico. Libreria scientifica editrice, Napoli.Google Scholar
Roberto Gusmani. 1973. Di alcuni presunti prestiti greci in latino. BSL 3 (1973), 76--88.Google Scholar
Marco Passarotti. 2004. Development and perspectives of the Latin morphological analyser LEMLAT. Linguistica Computazionale 20, A (2004), 397--414.Google Scholar
Sarah Grey Thomason and Terrence Kaufman. 1992. Language contact, creolization, and genetic linguistics. University of California Press, Berkeley.Google Scholar
Paul Tombeur. 1998. Thesaurus formarum totius Latinitatis: a Plauto usque ad saeculum XXum; TF.[2]. CETEDOC Index of Latin forms: database for the study of the vocabulary of the entire Latin world; base de données pour l'étude du vocabulaire de toute la latinité. Brepols, Turnhout.Google Scholar
Margaret MT Watmough. 1997. Studies in the Etruscan loanwords in Latin. Vol. 33. Olschki, Firenze.Google Scholar

Index Terms

The Impact of Unassimilated Loanwords on the Latin Lexicon. A Qualitative and Quantitative Analysis
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Phonology / morphology

Recommendations

A novel unsupervised corpus-based stemming technique using lexicon and corpus statistics
Abstract
Word Stemming is a widely used mechanism in the fields of Natural Language Processing, Information Retrieval, and Language Modeling. Language-independent stemmers discover classes of morphologically related words from the ambient ...
Read More
An unsupervised method for identifying loanwords in Korean

This paper presents an unsupervised method for developing a character-based n-gram classifier that identifies loanwords or transliterated foreign words in Korean text. The classifier is trained on an unlabeled corpus using the Expectation Maximization ...
Read More
Impact of Morphological Segmentation on Pre-trained Language Models
Intelligent Systems
Abstract
Pre-trained Language Models are the current state-of-the-art in many natural language processing tasks. These models rely on subword-based tokenization to solve the problem of out-of-vocabulary words. However, commonly used subword segmentation ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

DATeCH2017: Proceedings of the 2nd International Conference on Digital Access to Textual Cultural Heritage
June 2017
179 pages
ISBN:9781450352659
DOI:10.1145/3078081

Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 June 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Language Contact
Latin
Lexicography
Loanwords
Morphology
Natural Language Processing
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
DATeCH2017 Paper Acceptance Rate29of37submissions,78%Overall Acceptance Rate60of86submissions,70%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 34
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

The Impact of Unassimilated Loanwords on the Latin Lexicon. A Qualitative and Quantitative Analysis

DATeCH2017: Proceedings of the 2nd International Conference on Digital Access to Textual Cultural Heritage

ABSTRACT

References

Cited By

Index Terms

Recommendations

A novel unsupervised corpus-based stemming technique using lexicon and corpus statistics

An unsupervised method for identifying loanwords in Korean

Impact of Morphological Segmentation on Pre-trained Language Models

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

The Impact of Unassimilated Loanwords on the Latin Lexicon. A Qualitative and Quantitative Analysis

DATeCH2017: Proceedings of the 2nd International Conference on Digital Access to Textual Cultural Heritage

ABSTRACT

References

Cited By

Index Terms

Recommendations

A novel unsupervised corpus-based stemming technique using lexicon and corpus statistics

An unsupervised method for identifying loanwords in Korean

Impact of Morphological Segmentation on Pre-trained Language Models

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media