A Quantitative Categorization of Phonemic Dialect Features in Context

Nagy, Naomi; Zhang, Xiaoli; Nagy, George; Schneider, Edgar W.

doi:10.1007/11508373_25

Naomi Nagy²²,
Xiaoli Zhang²³,
George Nagy²³ &
…
Edgar W. Schneider²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3554))

Included in the following conference series:

International and Interdisciplinary Conference on Modeling and Using Context

1218 Accesses
3 Citations

Abstract

We test a method of clustering dialects of English according to patterns of shared phonological features. Previous linguistic research has generally considered phonological features as independent of each other, but context is important: rather than considering each phonological feature individually, we compare the patterns of shared features, or Mutual Information (MI). The dependence of one phonological feature on the others is quantified and exploited. The results of this method of categorizing 59 dialect varieties by 168 binary internal (pronunciation) features are compared to traditional groupings based on external features (e.g., ethnic, geographic). The MI and size of the groups are calculated for taxonomies at various levels of granularity and these groups are compared to other analyses of geographic and ethnic distribution. Applications that could be improved by using MI methods are suggested.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Classifying World Englishes from a Lexical Perspective: A Corpus-Based Approach

Cluster Analysis for Commonalities Between Words of Different Languages

Dialect Typology: Recent Advances

References

Fetzer, A.: Recontextualizing context: Grammaticality meets appropriateness. Benjamins, Philadelphia (2004)
Google Scholar
Giunchiglia, F., Bouquet, P.: Introduction to contextual reasoning. An Artificial Intelligence Perspective. In: Kokinov, B. (ed.) Perspectives on Cognitive Science 3. NBU Press, Sofia (1997)
Google Scholar
Sarkar, P., Nagy, G.: Style consistent classification of isogenous patterns. IEEE Trans. Pattern Analysis and Machine Intelligence 27(1), 88–98 (2005)
Article Google Scholar
Veeramachaneni, S., Nagy, G.: Style context with second order statistics. IEEE Trans. Pattern Analysis and Machine Intelligence 27(1), 14–22 (2005)
Article Google Scholar
Carver, C.M.: American Regional Dialects: A Word Geography. University of Michigan Press, Ann Arbor (1987)
Google Scholar
Labov, W., Ash, S., Boberg, C.: Atlas of North American English. Mouton de Gruyter, Paris (2005)
Book Google Scholar
Hughes, A., Trudgill, P.: English Accents and Dialects: An Introduction to Social and Regional Varieties of British English. Edward Arnold, London (1987)
Google Scholar
Trudgill, P.: The Dialects of England. Blackwell, London (1999)
Google Scholar
Nerbonne, J., Kleiweg, P.: Lexical distance in LAMSAS. Computers and the Humanities 37(3), 339–357 (2003)
Article Google Scholar
Gooskens, C., Heeringa, W.: Perceptive evaluation of Levenshtein dialect distance measurements using Norwegian dialect data. Language Variation and Change 16(3), 189–207 (2004)
Article Google Scholar
Cheng, C.-C.: Measuring Relationship among Dialects: DOC [Dictionary on computer] and Related Resources. Computational Linguistics and Chinese Language Processing 2(1), 41–72 (1997)
Google Scholar
Heeringa, W., Braun, A.: The Use of the Almeida-Braun System in the Measurement of Dutch Dialect Distances. Computers and the Humanities 37(3), 257–271 (2003)
Article Google Scholar
Heeringa, W.: Measuring dialect pronunciation differences using Levenshtein distance. University of Groningen, Groningen (2004)
Google Scholar
Heggarty, P.A.: Measured Language: From First Principles to New Techniques for Putting Numbers on Language Similarity. Blackwell, Oxford (in prep.)
Google Scholar
Schneider, E.W., et al. (eds.): A Handbook of Varieties of English: A Multimedia Reference Tool. Mouton de Gruyter, Berlin (2005)
Google Scholar
Nagy, N.: Addenda to Categorization of phonemic dialect features in context (2005), http://pubpages.unh.edu/~ngn/papers/Context05/CONTEXT05_addenda
Wells, J.C. (ed.): Accents of English. Cambridge University Press, Cambridge (1982)
Google Scholar
Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, Hoboken (1980)
Google Scholar
Day, W.H.E., Edelsbrunner, H.: Efficient algorithms for agglomerative hierarchical clustering methods. Journal of Classification 1(1), 7–24 (1984)
Article MATH Google Scholar
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice Hall, Englewood Cliffs (1988)
MATH Google Scholar
Theodoridis, S., Koutroumbas, K.: Pattern Recognition. Academic, NY (1999)
Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley-Interscience, Hoboken (2001)
MATH Google Scholar
Topchy, A., et al.: Adaptive Clustering Ensembles. In: Proc. ICPR, Cambridge (2004)
Google Scholar
Jain, A.K., et al.: Landscape of Clustering Algorithms. In: Proc. ICPR, Cambridge (2004)
Google Scholar
Redner, R.A., Walker, H.F.: Mixture densities, maximum likelihood, and the EM algorithm. SIAM Review 26(2), 195–235 (1984)
Article MathSciNet MATH Google Scholar
Topchy, A., Jain, A.K., Punch, W.: A Mixture Model for Clustering Ensembles. In: Proc. SIAM International Conference on Data Mining (SDM 2004), Florida (2004)
Google Scholar
Foulkes, P.: Current trends in British sociophonetics. Univ. of PA Working Papers in Linguistics: A Selection of Papers from NWAV 30 8(3), 75–86 (2002)
Google Scholar
Rabiner, L.R., Juang, B.H.: Fundamentals of Speech Recognition. Prenctice Hall, Englewood Cliffs (1993)
Google Scholar

Download references

Author information

Authors and Affiliations

English Department, University of New Hampshire, Durham, NH, 03824, USA
Naomi Nagy
DocLab, ECSE, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
Xiaoli Zhang & George Nagy
Department of English Linguistics, Regensburg University, Regensburg, Germany
Edgar W. Schneider

Authors

Naomi Nagy
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoli Zhang
View author publications
You can also search for this author in PubMed Google Scholar
George Nagy
View author publications
You can also search for this author in PubMed Google Scholar
Edgar W. Schneider
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Human-Computer Interaction Institute, Carnegie Mellon University, 5000 Forbes Ave, 15213-3891, Pittsburgh, PA, USA
Anind Dey
Central and East European Center for Cognitive Science, New Bulgarian University, 21 Montevideo Street, 1618, Sofia, Bulgaria
Boicho Kokinov
Computer Science Department, Indiana University, 47405, Bloomington, IN, U.S.A.
David Leake
Department of Computer Science, University of Maine, 04469, Orono, Maine, USA
Roy Turner

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nagy, N., Zhang, X., Nagy, G., Schneider, E.W. (2005). A Quantitative Categorization of Phonemic Dialect Features in Context. In: Dey, A., Kokinov, B., Leake, D., Turner, R. (eds) Modeling and Using Context. CONTEXT 2005. Lecture Notes in Computer Science(), vol 3554. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11508373_25

Download citation

DOI: https://doi.org/10.1007/11508373_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26924-3
Online ISBN: 978-3-540-31890-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics