Comparing the Stability of Different Clustering Results of Dialect Data

Haimerl, Edgar; Mucha, Hans-Joachim

doi:10.1007/978-3-540-70981-7_71

Edgar Haimerl³ &
Hans-Joachim Mucha⁴

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

3781 Accesses
2 Citations

Abstract

[Mucha and Haimerl (2005)] proposed an algorithm to determine the stability of clusters found in hierarchical cluster analysis (HCA) and to calculate the rate of recovery by which an element can be reassigned to the same cluster in successive classifications of bootstrap samples. As proof of the concept this algorithm was applied to quantitative linguistics data. These investigations used only HCA algorithms. This paper will take a broader look at the stability of clustering results, and it will take different cluster algorithms into account; e.g. we compare the stability values of partitions from HCA with results from partitioning algorithms. To ease the comparison, the same data set - from dialect research of Northern Italy, as in [Mucha and Haimerl (2005)] - will be used here.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

ASCOLI, G.I. (1873): Saggi ladini. Archivio glottologico italiano, 1, 1–556.
Google Scholar
BAUER, R. (2003): Dolomitenladinische Ähnlichkeitsprofile aus dem Gadertal; ein Werkstattbericht zur dialektometrischen Analyse des ALD-I. In: Ladinia XXVIXXVII (2002–2003), 209–250.
Google Scholar
FRALEY, C. and RAFTERY, A.E. (2002): MCLUST: Software for Model-Based Clustering, Density Estimation and Discriminant Analysis. Technical Report 415, Department of Statistics, University of Washington.
Google Scholar
GOEBL, H. (1984): Dialektometrische Studien anhand italoromanischer, rätoromanischer und galloromanischer Sprachmaterialien aus AIS und ALF. Bd.1–3. Max Niemeyer, Tübingen.
Google Scholar
GOEBL, H., BAUER, R. and HAIMERL, E. (1998): Atlante linguistico del ladino dolomitico e dei dialetti limitrofi 1a parte. Dr. Ludwig Reichert Verlag, Wiesbaden.
Google Scholar
HAIMERL, E. (1998): A Database Application for the Generation of Phonetic Atlas Maps. In: J. Nerbonne (Ed.): Linguistic Databases. CSLI, Stanford, 103–116.
Google Scholar
HUBERT, L.J. and ARABIE, P. (1985): Comparing Partitions. Journal of Classification, 2, 193–218.
Article Google Scholar
KAUFMAN, L. and ROUSSEEUW, P.J. (1990): Finding Groups in Data. Wiley, New York.
Book Google Scholar
MUCHA, H.-J. (2004): Automatic Validation of Hierarchical Clustering. In: J. Antoch (Ed.): Proceedings in Computational Statistics, COMPSTAT 2004, 16th Symposium. Physica, Heidelberg, 1535–1542.
Google Scholar
MUCHA, H.-J. and HAIMERL, E. (2005): Automatic Validation of Hierarchical Cluster Analysis with Application in Dialectometry. In: C. Weihs and W. Gaul (Eds.): Classification-The Ubiquitous Challenge, Springer, Berlin, 513–520.
Chapter Google Scholar
SPÄTH, H. (1980): Cluster Analysis Algorithms. Ellis Horwood Limited, Chichester.
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Institut für Romanistik, Universität Salzburg, Akademiestraße 24, A-5020, Salzburg, Austria
Edgar Haimerl
Weierstraß-Institut für Angewandte Analysis und Stochastik, D-10117, Berlin, Germany
Hans-Joachim Mucha

Authors

Edgar Haimerl
View author publications
You can also search for this author in PubMed Google Scholar
Hans-Joachim Mucha
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Business Administration and Economics, Bielefeld University, Universitätsstr. 25, 33501, Bielefeld, Germany
Reinhold Decker
Department of Economics, Freie Universität Berlin, Garystraße 21, 14195, Berlin, Germany
Hans -J. Lenz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Haimerl, E., Mucha, HJ. (2007). Comparing the Stability of Different Clustering Results of Dialect Data. In: Decker, R., Lenz, H.J. (eds) Advances in Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70981-7_71

Download citation

DOI: https://doi.org/10.1007/978-3-540-70981-7_71
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70980-0
Online ISBN: 978-3-540-70981-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics