Skip to main content

Comparing the Stability of Different Clustering Results of Dialect Data

  • Conference paper
Advances in Data Analysis

Abstract

[Mucha and Haimerl (2005)] proposed an algorithm to determine the stability of clusters found in hierarchical cluster analysis (HCA) and to calculate the rate of recovery by which an element can be reassigned to the same cluster in successive classifications of bootstrap samples. As proof of the concept this algorithm was applied to quantitative linguistics data. These investigations used only HCA algorithms. This paper will take a broader look at the stability of clustering results, and it will take different cluster algorithms into account; e.g. we compare the stability values of partitions from HCA with results from partitioning algorithms. To ease the comparison, the same data set - from dialect research of Northern Italy, as in [Mucha and Haimerl (2005)] - will be used here.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • ASCOLI, G.I. (1873): Saggi ladini. Archivio glottologico italiano, 1, 1–556.

    Google Scholar 

  • BAUER, R. (2003): Dolomitenladinische Ähnlichkeitsprofile aus dem Gadertal; ein Werkstattbericht zur dialektometrischen Analyse des ALD-I. In: Ladinia XXVIXXVII (2002–2003), 209–250.

    Google Scholar 

  • FRALEY, C. and RAFTERY, A.E. (2002): MCLUST: Software for Model-Based Clustering, Density Estimation and Discriminant Analysis. Technical Report 415, Department of Statistics, University of Washington.

    Google Scholar 

  • GOEBL, H. (1984): Dialektometrische Studien anhand italoromanischer, rätoromanischer und galloromanischer Sprachmaterialien aus AIS und ALF. Bd.1–3. Max Niemeyer, Tübingen.

    Google Scholar 

  • GOEBL, H., BAUER, R. and HAIMERL, E. (1998): Atlante linguistico del ladino dolomitico e dei dialetti limitrofi 1a parte. Dr. Ludwig Reichert Verlag, Wiesbaden.

    Google Scholar 

  • HAIMERL, E. (1998): A Database Application for the Generation of Phonetic Atlas Maps. In: J. Nerbonne (Ed.): Linguistic Databases. CSLI, Stanford, 103–116.

    Google Scholar 

  • HUBERT, L.J. and ARABIE, P. (1985): Comparing Partitions. Journal of Classification, 2, 193–218.

    Article  Google Scholar 

  • KAUFMAN, L. and ROUSSEEUW, P.J. (1990): Finding Groups in Data. Wiley, New York.

    Book  Google Scholar 

  • MUCHA, H.-J. (2004): Automatic Validation of Hierarchical Clustering. In: J. Antoch (Ed.): Proceedings in Computational Statistics, COMPSTAT 2004, 16th Symposium. Physica, Heidelberg, 1535–1542.

    Google Scholar 

  • MUCHA, H.-J. and HAIMERL, E. (2005): Automatic Validation of Hierarchical Cluster Analysis with Application in Dialectometry. In: C. Weihs and W. Gaul (Eds.): Classification-The Ubiquitous Challenge, Springer, Berlin, 513–520.

    Chapter  Google Scholar 

  • SPÄTH, H. (1980): Cluster Analysis Algorithms. Ellis Horwood Limited, Chichester.

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Haimerl, E., Mucha, HJ. (2007). Comparing the Stability of Different Clustering Results of Dialect Data. In: Decker, R., Lenz, H.J. (eds) Advances in Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70981-7_71

Download citation

Publish with us

Policies and ethics