Generation of classification trees from variable weighted features

Barahona, Pedro; Bel-Enguix, Gemma; Dahl, Veronica; Jiménez-López, M. Dolores; Krippahl, Ludwig

doi:10.1007/s11047-013-9368-7

Generation of classification trees from variable weighted features

Published: 24 March 2013

Volume 13, pages 169–177, (2014)
Cite this article

Natural Computing Aims and scope Submit manuscript

Pedro Barahona¹,
Gemma Bel-Enguix²,
Veronica Dahl³,
M. Dolores Jiménez-López⁴ &
…
Ludwig Krippahl¹

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

Trees are a useful framework for classifying entities whose attributes are, at least partially, related through a common ancestry, such as species of organisms, family members or languages. In some common applications, such as phylogenetic trees based on DNA sequences, relatedness can be inferred from the statistical analysis of unweighted attributes. The vast majority of mutations that survive across generations are evolutionarily neutral, which means that most genetic differences between species will have accumulated independently and randomly. In these cases, it is possible to calculate the tree from a precomputed matrix of distances. In other cases, such as with anatomical traits or languages, the assumption of random and independent differences does not hold, making it necessary to consider some traits to be more relevant than others for determining how related two entities are. In this paper, we present a constraint programming approach that can enforce consistency between bounds on the relative weight of each trait and tree topologies, so that the user can best determine which sets of traits to use and how the entities are likely to be related.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Estimation of Trees on the Basis of Pairwise Comparisons with Random Errors

On the information content of discrete phylogenetic characters

Article 16 December 2017

Information Metrics for Phylogenetic Trees via Distributions of Discrete and Continuous Characters

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Blockeel H, De Raedt L, Ramon J (1998) Top-down induction of clustering trees. In: Shavlik JW (ed) ICML’98. Proceedings of the fifteenth international conference on machine learning. Morgan Kaufmann Publishers, San Francisco, pp 55–63
Campbell L (2006a) Areal linguistics: a closer scrutiny. In: Matras Y, McMahon A, Vincent N (eds) Linguistic areas: convergence in historical and typological perspective. Palgrave Macmillan, Basingstoke, pp 1–31
Campbell L (2006) Areal linguistics. In: Brown K (ed) Encyclopedia of language and linguistics. Elsevier, Oxford, pp 454–460
Google Scholar
Cavalli-Sforza LL (2000) Genes, peoples and languages. North Point Press, New York
Google Scholar
Cavalli-Sforza LL, Feldman M (1981) Cultural transmission and evolution. Princeton University Press, Princenton
Google Scholar
Dahl V, Miralles JE (2012) Womb grammars: constraint solving for grammar induction. In: Sneyers J, Frühwirth T (eds) CHR’12. Proceedings of the 9th workshop on constraint handling rules. Technical Report CW 624. Department of Computer Science, K.U. Leuven, pp 32–40
Farris JS (1972) Estimating phylogenetic trees from distance matrices. Am Nat 106(951):645–668
Article Google Scholar
Fitch W, Margoliash E (1967) Construction of phylogenetic trees. Science 155(3760):279–284
Article Google Scholar
Hock HH (1986) Principles of historical linguistics. Mouton de Gruyter, Amsterdam
Google Scholar
Holman EW, Brown CH, Wichmann S, Müller A, Velupillai V, Hammarström H, Jung H, Bakker D, Brown P, Belyaev O, Urban M, Mailhammer R, List J-M, Egorov D (2011) Automated dating of the world’s language families based on lexical similarity. Curr Anthropol 52(6):841–875
Article Google Scholar
Kimura M (1968) Evolutionary rate at the molecular level. Nature 217(5129):624–626
Article Google Scholar
Makarenkov V (2001) T-Rex: reconstructing and visualizing phylogenetic trees and reticulation networks. Bioinformatics 17:664–668
Article Google Scholar
Quinlanm JR (1986) Induction of decision trees. Mach Learn 1(1):81–106
Google Scholar
Saitou N, Nei M (1987) The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol 4(4):406–425
Google Scholar
Schleicher A (1863) Die Darwinsche Theorie und die Sprachwissenschaft. Böhlau, Weimar
Google Scholar
Schmidt J (1872) Die Verwantschaftsverhältnisse der Indogermanischen Sprachen. Böhlau, Weimar
Google Scholar
Seiffert E et al (2009) Convergent evolution of anthropoid-like adaptations in Eocene Adapiform primates. Science 461:1118–1121
Google Scholar
Song JJ (ed) (2011) The Oxford handbook of linguistics typology. Oxford University Press, Oxford
Google Scholar

Download references

Acknowledgments

This paper has been supported by Project HP2008-0029 and by Portuguese national funds through FCT-Fundação para a Ciência e Tecnologia, under Project PTDC/EIA-CCO/115999/2009.

Author information

Authors and Affiliations

Departamento de Informática, Universidade Nova de Lisboa, Quinta da Torre, 2829-516, Caparica, Portugal
Pedro Barahona & Ludwig Krippahl
Laboratoire d’Informatique Fondamentale, CNRS-Aix-Marseille University, Avenue de Luminy, 163, 13288, Marseille, France
Gemma Bel-Enguix
Department of Computer Science, Simon Fraser University, 8888 University Drive, Burnaby, BC, Canada
Veronica Dahl
Research Group on Mathematical Linguistics, Universitat Rovira i Virgili, Av. Catalunya, 35, 43002, Tarragona, Spain
M. Dolores Jiménez-López

Authors

Pedro Barahona
View author publications
You can also search for this author inPubMed Google Scholar
Gemma Bel-Enguix
View author publications
You can also search for this author inPubMed Google Scholar
Veronica Dahl
View author publications
You can also search for this author inPubMed Google Scholar
M. Dolores Jiménez-López
View author publications
You can also search for this author inPubMed Google Scholar
Ludwig Krippahl
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to M. Dolores Jiménez-López.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Barahona, P., Bel-Enguix, G., Dahl, V. et al. Generation of classification trees from variable weighted features. Nat Comput 13, 169–177 (2014). https://doi.org/10.1007/s11047-013-9368-7

Download citation

Published: 24 March 2013
Issue Date: June 2014
DOI: https://doi.org/10.1007/s11047-013-9368-7

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Generation of classification trees from variable weighted features

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Estimation of Trees on the Basis of Pairwise Comparisons with Random Errors

On the information content of discrete phylogenetic characters

Information Metrics for Phylogenetic Trees via Distributions of Discrete and Continuous Characters

Explore related subjects

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now