Precise plant classification within genus level based on simulated annealing aided cloud classifier

doi:10.1016/j.eswa.2010.08.090

Expert Systems with Applications

Volume 38, Issue 4, April 2011, Pages 3009-3014

https://doi.org/10.1016/j.eswa.2010.08.090 Get rights and content

Abstract

This is a series research on plant numerical taxonomy, which provides a precise classification method for the description, discrimination, and review of proposals for new or revised plant species to be recognized as taxon units within the genus level. We firstly used all the available quantitative attributes to build cloud models for different sections. Then, the shortest path based simulated annealing algorithm (SPSA) was applied for optimizing these models. After these, the optimized models were validated by the previously used quantitative attribute data. Results showed that cloud models’ accuracy rates of Sect. Tuberculata, Sect. Oleifera and Sect. Paracamellia were 85.00%, 60.00%, 80.00%. And we found some interesting overlaps between the type species and ‘expected species’ that the selected expected species Camellia oleifera and Camellia brevistyla are also type species of Sect. Oleifera and Sect. Paracamellia, respectively. Here we suggest that the expected species be served as an illustration in plant numerical taxonomy. Based on the simulated annealing aided cloud classifier, the taxon hedges, associated with ‘expected species’, were setting to advance our common understanding of sections and improve our capability to recognize and discriminate plant species. These procedures provide a dynamic and practical way to publish new or revised descriptions of species and sections.

Research highlights

► Series application research of improved cloud classifier algorithm. ► Annealing algorithm combined with the cloud classifier. ► Simulated annealing to minimize the edge distances.

Introduction

Cloud theory is now a popular theory handling uncertainty based on the uncertain transition between qualitative concept and quantitative description (Li et al., 1998, Li et al., 1997, Li et al., 1998). Based on this theory, the cloud classifier has been developed recent years for adaptive linguistic hedge (Lu, Pi, Peng, Wang, & Zhang, 2009). This classifier represents a qualitative concept with three digital characteristics, expected value Ex, entropy σ and deviation D (Di et al., 1998a, Di et al., 1998b), which integrates the fuzziness and randomness of a linguistic term in a unified way. Our previously work (Lu et al., 2009) applied this classifier in the plant numerical taxonomy. In which, the particle swarm optimization algorithm (PSO) was used for optimizing the sections’ hedges. However, there is still some potential improvement left for us as the accurate rates is not high enough. In this work, we will apply the shortest path based simulated annealing algorithm (SPSA) in optimizing the cloud classifier.

SPSA is a new kind of dynamic multi-stage facility layout problem under dynamic business environment, in which new species may be added into, or old species may be removed from their original taxa. Since every section (or genus) has its own range, the distances of species to their ‘expected species’ (Lu et al., 2009) in each section (or genus) should get the global minimum values before the best classification being gained (Pi et al., 2009). Then, the hedges problem could be converted into a shortest path problem by studying its distance function and species adding/removing heuristic rules, and the corresponding mathematical model established for this problem (Dong, Wu, & Hou, 2009). Hence, the SPSA may have good performances in optimizing the cloud classifier.

In this research, the shortest path based simulated annealing aided cloud classifier (SPSACM) method is used for plant classification by analyzing leaf morphology and anatomy data which is partly from our previous work (Lin et al., 2008, Lu et al., 2008). Our purpose is to provide a basic tool in plant taxonomy.

Section snippets

Materials

Adult leaves fully exposed to sunlight of plants of the genus Camellia are collected from the International Camellia Species Garden in Jinhua city, including 15 species in Section Paracamellia Sealy: Camellia grijsii Hance, Camellia confuse Craib, Camellia kissi Wall., Camellia fluviatilis Hand.-Mazz., Camellia brevistyla Coh. St., Camellia hiemalis Nakai, Camellia obtusifolia Chang, Camellia maliflora Lindl, Camellia shensiensis Chang, Camellia puniceiflora Chang, Camellia tenii Sealy,

Different classification results based on different attributes

Fig. 2A shows a 3-D (based on three selective attributes) distribution of original data. After the classification procedure, Fig. 2B and C shows visible differences in the cloud models with different attribute base. Fig. 2D displays all available linguistic atoms generated by a series of linguistic atom generators in this research. These generators have different rules for analysis of quantity and qualitative data.

Cloud models in Fig. 2B, which are based on attributes with small weights (CMSW)

Discussion

In the plant numerical taxonomy, floras are used as most comprehensive tools for people to identify and distinguish plants (Brach & Song, 2005). Recent years, floras of large scope have been written by collaboration of many authors who collectively have examined thousands of plant samples and evaluated and incorporated information from dozens, or even hundreds, of publications (Wen, 1994). For the botanists edited these floras, two primary issues must be well considered: I. How should the

Conclusion

The proposed SPSACM method, based on attribute similarity, is extended from the cloud model and simulated annealing algorithm. We have firstly demonstrated by experiments that the taxonomic results based on the SPSACM method have shown the superiority performance over some related methods.

Then, the weight values of attributes are highly commended in establishing of flora keys.

Besides, we propose again the new nomenclature ‘expected species’: an included species that has the minimum sum of

Acknowledgements

The authors would like to thank Y.F. Huang and L.J. Ma for substantial help in data collection. Funding of Innovation Fund for the Master’s Academe of Zhejiang Normal University is also gratefully acknowledged.

References (26)

M. Dong et al.
Shortest path based simulated annealing algorithm for dynamic facility layout problem under dynamic business environment
Expert Systems with Applications
(2009)
D.Y. Li et al.
Knowledge representation and discovery based on linguistic atoms
Knowledge-based Systems
(1998)
H.F. Lu et al.
A particle swarm optimization-aided fuzzy cloud classifier applied for plant numerical taxonomy based on attribute similarity
Expert Systems with Applications
(2009)
F. Rindi et al.
Phylogenetic relationships and species circumscription in Trentepohlia and Printzina (Trentepohliales, Chlorophyta)
Molecular Phylogenetics and Evolution
(2009)
A.R. Brach et al.
ActKey: A Web-based interactive identification key program
Taxon
(2005)
A.R. Brach et al.
eFloras: New directions for online floras exemplified by the Flora of China Project
Taxon
(2006)
Chang, H. T. (1998). Theaceae. In Delectis Florae Reipublicae Popularis Sinicae Agendae Academiae Sinicae Edita (Ed.),...
M.J. Dallwitz
A general system for coding taxonomic information
Taxon
(1980)
M.J. Dallwitz
A comparison of matrix-based taxonomic identification systems with rule-based systems
Dallwitz, M. J., Paine, T. A., & Zurcher, E. J. (2000). Principles of interactive keys....

K.C. Di et al.

Knowledge representation and discovery in spatial databases based on cloud theory

International Archives of Photogrammetry and Remote Sensing

(1998)

K.C. Di et al.

Intelligent query in spatial databases based on cloud model

Heidorn, P. B. (2001). A tool for multipurpose use of online flora and fauna: The biological information browsing...

Cited by (6)

Cloud computing research: A review of research themes, frameworks, methods and future research directions
2018, International Journal of Information Management
Citation Excerpt :
The final theme concerns the domains in which cloud computing has been applied. The articles in this theme investigate the application of cloud computing in areas such as education (Le Roux & Evans, 2011; Sultan, 2010), e-Science (Pi et al., 2011), e-Government (Decman & Vintar, 2013; Zissis and Lekkas, 2012), green IT (Basmadjian, Meer, Lent, & Giuliani, 2012; Gottschalk & Kirn, 2013), mobile cloud computing (Fernando, Loke, & Rahayu, 2013; Thilakanathan et al., 2014) and knowledge management (Lai, Tam, & Chan, 2012; Sultan, 2013). The studies acknowledge the positive transformation cloud computing has brought to these domains.
This paper presents a meta-analysis of cloud computing research in information systems with the aim of taking stock of literature and their associated research frameworks, research methodology, geographical distribution, level of analysis as well as trends of these studies over the period of 7 years. A total of 285 articles from 67 peer review journals from year 2009 to 2015 were used in the analysis. The findings indicate that extant cloud computing literature tends to skew towards the technological dimension to the detriment of other under researched dimensions such as business, conceptualization and application domain. Whilst there has been a constant increase in cloud computing studies over the last seven years, a significant number of these studies have not been underpinned by theoretical frameworks and models. Also, majority of cloud computing studies utilized experiment and simulation as methods of enquiry as compared to the qualitative, quantitative, and mixed methodologies. This study contributes to cloud computing research by providing holistic insights into trends on themes, methodology, research framework, geographical focus and future research directions.
Camellia (Theaceae) classification with support vector machines based on fractal parameters and red, green, and blue intensity of leaves
2017, Bangladesh Journal of Plant Taxonomy
A novel method of uncertain data classification
2014, Journal of Computational Information Systems
Classification of Camellia species from 3 sections using leaf anatomical data with back-propagation neural networks and support vector machines
2013, Turkish Journal of Botany
Classification of Camellia (theaceae) species using leaf architecture variations and pattern recognition techniques
2012, PLoS ONE
Floral morphology resolves the taxonomy of Camellia L. (Theaceae) sect. Oleifera and sect. Paracamellia
2012, Bangladesh Journal of Plant Taxonomy

View full text

Article preview

Expert Systems with Applications

Abstract

Research highlights

Introduction

Section snippets

Materials

Different classification results based on different attributes

Discussion

Conclusion

Acknowledgements

References (26)

Shortest path based simulated annealing algorithm for dynamic facility layout problem under dynamic business environment

Expert Systems with Applications

Knowledge representation and discovery based on linguistic atoms

Knowledge-based Systems

A particle swarm optimization-aided fuzzy cloud classifier applied for plant numerical taxonomy based on attribute similarity

Expert Systems with Applications

Phylogenetic relationships and species circumscription in Trentepohlia and Printzina (Trentepohliales, Chlorophyta)

Molecular Phylogenetics and Evolution

ActKey: A Web-based interactive identification key program

Taxon

eFloras: New directions for online floras exemplified by the Flora of China Project

Taxon

A general system for coding taxonomic information

Taxon

A comparison of matrix-based taxonomic identification systems with rule-based systems

Knowledge representation and discovery in spatial databases based on cloud theory

International Archives of Photogrammetry and Remote Sensing

Intelligent query in spatial databases based on cloud model

Cited by (6)

Cloud computing research: A review of research themes, frameworks, methods and future research directions

Camellia (Theaceae) classification with support vector machines based on fractal parameters and red, green, and blue intensity of leaves

A novel method of uncertain data classification

Classification of Camellia species from 3 sections using leaf anatomical data with back-propagation neural networks and support vector machines

Classification of Camellia (theaceae) species using leaf architecture variations and pattern recognition techniques

Floral morphology resolves the taxonomy of Camellia L. (Theaceae) sect. Oleifera and sect. Paracamellia