Abstract
Trachoma, caused by repeated ocular infections with Chlamydia trachomatis whose vector is a fly, is an important cause of blindness in the world. We are presenting here an application of the Symbolic Data Analysis approach to an interventional study on trachoma conducted in Mali. This study was conducted to choose among three antibiotic strategies those with the best cost-effectiveness ratio and to find the demographic and environmental parameters on which we could try to intervene. The Symbolic Data Analysis approach aims at studying classes of individuals considered as new units. These units are described by variables whose values express for each class the variation of the values taken by each of its individuals. Finally, the results obtained are compared to those previously provided by multiple logistic regression analysis. Symbolic Data Analysis actually provides a new perspective on this study and suggests that some demographic, economics and environmental parameters are related to the disease and its evolution during the treatment, whatever the strategy. Moreover, it is shown that the efficiency of each strategy depends on environmental parameters.
Similar content being viewed by others
References
Afonso F, Haddad R, Toque C, Eliezer ES, Diday E (2014) User manual of the SYR Software. Syrokko internal publication, p 70
Billard L, Diday E (2003) From the statistics of data to the statistics of knowledge: symbolic data analysis. J Am Stat Assoc 98(462):470–487
Billard L, Diday E (2006) Symbolic data analysis: conceptual statistics and data mining. Wiley, Chichester, p 321
Bock H, Diday E (2000) Analysis of symbolic data. In: Bock D (ed) Exploratory methods for extracting statistical information from complex data. Springer, Heidelberg, p 425. ISBN 3-540-66619-2
Diday E (2011) Principal component analysis for categorical histogram data: some open directions of research. In: Fichet B, Piccolo D, Verde R, Vichi M (eds) Classification and multivariate analysis for complex data structures. Studies in classification, data analysis, and knowledge organization. Springer, Heidelberg, pp 3–15
Diday E (2013) Principal component analysis for bar charts and metabins tables. Stat Anal Data Min 6(5):403–430
Diday E, Afonso F, Haddad R (2013) The symbolic data analysis paradigm, discriminate discretization and financial application. In: Advances in Theory and Applications of High Dimensional and Symbolic Data Analysis, HDSDA 2013. Revue des Nouvelles Technologies de l’Information, vol E-25. pp 1–14
Diday E, Noirhomme M (2008) Symbolic data analysis and the SODAS software. Wiley, Chichester, p 457
Hosmer D, Lemeshow S (2000) Applied logistic regression. Wiley, New York. ISBN 0-471-61553-6
Lee J (1986) Insight on the use of multiple logistic regression analysis to estimate association between risk factor and disease occurrence. Int J Epidemiol 15:22–29
Schemann JF, Guinot C, Traore L, Sacko D, Zefack G, Dembele M, Diallo I, Malvy D (2007) Longitudinal evaluation of three azithromycin distribution strategies for treatment of trachoma in a sub-saharan African country, Mali. Acta Trop 101:40–53
Souza RMCR, Queiroz DCF, Cysneiros FJA (2011) Logistic regression based pattern classifiers for symbolic interval data. Pattern Anal Appl 14:273–282
WHO (1988) Programme for the prevention of blindness and deafness. In: Coding instructions for the WHO/PBL eye examination record (version iii). Tech. rep. World Health Organization, Genève
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Guinot, C., Malvy, D., Schémann, JF. et al. Strategies evaluation in environmental conditions by symbolic data analysis: application in medicine and epidemiology to trachoma. Adv Data Anal Classif 9, 107–119 (2015). https://doi.org/10.1007/s11634-015-0201-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11634-015-0201-2