Abstract:
Linking large disparate database systems at individual person based level for medical informatics and e-health research is a challenging task. In the interests of identif...Show MoreMetadata
Abstract:
Linking large disparate database systems at individual person based level for medical informatics and e-health research is a challenging task. In the interests of identifying influential determinants of effects of children's health and socioeconomic status on educational attainment, this paper links child health data records including birth records, school key stage attainment records, deprivation index scores etc. from multiple sources via an e-health infrastructure SAIL databank. Furthermore, a novel scheme of automatically identifying influential attributes from high dimensional data is presented. The proposed scheme applies the entropy regularisation and particle swarm optimisation (PSO) techniques to the construction of an optimal support vector machine (SVM) model. The novelty of the proposed scheme lies in that during learning process the importance of less influential attributes automatically approaches to zero, whilst the importance of the most important attributes turns to one, so that only the most influential attributes turn up in the final SVM model. What's more, the model selection, feature identification and dimensionality reduction are performed simultaneously in an integrated manner in one model structure. The experimental results have shown that the proposed method is efficient in performing dimensionality reduction and identifying the important determinants of the effects of children's health and socioeconomic status on educational attainment.
Date of Conference: 18-23 July 2010
Date Added to IEEE Xplore: 14 October 2010
ISBN Information: