Evaluating the Performance of the Multilayer Perceptron as a Data Editing Tool

Cubiles-de-la-Vega, Ma-Dolores; Silva-Ramírez, Esther-Lydia; Pino-Mejías, Rafael; López-Coello, Manuel

doi:10.1007/978-3-642-02478-8_163

Ma-Dolores Cubiles-de-la-Vega²⁰,
Esther-Lydia Silva-Ramírez²¹,
Rafael Pino-Mejías²⁰ &
…
Manuel López-Coello²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5517))

Included in the following conference series:

International Work-Conference on Artificial Neural Networks

2123 Accesses

Abstract

Usually, the knowledge discovery process is developed using data sets which contain errors in the form of inconsistent values. The activity aimed at detecting and correcting logical inconsistencies in data sets is named as data editing. Traditional tools for this task, as the Fellegi-Holt methodology, require a heavy intervention of subject matter experts. This paper discusses a methodological framework for the development of an automated data editing process which can be accomplished by a general nonlinear approximation model, as an artificial neural network. We have performed and empirical evaluation of the performance of this approach over eight data sets, considering several hidden layer sizes and seven learning algorithms for the multilayer perceptron. The obtained results suggest that this approach offers a hopeful performance, providing a promising data cleaning tool.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fellegi, I., Holt, D.: Systematic Approach to Automatic Edit and Imputation. J. Am. Stat. Assoc. 71(353), 17–35 (1976)
Article Google Scholar
Manzari, A.: Combining editing and imputation methods: an experimental application on population census data. J. R. Stat. Soc. Ser. A-Stat. Soc. 167(2), 295–307 (2004)
Article MathSciNet Google Scholar
Petrakos, G., Conversano, C., Farmakis, G., et al.: New Ways of Specifying Data Edits. J. R. Stat. Soc. Ser. A-Stat. Soc. 167(2), 249–274 (2004)
Article MathSciNet Google Scholar
Bishop, C.: Neural Networks for Pattern Recognition. Oxford University Press, Oxford (2005)
MATH Google Scholar
Nordbotten, S.: Editing Statistical Records by Neural Networks. Journal of Official Statistics 4(11), 391–411 (1995)
Google Scholar
UCI Repository of Machine Learning Databases, http://www.ics.uci.edu/~mlearn/MLRepository.html

Download references

Author information

Authors and Affiliations

Department of Statistics and Operational Research, University of Seville, Av. Reina Mercedes s/n, 41012, Sevilla, Spain
Ma-Dolores Cubiles-de-la-Vega & Rafael Pino-Mejías
Department of Languages and Computer System, University of Cadiz, C/ Chile 1, 11003, Cadiz, Spain
Esther-Lydia Silva-Ramírez & Manuel López-Coello

Authors

Ma-Dolores Cubiles-de-la-Vega
View author publications
You can also search for this author in PubMed Google Scholar
Esther-Lydia Silva-Ramírez
View author publications
You can also search for this author in PubMed Google Scholar
Rafael Pino-Mejías
View author publications
You can also search for this author in PubMed Google Scholar
Manuel López-Coello
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departamento de Ingeniería Electrónica, Universitat Politècnica de Catalunya (UPC). E.T.S.I. de Telecomunicación., , , ,, Campus Norte, Edificio C4, C/ Jordi Girona, 1-3, E08034, Barcelona, Spain
Joan Cabestany
Grupo ISIS, Dpto. Tecnología Electrónica ETSI Telecomunicación, Universidad de Málaga, Campus de Teatinos, 29071, Málaga, Spain
Francisco Sandoval
Department of Computer Architecture and Computer Technology, University of Granada, Spain
Alberto Prieto
Department of Informatics, University of Salamanca, Salamanca, Spain
Juan M. Corchado

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cubiles-de-la-Vega, MD., Silva-Ramírez, EL., Pino-Mejías, R., López-Coello, M. (2009). Evaluating the Performance of the Multilayer Perceptron as a Data Editing Tool. In: Cabestany, J., Sandoval, F., Prieto, A., Corchado, J.M. (eds) Bio-Inspired Systems: Computational and Ambient Intelligence. IWANN 2009. Lecture Notes in Computer Science, vol 5517. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02478-8_163

Download citation

DOI: https://doi.org/10.1007/978-3-642-02478-8_163
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02477-1
Online ISBN: 978-3-642-02478-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics