skip to main content
10.1145/1363686.1363905acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
poster

A new algorithm for data discretization and feature selection

Published:16 March 2008Publication History

ABSTRACT

Data discretization and feature selection are two important tasks that can be performed prior to the learning phase of data mining algorithms and can significantly reduce the processing effort of the learning algorithm. In this paper, we present a new algorithm, called Omega, for data preprocessing. Our proposed algorithm performs simultaneously data discretization and feature selection. Some experiments were performed to validate the effects of the preprocessing performed by the Omega algorithm in the results of the C4.5 algorithm (a well-known decision tree-based classifier). The results indicates that the proposed algorithm Omega is well-suited to both, data discretization and feature selection, being appropriate for data pre-processing.

References

  1. A. Asuncion and D. Newman. Uci repository (www.ics.uci.edu/mlearn/mlrepository.html). 2007.Google ScholarGoogle Scholar
  2. R. Kerber. Chimerge: Discretization of numeric attributes. In 10th Intl. Conf. on Artificial Intelligence, pages 123--128, 1992.Google ScholarGoogle Scholar
  3. K. Kira and L. A. Rendell. A practical approach for feature selection. In 9th Intl. Conf. on Machine Learning, pages 249--256, Aberdeen, Scotland, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. H. Liu and R. Setiono. Feature selection via discretization. Knowledge and Data Engineering, 9(4):642--645, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A new algorithm for data discretization and feature selection

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SAC '08: Proceedings of the 2008 ACM symposium on Applied computing
      March 2008
      2586 pages
      ISBN:9781595937537
      DOI:10.1145/1363686

      Copyright © 2008 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 16 March 2008

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      Overall Acceptance Rate1,650of6,669submissions,25%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader