skip to main content
10.1145/1401890.1401953acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Multi-class cost-sensitive boosting with p-norm loss functions

Published: 24 August 2008 Publication History

Abstract

We propose a family of novel cost-sensitive boosting methods for multi-class classification by applying the theory of gradient boosting to p-norm based cost functionals. We establish theoretical guarantees including proof of convergence and convergence rates for the proposed methods. Our theoretical treatment provides interpretations for some of the existing algorithms in terms of the proposed family, including a generalization of the costing algorithm, DSE and GBSE-t, and the Average Cost method. We also experimentally evaluate the performance of our new algorithms against existing methods of cost sensitive boosting, including AdaCost, CSB2, and AdaBoost.M2 with cost-sensitive weight initialization. We show that our proposed scheme generally achieves superior results in terms of cost minimization and, with the use of higher order p-norm loss in certain cases, consistently outperforms the comparison methods, thus establishing its empirical advantage.

References

[1]
N. Abe, B. Zadrozny, and J. Langford. An iterative method for multi-class cost-sensitive learning. In KDD '04: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 3--11, New York, NY, USA, 2004. ACM.
[2]
C. L. Blake and C. J. Merz. UCI repository of machine learning databases. Department of Information and Computer Sciences, University of California, Irvine, 1998. http://www.ics.uci.edu/~mlearn/MLRepository.html.
[3]
P. Domingos. MetaCost: A general method for making classifiers cost sensitive. In Proceedings of the Fifth International Conference on Knowledge Discovery and Data Mining, pages 155--164. ACM Press, 1999.
[4]
Charles Elkan. Magical thinking in data mining: Lessons from coil challenge 2000. In Proceedings of the Seventh International Conference on Knowledge Discovery and Data Mining, pages 426--431. ACM Press, 2001.
[5]
W. Fan, S. J. Stolfo, J. Zhang, and P. K. Chan. AdaCost: Misclassification cost sensitive boosting. In Proceedings of the Sixteenth International Conference on Machine Learning, pages 97--105, 1999.
[6]
Y. Freund and R. E. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1):119--139, 1997.
[7]
U. Knoll, G. Nakhaeizadeh, and B. Tausend. Cost-sensitive pruning of decision trees. In Proceedings of the Eight European Conference on Machine Learning, pages 383--386, 1994.
[8]
D. Margineantu. Methods for Cost-Sensitive Learning. PhD thesis, Department of Computer Science, Oregon State University, Corvallis, 2001.
[9]
L. Mason, J. Baxter, P. Barlett, and M. Frean. Boosting algorithms as gradient descent. In Advances in Neural Information Processing systems 12, pages 512--158, 2000.
[10]
L. Mason, J. Baxter, P. Bartlett, and M. Frean. Boosting algorithms as gradient descent in function space. Technical report, RSISE, Australian National University, 1999. http://wwwsyseng.anu.edu.au/~jon/papers/doom2.ps.gz.
[11]
J. Quinlan. C4.5: Programs for Machine Learning. San Mateo, CA: Morgan Kaufmann, 1993.
[12]
R. E. Schapire and Y. Singer. Improved boosting using confidence-rated predictions. Machine Learning, 37(3):297--336, 1999.
[13]
K. M. Ting. A comparative study of cost-sensitive boosting algorithms. In Proceedings of the Seventeenth International Conference on Machine Learning, pages 983--990, 2000.
[14]
I. H. Witten and E. Frank. Data Mining: Practical machine learning tools with Java implementations. Morgan Kaufmann, 2000.
[15]
B. Zadrozny and C. Elkan. Learning and making decisions when costs and probabilities are both unknown. In Proceedings of the Seventh International Conference on Knowledge Discovery and Data Mining, pages 204--213. ACM Press, 2001.
[16]
B. Zadrozny, J. Langford, and N. Abe. Cost-sensitive learning by cost-proportionate example weighting. In Proceedings of the Third IEEE International Conference on Data Mining, pages 435--442, 2003.
[17]
T. Zhang. Sequential greedy approximation for certain convex optimization problems. volume IT-49, pages 682--691, 2003.
[18]
Z. H. Zhou and X. Y. Liu. On multi-class cost sensitive learning. In Proceedings of the 21st National Conference on Artificial Intelligence, pages 567--572, 2006.

Cited By

View all
  • (2021)Cost-Sensitive Variable Selection for Multi-Class Imbalanced Datasets Using Bayesian NetworksMathematics10.3390/math90201569:2(156)Online publication date: 13-Jan-2021
  • (2020)Improving multi-class Boosting-based object detectionIntegrated Computer-Aided Engineering10.3233/ICA-200636(1-16)Online publication date: 3-Jul-2020
  • (2018)Second-Order Online Active Learning and Its ApplicationsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.277809730:7(1338-1351)Online publication date: 1-Jul-2018
  • Show More Cited By

Index Terms

  1. Multi-class cost-sensitive boosting with p-norm loss functions

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '08: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2008
    1116 pages
    ISBN:9781605581934
    DOI:10.1145/1401890
    • General Chair:
    • Ying Li,
    • Program Chairs:
    • Bing Liu,
    • Sunita Sarawagi
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 August 2008

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. boosting
    2. cost-sensitive learning
    3. multi-class classification

    Qualifiers

    • Research-article

    Conference

    KDD08

    Acceptance Rates

    KDD '08 Paper Acceptance Rate 118 of 593 submissions, 20%;
    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)7
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Cost-Sensitive Variable Selection for Multi-Class Imbalanced Datasets Using Bayesian NetworksMathematics10.3390/math90201569:2(156)Online publication date: 13-Jan-2021
    • (2020)Improving multi-class Boosting-based object detectionIntegrated Computer-Aided Engineering10.3233/ICA-200636(1-16)Online publication date: 3-Jul-2020
    • (2018)Second-Order Online Active Learning and Its ApplicationsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.277809730:7(1338-1351)Online publication date: 1-Jul-2018
    • (2015)CogBoost: Boosting for Fast Cost-Sensitive Graph ClassificationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2015.239111527:11(2933-2946)Online publication date: 1-Nov-2015
    • (2015)Cost-Sensitive Online Classification with Adaptive Regularization and Its ApplicationsProceedings of the 2015 IEEE International Conference on Data Mining (ICDM)10.1109/ICDM.2015.51(649-658)Online publication date: 14-Nov-2015
    • (2014)Cost-Sensitive Online ClassificationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2013.15726:10(2425-2438)Online publication date: Oct-2014
    • (2014)Cost-sensitive learning for defect escalationKnowledge-Based Systems10.1016/j.knosys.2014.04.03366:1(146-155)Online publication date: 1-Aug-2014
    • (2013)A survey of cost-sensitive decision tree induction algorithmsACM Computing Surveys10.1145/2431211.243121545:2(1-35)Online publication date: 12-Mar-2013
    • (2012)Calibrated asymmetric surrogate lossesElectronic Journal of Statistics10.1214/12-EJS6996:noneOnline publication date: 1-Jan-2012
    • (2012)Cost-Sensitive Online ClassificationProceedings of the 2012 IEEE 12th International Conference on Data Mining10.1109/ICDM.2012.116(1140-1145)Online publication date: 10-Dec-2012
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media