The Inverse Classification Problem

Aggarwal, Charu C.; Chen, Chen; Han, Jiawei

doi:10.1007/s11390-010-9337-x

The Inverse Classification Problem

Regular Paper
Published: 08 May 2010

Volume 25, pages 458–468, (2010)
Cite this article

Journal of Computer Science and Technology Aims and scope Submit manuscript

Charu C. Aggarwal¹,
Chen Chen² &
Jiawei Han^1,2

537 Accesses
36 Citations
5 Altmetric
Explore all metrics

Abstract

In this paper, we examine an emerging variation of the classification problem, which is known as the inverse classification problem. In this problem, we determine the features to be used to create a record which will result in a desired class label. Such an approach is useful in applications in which it is an objective to determine a set of actions to be taken in order to guide the data mining application towards a desired solution. This system can be used for a variety of decision support applications which have pre-determined task criteria. We will show that the inverse classification problem is a powerful and general model which encompasses a number of different criteria. We propose a number of algorithms for the inverse classification problem, which use an inverted list representation for intermediate data structure representation and classification. We validate our approach over a number of real datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Artificial Intelligence

References

Aggarwal C, Han J, Wang J, Yu P. A framework for on-demand classification of evolving data streams. May 2006, 18(5): 577–589.
Google Scholar
Alsabti K, Ranka S, Singh V. CLOUDS: A decision tree classifier for large datasets. In Proc. KDD, New York, USA, Aug. 27–31, 1998, pp.2–8.
Breiman L, Friedman J, Olshen R A, Stone C J. Classification and Regression Trees. Chapman & Hall, 1984.
Brodley C E, Utgoff P E. Multivariate decision trees. Machine Learning, 1995, 19(1): 45–77.
MATH Google Scholar
Breslow L, Aha D. Simplifying decision trees. Knowledge Engineering Review, 1997, 12(1): 1–40.
Article Google Scholar
Duda R, Hart P, Stork D. Pattern Classification. 2nd Edition, New York: John Wiley and Sons Inc., 2001.
MATH Google Scholar
Friedman J H. A recursive partitioning decision rule for non-parametric classifiers. IEEE Transactions on Computers, 1977, 26(4): 404–408.
Article MATH Google Scholar
Gehrke J, Ganti V, Ramakrishnan R, Loh W Y. BOAT: Optimistic decision tree construction. In Proc. ACM SIGMOD Int. Conf. Management of Data, Philadelphia, USA, May 31–June 3, 1999, pp.169–180.
James M. Classification Algorithms. Wiley, 1985.
Quinlan J R. C4.5: Programs for Machine Learning. Morgan Kaufmann, 1993.
Smyth P, Gray A, Fayyad U M. Retrofitting decision tree classifiers using kernel density estimation. In Proc. the International Conference on Machine Learning, Taheo City, USA, July 9–12, 1995, pp.506–514.
Achtert E, Kriegel H P, Kröger P, Renz M, Züfle A. Reverse k-nearest neighbor search in dynamic and general metric databases. In Proc. EDBT, Saint Petersburg, Russia, Mar. 24–26, 2009, pp.886–897.
Tao Y, Yiu M L, Mamoulis N. Reverse nearest neighbor search in metric spaces. IEEE Transactions on Knowledge and Data Engineering, 2006, 18(9): 1239–1252.
Article Google Scholar
Kaelbling L, Littman M, Moore A. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 1996, 4: 237–285.
Google Scholar
Sutton R, Barto A. Re-Inforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1988.
Google Scholar
Hettich S, Blake C, Merz C. UCI repository of machine learning databases. Department of Information and Computer Science, University of California, Irvine, 1998, http://archive.ics.uci.edu/ml.
Witten I, Frank E. Data Mining: Practical Machine Learning Tools with Java Implementations. San Francisco: Morgan Kaufmann Publishers, CA, 2000, http://www.cs.waikato.ac.nz/∼ml/weka/book.html.
Kohavi R. The power of decision tables. In Proc. European Conference on Machine Learning, Crete, Greece, Apr. 25–27, 1995, pp.174–189.

Download references

Author information

Authors and Affiliations

IBM T. J. Watson Research Center, Yorktown Heights, NY, 10598, U.S.A
Charu C. Aggarwal (Member, ACM, Fellow, IEEE) & Jiawei Han (Fellow, ACM, IEEE)
University of Illinois at Urbana-Champaign, Urbana, Illinois, U.S.A
Chen Chen & Jiawei Han (Fellow, ACM, IEEE)

Authors

Charu C. Aggarwal
View author publications
You can also search for this author in PubMed Google Scholar
Chen Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jiawei Han
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Charu C. Aggarwal.

Additional information

This paper is an extended version of a paper published in IEEE ICDE Conference, 2006.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Aggarwal, C.C., Chen, C. & Han, J. The Inverse Classification Problem. J. Comput. Sci. Technol. 25, 458–468 (2010). https://doi.org/10.1007/s11390-010-9337-x

Download citation

Received: 03 November 2009
Revised: 09 December 2009
Published: 08 May 2010
Issue Date: May 2010
DOI: https://doi.org/10.1007/s11390-010-9337-x

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Inverse Classification Problem

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

SAMME.C2 algorithm for imbalanced multi-class classification

Classification

Classification

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

The Inverse Classification Problem

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

SAMME.C2 algorithm for imbalanced multi-class classification

Classification

Classification

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation