Outlier Detection Using Rough Set Theory

Jiang, Feng; Sui, Yuefei; Cao, Cungen

doi:10.1007/11548706_9

Feng Jiang^22,23,
Yuefei Sui²² &
Cungen Cao²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3642))

Included in the following conference series:

International Workshop on Rough Sets, Fuzzy Sets, Data Mining, and Granular-Soft Computing

1705 Accesses

Abstract

In this paper, we suggest to exploit the framework of rough set for detecting outliers — individuals who behave in an unexpected way or feature abnormal properties. The ability to locate outliers can help to maintain knowledge base integrity and to single out irregular individuals. First, we formally define the notions of exceptional set and minimal exceptional set. We then analyze some special cases of exceptional set and minimal exceptional set. Finally, we introduce a new definition for outliers as well as the definition of exceptional degree. Through calculating the exceptional degree for each object in minimal exceptional sets, we can find out all outliers in a given dataset.

This work is supported by the National NSF of China (60273019 and 60073017), the National 973 Project of China (G1999032701), Ministry of Science and Technology (2001CCA03000) and the National Laboratory of Software Development Environment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Outlier detection for incomplete real-valued data via information entropy and class-consistent technology

Article 18 April 2024

A Rough Entropy-Based Weighted Density Outlier Detection Method for Two Universal Sets

Outliers Detection in Multi-label Datasets

References

Pawlak, Z.: Rough sets. International Journal of Computer and Information Sciences 11, 341–356 (1982)
Article MATH MathSciNet Google Scholar
Pawlak, Z.: Rough sets: Theoretical Aspects of Reasoning about Data. Kluwer Academic Publishers, Dordrecht (1991)
MATH Google Scholar
Pawlak, Z., Grzymala-Busse, J.W., Slowinski, R., Ziarko, W.: Rough sets. Comm. ACM 38, 89–95 (1995)
Article Google Scholar
Hawkins, D.: Identifications of Outliers. Chapman and Hall, London (1980)
Google Scholar
Barnett, V., Lewis, T.: Outliers in Statistical Data. John Wiley & Sons, Chichester (1994)
MATH Google Scholar
Knorr, E., Ng, R.: A Unified Notion of Outliers: Properties and Computation. In: Proc. of the Int. Conf. on Knowledge Discovery and Data Mining, pp. 219–222 (1997)
Google Scholar
Knorr, E., Ng, R.: Algorithms for Mining Distance-based Outliers in Large Datasets. In: VLDB Conference Proceedings (1998)
Google Scholar
Knorr, E., Ng, R.: Finding intensional knowledge of distance-based outliers. In: Proc. of the 25th VLDB Conf. (1999)
Google Scholar
Angiulli, F., Pizzuti, C.: Fast outlier detection in high dimensional spaces. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) PKDD 2002. LNCS (LNAI), vol. 2431, pp. 15–226. Springer, Heidelberg (2002)
Chapter Google Scholar
Ramaswamy, S., Rastogi, R., Shim, K.: Efficient algorithms for mining outliers from large datasets. In: Proc. of the ACM SIGMOD Conf. (2000)
Google Scholar
Knorr, E., Ng, R., Tucakov, V.: Distance-based outliers: algorithms and applications. VLDB Journal: Very Large Databases 8(3-4), 237–253 (2000)
Article Google Scholar
Eskin, E., Arnold, A., Prerau, M., Portnoy, L., Stolfo, S.: A geometric framework for unsupervised anomaly detection: Detecting intrusions in unlabeled data. In: Data Mining for Security Applications (2002)
Google Scholar
Lane, T., Brodley, C.E.: Temporal sequence learning and data reduction for anomaly detection. ACM Transactions on Information and System Security 2(3), 295–331 (1999)
Article Google Scholar
Breunig, M.M., Kriegel, H.P., Ng, R.T., Sander, J.: LOF: Identifying density-based local outliers. In: Proc. ACM SIGMOD Conf., pp. 93–104 (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100080, P.R. China
Feng Jiang, Yuefei Sui & Cungen Cao
Graduate School of Chinese Academy of Sciences, Beijing, 100039, P.R. China
Feng Jiang

Authors

Feng Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Yuefei Sui
View author publications
You can also search for this author in PubMed Google Scholar
Cungen Cao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Regina, Regina, SK, S4S 0A2 Canada, Polish-Japanese Institute of Information Technology, Koszykowa 86, 02-008 Warsaw, P.O. Box, Poland
Dominik Ślęzak
Department of Computer Science, University of Regina, S4S 0A2, Regina, Saskatchewan, Canada
JingTao Yao & Wojciech Ziarko &
Department of Electrical and Computer Engineering, University of Manitoba, R3T 5V6, Winnipeg, Manitoba, Canada
James F. Peters
College of Computer and Information Engineering, Hehan University, Henan, China
Xiaohua Hu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, F., Sui, Y., Cao, C. (2005). Outlier Detection Using Rough Set Theory. In: Ślęzak, D., Yao, J., Peters, J.F., Ziarko, W., Hu, X. (eds) Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing. RSFDGrC 2005. Lecture Notes in Computer Science(), vol 3642. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11548706_9

Download citation

DOI: https://doi.org/10.1007/11548706_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28660-8
Online ISBN: 978-3-540-31824-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics