Skip to main content
Log in

FPGA supported rough set reduct calculation for big datasets

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

The rough sets theory developed by Prof. Z. Pawlak is one of the tools used in intelligent systems for data analysis and processing. In modern systems, the amount of the collected data is increasing quickly, so the computation speed becomes the critical factor.

One of the solutions to this problem is data reduction. Removing the redundancy in the rough sets can be achieved with the reduct. Most of the algorithms for reduct generation are only software implementations, resulting in many limitations coming from using the fixed word length, as well as consuming the time for fetching and processing of the instructions and data. These limitations make the software-based implementations relatively slow. Unlike software-based systems, hardware systems can process data much faster.

This paper presents FPGA and softcore CPU based device for large datasets reduct calculation using rough set methods. Presented architecture has been tested on two real datasets by downloading and running presented solutions inside FPGA. Tested datasets had 1 000 to 1 000 000 objects. For the research purpose, the algorithm was also implemented in C language and ran on a PC. The time of a reduct calculation in hardware and software was considered. The obtained results show an increase in the speed of data processing.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Data Availability

In this paper, results were obtained for datasets: Poker Hand Dataset (Cattral et al., 2002) (created by Robert Cattral and Franz Oppacher) and data about children with insulin-dependent diabetes mellitus (type 1) (Stepaniuk, 2000) (created by Jaroslaw Stepaniuk).

Code Availability

The code is protected by the copyright law of the Bialystok Univeristy of Technology.

References

  • Banerjee, A., & Maji, P. (2019). Segmentation of bias field induced brain mr images using rough sets and stomped-t distribution. Information Sciences, 504, 520–545. https://doi.org/10.1016/j.ins.2019.07.027.

    Article  MathSciNet  Google Scholar 

  • Cattral, R., Oppacher, F., & Deugo, D. (2002). Evolutionary data mining with automatic rule generalization.

  • Chen, Y.S., & Cheng, C.H. (2013). Hybrid models based on rough set classifiers for setting credit rating decision rules in the global banking industry. Knowledge-Based Systems, 39, 224–239. https://doi.org/10.1016/j.knosys.2012.11.004; http://www.sciencedirect.com/science/article/pii/S0950705112003139. Accessed 12.02.2022.

  • Choromański, M., Grześ, T., & Hońko, P. (2019). Two fpga devices in the problem of finding minimal reducts. In K. Saeed, R. Chaki, & V. Janev (Eds.) Computer information systems and industrial management, lecture notes in computer science, vol. 11703, pp. 410-420. Cham: Springer International Publishing. https://doi.org/10.1007/978-3-030-28957-7_34.

  • Choromanski, M., Grzes, T., & Honko, P. (2020). Breadth search strategies for finding minimal reducts: towards hardware implementation. Neural Computing & Applications, 32 (18), 14801–14816. https://doi.org/10.1007/s00521-020-04833-7.

    Article  Google Scholar 

  • Dean, J., & Ghemawat, S. (2008). Mapreduce, simplified data processing on large clusters. Communication ACM, 51(1), 107–113. https://doi.org/10.1145/1327452.1327492.

    Article  Google Scholar 

  • Dua, D. (2017). Graff. C.: UCI machine learning repository. http://archive.ics.uci.edu/ml. Accessed 12.02.2022.

  • Grzes, T., & Kopczynski, M. (2019). Hardware implementation on field programmable gate array of two-stage algorithm for rough set reduct generation. In T. Mihálydeák, F. Min, G. Wang, M. Banerjee, I. Düntsch, Z. Suraj, & D. Ciucci (Eds.) Rough sets, pp. 495–506. Cham: Springer International Publishing.

  • Jiang, H., Zhan, J., Sun, B., & Alcantud, J.C.R. (2020). An madm approach to covering-based variable precision fuzzy rough sets: an application to medical diagnosis. International Journal of Machine Learning and Cybernetics, 11 (9), 2181–2207. https://doi.org/10.1007/s13042-020-01109-3.

    Article  Google Scholar 

  • Jing, S.Y., & Yang, J. (2020). High-performance attribute reduction on graphics processing unit. Journal of Experimental & Theoretical Artificial Intelligence, 32(6), 977–996. https://doi.org/10.1080/0952813X.2019.1710577.

    Article  Google Scholar 

  • Kanasugi, A., & Matsumoto, M. (2007). Design and implementation of rough rules generation from logical rules on fpga board. In M. Kryszkiewicz, J. F. Peters, H. Rybinski, & A. Skowron (Eds.) Rough sets and intelligent systems paradigms, lecture notes in computer science. vol. 4585, pp. 594–602. Berlin: Springer. https://doi.org/10.1007/978-3-540-73451-2_62.

  • Kanasugi, A., & Yokoyama, A. (2001). A basic design for rough set processor. https://doi.org/10.11517/pjsai.JSAI01.0.65.0.

  • Kopczynski, M., Grzes, T., & Stepaniuk, J. (2014). Fpga in rough-granular computing: Reduct generation. In Proceedings of the 2014 IEEE/WIC/ACM international joint conferences on web intelligence (WI) and intelligent agent technologies (IAT) - Vol. 02, WI-IAT ’14. pp. 364-370, USA: IEEE computer society.

  • Kopczynski, M., Grzes, T., & Stepaniuk, J. (2015). Computation of cores in big datasets: an fpga approach. In D. Ciucci, G. Wang, S. Mitra, & W. Z. Wu (Eds.) Rough sets and knowledge technology, lecture notes in computer science. vol. 11499, pp. 153-163, Cham: Springer International Publishing. https://doi.org/10.1007/978-3-030-22815-6_38.

  • Li, Y., Zhang, J., Mitra, H., & Yu, S. (2020). The systematic thinking ability of hardware/software co-design using fpga. 2020 IEEE frontiers in education conference (FIE), pp. 1–6. https://doi.org/10.1109/FIE44824.2020.9274284.

  • Marz, N., & Warren, J. (2015). Big data: principles and best practices of scalable realtime data systems, 1st edn. USA: Manning Publications Co.

    Google Scholar 

  • Pawlak, Z. (2004). Elementary rough set granules: toward a rough set processor. In Rough-neural computing: techniques for computing with words. Cognitive technologies, chapter 1, pp. 5–14. Berlin: Springer. https://doi.org/10.1007/978-3-642-18859-6_1.

  • Penmatsa, R.K.V., Kalidindi, A., & Mallidi, S.K.R. (2020). Feature reduction and optimization of malware detection system using ant colony optimization and rough sets. International Journal of Information Security and Privacy, 14(3, SI), 95–114. https://doi.org/10.4018/IJISP.2020070106.

    Article  Google Scholar 

  • Polkowski, L. (2017). Rough sets, rough mereology and uncertainty. In G. Wang, A. Skowron, Y. Yao, D. Slezak, & L. Polkowski (Eds.) Thriving Rough Sets: 10th anniversary-honoring professor Zdzislaw Pawlak’s life and legacy & 35 years of rough sets, studies in computational intelligence. vol. 708, pp. 49–85. https://doi.org/10.1007/978-3-319-54966-8_4.

  • Roy, P., Goswami, S., Chakraborty, S., Azar, A., & Dey, N. (2014). Image segmentation using rough set theory, a review. International Journal of Rough Sets and Data Analysis, 1, 62–74. https://doi.org/10.4018/ijrsda.2014070105.

    Article  Google Scholar 

  • Sowkuntla, P., & Prasad, P.S.V.S.S. (2020). Mapreduce based improved quick reduct algorithm with granular refinement using vertical partitioning scheme. Knowledge-Based Systems, vol. 189. https://doi.org/10.1016/j.knosys.2019.105104.

  • Stepaniuk, J. (2000). Knowledge discovery by application of rough set models, pp. 137–233. Heidelberg: Physica-Verlag HD. https://doi.org/10.1007/978-3-7908-1840-6_5.

    Book  Google Scholar 

  • Stepaniuk, J. (2008). Rough-granular computing in knowledge discovery and data mining studies in computational intelligence Vol. 152. Berlin: Springer.

    MATH  Google Scholar 

  • Stepaniuk, J., Kopczynski, M., & Grzes, T. (2013). The first step toward processor for rough set methods. Fundamental Information, 127(1–4), 429–443. https://doi.org/10.3233/FI-2013-919.

    Article  Google Scholar 

  • Sun, G., Qi, X., & Zhang, Y. (2011). A fpga-based implementation of rough set theory. pp. 2561–2564. https://doi.org/10.1109/CCDC.2011.5968642.CitedBy2.

  • Sun, G., Wang, H., Lu, J., & He, X. (2013). A FPGA-based discretization algorithm of continuous attributes in rough set, Applied Mechanics and Materials, pp. 278-280 . www.scopus.com. Accessed 12.02.2022.

  • Sun, L., Xu, J., & Li, Y. (2014). A feature selection approach of inconsistent decision systems in rough set. Journal of Computers, 9, 1333–1340.

    Article  Google Scholar 

  • Tan, A., Wu, W.Z., Li, J., & Li, T. (2020). Reduction foundation with multigranulation rough sets using discernibility. Artificial Intelligence Review, 53(4), 2425–2452. https://doi.org/10.1007/s10462-019-09737-0.

    Article  Google Scholar 

  • Tiwari, K., & Kothari, A. (2015). Design and implementation of rough set co-processor on fpga. International Journal of Innovative Computing, Information and Control, 11(2), 641–656.

    Google Scholar 

  • Tiwari, K., & Kothari, A. (2016). Design of intelligent system for medical applications using rough set theory. International Journal of Data Mining, Modelling and Management, 8(3), 279–301. https://doi.org/10.1504/IJDMMM.2016.079069. Cited By 0.

    Article  Google Scholar 

  • Tiwari, K.S., & Kothari, A.G. (2011). Architecture and implementation of attribute reduction algorithm using binary discernibility matrix. In 2011 International conference on computational intelligence and communication networks, pp. 212–216. IEEE. https://doi.org/10.1109/CICN.2011.42.

  • Tiwari, K.S., & Kothari, A.G. (2014). Design and implementation of rough set algorithms on fpga: a survey international journal of advanced research in artificial intelligence vol. 3(9).

  • Tsumoto, S. (2013). Rough Sets and Medical Differential Diagnosis, pp. 605–621. Berlin: Springer. https://doi.org/10.1007/978-3-642-30344-9_23.

    Book  Google Scholar 

  • Wang, C., He, Q., Chen, D., & Hu, Q. (2014). A novel method for attribute reduction of covering decision systems. Information Sciences, 254, 181–196. 10.1016/j.ins.2013.08.057.

    Article  MathSciNet  MATH  Google Scholar 

  • Yin, L., Qin, L., Jiang, Z., & Xu, X. (2021). A fast parallel attribute reduction algorithm using Apache Spark. Knowledge-Based Systems, pp. 212. https://doi.org/10.1016/j.knosys.2020.106582.

  • Zhang, W.X., Mi, J.S., & Wu, W.Z. (2003). Approaches to knowledge reductions in inconsistent systems. International Journal Intelligence System, 18(9), 989–1000. https://doi.org/10.1002/int.10128.

    Article  MATH  Google Scholar 

  • Zhou, B., Cho, H., & Zhang, X. (2018). Scalable implementations of rough set algorithms: a survey. In IEA/AIE.

Download references

Acknowledgements

The work was supported by the grant WZ/WI-IIT/2/2020 from Bialystok University of Technology and funded with resources for research by the Ministry of Science and Higher Education in Poland.

Funding

The work was supported by the grant WZ/WI-IIT/2/2020 from Bialystok University of Technology and funded with resources for research by the Ministry of Science and Higher Education in Poland.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by Maciej Kopczynski and Tomasz Grzes. The first draft of the manuscript was written by all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Maciej Kopczynski.

Ethics declarations

Consent for Publication

The participant has consented to the submission of the regular paper to the journal.

Competing interests

The authors have no financial or proprietary interests in any material discussed in this article.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kopczynski, M., Grzes, T. FPGA supported rough set reduct calculation for big datasets. J Intell Inf Syst 59, 779–799 (2022). https://doi.org/10.1007/s10844-022-00725-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-022-00725-5

Keywords

Navigation