skip to main content
research-article

A Novel Feature Selection Method for Risk Management in High-Dimensional Time Series of Cryptocurrency Market

Authors Info & Claims
Published:28 September 2023Publication History
Skip Abstract Section

Abstract

In this study, a novel approach for feature selection has been presented in order to overcome the challenge of classifying positive and negative risk prediction in the cryptocurrency market, which contains high fluctuation. This approach is based on maximizing information gain with simultaneously minimizing the similarity of selected features to achieve a proper feature set for improving classification accuracy. The proposed method was compared with other feature selection techniques, such as sequential and bidirectional feature selection, univariate feature selection, and least absolute shrinkage and selection operator. To evaluate the feature selection techniques, several classifiers were employed: XGBoost, k-nearest neighbor, support vector machine, random forest, logistic regression, long short-term memory, and deep neural networks. The features were elicited from the time series of Bitcoin, Binance, and Ethereum cryptocurrencies. The results of applying the selected features to different classifiers indicated that XGBoost and random forest provided better results on the time series datasets. Furthermore, the proposed feature selection method achieved the best results on two (out of three) cryptocurrencies. The accuracy in the best state varied between 55% to 68% for different time series. It is worth mentioning that preprocessed features were used in this research, meaning that raw data (candle data) were used to derive efficient features that can explain the problem and help the classifiers in predicting the labels.

REFERENCES

  1. [1] Melisa Ozdamar, Ahmet Sensoy, and Akdeniz Kevent. 2022. Retail vs institutional investor attention in the cryptocurrency market. Journal of International Financial Markets, Institutions and Money 81 (2022), 101674. DOI: Google ScholarGoogle ScholarCross RefCross Ref
  2. [2] Arunima Ghosh, Shashank Gupta, Amit Dua, and Neeraj Kumar. 2020. Security of cryptocurrencies in blockchain technology: State-of-art, challenges and future prospects. Journal of Network and Computer Applications 163 (2020), 102635. Google ScholarGoogle ScholarCross RefCross Ref
  3. [3] Hatem Brik, Jihene El Ouakdi, and Ftiti Zied. 2022. Roles of stable versus nonstable cryptocurrencies in Bitcoin market dynamics. Research in International Business and Finance 62 (2022), 101720. Google ScholarGoogle ScholarCross RefCross Ref
  4. [4] Mohil Mahesh, Kumar Patel, Sudeep Tanwar, Rajesh Gupta, and Kumar Neeraj. 2020. A deep learning-based cryptocurrency price prediction scheme for financial institutions. Journal of Information Security and Applications 55 (2020), 102583. Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Minqi Jiang, Jiapeng Liu, and Lu Zhang. 2021. An extended regularized Kalman filter based on genetic algorithm: Application to dynamic asset pricing models. The Quarterly Review of Economics and Finance 79 (2021), 2844. Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Amirizadeh Elham and Boostani Reza. 2021. CDEC: A constrained deep embedded clustering. International Journal of Intelligent Computing and Cybernetics 14, 4 (2021), 686701. .Google ScholarGoogle ScholarCross RefCross Ref
  7. [7] Boostani Reza, Karimzadeh Foroozan, and Nami Mohammad. 2017. A comparative review on sleep stage classification methods in patients and healthy individuals. Computer Methods and Programs in Biomedicine 140 (2017), 7791. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] Moayedi Fatemeh, Azimifar Zohreh, Boostani Reza, and Katebi Serajoddin. 2010. Contourlet-based mammography mass classification using the SVM family. Computers in Biology and Medicine 40, 4 (2010), 373383. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. [9] Goshtasbi Narges, Boostani Reza, and Sanei Saeid. 2022. SleepFCN: A fully convolutional deep learning framework for sleep stage classification using single-channel electroencephalograms. IEEE Transactions on Neural Systems and Rehabilitation Engineering 30 (2022), 20882096. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  10. [10] Afshar Sara, Boostani Reza, and Sanei Saeid. 2021. A combinatorial deep learning structure for precise depth of anesthesia estimation from EEG signals. IEEE Journal of Biomedical and Health Informatics 25, 9 (2021), 34083415. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  11. [11] Rajabi Shahab, Roozkhos Pardis, and Motahari Farimani Nasser. 2022. MLP-based learnable window size for Bitcoin price prediction. Applied Soft Computing 129 (2022), 109584. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. [12] Jaquart Patrick, Dann David, and Weinhardt Christof. 2021. Short-term bitcoin market prediction via machine learning. The Journal of Finance and Data Science 7 (2021), 4566. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  13. [13] Chen Zheshi, Li Chunhong, and Sun Wenjun. 2020. Bitcoin price prediction using machine learning: An approach to sample dimension engineering. Journal of Computational and Applied Mathematics 365 (2020), 112395. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] Atsalakis George, Atsalaki Ioanna G., Pasiouras Fotios, and Zopounidis Constantin. 2019. Bitcoin price forecasting with neuro-fuzzy techniques. European Journal of Operational Research 276 (2019), 770780. Google ScholarGoogle ScholarCross RefCross Ref
  15. [15] Yaohao Peng, Pedro Henrique Melo Albuquerque, Herbert Kimura, Cayan Atreio Portela, and Barcena Saavedra. 2021. Feature selection and deep neural networks for stock price direction forecasting using technical analysis indicators. Machine Learning with Applications 5 (2021), 100060. Google ScholarGoogle ScholarCross RefCross Ref
  16. [16] Tibshirani Robert. 1996. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological) (1996), 267288. Google ScholarGoogle ScholarCross RefCross Ref
  17. [17] Takeshi Emura, Shigeyuki Matsui, and Chen Hsuan-Yu. 2019. Compound.Cox: Univariate feature selection and compound covariate for predicting survival. Computer Methods and Programs in Biomedicine 168 (2019), 2137. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Cover Thomas M. and Hart Peter E.. 1967. Nearest neighbor pattern classification. IEEE Transactions on Information Theory 13, 1 (1967), 2127. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. [19] Cortes Corinna and Vapnik Vladimir. 1995. Support-vector networks. Machine Learning 20, 3 (1995), 273297. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. [20] TinKam Ho. 1998. The random subspace method for constructing decision forests. IEEE Transactions on Pattern Analysis and Machine Intelligence 2, 8 (1995), 832844. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. [21] Hosmer David W. and Lemeshow Stanley. 2000. Applied Logistic Regression (2nd ed.). WileyGoogle ScholarGoogle ScholarCross RefCross Ref
  22. [22] Sagi Omer and Rokach Lior. 2021. Approximating XGBoost with an interpretable decision tree. Information Sciences 572 (2021), 522542. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. [23] Hashempour Sara, Boostani Reza, Mohammadi Mokhktar, and Sanei Saeid. 2022. Continuous scoring of depression from EEG signals via a hybrid of convolutional neural networks. IEEE Transactions on Neural Systems and Rehabilitation Engineering 30 (2022), 176183. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Dehghani Maryam, Mobaien Ali, and Boostani Reza. 2021. A deep neural network-based transfer learning to enhance the performance and learning speed of BCI systems. Brain-Computer Interfaces 8, 1-2 (2021), 1425. Google ScholarGoogle ScholarCross RefCross Ref
  25. [25] Afrasiabi Somayeh, Boostani Reza, Masnadi-Shirazi Mohammad Ali, and Nezam Tahereh. 2021. An EEG based hierarchical classification strategy to differentiate five intensities of pain. Expert Systems with Applications 180 (2021), 115010-1-14. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Modarres-Haghighi Parisa, Boostani Reza, Nami Mohammad, and Sanei Saeid. 2021. Quantification of pain severity using EEG-based functional connectivity. Biomedical Signal Processing and Control 69 (2021), 102840. Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Hossein Shakoor Mohammad, Boostani Reza, Sabeti Malihe, and Mohammadi Mokhtar. 2023. Feature selection and mapping of local binary pattern for texture classification. Multimedia Tools and Applications 82, 5 (2023), 76397676. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. [28] Ganjei Mohammad Ahmadi and Boostani Reza. 2022. A hybrid feature selection scheme for high-dimensional data. Engineering Applications of Artificial Intelligence 113 (2022), 104894. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. [29] Ganjei Mohammad Ahmadi and Boostani Reza. 2019. A fast hybrid feature selection method. 9th International Conference on Computer and Knowledge Engineering (ICCKE), Tehran (Iran), (2019), 611. DOI: Google ScholarGoogle ScholarCross RefCross Ref
  30. [30] Ijadi Maghsoodi Abtin. 2023. Cryptocurrency portfolio allocation using a novel hybrid and predictive big data decision support system. 115 (2023), 102787. Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Yi-Shuai Ren, Chao-Qun Ma, Xiao-Lin Kong, Konstantinos Baltas, and Qasim Zureigat. 2022. Past, present, and future of the application of machine learning in cryptocurrency research. Research in International Business and Finance 63 (2022), 101799. Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A Novel Feature Selection Method for Risk Management in High-Dimensional Time Series of Cryptocurrency Market

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          • Published in

            cover image Journal of Data and Information Quality
            Journal of Data and Information Quality  Volume 15, Issue 3
            September 2023
            326 pages
            ISSN:1936-1955
            EISSN:1936-1963
            DOI:10.1145/3611329
            Issue’s Table of Contents

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 28 September 2023
            • Online AM: 26 May 2023
            • Accepted: 17 April 2023
            • Revised: 10 April 2023
            • Received: 16 December 2022
            Published in jdiq Volume 15, Issue 3

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Full Text

          View this article in Full Text.

          View Full Text