Elsevier

Expert Systems with Applications

Volume 105, 1 September 2018, Pages 159-173
Expert Systems with Applications

A hybrid supervised semi-supervised graph-based model to predict one-day ahead movement of global stock markets and commodity prices

https://doi.org/10.1016/j.eswa.2018.03.037Get rights and content

Highlights

  • A hybrid supervised semi-supervised model for financial prediction is proposed.

  • The model uses both past data of markets being predicted and markets interactions.

  • A network construction algorithm for modeling markets interactions is proposed.

  • The network is constructed from the outset on the basis of prediction purpose.

  • Markets interactions can be more important than past data of markets in prediction.

Abstract

Market prediction has been an important machine learning research topic in recent decades. A neglected issue in prediction is having a model that can simultaneously pay attention to the interaction of global markets along historical data of the target markets being predicted. As a solution, we present a hybrid supervised semi-supervised model called HyS3 for direction of movement prediction. The graph-based semi-supervised part of HyS3 models the markets global interactions through a network designed with a novel continuous Kruskal-based graph construction algorithm called ConKruG. The supervised part of the model injects results extracted from each market’s historical data to the network whenever the hybrid model allows with an innovative conditional mechanism. The significance of higher prediction accuracy of HyS3 is comparing to other models is proved statistically against other models including supervised models and network-based semi-supervised predictions.

Introduction

Prediction of financial time series has been an interesting machine learning topic according to many surveys (Atsalakis, Valavanis, 2009, Rather, Sastry, Agarwal, 2017, Sapankevych, Sankar, 2009, Soni, 2011). In this research, we will focus on predicting one-day ahead, falls and rises of financial time series that is referred to as direction of movement. The financial time series used in this study are some famous stock market indexes across the world, the gold spot price of 10 AM London and prices of different types of crude oil. There are complex interrelations among these financial markets. Some models are developed to show these relations in the field of econophysics (Gao, An, Zhong, 2013, Liu, Tse, 2012, Mantegna, Stanley, 1999, Papana, Kyrtsou, Kugiumtzis, Diks, 2017). They look at the global market environment as a complex system and model it using networks. Network modeling of relations among direction of movements of these markets showed to be useful for improving performance of prediction in the works of Park and Shin (2013) and Shin, Hou, Park, Park, and Choi (2013). We will provide a novel hybrid supervised semi-supervised model called HyS3 for market prediction using network modeling. A novel algorithm is also developed using a continuous Kruskal-based approach for making a graph called ConKruG to model complex markets interactions. In Table 1 the advantages of HyS3 to other models in the literature are presented briefly. Further descriptions and examples of existing researches related to the points mentioned in Table 1 are given in the following part of the introduction.

It can be assumed that direction of movement prediction is a binary classification problem. There are two major methods to classify unknown directions in machine learning literature: supervised and semi-supervised. In supervised learning, there are enough historical data and the aim of the model is to learn the past patterns to predict future directions. In semi-supervised learning, there are a few known market directions for markets that revealed their prices information. Graph-based semi-supervised learning1 market prediction, models the interactions among markets with known or unknown direction with a graph. Zhu (2005) has explained graph-based semi-supervised models in his research. The known markets directions will be spread or propagated to markets with unknown directions. In this study, the earth is partitioned into four time zones in a way that for any given time, only markets in one time zone have revealed their close price information. Therefore, the direction of movement for them is known and the other zones have markets with unknown directions that can be predicted through a GSSL method. HyS3 uses ConKruG algorithm to make the graph. HyS3 uses a novel approach to inject the information from a supervised model (SVM2 in our case), to the ConKruG network.

Many researches in machine learning like Patel, Shah, Thakkar, and Kotecha (2015), Kia, Fathian, and Gholamian (2012), Kara, Boyacioglu, and Baykan (2011) and Yao and Tan (2000) tried to predict a stock index or commodity price simply by its historical data. Some other researches like Fathian and Kia (2012), Huang, Nakamori, and Wang (2005) and Thawornwong and Enke (2004) tried to improve the performance of financial predictions by using some external factors as inputs of their supervised models along with historical data of the target market. Park and Shin (2013) and Shin et al. (2013) used some stock prices data in South Korean market and commodity prices in a network-based semi-supervised model without paying attention to the historical data of the time series being predicted and they achieved better results compared to their rival models using only historical data of the target market. The question that arises here is when to use the past data of target markets and when to use the data from other markets to predict the future direction of a market. HyS3 model has a novel approach for deciding how and when to combine information from historical data of target markets and information achieved from global markets interactions.

Finding the external factor(s) that can help the prediction is sometimes a trial and error task. Sometimes having a prior knowledge of the research domain makes this task easier. Researchers choose some helping markets to add as input next to the historical data of the market being predicted and then test if these helping markets make the prediction performance increase (Jaruszewicz & Mańdziuk, 2004). Other times, researchers choose among a list of time series by the means of a feature reduction algorithm (Tsai & Hsiao, 2010). We show that by using a network structure of relations among external factors ,the model can use a large number of external factors according to their distance and position to the markets that are going to be predicted. The network can regulate the effect of external factors in prediction if made with a purposeful method like ConKruG.

As the surveys like Rather et al. (2017), Soni (2011) Atsalakis and Valavanis (2009) and Sapankevych and Sankar (2009) show, in most studies one or a short list of stock indexes or commodity price time series are selected to be predicted by a model. The lack of a general prediction model is seen in the literature. ConKruG network can model interactions among any number of markets for prediction with HyS3.

Using network structure for market prediction is a newly opened research topic and therefore there are not enough studies up to now. Kia, Haratizadeh, and Heshmati (2016), Park and Shin (2013) and Shin et al. (2013) constructed networks that seemed to be helpful in modeling the complex relations among movements of financial time series. They first assumed that a special distance function or a special rule derived from their datasets may be helpful in constructing a prediction network and then tested their assumption. In our approach, ConKruG designs the network gradually by adding any helpful link for the prediction aim.

Another aspect of the research, is the time period of prediction. Narayan, Sharma, and Thuraisamy (2015) discuss the importance of data frequency in sampling. Noisy nature of financial time series makes it difficult to predict direction for short period frequencies. Therefore many models try to predict a threshold of change in the direction (Homm, Breitung, 2012, Roy, Sarkar, 2013) or use longer time periods like week or month (Boyacioglu & Avci, 2010) or simply try to predict the trend by smoothing the time series with moving averages or other methods (Park & Shin, 2013). In this paper, we construct a model for predicting daily direction of movement for everyday in markets across the globe despite the difficulties of improving the accuracy in the noisy financial environment and short time period.

The structure of the paper is as follows: in the next section some preliminaries about the GSSL and SVM are explained. At the end of the section two, we discuss our most important rival models prediction algorithms in brief. The third section explains our hybrid proposed model, HyS3 in detail and describes the proposed novel graph construction algorithm, ConKruG. The final section, explains the model settings and parameters, presents the results and discusses their interpretation. The last section concludes and has some suggestions for further researches.

Section snippets

Preliminaries and related works

In our model, we modify the GSSL algorithm to inject the probabilities vector output from SVM. Therefore, a brief description of general forms of GSSL algorithms and SVM are presented. Then the works of researchers in Park and Shin (2013) and Shin et al. (2013) studies are explained to be compared with our method and results in later sections.

HyS3 model

An explanation of HyS3, the proposed hybrid supervised semi-supervised prediction is presented in this section. Fig. 4 presents an overview of our approach. The sub-systems of the model are numbered for better explanation in text.

The first block of the model is data preparation block. It prepares data as described in Section 3.1 and divides the dataset. Some parts of the model like ConKruG only use train and validation datasets. It is worth mentioning that prediction models (when trained or

Experimental settings and results

The evaluation results of our proposed model is the accuracy of direction prediction. The number of two classes of rise and fall movements are balanced with an acceptable approximation for stock indexes and commodity prices. The statistical significance of the higher accuracy of our model is presented by an appropriate statistical test.

The codes of ConKruG algorithm and HyS3 model are in github repository (Kia, Haratizadeh, & Shouraki, 2018) with LGPL v3.0 license. Other models that led to the

Conclusion and future researches

A better prediction can be gained by using global markets information together with historical data of the markets being predicted. Capability of graph-based semi-supervised methods to use global market information for prediction depends on the constructed network that models markets interactions. A network that is built from the beginning of the construction process by looking at the type of problem that it seeks to solve is more suitable for prediction. We created an innovative network

References (50)

  • J. Patel et al.

    Predicting stock and stock price index movement using trend deterministic data preparation and machine learning techniques

    Expert Systems with Applications

    (2015)
  • ShinH. et al.

    Prediction of movement direction in crude oil prices based on semi-supervised learning

    Decision Support Systems

    (2013)
  • ShinK.-S. et al.

    An application of support vector machines in bankruptcy prediction model

    Expert Systems with Applications

    (2005)
  • S. Thawornwong et al.

    The adaptive selection of financial and economic variables for use with artificial neural networks

    Neurocomputing

    (2004)
  • TsaiC.-F. et al.

    Combining multiple feature selection methods for stock prediction: Union, intersection, and multi-intersection approaches

    Decision Support Systems

    (2010)
  • WengB. et al.

    Stock market one-day ahead movement prediction using disparate data sources

    Expert Systems with Applications

    (2017)
  • YaoJ. et al.

    A case study on using neural networks to perform technical forecasting of Forex

    Neurocomputing

    (2000)
  • J.R. Anderson

    The architecture of cognition

    (2013)
  • M.S. Baghshah et al.

    Scalable semi-supervised clustering by spectral kernel learning

    Pattern Recognition Letters

    (2014)
  • G.C. Cawley et al.

    On over-fitting in model selection and subsequent selection bias in performance evaluation

    Journal of Machine Learning Research

    (2010)
  • ChangC.-C. et al.

    LIBSVM: A library for support vector machines

    ACM Transactions on Intelligent Systems and Technology (TIST)

    (2011)
  • O. Chapelle et al.

    Semi-supervised learning (Chapelle, O. et al., Eds.; 2006) [book reviews]

    IEEE Transactions on Neural Networks

    (2009)
  • T.H. Cormen

    Introduction to algorithms

    (2009)
  • C. Cortes et al.

    Support vector machine

    Machine Learning

    (1995)
  • M. Fathian et al.

    Exchange rate prediction with multilayer perceptron neural network using gold price as external factor

    Management Science Letters

    (2012)
  • Cited by (34)

    • GCNET: Graph-based prediction of stock price movement using graph convolutional network

      2022, Engineering Applications of Artificial Intelligence
      Citation Excerpt :

      Taking into account the parameters of the network, they developed an AutoRegressive Integrated Moving Average (ARIMA) model and predicted the value of the S&p500 index. In Kia et al. (2018) a graph structure is introduced in which the nodes represent the markets, and the edges represent strong correlations between the market index time series. The nodes whose corresponding indices are already known, are labeled while the nodes whose corresponding indices are still unknown, due to the time differences between different time zones, are predicted using a label spreading algorithm.

    • Forecasting the overnight return direction of stock market index combining global market indices: A multiple-branch deep learning approach

      2022, Expert Systems with Applications
      Citation Excerpt :

      Marschinski and Kantz (2002) and Kwon and Yang (2008) employ the transfer entropy to investigate the relationships among different SMIs and confirm the existence of information transfer. Furthermore, forecasting the SMI movements based on international SMIs has become a hot research topic in recent years, and there is a large body of literature that uses international SMIs as the inputs of machine learning techniques to forecast the target SMI’s movements (Hoseinzade & Haratizadeh, 2019; Karaca et al., 2020; Kia et al., 2018; Malagrino et al., 2018). There are generally three types of methods for forecasting a stock market with these multisource data: (1) conventional machine learning techniques (Zhong & Enke, 2017), (2) ensemble learning techniques (Weng et al., 2018), and (3) deep learning techniques (Long et al., 2019).

    View all citing articles on Scopus
    View full text