Abstract
To make money in trading one ought to forecast the future price, but to do so accurately one must verify the predictions using the past data. A short trading history can present a problem. We showed both theoretically and experimentally that the history of some financial assets can be reconstructed quite accurately. We forecasted the past price movements of exchange traded funds (ETFs). The problem in practice is very acute as there are a number of very liquid ETFs that can be traded with minimum slippage but their available history is too short. In such situations systematic traders cannot test their trading models as the history length is insufficient. To forecast historical ETF prices we used stocks with a longer history available. In some cases we created multiple model instances with a variable number of stocks. As soon as the stock history became unavailable we selected a different model. We compared this and eight other methods using a set of US ETFs ranging from S&P 500 to uranium. The experimental study showed the expectation maximisation with covariance matrix normalization to be the best method for this task.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Mockus, J., Raudys, A.: On the Efficient-Market Hypothesis and Stock Exchange Game Model. Expert Systems with Applications 37(8), 5673–5681 (2010)
Graham, K.: Imputing for missing survey responses. s.l. In: Proceedings of the Section on Survey Research Methods. American Statistical Association (1982)
Schneider, T.: Analysis of incomplete climate data: Estimation of mean values and covariance matrices and imputation of missing values. Journal of Climate 14, 853–871 (2001)
Little, R.J.A., Rubin, D.B.: Statistical Analysis with Missing Data, pp. 3–18, 39–48, 127–139. John Wiley & Sons, Los Angeles (1987)
Tseng, S., Wang, K., Lee, C.: A pre-processing method to deal with missing values by integrating clustering and regression techniques. Applied Artificial Intelligence 17(5-6), 535–544 (2003)
Firat, M., Dikbas, F., Koc, A.C., Güngör, M.: Estimation of Missing River Flows using Expectation Maximization Method. Balwois, Ohrid (2010)
Amato, A., Calabrese, M., Di Lecce, V.: Decision Trees in Time Series Reconstruction Problems. In: IEEE International Instrumentation and Measurement Technology Conference, pp. 895–899. IEEE, Canada (2008)
Kim, J.-W., Pachepsky, Y.A.: Reconstructing missing daily precipitation data using regression trees and artificial neural networks for SWAT stream flow simulation. Journal of Hydrology 394(3-4), 305–314 (2010)
Huang, X., Zhu, Q.: A pseudo nearest neighbour approach for missing data recovery on Gaussian data sets. Pattern Recognition Letters 23(13), 1613–1622 (2002)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Missing, Mining, Inference, and Prediction. Springer, New York (2001)
Aksoy, H., Toprak, Z.F., Aytek, A.: Stochastic generation of hourly mean wind speed data. Renewable Energy 29(14), 2111–2131 (2004)
Srikanthan, R.: A multisite daily rainfall data generation model for climate change conditions. In: 18th World IMACS / MODSIM Congress, pp. 3976–3982. eWater CRC, Water Division, Bureau of Meteorology, Melbourne (2009)
Andridge Rebecca, R., Little Roderick, J.A.: A Review of Hot Deck Imputation for Survey Non-response. International Statistical Review 78(1), 40–64 (2010)
Utsunomiya, K., Sonoda, K.: Methodology for Handling Missing Values In Tankan. Research and Statistics Department Bank of Japan, Japan (2001)
Bang, Y.-K., Lee, C.-H.: Fuzzy Time Series Prediction with Data Preprocessing and Error Compensation Based on Correlation Analysis. In: Third International Conference on Convergence and Hybrid Information Technology, vol. 2, pp. 714–721. IEEE (2008)
Shrestha, S.L.: Categorical Regression Models with Optimal Scaling for Predicting Indoor Air Pollution Concentrations inside Kitchens in Nepalese Households. Nepal Journal of Science and Technology 10, 205–211 (2009)
Sujatha, K.V., Sundaram, S.M.: Stock Index Prediction Using Regression and Neural Network Models under Non Normal Conditions. In: 2010 International Conference on Emerging Trends in Robotics and Communication Technologies (INTERACT), pp. 59–63 (2010)
Yankov, D., DeCoste, D., Keogh, E.: Ensembles of Nearest Neighbor Forecasts. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 545–556. Springer, Heidelberg (2006)
Mustapha, N., Jalali, M., Bozorgniya, A., Jalali, M.: Navigation Patterns Mining Approach based on Expectation Maximization Algorithm. World Academy of Science, Engineering and Technology 50, 855–859 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Raudys, A., Sirvydis, L., Lisovskij, K. (2012). Synthetic History for Exchange Traded Funds. In: Abramowicz, W., Kriksciuniene, D., Sakalauskas, V. (eds) Business Information Systems. BIS 2012. Lecture Notes in Business Information Processing, vol 117. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30359-3_20
Download citation
DOI: https://doi.org/10.1007/978-3-642-30359-3_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30358-6
Online ISBN: 978-3-642-30359-3
eBook Packages: Computer ScienceComputer Science (R0)