Skip to main content
Log in

A network-based data mining approach to portfolio selection via weighted clique relaxations

  • Published:
Annals of Operations Research Aims and scope Submit manuscript

Abstract

We introduce a new network-based data mining approach to selecting diversified portfolios by modeling the stock market as a network and utilizing combinatorial optimization techniques to find maximum-weight s-plexes in the obtained networks. The considered approach is based on the weighted market graph model, which is used for identifying clusters of stocks according to a correlation-based criterion. The proposed techniques provide a new framework for selecting profitable diversified portfolios, which is verified by computational experiments on historical data over the past decade. In addition, the proposed approach can be used as a complementary tool for narrowing down a set of “candidate” stocks for a diversified portfolio, which can potentially be analyzed using other known portfolio selection techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Algorithm 1
Fig. 1
Fig. 2

Similar content being viewed by others

References

  • Abello, J., Resende, M. G. C., & Sudarsky, S. (2002). Massive quasi-clique detection. In S. Rajsbaum (Ed.), LATIN 2002: theoretical informatics (pp. 598–612). London: Springer.

    Chapter  Google Scholar 

  • Balasundaram, B., Butenko, S., & Hicks, I. V. (2011). Clique relaxations in social network analysis: the maximum k-plex problem. Operations Research, 59, 133–142.

    Article  Google Scholar 

  • Boginski, V., Butenko, S., & Pardalos, P. (2003). On structural properties of the market graph. In A. Nagurney (Ed.), Innovations in financial and economic networks (pp. 29–45). Northampton: Edward Elgar.

    Google Scholar 

  • Boginski, V., Butenko, S., & Pardalos, P. (2005). Statistical analysis of financial networks. Computational Statistics & Data Analysis, 48(2), 431–443.

    Article  Google Scholar 

  • Boginski, V., Butenko, S., & Pardalos, P. (2006). Mining market data: a network approach. Computers & Operations Research, 33(11), 3171–3184.

    Article  Google Scholar 

  • Bomze, I. M., Budinich, M., Pardalos, P. M., & Pelillo, M. (1999). The maximum clique problem. In D.-Z. Du & P. M. Pardalos (Eds.), Handbook of combinatorial optimization (pp. 1–74). Dordrecht: Kluwer Academic.

    Chapter  Google Scholar 

  • Elton, E. J., & Gruber, M. J. (1997). Modern portfolio theory, 1950 to date. Journal of Banking & Finance, 21, 1743–1759.

    Article  Google Scholar 

  • Grane, A., & Veiga, H. (2010). Wavelet-based detection of outliers in financial time series. Computational Statistics & Data Analysis, 54, 2580–2593.

    Article  Google Scholar 

  • Harary, F., & Ross, I. C. (1957). A procedure for clique detection using the group matrix. Sociometry, 20(3), 205–215.

    Article  Google Scholar 

  • Kim, T., & White, H. (2004). On more robust estimation of skewness and kurtosis. Finance Research Letters, 1, 56–73.

    Article  Google Scholar 

  • Luce, R. D. (1950). Connectivity and generalized cliques in sociometric group structure. Psychometrika, 15(2), 169–190.

    Article  Google Scholar 

  • Luce, R. D., & Perry, A. D. (1949). A method of matrix analysis of group structure. Psychometrika, 14(2), 95–116.

    Article  Google Scholar 

  • Luenberger, D. G. (1998). Investment science. New York: Oxford University Press.

    Google Scholar 

  • Mantegna, R. N., & Stanley, H. E. (2000). An introduction to econophysics: correlations and complexity in finance. Cambridge: Cambridge University Press.

    Google Scholar 

  • Markowitz, H. M. (1952). Portfolio selection. The Journal of Finance, 7, 77–91.

    Google Scholar 

  • McClosky, B. (2011) Clique relaxations. In: J. Cochran (Ed.), Encyclopedia of operations research and management science. New York: Wiley.

    Google Scholar 

  • Mokken, R. J. (1979). Cliques, clubs and clans. Quality and Quantity, 13, 161–173.

    Article  Google Scholar 

  • Newman, M. (2003). The structure and function of complex networks. SIAM Review, 45, 167–256.

    Article  Google Scholar 

  • Östergȧrd, P. R. J. (1999). A new algorithm for the maximum-weight clique problem. Electronic Notes in Discrete Mathematics, 3, 153–156.

    Article  Google Scholar 

  • Pattillo, J., Youssef, N., & Butenko, S. (2013). On clique relaxation models in network analysis. European Journal of Operational Research, 226, 9–18.

    Article  Google Scholar 

  • Pena, D. (2001). Outliers, influential observations and missing data. In D. Pena, G. Tiao, & R. Tsay (Eds.), A course in time series (pp. 136–170). New York: Wiley.

    Google Scholar 

  • Seidman, S. B., & Foster, B. L. (1978). A graph theoretic generalization of the clique concept. The Journal of Mathematical Sociology, 6, 139–154.

    Article  Google Scholar 

  • Shirokikh, O., Pastukhov, G., Boginski, V., & Butenko, S. (2013). Computational study of the U.S. stock market evolution: a rank correlation-based network model. Computational Management Science. doi:10.1007/s10287-012-0160-4.

    Google Scholar 

  • Trukhanov, S., Balasubramaniam, C., Balasundaram, B., & Butenko, S. (2013, working paper). Exact algorithms for hard node deletion problems in graphs.

  • Yu, H., Paccanaro, A., Trifonov, V., & Gerstein, M. (2006). Predicting interactions in protein networks by completing defective cliques. Bioinformatics, 22, 823–829.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vladimir Boginski.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Boginski, V., Butenko, S., Shirokikh, O. et al. A network-based data mining approach to portfolio selection via weighted clique relaxations. Ann Oper Res 216, 23–34 (2014). https://doi.org/10.1007/s10479-013-1395-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10479-013-1395-3

Keywords

Navigation