Three $$l_1$$ Based Nonconvex Methods in Constructing Sparse Mean Reverting Portfolios

Long, Xiaolong; Solna, Knut; Xin, Jack

doi:10.1007/s10915-017-0578-5

Three $l_1$ Based Nonconvex Methods in Constructing Sparse Mean Reverting Portfolios

Published: 20 October 2017

Volume 75, pages 1156–1186, (2018)
Cite this article

Journal of Scientific Computing Aims and scope Submit manuscript

Xiaolong Long¹,
Knut Solna² &
Jack Xin²

204 Accesses
Explore all metrics

Abstract

We study the problem of constructing sparse and fast mean reverting portfolios. The problem is motivated by convergence trading and formulated as a generalized eigenvalue problem with a cardinality constraint (d’Aspremont in Quant Finance 11(3):351–364, 2011). We use a proxy of mean reversion coefficient, the direct Ornstein–Uhlenbeck estimator, which can be applied to both stationary and nonstationary data. In addition, we introduce three different methods to enforce the sparsity of the solutions. One method uses the ratio of $l_1$ and $l_2$ norms and the other two use $l_1$ norm. We analyze various formulations of the resulting non-convex optimization problems and develop efficient algorithms to solve them for portfolio sizes as large as hundreds. By adopting a simple convergence trading strategy, we test the performance of our sparse mean reverting portfolios on both synthetic and historical real market data. In particular, the $l_1$ regularization method, in combination with quadratic program formulation as well as difference of convex functions and least angle regression treatment, gives fast and robust performance on large out-of-sample data set.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A novel regularization-based optimization approach to sparse mean-reverting portfolios selection

Article 27 January 2023

An Iterative Approach to Ill-Conditioned Optimal Portfolio Selection

Article Open access 18 November 2019

Numerical Solution of the Regularized Portfolio Selection Problem

References

Barekat, F., Yin, K., Caflisch, R.E., Osher, S.J., Lai, R., Ozolins, V.: Compressed wannier modes found from an $L_1$ regularized energy functional. arXiv preprint arXiv:1403.6883 (2014)
Box, G.E., Tiao, G.C.: A canonical analysis of multiple time series. Biometrika 64(2), 355–365 (1977)
Article MathSciNet MATH Google Scholar
Box, M., Davies, D., Swann, W.H., Australia, I.: Non-linear Optimization Techniques, vol. 5. Oliver & Boyd, Edinburgh (1969)
Google Scholar
Chow, S.-N., Yang, T.-S., Zhou, H.: Global optimizations by intermittent diffusion. National Science Council Tunghai University Endowment Fund for Academic Advancement Mathematics Research Promotion Center, p. 121 (2009)
Cuturi, M., D’aspremont, A.: Mean reversion with a variance threshold. In: Proceedings of the 30th International Conference on Machine Learning (ICML-13), pp. 271–279 (2013)
d’Aspremont, A.: Identifying small mean-reverting portfolios. Quant. Finance 11(3), 351–364 (2011)
Article MathSciNet MATH Google Scholar
d’Aspremont, A., El Ghaoui, L., Jordan, M.I., Lanckriet, G.R.: A direct formulation for sparse PCA using semidefinite programming. SIAM Rev. 49(3), 434–448 (2007)
Article MathSciNet MATH Google Scholar
Efron, B., Hastie, T., Johnstone, I., Tibshirani, R., et al.: Least angle regression. Ann. Stat. 32(2), 407–499 (2004)
Article MathSciNet MATH Google Scholar
Engle, R.F., Granger, C.W.: Co-integration and error correction: representation, estimation, and testing. Econom. J. Econom. Soc. 55, 251–276 (1987)
Esser, E., Lou, Y., Xin, J.: A method for finding structured sparse solutions to nonnegative least squares problems with applications. SIAM J. Imaging Sci. 6(4), 2010–2046 (2013)
Article MathSciNet MATH Google Scholar
Fogarasi, N., Levendovszky, J.: Improved parameter estimation and simple trading algorithm for sparse, mean reverting portfolios. In: Annales Universitatis Scientiarium Budapestinensis, Sectio Computatorica, vol. 37, pp. 121–144 (2012)
Horst, R., Thoai, N.V.: DC programming: overview. J. Optim. Theory Appl. 103(1), 1–43 (1999)
Article MathSciNet MATH Google Scholar
Hotelling, H.: Relations between two sets of variates. Biometrika 28(3–4), 321–377 (1936)
Article MATH Google Scholar
Hoyer, P.O.: Non-negative matrix factorization with sparseness constraints. J. Mach. Learn. Res. 5, 1457–1469 (2004)
MathSciNet MATH Google Scholar
Hu, Y., Long, H.: Least squares estimator for Ornstein–Uhlenbeck processes driven by $\alpha $-stable motions. Stoch. Process. Appl. 119(8), 2465–2480 (2009)
Article MathSciNet MATH Google Scholar
Ji, H., Li, J., Shen, Z., Wang, K.: Image deconvolution using a characterization of sharp images in wavelet domain. Appl. Comput. Harmonic Anal. 32(2), 295–304 (2012)
Article MathSciNet MATH Google Scholar
Jolliffe, I.T., Trendafilov, N.T., Uddin, M.: A modified principal component technique based on the lasso. J. Comput. Gr. Stat. 12(3), 531–547 (2003)
Article MathSciNet Google Scholar
Journée, M., Nesterov, Y., Richtárik, P., Sepulchre, R.: Generalized power method for sparse principal component analysis. J. Mach. Learn. Res. 11, 517–553 (2010)
MathSciNet MATH Google Scholar
Krishnan, D., Tay, T., Fergus, R.: Blind deconvolution using a normalized sparsity measure. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 233–240. IEEE (2011)
Lai, R., Osher, S.: A splitting method for orthogonality constrained problems. J. Sci. Comput. 58(2), 431–449 (2014)
Article MathSciNet MATH Google Scholar
Le Thi, H., Huynh, V., Pham Dinh, T.: Convergence analysis of DC algorithm for DC programming with subanalytic data. Annals of Operations Research, Technical Report, LMI, INSA-Rouen (2009)
Le Thi, H.A., Tao, P.D.: Solving a class of linearly constrained indefinite quadratic problems by DC algorithms. J. Glob. Optim. 11(3), 253–285 (1997)
Article MATH Google Scholar
Natarajan, B.K.: Sparse approximate solutions to linear systems. SIAM J. Comput. 24(2), 227–234 (1995)
Article MathSciNet MATH Google Scholar
Ngo, T.T., Bellalij, M., Saad, Y.: The trace ratio optimization problem. SIAM Rev. 54(3), 545–569 (2012)
Article MathSciNet MATH Google Scholar
Ozoliņš, V., Lai, R., Caflisch, R., Osher, S.: Compressed modes for variational problems in mathematics and physics. Proc. Natl. Acad. Sci. 110(46), 18368–18373 (2013)
Article MathSciNet MATH Google Scholar
Ozoliņš, V., Lai, R., Caflisch, R., Osher, S.: Compressed plane waves yield a compactly supported multiresolution basis for the laplace operator. Proc. Natl. Acad. Sci. 111(5), 1691–1696 (2014)
Article MathSciNet MATH Google Scholar
Richtárik, P., Takáč, M., Ahipaşaoğlu, S.D.: Alternating maximization: unifying framework for 8 sparse PCA formulations and efficient parallel codes. arXiv preprint arXiv:1212.4137 (2012)
Sriperumbudur, B.K., Torres, D.A., Lanckriet, G.R.: Sparse eigen methods by dc programming. In: Proceedings of the 24th International Conference on Machine Learning, pp. 831–838. ACM (2007)
Tao, P.D., Le Thi, H.A.: Convex analysis approach to DC programming: theory, algorithms and applications. Acta Math. Vietnam. 22(1), 289–355 (1997)
MathSciNet MATH Google Scholar
Tao, P.D., Le Thi, H.A.: A DC optimization algorithm for solving the trust-region subproblem. SIAM J. Optim. 8(2), 476–505 (1998)
Article MathSciNet MATH Google Scholar
Tibshirani, R.: Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B (Methodol.) 58, 267–288 (1996)
Yin, P., Esser, E., Xin, J.: Ratio and difference of l1 and l2 norms and sparse representation with coherent dictionaries. Commun. Inf. Syst. 14(2), 87–109 (2014)
MathSciNet Google Scholar
Yu, J.: Bias in the estimation of the mean reversion parameter in continuous time models. J. Econom. 169(1), 114–122 (2012)
Article MathSciNet MATH Google Scholar
Zou, H., Hastie, T., Tibshirani, R.: Sparse principal component analysis. J. Comput. Gr. Stat. 15(2), 265–286 (2006)
Article MathSciNet Google Scholar

Download references

Acknowledgements

We would like to thank Dr. Wuan Luo for bringing reference [6] to our attention and for helpful communication.

Author information

Authors and Affiliations

8700 Pershing Dr., Playa Del Rey, CA, 90293, USA
Xiaolong Long
Department of Mathematics, University of California at Irvine, Irvine, CA, 92697, USA
Knut Solna & Jack Xin

Authors

Xiaolong Long
View author publications
You can also search for this author inPubMed Google Scholar
Knut Solna
View author publications
You can also search for this author inPubMed Google Scholar
Jack Xin
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Xiaolong Long.

Additional information

Dedicated to the memory of our good friend and colleague Ernie Esser.

This work was partially supported by NSF Grants DMS-1222507, DMS-1522383, and IIS-1632935.

Appendices

Appendix A: Semi-definite Relaxation Method Derived From $l_1/l_2$

By setting $X = xx^T$, then the problem (12) is equivalent to

$$\begin{aligned} \begin{array}{clcr} \min _{X} &{} \mathrm {trace}(AX)/\mathrm {trace}(BX)\\ s.t. &{} \begin{array}[t]{ll} \frac{1^T |X| 1}{\mathrm {trace}(X)} \le m^2\\ rank(X) = 1\\ X \succeq 0 \end{array} \end{array} \end{aligned}$$

where |X| means we take the absolute value for each entry of X.

Then after a change of variables:

$$\begin{aligned} Y = \frac{X}{\mathrm {trace}(BX)},\quad z = \frac{1}{\mathrm {trace}(BX)} \end{aligned}$$

and dropping the rank constraint, the previous problem can be written as a semidefinite programming problem:

$$\begin{aligned} \begin{array}{clcr} \min _{Y} &{} \mathrm {trace}(AY)\\ s.t. &{} \begin{array}[t]{ll} 1^T|Y|1\le m^2 z\\ \mathrm {trace}(Y) - z = 0\\ \mathrm {trace}(BY) = 1\\ Y \succeq 0\\ \end{array} \end{array} \end{aligned}$$

(19)

If we set $\mathrm {Card}(x) = k = m^2$, this is exactly the semidefinite relaxation in [6].

Appendix B: Estimation of the Matrices A and B

For the estimations below, we assume that each column of the data matrix S represents an asset and its mean is 0. Its size is $l\times n$, so we have l observations and n assets.

We define $S_c$ and $S_f$ in the following way:

$$\begin{aligned} S_c= & {} \left( \begin{array}{c} S_1^T \\ \vdots \\ S_{l-1}^T \end{array} \right) =\left( \begin{array}{ccc} S_{11}&{} \ldots &{} S_{1n} \\ \vdots &{} \ddots &{}\vdots \\ S_{l-1,1}&{} \ldots &{} S_{l-1,n}\end{array} \right) \\ S_f= & {} \left( \begin{array}{c} S_2^T \\ \vdots \\ S_{l}^T \end{array} \right) =\left( \begin{array}{ccc} S_{21}&{} \ldots &{} S_{2n} \\ \vdots &{} \ddots &{}\vdots \\ S_{l1}&{} \ldots &{} S_{ln}\end{array} \right) \end{aligned}$$

where $S_{ti}$ is the value at time t of the ith asset.

1.1 Appendix B.1: Predictability

In [11], the authors discussed several methods in estimating $\beta $ and $\Gamma $ in VAR(1) model (4). In most cases, the number of the observations of assets values l is greater than the number of assets n. Under this case and previous assumptions, we could use the following estimates:

$$\begin{aligned} {\hat{\beta }} = (S_{c}^T S_{c})^{-1}(S^T_{c}S_{f})) \quad {\hat{\Gamma }}= \frac{1}{l-1}S^T_{c}S_{c} \end{aligned}$$

Therefore, the matrices in problem (7) can be estimated as:

$$\begin{aligned} {\hat{A}} = {\hat{\beta }}^T {\hat{\Gamma }}{\hat{\beta }}\quad {\hat{B}} = {\hat{\Gamma }} \end{aligned}$$

1.2 Appendix B.2: Direct OU Estimator

Using a similar method as B.1, we estimate the matrices A and B as follows:

$$\begin{aligned} {\hat{A}} = \frac{1}{l-1} \left( S_{c}^T S_{f}+S_{f}^T S_{c}\right) \quad {\hat{B}} = \frac{2}{l-1} S_{c}^T S_{c} \end{aligned}$$

Rights and permissions

Reprints and permissions

About this article

Cite this article

Long, X., Solna, K. & Xin, J. Three $l_1$ Based Nonconvex Methods in Constructing Sparse Mean Reverting Portfolios. J Sci Comput 75, 1156–1186 (2018). https://doi.org/10.1007/s10915-017-0578-5

Download citation

Received: 21 September 2015
Revised: 07 October 2017
Accepted: 10 October 2017
Published: 20 October 2017
Issue Date: May 2018
DOI: https://doi.org/10.1007/s10915-017-0578-5

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Three \(l_1\) Based Nonconvex Methods in Constructing Sparse Mean Reverting Portfolios

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A novel regularization-based optimization approach to sparse mean-reverting portfolios selection

An Iterative Approach to Ill-Conditioned Optimal Portfolio Selection

Numerical Solution of the Regularized Portfolio Selection Problem

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendices

Appendix A: Semi-definite Relaxation Method Derived From \(l_1/l_2\)

Appendix B: Estimation of the Matrices A and B

1.1 Appendix B.1: Predictability

1.2 Appendix B.2: Direct OU Estimator

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Subscribe and save

Buy Now

Three \(l_1\) Based Nonconvex Methods in Constructing Sparse Mean Reverting Portfolios

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A novel regularization-based optimization approach to sparse mean-reverting portfolios selection

An Iterative Approach to Ill-Conditioned Optimal Portfolio Selection

Numerical Solution of the Regularized Portfolio Selection Problem

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendices

Appendix A: Semi-definite Relaxation Method Derived From \(l_1/l_2\)

Appendix B: Estimation of the Matrices A and B

1.1 Appendix B.1: Predictability

1.2 Appendix B.2: Direct OU Estimator

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Subscribe and save

Buy Now