The Group-Lasso: ℓ1, ∞  Regularization versus ℓ1,2 Regularization

Vogt, Julia E.; Roth, Volker

doi:10.1007/978-3-642-15986-2_26

The Group-Lasso: ℓ_1, ∞ Regularization versus ℓ_1,2 Regularization

Julia E. Vogt²⁰ &
Volker Roth²⁰

Conference paper

2551 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6376))

Abstract

The ℓ_1, ∞ norm and the ℓ_1,2 norm are well known tools for joint regularization in Group-Lasso methods. While the ℓ_1,2 version has been studied in detail, there are still open questions regarding the uniqueness of solutions and the efficiency of algorithms for the ℓ_1, ∞ variant. For the latter, we characterize the conditions for uniqueness of solutions, we present a simple test for uniqueness, and we derive a highly efficient active set algorithm that can deal with input dimensions in the millions. We compare both variants of the Group-Lasso for the two most common application scenarios of the Group-Lasso, one is to obtain sparsity on the level of groups in “standard” prediction problems, the second one is multi-task learning where the aim is to solve many learning problems in parallel which are coupled via the Group-Lasso constraint. We show that both version perform quite similar in “standard” applications. However, a very clear distinction between the variants occurs in multi-task settings where the ℓ_1,2 version consistently outperforms the ℓ_1, ∞ counterpart in terms of prediction accuracy.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Tibshirani, R.: Regression shrinkage and selection via the Lasso. J. Roy. Stat. Soc. B 58(1), 267–288 (1996)
MATH MathSciNet Google Scholar
Yuan, M., Lin, Y.: Model selection and estimation in regression with grouped variables. J. Roy. Stat. Soc. B, 49–67 (2006)
Google Scholar
Turlach, B.A., Venables, W.N., Wright, S.J.: Simultaneous variable selection. Technometrics 47, 349–363 (2005)
Article MathSciNet Google Scholar
Meier, L., van de Geer, S., Bühlmann, P.: The Group Lasso for Logistic Regression. J. Roy. Stat. Soc. B 70(1), 53–71 (2008)
Article MATH Google Scholar
Argyriou, A., Evgeniou, T., Pontil, M.: Multi-task feature learning. In: Advances in Neural Information Processing Systems, vol. 19. MIT Press, Cambridge (2007)
Google Scholar
Kim, Y., Kim, J., Kim, Y.: Blockwise sparse regression. Statistica Sinica 16, 375–390 (2006)
MATH MathSciNet Google Scholar
Roth, V., Fischer, B.: The Group-Lasso for generalized linear models: uniqueness of solutions and efficient algorithms. In: ICML 2008, pp. 848–855. ACM, New York (2008)
Chapter Google Scholar
Schmidt, M., Murphy, K., Fung, G., Rosales, R.: Structure learning in random fields for heart motion abnormality detection. In: CVPR (2008)
Google Scholar
Quattoni, A., Carreras, X., Collins, M., Darrell, T.: An efficient projection for l _1 ∞ regularization. In: 26th Intern. Conference on Machine Learning (2009)
Google Scholar
Liu, H., Palatucci, M., Zhang, J.: Blockwise coordinate descent procedures for the multi-task lasso, with applications to neural semantic basis discovery. In: 26th Intern. Conference on Machine Learning (2009)
Google Scholar
Osborne, M., Presnell, B., Turlach, B.: On the LASSO and its dual. J. Comp. and Graphical Statistics 9(2), 319–337 (2000)
Article MathSciNet Google Scholar
McCullaghand, P., Nelder, J.: Generalized Linear Models. Chapman & Hall, Boca Raton (1983)
Google Scholar
Liu, Q., Xu, Q., Zheng, V.W., Xue, H., Cao, Z., Yang, Q.: Multi-task learning for cross-platform sirna efficacy prediction: an in-silico study. BMC Bioinformatics 11(1), 181 (2010)
Article Google Scholar
Yeo, G., Burge, C.: Maximum entropy modeling of short sequence motifs with applications to RNA splicing signals. J. Comp. Biology 11, 377–394 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Basel, Bernoullistr. 16, CH-4056, Basel, Switzerland
Julia E. Vogt & Volker Roth

Authors

Julia E. Vogt
View author publications
You can also search for this author in PubMed Google Scholar
Volker Roth
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

GRIS, TU Darmstadt, Fraunhoferstr. 5, 64283, Darmstadt, Germany
Michael Goesele & Stefan Roth &
Fraunhofer Institute for Computer Graphics Research (IGD), Fraunhoferstr. 5, 64283, Darmstadt, Germany
Arjan Kuijper
Dept. of Computer Science, TU Darmstadt, Hochschulstrasse 10, 64289, Darmstadt, Germany
Bernt Schiele
TU Darmstadt, Hochschulstr. 10, 64289, Darmstadt, Germany
Konrad Schindler

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vogt, J.E., Roth, V. (2010). The Group-Lasso: ℓ_1, ∞ Regularization versus ℓ_1,2 Regularization. In: Goesele, M., Roth, S., Kuijper, A., Schiele, B., Schindler, K. (eds) Pattern Recognition. DAGM 2010. Lecture Notes in Computer Science, vol 6376. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15986-2_26

Download citation

DOI: https://doi.org/10.1007/978-3-642-15986-2_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15985-5
Online ISBN: 978-3-642-15986-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics