Improving Random Projections Using Marginal Information

Li, Ping; Hastie, Trevor J.; Church, Kenneth W.

doi:10.1007/11776420_46

Ping Li²⁰,
Trevor J. Hastie²⁰ &
Kenneth W. Church²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4005))

Included in the following conference series:

International Conference on Computational Learning Theory

2936 Accesses

Abstract

We present an improved version of random projections that takes advantage of marginal norms. Using a maximum likelihood estimator (MLE), margin-constrained random projections can improve estimation accuracy considerably. Theoretical properties of this estimator are analyzed in detail.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Random projections for Bayesian regression

Article Open access 19 November 2015

Random Projections with Bayesian Priors

Random Projections for Large-Scale Regression

References

Vempala, S.S.: The Random Projection Method. American Mathematical Society, Providence, RI (2004)
Google Scholar
Arriaga, R., Vempala, S.: An algorithmic theory of learning: Robust concepts and random projection. In: Proc. of FOCS, pp. 616–623 (1999) (Also to appear in Machine Learning)
Google Scholar
Dasgupta, S.: Learning mixtures of gaussians. In: Proc. of FOCS, New York, pp. 634–644 (1999)
Google Scholar
Fradkin, D., Madigan, D.: Experiments with random projections for machine learning. In: Proc. of KDD, Washington, DC, pp. 517–522 (2003)
Google Scholar
Fern, X.Z., Brodley, C.E.: Random projection for high dimensional data clustering: A cluster ensemble approach. In: Proc. of ICML, Washington, DC, pp. 186–193 (2003)
Google Scholar
Balcan, M.-F., Blum, A., Vempala, S.S.: On kernels, margins, and low-dimensional mappings. In: Ben-David, S., Case, J., Maruoka, A. (eds.) ALT 2004. LNCS, vol. 3244, pp. 194–205. Springer, Heidelberg (2004)
Chapter Google Scholar
Papadimitriou, C.H., Raghavan, P., Tamaki, H., Vempala, S.: Latent semantic indexing: A probabilistic analysis. In: Proc. of PODS, Seattle, WA, pp. 159–168 (1998)
Google Scholar
Achlioptas, D., McSherry, F., Schölkopf, B.: Sampling techniques for kernel methods. In: Proc. of NIPS, Vancouver, BC, Canada, pp. 335–342 (2001)
Google Scholar
Bingham, E., Mannila, H.: Random projection in dimensionality reduction: Applications to image and text data. In: Proc. of KDD, San Francisco, CA, pp. 245–250 (2001)
Google Scholar
Charikar, M.S.: Similarity estimation techniques from rounding algorithms. In: Proc. of STOC, Montreal, Quebec, Canada, pp. 380–388 (2002)
Google Scholar
Ravichandran, D., Pantel, P., Hovy, E.: Randomized algorithms and NLP: Using locality sensitive hash function for high speed noun clustering. In: Proc. of ACL, Ann Arbor, MI, pp. 622–629 (2005)
Google Scholar
Liu, K., Kargupta, H., Ryan, J.: Random projection-based multiplicative data perturbation for privacy preserving distributed data mining. IEEE Transactions on Knowledge and Data Engineering 18, 92–106 (2006)
Article Google Scholar
Johnson, W.B., Lindenstrauss, J.: Extensions of Lipschitz mapping into Hilbert space. Contemporary Mathematics 26, 189–206 (1984)
MathSciNet MATH Google Scholar
Indyk, P., Motwani, R.: Approximate nearest neighbors: Towards removing the curse of dimensionality. In: Proc. of STOC, Dallas, TX, pp. 604–613 (1998)
Google Scholar
Achlioptas, D.: Database-friendly random projections: Johnson-Lindenstrauss with binary coins. Journal of Computer and System Sciences 66, 671–687 (2003)
Article MathSciNet MATH Google Scholar
Bartlett, M.S.: Approximate confidence intervals, II. Biometrika 40, 306–317 (1953)
MathSciNet MATH Google Scholar
Small, C.G., Wang, J., Yang, Z.: Eliminating multiple root problems in estimation. Statistical Science 15, 313–341 (2000)
Article MathSciNet Google Scholar
Lehmann, E.L., Romano, J.P.: Testing Statistical Hypothesis, 3rd edn. Springer, New York (2005)
Google Scholar
Li, P., Paul, D., Narasimhan, R., Cioffi, J.: On the distribution of SINR for the MMSE MIMO receiver and performance analysis. IEEE Trans. Inform. Theory 52, 271–286 (2006)
Article MathSciNet Google Scholar
Li, P., Hastie, T.J., Church, K.W.: Margin-constrained random projections and very sparse random projections. Technical report, Department of Statistics, Stanford University (2006)
Google Scholar
Li, P., Church, K.W., Hastie, T.J.: A sketched-based sampling algorithm on sparse data. Technical report, Department of Statistics, Stanford University (2006)
Google Scholar
Shenton, L.R., Bowman, K.: Higher moments of a maximum-likelihood estimate. Journal of Royal Statistical Society B 25, 305–317 (1963)
MathSciNet MATH Google Scholar
Ferrari, S.L.P., Botter, D.A., Cordeiro, G.M., Cribari-Neto, F.: Second and third order bias reduction for one-parameter family models. Stat. and Prob. Letters 30, 339–345 (1996)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics, Stanford University, Stanford, CA, 94305, USA
Ping Li & Trevor J. Hastie
Microsoft Research, One Microsoft Way, Redmond, WA, 98052, USA
Kenneth W. Church

Authors

Ping Li
View author publications
You can also search for this author in PubMed Google Scholar
Trevor J. Hastie
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth W. Church
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ICREA and Department of Economics, Universitat Pompeu Fabra, Ramon Trias Fargas 25-27, 08005, Barcelona, Spain
Gábor Lugosi
Ruhr-Universität Bochum, Germany
Hans Ulrich Simon

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, P., Hastie, T.J., Church, K.W. (2006). Improving Random Projections Using Marginal Information. In: Lugosi, G., Simon, H.U. (eds) Learning Theory. COLT 2006. Lecture Notes in Computer Science(), vol 4005. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11776420_46

Download citation

DOI: https://doi.org/10.1007/11776420_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-35294-5
Online ISBN: 978-3-540-35296-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Improving Random Projections Using Marginal Information

Abstract

Access this chapter

Preview

Similar content being viewed by others

Random projections for Bayesian regression

Random Projections with Bayesian Priors

Random Projections for Large-Scale Regression

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Improving Random Projections Using Marginal Information

Abstract

Access this chapter

Preview

Similar content being viewed by others

Random projections for Bayesian regression

Random Projections with Bayesian Priors

Random Projections for Large-Scale Regression

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation