Skip to main content

Principal Sample Analysis for Data Ranking

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11489))

Abstract

Because of the ever growing amounts of data, challenges have appeared for storage and processing, making data reduction still an important field of study. Numerosity reduction or prototype selection is one of the primary methods of data reduction. In this paper, we propose some possible improvements for Principal Sample Analysis (PSA) which is a numerosity reduction algorithm. The improvements are PSA in Hilbert space, improving its time complexity using anchor points, sample size estimation using PAC learning, and PSA for regression and clustering tasks.

B. Ghojogh—Supervised by Dr. Mark Crowley and Dr. Fakhri Karray, Department of Electrical and Computer Engineering, {mcrowley, karray}@uwaterloo.ca.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Garcia, S., Derrac, J., Cano, J., Herrera, F.: Prototype selection for nearest neighbor classification: taxonomy and empirical study. IEEE Trans. Pattern Anal. Mach. Intell. 34(3), 417–435 (2012)

    Article  Google Scholar 

  2. Triguero, I., Derrac, J., Garcia, S., Herrera, F.: A taxonomy and experimental study on prototype generation for nearest neighbor classification. IEEE Trans. Syst. Man Cybern. Part C 42(1), 86–100 (2012)

    Article  Google Scholar 

  3. Ghojogh, B., Crowley, M.: Principal sample analysis for data reduction. In: 2018 IEEE International Conference on Big Knowledge (ICBK), pp. 350–357. IEEE (2018)

    Google Scholar 

  4. Mika, S., Ratsch, G., Weston, J., Scholkopf, B., Mullers, K.R.: Fisher discriminant analysis with kernels. In: Proceedings of the 1999 IEEE signal processing society workshop on Neural networks for signal processing IX, pp. 41–48. IEEE (1999)

    Google Scholar 

  5. Ghojogh, B., Ghojogh, A., Crowley, M., Karray, F.: Fitting a mixture distribution to data: tutorial. arXiv preprint arXiv:1901.06708 (2019)

  6. Vapnik, V.: The Nature of Statistical Learning Theory. Statistics for Engineering and Information Science. Springer, Heidelberg (2000). https://doi.org/10.1007/978-1-4757-3264-1

    Book  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Benyamin Ghojogh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ghojogh, B. (2019). Principal Sample Analysis for Data Ranking. In: Meurs, MJ., Rudzicz, F. (eds) Advances in Artificial Intelligence. Canadian AI 2019. Lecture Notes in Computer Science(), vol 11489. Springer, Cham. https://doi.org/10.1007/978-3-030-18305-9_62

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-18305-9_62

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-18304-2

  • Online ISBN: 978-3-030-18305-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics