Skip to main content

Fitting Aggregation Functions to Data: Part II - Idempotization

  • Conference paper
  • First Online:
Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2016)

Abstract

The use of supervised learning techniques for fitting weights and/or generator functions of weighted quasi-arithmetic means – a special class of idempotent and nondecreasing aggregation functions – to empirical data has already been considered in a number of papers. Nevertheless, there are still some important issues that have not been discussed in the literature yet. In the second part of this two-part contribution we deal with a quite common situation in which we have inputs coming from different sources, describing a similar phenomenon, but which have not been properly normalized. In such a case, idempotent and nondecreasing functions cannot be used to aggregate them unless proper pre-processing is performed. The proposed idempotization method, based on the notion of B-splines, allows for an automatic calibration of independent variables. The introduced technique is applied in an R source code plagiarism detection system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bartoszuk, M., Beliakov, G., Gagolewski, M., James, S.: Fitting aggregation functions to data: part I - linearization and regularization. In: Carvalho, J.P., Lesot, M.-J., Kaymak, U., Vieira, S., Bouchon-Meunier, B., Yager, R.R. (eds.) IPMU 2016, Part II. CCIS, vol. 611, pp. 767–779. Springer, Heidelberg (2016)

    Google Scholar 

  2. Bartoszuk, M., Gagolewski, M.: A fuzzy R code similarity detection algorithm. In: Laurent, A., Strauss, O., Bouchon-Meunier, B., Yager, R.R. (eds.) IPMU 2014, Part III. CCIS, vol. 444, pp. 21–30. Springer, Heidelberg (2014)

    Google Scholar 

  3. Bartoszuk, M., Gagolewski, M.: Detecting similarity of R functions via a fusion of multiple heuristic methods. In: Alonso, J., Bustince, H., Reformat, M. (eds.) Proceedings of IFSA/Eusflat 2015, pp. 419–426. Atlantic Press (2015)

    Google Scholar 

  4. Beliakov, G.: Monotone approximation of aggregation operators using least squares splines. Int. J. Uncertainty Fuzziness Knowl. Based Syst. 10, 659–676 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  5. Beliakov, G.: How to build aggregation operators from data. Int. J. Intell. Syst. 18, 903–923 (2003)

    Article  MATH  Google Scholar 

  6. Beliakov, G.: Learning weights in the generalized OWA operators. Fuzzy Optim. Decis. Making 4, 119–130 (2005)

    Article  MathSciNet  MATH  Google Scholar 

  7. Beliakov, G.: Construction of aggregation functions from data using linear programming. Fuzzy Sets Syst. 160, 65–75 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  8. Beliakov, G., Bustince, H., Calvo, T.: A Practical Guide to Averaging Functions. STUDFUZZ, vol. 329. Springer, Heidelberg (2016)

    Google Scholar 

  9. Beliakov, G., Pradera, A., Calvo, T.: Aggregation Functions: A Guide for Practitioners. STUDFUZZ, vol. 221. Springer, Heidelberg (2007)

    MATH  Google Scholar 

  10. Beliakov, G., Warren, J.: Appropriate choice of aggregation operators in fuzzy decision support systems. IEEE Trans. Fuzzy Syst. 9(6), 773–784 (2001)

    Article  Google Scholar 

  11. Gagolewski, M.: Data Fusion: Theory, Methods, and Applications. Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland (2015)

    Google Scholar 

  12. Hansen, N.: The CMA evolution strategy: a comparing review. In: Lozano, J., Larranga, P., Inza, I., Bengoetxea, E. (eds.) Towards a New Evolutionary Computation. STUDFUZZ, vol. 192, pp. 75–102. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  13. Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer, New York (2013)

    MATH  Google Scholar 

  14. Mesiar, R., Mesiarová-Zemánková, A.: The ordered modular averages. IEEE Trans. Fuzzy Syst. 19(1), 42–50 (2011)

    Article  Google Scholar 

  15. R Development Core Team: R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2016). http://www.R-project.org

  16. Schumaker, L.: Spline Functions: Basic Theory. Cambridge University Press, Cambridge (2007)

    Book  MATH  Google Scholar 

Download references

Acknowledgments

This study was supported by the National Science Center, Poland, research project 2014/13/D/HS4/01700.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marek Gagolewski .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Bartoszuk, M., Beliakov, G., Gagolewski, M., James, S. (2016). Fitting Aggregation Functions to Data: Part II - Idempotization. In: Carvalho, J., Lesot, MJ., Kaymak, U., Vieira, S., Bouchon-Meunier, B., Yager, R. (eds) Information Processing and Management of Uncertainty in Knowledge-Based Systems. IPMU 2016. Communications in Computer and Information Science, vol 611. Springer, Cham. https://doi.org/10.1007/978-3-319-40581-0_63

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-40581-0_63

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-40580-3

  • Online ISBN: 978-3-319-40581-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics