Balancing High-Dimensional Datasets with Complex Layers

Bobrowski, Leon

doi:10.1007/978-3-031-34204-2_6

Leon Bobrowski ORCID: orcid.org/0000-0003-4735-2460^21,22

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1826))

Included in the following conference series:

International Conference on Engineering Applications of Neural Networks

929 Accesses

Abstract

Unbalanced datasets generate difficulties in designing good classification models because those classes that are represented by the most numerous training sets are harmfully preferred. For this reason, learning sets are often balanced by adding some synthetic feature vectors or by reducing the most numerous learning sets.

High-dimensional learning sets give possibility to design complex layer of linear classifiers. Such layers can also be used for balancing purposes. In this approach, averaging of a small number of feature vectors is partially complemented by averaging vertices based on balanced feature subsets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Feature Selection with Class Hierarchy for Imbalance Problems

Imbalanced Data Classification Based on Feature Selection Techniques

Classification on Imbalanced Data Sets, Taking Advantage of Errors to Improve Performance

References

Duda, O.R., Hart, P.E., Stork, D.G.: Pattern Classification. J. Wiley, New York (2001)
MATH Google Scholar
Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, Heidelberg (2006)
Google Scholar
Chawla, N.V., et al.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)
Article MATH Google Scholar
Bobrowski, L.: complex layers of formal neurons. In: Engineering Applications of Neural Networks - EAAAI/EANN 2022, EANN 2022, pp. 81–89. Springer, Heidelberg (2022)
Google Scholar
Bobrowski, L.: Data Exploration and Linear Separability, pp. 1–172. Lambert Academic Publishing, Saarbrücken 2019
Google Scholar
Boser, B.E., Guyon, I., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Proceedings of the Fifth Annual Workshop of Computational Learning Theory, vol. 5, pp. 144–152. ACM, Pittsburgh (1992)
Google Scholar
Bobrowski, L.: Computing on vertices in data mining, pp. 1–19. Data mining, Intech Open (2021)
Google Scholar
Bobrowski, L., Łukaszuk, T.: Relaxed linear separability (RLS) approach to feature (gene) subset selection. In: Xia, X (ed.) Selected Works in Bioinformatics, INTECH, pp. 103–118 (2011)
Google Scholar
Bobrowski, L., Zabielski, P.: Classification model with collinear grouping of features. J. Inf. Telecommun. 7(1), 73–88 (2023)
Google Scholar

Download references

Acknowledgments

The presented study was supported by the grant WZ/WI-IIT/4/2023 from the Bialystok University of Technology and funded from the resources for research by the Polish Ministry of Science and Higher Education.

Author information

Authors and Affiliations

Faculty of Computer Science, Bialystok University of Technology, Wiejska 45A, Bialystok, Poland
Leon Bobrowski
Institute of Biocybernetics and Biomedical Engineering, PAS, Warsaw, Poland
Leon Bobrowski

Authors

Leon Bobrowski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Leon Bobrowski .

Editor information

Editors and Affiliations

Democritus University of Thrace, Xanthi, Greece
Lazaros Iliadis
University of Piraeus, Piraeus, Greece
Ilias Maglogiannis
University of Leon, León, Spain
Serafin Alonso
Teesside University, Middlesbrough, UK
Chrisina Jayne
University of the West of England, Bristol, UK
Elias Pimenidis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bobrowski, L. (2023). Balancing High-Dimensional Datasets with Complex Layers. In: Iliadis, L., Maglogiannis, I., Alonso, S., Jayne, C., Pimenidis, E. (eds) Engineering Applications of Neural Networks. EANN 2023. Communications in Computer and Information Science, vol 1826. Springer, Cham. https://doi.org/10.1007/978-3-031-34204-2_6

Download citation

DOI: https://doi.org/10.1007/978-3-031-34204-2_6
Published: 07 June 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-34203-5
Online ISBN: 978-3-031-34204-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics