loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Eva K. Lee 1 ; 2 and Brent Egan 3 ; 4

Affiliations: 1 Center for Operations Research in Medicine and Healthcare, The Data and Analytic Innovation Institute, Atlanta, U.S.A. ; 2 Georgia Institute of Technology, Atlanta, U.S.A. ; 3 American Medical Association, Chicago, U.S.A. ; 4 Medical University of South Carolina, U.S.A.

Keyword(s): Multi-stage Classification, Multi-group Classification, Imbalanced Data, Discriminant Analysis via Mixed Integer Program, Cardiovascular Disease, Multi-Site Knowledge Discovery, Best Practice, Alzheimer’s Disease, Mild Cognitive Impairment, Early Diagnosis of Dementia, Machine Learning for Evidence-based Practice, Branch-and-Bound.

Abstract: We present a multi-stage, multi-group classification framework that incorporates discriminant analysis via mixed integer programming (DAMIP) with an exact combinatorial branch-and-bound (BB) algorithm and a fast particle swarm optimization (PSO) for feature selection for classification. By utilizing a reserved judgment region, DAMIP allows the classifier to delay making decisions on ‘difficult-to-classify’ observations and develop new classification rules in a later stage. Such a design works well for mixed (poorly separated) data that are difficult to classify without committing a high percentage of misclassification errors. We also establish variant DAMIP models that enable problem-specific fine tuning to establish proper misclassification limits and reserved judgement levels that facilitate efficient management of imbalanced groups. This ensures that minority groups with relatively few entities are treated equally as the majority groups. We apply the framework to two real-life med ical problems: (a) multi-site treatment outcome prediction for best practice discovery in cardiovascular disease, and (b) early disease diagnosis in predicting subjects into normal cognition, mild cognitive impairment, and Alzheimer’s disease groups using neuropsychological tests and blood plasma biomarkers. Both problems involve poorly separated data and imbalanced groups in which traditional classifiers yield low prediction accuracy. The multi-stage BB-PSO/DAMIP manages the poorly separable imbalanced data well and returns interpretable predictive results with over 80% blind prediction accuracy. Mathematically, DAMIP is NP-complete with its classifier proven to be universally strongly consistent. Hence, DAMIP has desirable solution characteristics for machine learning purposes. Computationally, DAMIP is the first multi-group, multi-stage classification model that simultaneously includes a reserved judgment capability and the ability to constrain misclassification rates within a single model. The formulation includes constraints that transform the features from their original space to the group space, serving as a dimension reduction mechanism. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.116.36.192

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Lee, E. and Egan, B. (2022). A Multi-stage Multi-group Classification Model: Applications to Knowledge Discovery for Evidence-based Patient-centered Care. In Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2022) - KDIR; ISBN 978-989-758-614-9; ISSN 2184-3228, SciTePress, pages 95-108. DOI: 10.5220/0011557200003335

@conference{kdir22,
author={Eva K. Lee. and Brent Egan.},
title={A Multi-stage Multi-group Classification Model: Applications to Knowledge Discovery for Evidence-based Patient-centered Care},
booktitle={Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2022) - KDIR},
year={2022},
pages={95-108},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011557200003335},
isbn={978-989-758-614-9},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2022) - KDIR
TI - A Multi-stage Multi-group Classification Model: Applications to Knowledge Discovery for Evidence-based Patient-centered Care
SN - 978-989-758-614-9
IS - 2184-3228
AU - Lee, E.
AU - Egan, B.
PY - 2022
SP - 95
EP - 108
DO - 10.5220/0011557200003335
PB - SciTePress