Skip to main content
Log in

Group penalized logistic regression differentiates between benign and malignant ovarian tumors

  • Data analytics and machine learning
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Ovarian cancer is one of the most common types of cancer in women. Correct differentiation between benign and malignant ovarian tumors is of immense importance in medical fields. In this paper, we introduce group penalized logistic regressions to enhance diagnosis accuracy. Firstly, we divide 349 ovarian cancer patients into two sets: one for learning model parameters, and the other for assessing prediction performance, and select 46 variables from 49 traits as the predictor vector to construct GLASSO/GSCAD/GMCP penalized logistic regressions with 11 groups. Secondly, we develop group coordinate descent (GCD) algorithm and its specific pseudo code to simultaneously complete group selection and group estimation, introduce the tenfold cross validation (CV) procedure to select the relatively optimal tuning parameter, and apply the testing set and Youden index to obtain class probability estimator and class index information. Finally, we compute the accuracy, precision, specificity, sensitivity, F1-score and the area under ROC curve (AUC) to assess the prediction performance to the proposed group penalized methods, and found that GLASSO/GSCAD/GMCP penalized logistic regressions outperform three machine learning methods (ANN, artificial neural network; SVM, support vector machine; XGBoost, eXtreme gradient boosting) and three deep learning methods (CNN, convolutional neural network; DNN, deep neural network; RNN, recurrent neural network) in terms of accuracy, F1-score and AUC.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Data availability

The data sets used in this study are available from https://www.kaggle.com/saurabhshahane/predict-ovarian-cancer.

References

Download references

Funding

Hu’s research was supported by the Fifth Batch of Excellent Talent Support Program of Chongqing Colleges and University (68021900601), the Natural Science Foundation of CQ CSTC (cstc.2018jcyjA2073), the Program for the Chongqing Statistics Postgraduate Supervisor Team (yds183002), Chongqing Social Science Plan Project (2019WT59), Science and Technology Research Program of Chongqing Education Commission (KJZD-M202100801), Mathematic and Statistics Team from Chongqing Technology and Business University (ZDPTTD201906) and Open Project from Chongqing Key Laboratory of Social Economy and Applied Statistics (KFJJ2022056).

Author information

Authors and Affiliations

Authors

Contributions

XH provided the basic idea, improved the initial writing, and completed the main revisions. YX collected data, provided the figures and the tables, and completed the initial writing and the part revisions. YY and HJ took part in the program writing for the original version.

Corresponding author

Correspondence to Ying Xie.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethic approval

This is an observational study and does not require ethics approval.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hu, X., Xie, Y., Yang, Y. et al. Group penalized logistic regression differentiates between benign and malignant ovarian tumors. Soft Comput 27, 18565–18584 (2023). https://doi.org/10.1007/s00500-023-09231-4

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-023-09231-4

Keywords

Navigation