Efficient Estimation of General Additive Neural Networks: A Case Study for CTG Data

Lisboa, P. J. G.; Ortega-Martorell, S.; Jayabalan, M.; Olier, I.

doi:10.1007/978-3-030-65965-3_29

P. J. G. Lisboa³⁵,
S. Ortega-Martorell³⁵,
M. Jayabalan³⁵ &
…
I. Olier³⁵

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1323))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

2769 Accesses

Abstract

This paper discusses the concepts of interpretability and explainability and outlines desiderata for robust interpretability. It then describes a neural network model that meets all criteria, with the addition of global faithfulness.

This is achieved by efficient estimation of a General Additive Neural Network, seeded by a conventional Multilayer Perceptron (MLP) by distilling the dependence on individual variables and pairwise interactions, so that their effects can be represented within the structure of a General Additive Model. This makes the logic of the model clear and transparent to users, across the complete input space. The model is self-explaining.

The modelling approach used in this paper derives the partial responses from the MLP, resulting in the Partial Response Network (PRN). Its application is illustrated in a medical context using the CTU-UHB Cardiotacography intrapartum database (n = 552) to infer the features associated with caesarean deliveries. This is the first application of the PRN to this data set and it is shown that the self-explaining model achieves comparable discrimination performance to that of Random Forests previously applied to the same data set. The classes are highly imbalanced with a prevalence of caesarean sections of 8.33%. The resulting model uses 4 from 8 possible features and has an AUROC of 0.69 [CI 0.60, 0.77] estimated by 4-fold cross-validation. Its performance and features are compared also with those from a Sparse Additive Models (SAM) which has an AUROC of 0.72 [CI 0.64, 0.80]. This is not significantly different and requires all features.

For clinical utility by risk stratification, the odds-ratio for caesarian section vs. not at the prevalence threshold is 3.97 for the PRN, better 3.14 for the SAM. Compared for consistency, parsimony, stability and scalability the models have complementary properties.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer

Generalized additive neural network with flexible parametric link function: model estimation using simulated and real clinical data

Article 30 June 2017

The relative data hungriness of unpenalized and penalized logistic regression and ensemble-based machine learning methods: the case of calibration

Article Open access 05 November 2024

References

Goodman, B., Flaxman, S.: European union regulations on algorithmic decision making and a ‘right to explanation’. AI Mag. 38, 50–57 (2017)
Article Google Scholar
Adadi, A., Berrada, M.: Peeking inside the black-box: a survey on explainable artificial intelligence (XAI). IEEE Access 6 52138–52160 (2018)
Google Scholar
Miller, T.: Explanation in artificial intelligence: insights from the social sciences. Artif. Intell. 267, 1–38 (2019)
Article MathSciNet Google Scholar
Biran, O., Cotton, C.: Explanation and justification in machine learning: a survey. In: IJCAI Workshop on Explainable AI (XAI) (2017)
Google Scholar
Etchells, T.A., Lisboa, P.J.G.: Orthogonal Search-Based Rule Extraction (OSRE) for trained neural networks: a practical and efficient approach. IEEE Trans. Neural Netw. 17(2), 374–384 (2006)
Article Google Scholar
Rögnvaldsson, T., Etchells, T.A., You, L., Garwicz, D., Jarman, I., Lisboa, P.J.G.: How to find simple and accurate rules for viral protease cleavage specificities. BMC Bioinf. 10(1), 149 (2009)
Article Google Scholar
Montani, S., Striani, M.: Artificial intelligence in clinical decision support: a focused literature survey. Yearb. Med. Inform. 28(1), 120–127 (2019)
Article Google Scholar
Ribeiro, M.T., Singh, S., Guestrin, C.: Why should I trust you? In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD 2016, pp. 1135–1144 (2016)
Google Scholar
Lundberg, S., Lee, S.-I., A unified approach to interpreting model predictions. In: Advances in Neural Information Processing Systems, vol. 30, pp. 4765–4774 (2017)
Google Scholar
Alvarez-Melis, D., Jaakkola, T.S.: Towards robust interpretability with self- explaining neural networks. In: NIPS, vol. 31 (2018)
Google Scholar
Ravikumar, P., Lafferty, J., Liu, H., Wasserman, L.: Sparse additive models. J. Roy. Stat. Soc.: Ser. B (Stat. Methodol.) 71(5), 1009–1030 (2009)
Article MathSciNet Google Scholar
Van Belle, V., Van Calster, B., Van Huffel, S., Suykens, J.A.K., Lisboa, P.: Explaining support vector machines: a color based nomogram. PLoS ONE 11(10), e0164568 (2016)
Article Google Scholar
Goldberger, A.L., et al.: PhysioBank, PhysioToolkit, and PhysioNet. Circulation 101(23), e215–e220 (2000)
Article Google Scholar
Chudáček, V., et al.: Open access intrapartum CTG database. BMC Pregnancy Childbirth 14(1), 16 (2014)
Article Google Scholar
Spilka, J., Chudacek, V., Koucky, M., Lhotska, L.: Assessment of non-linear features for intrapartal fetal heart rate classification. In: 2009 9th International Conference on Information Technology and Applications in Biomedicine, pp. 1–4 (2009)
Google Scholar
Fergus, P., Selvaraj, M., Chalmers, C.: Machine learning ensemble modelling to classify caesarean section and vaginal delivery types using Cardiotocography traces. Comput. Biol. Med. 93, 7–16 (2018)
Article Google Scholar
Zhao, Z., Zhang, Y., Deng, Y.: A comprehensive feature analysis of the fetal heart rate signal for the intelligent assessment of fetal state. J. Clin. Med. 7(8), 223 (2018)
Article Google Scholar
Georgoulas, G., Karvelis, P., Spilka, J., Chudáček, V., Stylios, C.D., Lhotská, L.: Investigating pH based evaluation of fetal heart rate (FHR) recordings. Health Technol. (Berl). 7(2–3), 241–254 (2017)
Google Scholar
Lisboa, P.J.G., Ortega-Martorell, S., Cashman, S., Olier, I.: The partial response network arXiv, pp. 1–10 (2019)
Google Scholar
Hooker, G.: Generalized functional ANOVA diagnostics for high-dimensional functions of dependent variables. J. Comput. Graph. Stat. 16(3), 709–732 (2007)
Article MathSciNet Google Scholar
Meier, L., Van De Geer, S., Bühlmann, P.: The group lasso for logistic regression. J. R. Stat. Soc. Ser. B Stat. Methodol. (2008)
Google Scholar
MacKay, D.J.C.: The evidence framework applied to classification networks. Neural Comput. 4(5), 720–736 (1992)
Article Google Scholar
Kim, B., et al.: Interpretability beyond feature attribution: quantitative testing with concept activation vectors (TCAV). In: International Conference on Machine Learning (ICLR) (2018)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Applied Mathematics, Liverpool John Moores University, Liverpool, L3 3AF, UK
P. J. G. Lisboa, S. Ortega-Martorell, M. Jayabalan & I. Olier

Authors

P. J. G. Lisboa
View author publications
You can also search for this author in PubMed Google Scholar
S. Ortega-Martorell
View author publications
You can also search for this author in PubMed Google Scholar
M. Jayabalan
View author publications
You can also search for this author in PubMed Google Scholar
I. Olier
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P. J. G. Lisboa .

Editor information

Editors and Affiliations

University of Sydney, Sydney, NSW, Australia
Irena Koprinska
Monash University, Clayton, VIC, Australia
Michael Kamp
University of Bari Aldo Moro, Bari, Italy
Annalisa Appice
University of Bari Aldo Moro, Bari, Italy
Corrado Loglisci
University of Guelph, Guelph, ON, Canada
Luiza Antonie
University of Caen Normandy, Caen, France
Albrecht Zimmermann
University of Pisa, Pisa, Italy
Riccardo Guidotti
Norwegian University of Science and Technology, Trondheim, Norway
Özlem Özgöbek
University of Porto, Porto, Portugal
Rita P. Ribeiro
UPC BarcelonaTech, Barcelona, Spain
Ricard Gavaldà
University of Porto, Porto, Portugal
João Gama
Fraunhofer IAIS, St. Augustin, Germany
Linara Adilova
Royal Holloway University of London, Egham, UK
Yamuna Krishnamurthy
University of Lisbon, Lisbon, Portugal
Pedro M. Ferreira
University of Bari Aldo Moro, Bari, Italy
Donato Malerba
University of Lisbon, Lisbon, Portugal
Ibéria Medeiros
University of Bari Aldo Moro, Bari, Italy
Michelangelo Ceci
ICAR-CNR, Rende, Italy
Giuseppe Manco
University of Naples Federico II, Naples, Italy
Elio Masciari
University of North Carolina, Charlotte, NC, USA
Zbigniew W. Ras
Australian National University, Canberra, ACT, Australia
Peter Christen
Leibniz University Hannover, Hannover, Germany
Eirini Ntoutsi
Technical University of Dortmund, Dortmund, Germany
Erich Schubert
University of Southern Denmark, Odense, Denmark
Arthur Zimek
University of Pisa, Pisa, Italy
Anna Monreale
Warsaw University of Technology, Warsaw, Poland
Przemyslaw Biecek
ISTI-CNR, PISA, Italy
Salvatore Rinzivillo
Berlin Institute of Technology, Berlin, Germany
Benjamin Kille
Berlin Institute of Technology, Berlin, Germany
Andreas Lommatzsch
Norwegian University of Science and Technology, Trondheim, Norway
Jon Atle Gulla

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lisboa, P.J.G., Ortega-Martorell, S., Jayabalan, M., Olier, I. (2020). Efficient Estimation of General Additive Neural Networks: A Case Study for CTG Data. In: Koprinska, I., et al. ECML PKDD 2020 Workshops. ECML PKDD 2020. Communications in Computer and Information Science, vol 1323. Springer, Cham. https://doi.org/10.1007/978-3-030-65965-3_29

Download citation

DOI: https://doi.org/10.1007/978-3-030-65965-3_29
Published: 02 February 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-65964-6
Online ISBN: 978-3-030-65965-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the ECML PKDD community (opens in a new tab)

Efficient Estimation of General Additive Neural Networks: A Case Study for CTG Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer

Generalized additive neural network with flexible parametric link function: model estimation using simulated and real clinical data

The relative data hungriness of unpenalized and penalized logistic regression and ensemble-based machine learning methods: the case of calibration

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

Efficient Estimation of General Additive Neural Networks: A Case Study for CTG Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer

Generalized additive neural network with flexible parametric link function: model estimation using simulated and real clinical data

The relative data hungriness of unpenalized and penalized logistic regression and ensemble-based machine learning methods: the case of calibration

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation