A Novel Descriptor and Molecular Graph-Based Bimodal Contrastive Learning Framework for Drug Molecular Property Prediction

He, Zhengda; Chen, Linjie; Lv, Hao; Zhou, Rui-ning; Xu, Jiaying; Chen, Yadong; Hu, Jianhua; Gao, Yang

doi:10.1007/978-981-99-4749-2_60

Zhengda He^13,14,
Linjie Chen¹⁴,
Hao Lv¹⁴,
Rui-ning Zhou¹⁴,
Jiaying Xu¹⁴,
Yadong Chen¹⁴,
Jianhua Hu¹⁴ &
…
Yang Gao¹³

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14088))

Included in the following conference series:

International Conference on Intelligent Computing

1310 Accesses

Abstract

In AI drug discovery, molecular property prediction is critical. Two main molecular representation methods in molecular property prediction models, descriptor-based and molecular graph-based, offer complementary information, but face challenges like representation conflicts and training imbalances when combined. To counter these issues, we propose a two-stage training process. The first stage employs a self-supervised contrastive learning scheme based on descriptors and graph representations, which pre-trains the encoders for the two modal representations, reducing bimodal feature conflicts and promoting representational consistency. In the second stage, supervised learning using target attribute labels is applied. Here, we design a multi-branch predictor architecture to address training imbalances and facilitate decision fusion. Our method, compatible with various graph neural network modules, has shown superior performance on most of the six tested datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Rajpurkar, P., Chen, E., Banerjee, O., et al.: AI in health and medicine. Nat. Med. 28(1), 31–38 (2022)
Article Google Scholar
Rabaan, A.A., Alhumaid, S., Mutair, A.A., et al.: Application of artificial intelligence in combating high antimicrobial resistance rates. Antibiotics 11(6), 784 (2022)
Article Google Scholar
Fang, X., Liu, L., Lei, J., et al.: Geometry-enhanced molecular representation learning for property prediction. Nature Mach. Intell. 4(2), 127–134 (2022)
Article Google Scholar
Asada, M., Miwa, M., Sasaki, Y.: Using drug descriptions and molecular structures for drug–drug interaction extraction from literature. Bioinformatics 37(12), 1739–1746 (2021)
Article Google Scholar
Kurotani, A., Kakiuchi, T., Kikuchi, J.: Solubility Prediction from Molecular Properties and Analytical Data Using an In-phase Deep Neural Network (Ip-DNN), ACS omega (2021)
Google Scholar
Alves, A.H.R., Cerri, R.: A two-step model for drug-target interaction prediction with predictive bi-clustering trees and XGBoost. In: 2022 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE (2022)
Google Scholar
Wei, Y., Li, S., Li, Z., et al.: Interpretable-ADMET: a web service for ADMET prediction and optimization based on deep neural representation. Bioinformatics 38(10), 2863–2871 (2022)
Article Google Scholar
Wieder, O., et al.: A compact review of molecular property prediction with graph neural networks, Drug Discovery Today: Technologies (2020)
Google Scholar
Rong, Y., Bian, Y., Xu, T., et al.: Self-supervised graph transformer on large-scale molecular data. Adv. Neural. Inf. Process. Syst. 33, 12559–12571 (2020)
Google Scholar
Lovrić, M., Molero, J.M., Kern, R.: PySpark and RDKit: moving towards big data in cheminformatics. Mol. Inf. 38(6), 1800082 (2019)
Article Google Scholar
Yap, C.W.: PaDEL-descriptor: an open source software to calculate molecular descriptors and fingerprints. J. Comput. Chem. 32(7), 1466–1474 (2011)
Article MathSciNet Google Scholar
Abu-Dief, A.M., El-Metwaly, N.M., Alzahrani, S.O., et al.: Structural, conformational and therapeutic studies on new thiazole complexes: drug-likeness and MOE-simulation assessments. Res. Chem. Intermediates 47, 1979–2002 (2021)
Google Scholar
Li, Z., Liu, F., Yang, W., et al.: A survey of convolutional neural networks: analysis, applications, and prospects. IEEE Trans. Neural Networks Learn. Syst. (2021)
Google Scholar
Busbridge, D., Sherburn, D., Cavallo, P., Hammerla, N.Y.: Relational graph attention networks, arXiv preprint arXiv:1904.05811 (2019)
Xiong, Z., et al.: Pushing the boundaries of molecular representation for drug discovery with the graph attention mechanism. J. Med. Chem. 63(16), 8749–8760 (2019)
Article Google Scholar
Chithrananda, S., Grand, G., Ramsundar, B.: Chemberta: large-scale self-supervised pretraining for molecular property prediction, arXiv preprint arXiv:2010.09885 (2020)
Hu, W., Liu, B., Gomes, J., et al.: Strategies for pre-training graph neural networks. In: International Conference on Learning Representations (ICLR) (2020)
Google Scholar
Li, P., et al.: Learn molecular representations from large-scale unlabeled molecules for drug discovery, arXiv preprint arXiv:2012.11175 (2020)
Jiang, D., et al.: Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models. J. Cheminform. 13(1), 1–23 (2021)
MathSciNet Google Scholar
Bai, P., Miljković, F., John, B., et al.: Interpretable bilinear attention network with domain adaptation improves drug–target prediction. Nature Mach. Intell., 1–11 (2023)
Google Scholar
Liu, S., Demirel, M.F., Liang, Y.: N-gram graph: Simple unsupervised representation for graphs, with applications to molecules. Advances in neural information processing systems, 32 (2019)
Google Scholar
Honda, S., Shi, S., Ueda, H.R.: Smiles transformer: Pre-trained molecular fingerprint for low data drug discovery, arXiv preprint arXiv:1911.04738 (2019)
He, K., Fan, H., Wu, Y., et al.: Momentum contrast for unsupervised visual representation learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9729–9738 (2020)
Google Scholar
Wang, Y., Wang, J., Cao, Z., et al.: Molecular contrastive learning of representations via graph neural networks. Nature Mach. Intell. 4(3), 279–287 (2022)
Article Google Scholar
Yang, K., et al.: Analyzing learned molecular representations for property prediction. J. Chem. Inf. Model. 59(8), 3370–3388 (2019)
Article Google Scholar
Rahaman, O., Gagliardi, A.: Deep learning total energies and orbital energies of large organic molecules using hybridization of molecular fingerprints. J. Chem. Inf. Model. 60(12), 5971–5983 (2020)
Article Google Scholar

Download references

Acknowledgements

Supported by grants from the National Natural Science Foundation of China (No. 81973182); National Science Foundation of China (No. 61806092); Jiangsu Natural Science Foundation (No. BK20180326); “Double First-Class” University project from China Pharmaceutical University (Program No. CPU2018GF02).

Author information

Authors and Affiliations

Nanjing University, Nanjing, Jiangsu, China
Zhengda He & Yang Gao
China Pharmaceutical University, Nanjing, Jiangsu, China
Zhengda He, Linjie Chen, Hao Lv, Rui-ning Zhou, Jiaying Xu, Yadong Chen & Jianhua Hu

Authors

Zhengda He
View author publications
You can also search for this author in PubMed Google Scholar
Linjie Chen
View author publications
You can also search for this author in PubMed Google Scholar
Hao Lv
View author publications
You can also search for this author in PubMed Google Scholar
Rui-ning Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Jiaying Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yadong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jianhua Hu
View author publications
You can also search for this author in PubMed Google Scholar
Yang Gao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yang Gao .

Editor information

Editors and Affiliations

Department of Computer Science, Eastern Institute of Technology, Zhejiang, China
De-Shuang Huang
University of Wollongong, North Wollongong, NSW, Australia
Prashan Premaratne
Zhengzhou University of Light Industry, Zhengzhou, China
Baohua Jin
Zhong Yuan University of Technology, Zhengzhou, China
Boyang Qu
University of Ulsan, Ulsan, Korea (Republic of)
Kang-Hyun Jo
Department of Computer Science, Liverpool John Moores University, Liverpool, UK
Abir Hussain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

He, Z. et al. (2023). A Novel Descriptor and Molecular Graph-Based Bimodal Contrastive Learning Framework for Drug Molecular Property Prediction. In: Huang, DS., Premaratne, P., Jin, B., Qu, B., Jo, KH., Hussain, A. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2023. Lecture Notes in Computer Science, vol 14088. Springer, Singapore. https://doi.org/10.1007/978-981-99-4749-2_60

Download citation

DOI: https://doi.org/10.1007/978-981-99-4749-2_60
Published: 30 July 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-4748-5
Online ISBN: 978-981-99-4749-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics