Skip to main content
Log in

Size-intensive descriptors

  • Published:
Journal of Computer-Aided Molecular Design Aims and scope Submit manuscript

Abstract

Non-linear effects in multi-linear structure–property (QSPR) models are sometimes included by using descriptors transformed by mathematical functions such as the square root or logarithm. Less commonly, products of two descriptors are used to account for cross dependencies. As described here, simple division of descriptors by chemical sample size (e.g. molecular weight, length, area or volume) creates size-intensive descriptors (alternatively, intrinsic descriptors) that are independent of the size of the chemical sample described, weakly correlated with the original descriptor, and important contributors to the best QSPR models. In our automated QSPRs, size-intensive descriptors in competition with their extensive descriptors are frequently selected as the best descriptors in the models with the highest r 2. Examples of QSPR models that use size-intensive descriptors are given, the lack of correlation of descriptors with their size-intensive version is demonstrated, and their physical significance is discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. A trivial example of a useless size-intensive descriptor is molecular weight divided by molecular weight which is intensive, but also devoid of all information.

References

  1. Todeschini R, Consonni V (2000) Handbook of molecular descriptors. Wiley-VCH

  2. Karleson M (2000) Molecular descriptors in QSAR/QSPR. Wiley-Interscience, New York

    Google Scholar 

  3. Stanton DT, Jurs PC (1990) Development and use of charged partial surface area structural descriptors in computer-assisted quantitative structure-property relationship studies. Anal Chem 62:2323–2329

    Article  CAS  Google Scholar 

  4. Kier LB (1986) Molecular connectivity in structure-activity analysis. Wiley, New York

    Google Scholar 

  5. Kier LB, Hall L (1976) Molecular connectivity in chemistry and drug research. Academic Press, New York

    Google Scholar 

  6. Kier LB (1999) Molecular structure description: the electrotopological state. Academic Press, New York

    Google Scholar 

  7. Stuper AJ, Jurs PC (1976) J Chem Inf Comput Sci 2:99

    Article  Google Scholar 

  8. Jurs PC, Chou JT, Yuan M (1979) In: Olson RC, Christoffersen RE (eds) Computer-assisted drug design. American Chemical Society, Washington, DC, pp 103–129

    Google Scholar 

  9. CODESSA, www.codessa-pro.com/descriptors/. Accessed 13 Aug 2007

  10. Purvis GD III (1994) The chemical sample: a fundamental object for molecular modeling. J Chem Inf Comput Sci 34:17–21

    CAS  Google Scholar 

  11. Katritzky AR, Lobanov VS, Karelson M (1998) Normal boiling points for organic compounds: correlation and prediction by a quantitative structure-property relationship. J Chem Inf Comput Sci 38:28–41

    Article  CAS  Google Scholar 

  12. Katritzky AR, Mu L, Lobanov VS, Karelson M (1996) Correlation of boiling points with molecular structure. 1. A training set of 298 diverse organics and test set of 9 simple inorganics. J Phys Chem 100:10400–10407

    Article  CAS  Google Scholar 

  13. Bartlett RJ, Purvis GD (1978) Int J Quantum Chem 14:561

    Article  CAS  Google Scholar 

  14. Bartlett RJ, Purvis GD III (1980) Molecular applications of coupled cluster and many-body perturbation methods. Phys Scr 21:255–265

    Article  CAS  Google Scholar 

  15. Wessel MD, Jurs PC, Tolan JW, Muskal SM (1998) Prediction of human intestinal absorption of drug compounds from molecular structure. J Chem Inf Comput Sci 38:726–735

    Article  CAS  Google Scholar 

  16. Hou TJ, Xu XJ (2003) ADME: evaluation in drug discovery. 3. modeling blood-brain barrier partitioning using simple molecular descriptors. J Chem Inf Comput Sci 43:2137–2152

    Article  CAS  Google Scholar 

  17. CAChe Worksystem Pro 6.1, Fujitsu Computer Systems, Beaverton, OR, 97007, (2007)

  18. Randic M (1975) J Am Chem Soc 97:6606–6615

    Article  Google Scholar 

  19. Guha R (2007) Chemical informatics functionality in R. J Stat Softw 18(5), supplemental material, http://www.jstatsoft.org/. Accessed 20 July 2007

  20. Stanton DT, Jurs PC, Hicks MG (1991) Computer-assisted prediction of normal boiling points of furans, tetrahydrofurans, and thophenes. J Chem Inf Comput Sci 31:301–310

    CAS  Google Scholar 

  21. Egolf LM, Wessel MD, Jurs PC (1994) Prediction of boiling points and critical temperatures of industrially important organic compounds from molecular structure. J Chem Inf Comput Sci 34:947–956

    Article  CAS  Google Scholar 

  22. Goll E, Jurs P (1999) Prediction of the normal boiling points of organic compounds from molecular structures with a computational neural network model. J Chem Inf Comput Sci 39(6):974–983

    Article  CAS  Google Scholar 

  23. Lowell H Hall LH, Story CT (1996) Boiling point and critical temperature of a heterogeneous data set: QSAR with atom type electrotopological state indices using artificial neural networks. J Chem Inf Comput Sci 36:1004–1014

    Google Scholar 

  24. Liang C, Gallagher D (1997) Prediction of physical & chemical properties by quantitative structure-property relationships: water solubility prediction. Am Lab March:34–40

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to George D. Purvis III.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Purvis, G.D. Size-intensive descriptors. J Comput Aided Mol Des 22, 461–468 (2008). https://doi.org/10.1007/s10822-008-9209-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10822-008-9209-4

Keywords