MTASpec software for calculating the vibrational IR and Raman spectra of large molecules at ab initio level,☆☆

https://doi.org/10.1016/j.cpc.2021.108175Get rights and content

Abstract

Fragmentation-based (FB) methods have been developed for enabling ab initio calculations on large molecules and clusters. However, a well-benchmarked FB-based utility, for calculating the vibrational spectra is not available. The present article reports a software package MTASpec, based on the FB-molecular tailoring approach (MTA), for computing the single point energy followed by vibrational IR and Raman spectra for spatially extended molecular systems. Accuracy and efficiency of MTASpec are assessed vis-à-vis their full calculation counterparts for some medium- to large-sized molecular systems using HF, DFT and MP2 theory employing large basis sets. The code is fully automated for use on linux platform, with the Gaussian suite of software at the back-end. It is envisaged that the MTASpec package would enable spectral studies of molecular systems containing ∼100 atoms and/or ∼10000 basis functions employing correlated theories with computational economy.

Program summary

Program title: MTASpec

CPC Library link to program files: https://doi.org/10.17632/m5b5zhxkfh.1

Licensing provisions: GPLv3

Programming language: Python, Fortran, C and Csh

External routines/libraries: Gaussian software [1]

Nature of problem: Single point energy, harmonic IR and Raman spectra computation for spatially extended molecular systems.

Solution method: The Fragment-based method, molecular tailoring approach (MTA) is employed for enabling economic and efficient ab initio computation of the vibrational spectra of large molecules and clusters.

References

  • [1]

    Gaussian 16, Revision A.03, M.J. Frisch, G.W. Trucks, H.B. Schlegel, G.E. Scuseria, M.A. Robb, J.R. Cheeseman, G. Scalmani, V. Barone, G.A. Petersson, H. Nakatsuji, X. Li, M. Caricato, A.V. Marenich, J. Bloino, B.G. Janesko, R. Gomperts, B. Mennucci, H.P. Hratchian, J.V. Ortiz, A.F. Izmaylov, J.L. Sonnenberg, D. Williams-Young, F. Ding, F. Lipparini, F. Egidi, J. Goings, B. Peng, A. Petrone, T. Henderson, D. Ranasinghe, V.G. Zakrzewski, J. Gao, N. Rega, G. Zheng, W. Liang, M. Hada, M. Ehara, K. Toyota, R. Fukuda, J. Hasegawa, M. Ishida, T. Nakajima, Y. Honda, O. Kitao, H. Nakai, T. Vreven, K. Throssell, J.A. Montgomery, Jr., J.E. Peralta, F. Ogliaro, M.J. Bearpark, J.J. Heyd, E.N. Brothers, K.N. Kudin, V.N. Staroverov, T.A. Keith, R. Kobayashi, J. Normand, K. Raghavachari, A.P. Rendell, J.C. Burant, S.S. Iyengar, J. Tomasi, M. Cossi, J.M. Millam, M. Klene, C. Adamo, R. Cammi, J.W. Ochterski, R.L. Martin, K. Morokuma, O. Farkas, J.B. Foresman, and D.J. Fox, Gaussian, Inc., Wallingford CT, 2016.

Introduction

Chemistry deals with the exploration of structures, properties and reactions of molecules. This involves the calculation and characterization of structures and relative energies not only of stable forms, but also of other stationary points on the potential energy surface, corresponding to reactive intermediates and transition structures. However, due to the required intricate experimental setup and/or rapid chemical phenomena, experimental spectroscopic scrutiny of molecules may be prohibitively difficult [1]. On the other hand, with the advent of powerful computational hardware and software, quantum chemical (QC) theories have turned out to be a major source of obtaining detailed information on molecular structure and reactivity. Apart from the calculation of energies, such high-level QC investigations offer valuable information on the vibrational frequencies and spectra of molecules.

Pursuing such ab initio-level QC investigations, however, requires large computational resources [2]. The computational cost of a QC method is determined by the level of theory and number of basis functions, N. The Hartree-Fock (HF) and Density Functional Theory (DFT) computations typically scale O(N3 or N4) whereas correlated theories e.g. Møller-Plesset second- order perturbation theory (MP2) and coupled-cluster singles and doubles with perturbative triples viz. CCSD(T) scale as O(N5) and O(N7) respectively [3], [4]. With this in view, computing molecular properties employing correlated ab initio methods indeed proves to be a non-trivial task for large molecules. Furthermore, exploring spectral features of large molecular systems employing off-the-shelf hardware is generally not feasible [5].

To overcome the computational power requirements and to cut down the scaling complexities of the QC methods, several fragmentation-based [FB] methods have been proposed. These were inspired by the work of Christoffersen et al. in the early 1970s [6] followed by the studies by the groups of Yang and Gadre in the 1990s [7], [8]. These methods are summarized in References [9], [10], [11]. The central theme of all FB methods is to divide the parent system into subsystems such that the latter can be readily treated computationally, although the details of fragmentation differ. It is noteworthy to mention that all FB routes follow certain approximations due to which canonical full calculations (FC) which demand huge computational resources, can be circumvented effectively. However, most of the FB methods are generally restricted only to energetics and geometry optimization. The accurate and well-benchmarked theoretical treatment of vibrational IR and Raman spectra for large molecules require extensive computations (viz. the hessian matrix, derivatives of dipole moment and polarizability tensor etc.). We summarize below the attempts reported in the literature for the calculation of vibrational spectra employing FB methods.

In 2008, Li and co-workers demonstrated the Generalized Energy-Based Fragmentation (GEBF) [12] approach for geometry optimization and vibrational spectra of large molecules. In the same year, Gadre and coworkers employed the Molecular Tailoring Approach (MTA) for hessian and frequency calculation of large molecules [13]. Bieler et al. [14] proposed the Cartesian Tensor Transfer Method (CTTM) for calculating the vibrational spectra of polypeptides in 2011. The research groups of Kitaura and Fedorov [15] developed and applied fragmentation-based Fragment Molecular Orbital (FMO) for targeting large biomolecules. In 2014, the use of FMO was demonstrated [15] for the estimation of vibrational IR and Raman spectra for crambin, a protein, and similar large molecules. In 2015, Raghavachari and co-workers [16] reported similar development for computing vibrational IR and Raman spectra within Molecules-In-Molecules (MIM) procedure. Most of these methods are demonstrated for prototype molecules of moderate size and at the HF/DFT level of theory. In the case of MP2 theory, vibrational IR and Raman spectra of large molecules reported in the literature are restricted to either small molecules or are done with some approximations, e.g. resolution of identity (RI) method [17]. A clear-cut comparison with the conventional calculation of vibrational spectra and a discussion regarding the accuracy and efficiency of FB-based methods is also rather scantily found in the literature.

In general, FB methods are restricted to the developer's research group due to their intricate understanding and/or programming implementation. Thus, a well-benchmarked general utility within FB methods, is not available for uninitiated users. In this light, Collins and Bettens [10] have remarked that “the fragmentation methods should move ahead benchmarking phase and to create, as well as disseminate user-friendly programs”. In a similar spirit, Raghavachari and Saha [11] in their article on FB methods stated that “fragmentation schemes are user-specific, requiring substantial user-intervention, precluding their use as broad black box code”.

With this in view, the present work reports the development of the MTA-based black-box code, MTASpec, facilitating ab initio investigations on spatially extended large molecular structures. This package is aimed at the implementation of HF/DFT as well as correlated theories for the exploration of energetics and spectral features of large molecules/molecular clusters assisted by grafting procedure. The software is mostly written using python (version 2.7) as a programming language to be deployed on Linux-based machine/s with pre-installed Gaussian [18] software. In its present version, MTASpec can be utilized for an accurate estimation of energies/spectra (vibrational IR and Raman) of large, closed-shell molecular systems. We trust that systems containing up to 500 first- or second-row atoms and up to 2000 basis functions could be treated at HF/hybrid DFT theory by the present version. Further, molecules containing up to 100 first- or second-row atoms and up to 1000 basis functions could be treated at the MP2 level theory.

Section snippets

MTASpec procedure for single point energy and harmonic vibrational frequency calculations: theory

Within the molecular tailoring approach (MTA), a spatially extended molecule under investigation is virtually cut up in a set of overlapping fragments. This set of fragments is subjected to ab initio computation instead of the spatially extended large parent system. On account of this, computational expenses are steeply reduced. Within the MTA procedure, initial fragments are made by putting a sphere on all non-hydrogen atoms. These fragments are merged, based on distance, with subsequent

MTASpec: program details

MTASpec is an automated program that runs in five parts listed below, facilitating the calculation of vibrational IR and Raman spectra of large molecules within MTA.

  • (i)

    Automatic fragmentation of large molecules.

  • (ii)

    MTA and FC calculations at LB to get the energy and Hessian matrices (HM).

  • (iii)

    MTA calculation at HB to get energy and HM.

  • (iv)

    Calculation of energy followed by vibrational frequency and intensities using synthesized HM.

  • (v)

    Visualization of IR and Raman spectrum along with normal modes.

The program will

Testing and benchmarking of MTASpec software

To demonstrate the efficacy and accuracy of the software MTASpec, prototype closed-shell medium to large-sized molecular systems containing up to 170 (mostly) first-row atoms are chosen. Table 1 enlists the test cases under study along with the number of atoms, level of theory and basis sets employed for the calculations. Within harmonic approximation, vibrational IR and Raman spectra are calculated at HF, DFT (with prototype B3LYP and BP86 functionals). Furthermore, the software is

Concluding remarks

Calculating the vibrational IR and Raman spectra of large molecules employing correlated ab initio theory is a herculean task. In the current study, we present a home-built, fully automated fragmentation-based MTASpec software meant for this purpose. The software is well benchmarked by comparing energetics and vibrational IR and Raman spectra generated by MTASpec with the respective FC counterparts, for various test systems. Since the errors in MTASpec energies are indeed very small (sub mH),

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

SRG is grateful to the National Supercomputing Mission (NSM), New Delhi for financial assistance under the project CORP:DG:3187. The support and computational resources provided by PARAM Shivay Facility under the NSM, Government of India at the Indian Institute of Technology, BHU, Varanasi are gratefully acknowledged.

References (33)

  • R.E. Christoffersen et al.

    Chem. Phys. Lett.

    (1969)
  • M. Hochlaf

    Phys. Chem. Chem. Phys.

    (2017)
  • E. Apra et al.
  • A. Szabo et al.

    Modern Quantum Chemistry

    (1989)
  • I.N. Levine

    Quantum Chemistry

    (2009)
  • S. Yoo et al.

    J. Phys. Chem. Lett.

    (2010)
  • W. Yang

    Phys. Rev. A

    (1991)
  • S.R. Gadre et al.

    J. Phys. Chem.

    (1994)
  • M.S. Gordon et al.

    Chem. Rev.

    (2012)
  • M.A. Collins et al.

    Chem. Rev.

    (2015)
  • K. Raghavachari et al.

    Chem. Rev.

    (2015)
  • W. Hua et al.

    J. Phys. Chem.

    (2008)
  • A.P. Rahalkar et al.

    J. Chem. Phys.

    (2008)
  • N.S. Bieler et al.

    J. Chem. Theory Comput.

    (2011)
  • H. Nakata et al.

    J. Chem. Theory Comput.

    (2014)
  • K.V.J. Jose et al.

    Mol. Phys.

    (2015)
  • Cited by (14)

    View all citing articles on Scopus

    The review of this paper was arranged by Prof. Stephan Fritzsche.

    ☆☆

    This paper and its associated computer program are available via the Computer Physics Communications homepage on ScienceDirect (http://www.sciencedirect.com/science/journal/00104655).

    View full text