Filename: b316260e

Back | Show only these items: | Tests of second-generation and third-generation density functionals for thermochemical kinetics

1
Tests of second-generation and third-generation density functionals for thermochemical kinetics

Type: Goal | Advantage: None | Novelty: None | ConceptID: Goa1

We report tests of second- and third-generation density functionals, for pure density functional theory (DFT) and hybrid DFT, against the BH6 representative barrier height database and the AE6 representative atomization energy database, with augmented, polarized double and triple zeta basis sets.

Type: Goal | Advantage: None | Novelty: None | ConceptID: Goa1

The pure DFT methods tested are G96LYP, BB95, PBE, mPWPW91, VSXC, HCTH, OLYP, and OPW91 and the hybrid DFT methods tested are B1B95, PBE0, mPW1PW91, B97-1, B98, MPW1K, B97-2, and O3LYP.

Type: Object | Advantage: None | Novelty: New | ConceptID: Obj1

The performance of these methods is tested against each other as well as against first-generation methods (BP86, BLYP, PW91, B3PW91, and B3LYP).

Type: Object | Advantage: None | Novelty: New | ConceptID: Obj2

We conclude that the overall performance of the second-generation DFT methods is considerably better than the first-generation methods.

Type: Conclusion | Advantage: None | Novelty: None | ConceptID: Con1

The MPW1K method is very good for barrier height calculations, and none of the pure DFT methods outperforms any of the hybrid DFT methods for kinetics.

Type: Conclusion | Advantage: None | Novelty: None | ConceptID: Con2

The B1B95, VSXC, B98, OLYP and O3LYP methods perform best for atomization energies.

Type: Conclusion | Advantage: None | Novelty: None | ConceptID: Con3

Using a mean mean unsigned error criterion (MMUE) that involves two sizes of basis sets (both with polarization and diffuse functions) and averages mean unsigned errors in barrier heights and in atomization energy per bond, we find that VSXC has the best performance among pure functionals, and B97-2, MPW1K, and B1B95 have the best performance of all hybrid functionals tested.

Type: Conclusion | Advantage: None | Novelty: None | ConceptID: Con4

Hybrid density functional theory¹ (mixing Hartree–Fock exchange with pure DFT) has become generally recognized as the electronic structure method of choice for calculations on large systems.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac1

Methods of this type can be justified theoretically by the adiabatic connection theory,² and hence they are sometimes called adiabatic connection methods.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac1

The adiabatic connection theory indicates that more accurate results can be obtained by replacing some DFT exchange by Hartree–Fock exchange.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac2

Hartree–Fock exchange does not suffer from the self-interaction error of DFT, which can be very important when hydrogen atoms are present, and this is one way to understand why mixing in exact exchange can reduce the error; however, DFT exchange is inseparable from DFT dynamical correlation, and it introduces some static correlation,³ so the optimum fraction of HF exchange is not 100%.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac2

The performance of hybrid DFT for thermochemistry is well documented.^1,2,4–21

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac3

At present, despite the successes of hybrid DFT, one is not completely satisfied for several reasons.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac3

First, the method is not systematically improvable.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac3

Second, the mixture of Hartree–Fock theory into DFT restricts the choice of algorithms such that the most efficient computational strategies used for pure DFT are inapplicable.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac3

Third, the most generally successful hybrid DFT methods are less accurate for kinetics than for thermochemistry.^16,20,22,23

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac3

Since Hartree–Fock theory usually overestimates barrier heights for chemical reactions, and pure DFT usually underestimates them,⁶ many workers noticed that more accurate results can be obtained for kinetics by increasing the fraction of Hartree–Fock exchange;²⁴ however, it was questionable whether this approach was physically meaningful.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac4

It was soon learned that the popular B3LYP hybrid method,^25–27 with 20% Hartree–Fock exchange, still systematically underestimates barrier heights,^22,23 but raising the fraction of Hartree–Fock exchange usually deteriorated the quality of the prediction of the theory for other quantities more rapidly than it increased the quality of barrier height predictions.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac4

However the hybrid mPW1PW91 method⁹ is more stable than the popular B3LYP hybrid method when the fraction of Hartree–Fock exchange is increased,²³ and this observation was used to optimize the MPW1K method for kinetics.²³

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac4

MPW1K gives remarkably accurate barrier heights with only slight deterioration of reaction energies.^20,23,28

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac4

The number of available density functionals is increasing rapidly.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac5

Boese et al²¹. have pointed out that “Many DFT users are overwhelmed by the sheer number of functionals and possibilities….

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac5

Very often because of sheer user inertia, first-generation functionals are applied rather than more accurate second-generation functionals….

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac5

Meanwhile, systematic studies on the dependency of the basis set and functionals remain sparse.” Furthermore, with a few exceptions,^{12,16,18,20,23,28–30} those systematic tests that are available are dominated by thermochemistry, and much less attention has been paid to quantities like barrier heights that are important for kinetics.

Type: Motivation | Advantage: None | Novelty: None | ConceptID: Mot1

Recently, a representative database of six barrier heights was developed³¹ such that the errors calculated for this database correlate extremely well with errors calculated for a much larger database^20,32 of 44 barrier heights.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac6

The small database is called BH6.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac6

The small size of this database makes it more straightforward to test a wide assortment of second-generation density functionals for kinetics, and the representative character of the database makes us expect that the conclusions are consistent with what would be concluded from tests against a much larger database.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac6

The same paper³¹ developed a representative database of atomization energies, containing six molecules and called AE6, such that performance on this database is indicative of performance on a much larger 109 molecule database.^19,20

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac6

The average number of bonds per molecule in AE6 is 4.833.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac6

In the present paper we report errors in the AE6 data base as mean signed error (MSE) and mean unsigned error (MUE) in kcal mol^–1 per bond by dividing the total MSE and MUE by 4.833.

Type: Result | Advantage: None | Novelty: None | ConceptID: Res1

Small representative databases can play an important role in allowing a wide variety of methods to be tested on the same data.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac7

Small databases will not necessarily uncover the interesting difficult cases, but it is reasonable to ask first how a method performs for typical cases.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac7

One advantage of the AE6 and BH6 databases is that they correspond to electronic energy contributions (including nuclear repulsion), exclusive of vibrational zero point energy and vibrational–rotational thermal energies.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac7

Thus they can be used to test electronic energy calculations without the complication of vibrational energy considerations.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac7

All calculations in this paper are single-point calculations at QCISD/MG3 geometries, where QCISD is quadratic configuration interaction with single and double excitations,³³ and MG3 is the modified^34,35 G3Large³⁶ basis set.

Type: Model | Advantage: None | Novelty: None | ConceptID: Mod1

The MG3 basis set,³⁴ also called G3LargeMP2,³⁵ is the same as 6-311++G(3d2f,2df,2p)³⁷ for H–Si, but improved³⁶ for atoms heavier than Si.

Type: Model | Advantage: None | Novelty: None | ConceptID: Mod1

The QCISD/MG3 greometries for all molecules in BH6 and AE6 can be obtained from the database website of our group.³⁸

Type: Model | Advantage: None | Novelty: None | ConceptID: Mod1

The effect of spin–orbit coupling is also added to open shell systems from a compendium given else where.³⁹

Type: Model | Advantage: None | Novelty: None | ConceptID: Mod1

We used the GAUSSIAN03⁴⁰ program to test all pure and hybrid DFT methods except OLYP and O3LYP⁴¹ (acronyms for density functionals are explained below).

Type: Method | Advantage: None | Novelty: Old | ConceptID: Met1

We found that O3LYP in GAUSSIAN03 cannot reproduce the atomization energies published in the original^41,42 O3LYP paper, and therefore O3LYP and OLYP calculations were carried out with the PQS ab initio program developed by Parallel Quantum Solutions.⁴³

Type: Method | Advantage: None | Novelty: Old | ConceptID: Met2

Baker and Pulay¹⁸ used PQS to assess O3LYP and OLYP for organic reactions.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac8

Note that the local correlation functional in O3LYP and OLYP is Vosko, Wilk, and Nusair's correlation functional V (VWN5),⁴⁴ while in B3LYP the local correlation functional is the Vosko, Wilk, and Nusair's correlation functional III (VWN3).⁴⁴

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac8

In the printed part of this article we will give MSE and MUE for BH6 and MSE per bond and MUE per bond for AE6 with two highly recommended basis sets, namely a recommended^19,28 augmented polarized double zeta set, 6-31+G(d,p),^45,46 and a recommended augmented polarized triple zeta set, MG3S.

Type: Model | Advantage: None | Novelty: None | ConceptID: Mod2

In tables 6-31+G(d,p) is abbreviated DIDZ (desert-island double zeta).

Type: Model | Advantage: None | Novelty: None | ConceptID: Mod2

The MG3S basis¹⁹ is the same as MG3 except it omits diffuse functions on hydrogens.

Type: Model | Advantage: None | Novelty: None | ConceptID: Mod2

The supporting information gives the following additional information that may be of interest to specialists: root-mean square errors, results for four more basis sets (6-31G(d), 6-31+G(2d,p), 6-311G(3d,2pd), and 6-311+G(2df,2p)), and results for the subsets of the databases that contain only H–F, where the latter values exclude molecules and barrier heights for systems that include Si and S. We simply comment that the trends in relative performance on the full AE6 and BH6 are mirrored in these subsets, and the conclusions about relative performance drawn from the two recommended basis sets are similar to those for the other four basis sets.

Type: Model | Advantage: None | Novelty: None | ConceptID: Mod2

With regard to comparison of basis sets, the results in supporting information support our previous conclusion¹⁹ that inclusion of diffuse functions on atoms heavier than H make the results more generally reliable.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac9

These tables are also consistent with our previous conclusion, based on previous results,^19,20,23 that if one were confined to a desert island with only one valence double zeta basis set and one valence triple zeta basis set, 6-31+G(d,p) and MG3S would be two very excellent choices (kudos to the Pople group^36,37,45,46 for their work in optimizing the basis functions in these basis sets).

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac9

Table 1 gives results for first-generation functionals, both pure DFT (X = 0) and hybrid DFT (X > 0).

Type: Result | Advantage: None | Novelty: None | ConceptID: Res2

In particular, the first-generation functionals we tested are BP86, BLYP, PW91, B3LYP and B3PW91.

Type: Object | Advantage: None | Novelty: None | ConceptID: Obj1

BP86 is a pure DFT method using Becke's 1988 gradient corrected exchange functional (B)²⁵ and Perdew's 1986 gradient corrected correlation functional (P86).⁴⁷

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac10

When one combines Becke's 1988 gradient corrected exchange functional (B) with Lee, Yang, and Parr's gradient corrected correlation functional (LYP),²⁶ one obtains the BLYP method.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac10

PW91 is a pure DFT method that incorporates Perdew and Wang's 1991 gradient corrected exchange and correlation functional.^48,49

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac10

B3PW91 is Becke's three parameter hybrid DFT method which includes 20% Hartree–Fock exchange.¹

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac10

B3LYP is the most popular hybrid DFT method; it was developed by Stephens et al²⁷. by following Becke's three-parameter hybrid DFT strategy.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac10

Table 2 gives some results for DFT methods involving second-generation functionals (mPW1PW91 and MPW1K) that have already been widely tested against our databases.

Type: Result | Advantage: None | Novelty: None | ConceptID: Res3

Barone and Adamo⁹ developed mPW1PW91 as a Becke-style one-parameter hybrid functional using their modified Perdew–Wang (mPW or MPW) exchange functional, Perdew and Wang's 1991 (PW91) correlation functional, and 25% of Hartree–Fock exchange.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac11

MPW1K is a method optimized against a kinetics database by our group;²³ it uses the same functionals as mPW1PW91, but the fraction of Hartree–Fock exchange is 42.8%.

Type: Background | Advantage: None | Novelty: None | ConceptID: Bac11

Table 3 and Table 4 give results for second-generation and third-generation pure and hybrid DFT functionals that have not previously been tested against our databases.

Type: Result | Advantage: None | Novelty: None | ConceptID: Res4

The tests in Table 3 and Table 4 are the main new results in the present paper.

Type: Result | Advantage: None | Novelty: None | ConceptID: Res4

By testing so many methods against the same representative databases, including barrier heights, we can put the new functionals in better perspective and thereby provide guidance to the “many DFT users” mentioned in the third paragraph as well as to developers of new density functionals and computational strategists.

Type: Object | Advantage: Yes | Novelty: New | ConceptID: Obj2

In particular, Tables 3 and 4 include the following second-generation methods (in alphabetical order):