Processing math: 100%

Average-of-configuration open-shell Hartree-Fock

This is a short introduction to the theory behind average-of-configuration open-shell Hartree-Fock as implemented in DIRAC. For a more complete description the reader may consult chapter 3 of the PhD thesis of Jørn Thyssen [Thyssen2004] .

It should first be noted that there is no restricted open-shell Hartree-Fock (ROHF) code in DIRAC. The reason is that spin-orbit interaction couples spin and spatial degrees of freedom and make the formalism much more complicated since one cannot exploit spin symmetry alone for fixing the expansion coefficients in the reference configuration state function (CSF) which serves as the trial function.

Instead of optimizing the energy for a single open-shell state, we shall optimize the energy for a limited set of open-shell states.

Energy expression

Suppose that we have a set of Ndet of Slater determinants, constituting our N-particle basis. We next construct and diagonalize a CI matrix in this basis. This gives Ndet solutions of the form

|ΨI=NdetP=1|ΦPCPI

We will now find the set of orbitals which minimizes the average energy

Eav=1NdetNdetI=1ΨP|H|ΨP

Inserting the expansion of the solutions in terms of Slater determinants and using the fact that the expansion coefficients CPI are elements of a unitary matrix we obtain

Eav=1NdetNdetP=1NdetQ=1ΦP|H|ΦQNdetI=1CPICQI=1NdetNdetP=1ΦP|H|ΦP

showing that the average can also be taken over the N-particle basis itself.

Introducing open shells and active electrons

The above average energy expression is a functional of the orbitals entering the Slater determinants. We will make a distinction between :

  • inactive orbitals, present in all Slater determinants, represented by indices ijkl
  • active orbitals, present in some, but not all Slater determinants, represented by indices uvxy
  • secondary orbitals, not present in any Slater determinant, represented by indices abcd

We shall also employ indices pqrs for general orbitals.

In order to generate our N-particle basis for averaging we will distribute the orbitals into a number of shells. Each shell S is specified by MS orbital and NS electrons. Inactive and secondary shells have NS=MS and NS=0, respectively, whereas active shells have NS<MS. We generate our N-electron basis by distributing all active electrons in all possible ways within their respective shells. The total energy can then be written in terms of orbitals rather than Slater determinants as

Eav=Sfs[pShpp+12TQTpp+12(aS1)QSpp]

where we have introduced

  • the fractional occupation fS=NSMS of shell S
  • the coupling coefficient aS
    • aS=MS(N21)NS(M21) for fs0
    • aS=1 for fs=0
  • two-electron contributions QSpq=fsrS<pr||qr>

Orbital rotations

We will use a exponential parametrization for the rotation of orbitals

˜ϕp=qϕq[exp(κ)]pq

where κ is an anti-Hermitian matrix to ensure unitarity of the transformation. The exponential parametrization allows for unconstrained optimization (no Lagrange multipliers). It also allows the easy identification of redundant variational parameters, that is, parameters whose variation does not change the energy. In this particular case one finds that rotations within shells are redundant and the corresponding matrix elements κpq can be set to zero.

Gradient elements and off-diagonal blocks of the Fock matrix

The generally non-zero elements of the gradient vector are:

  • inactive-secondary rotations:
gia=hai+TQTai=FIaia
  • active-secondary rotations:
gua=fU[FIau+(aU1)QUau];uU
  • inactive-active rotations:
giu=(1fU)[FIui+fUαUQUui];uU
  • inter-shell active-active rotations
guv=(fUfV)FIvu(αU1)fUQUvu(αV1)fVQVvu;uU, vV

where we have introduced

αS=1aS1fS

These gradient elements allow the definition of the off-diagonal elements of the Fock matrix:

  • inactive-secondary block:
  • active-secondary block:
  • inactive-active block:
  • inter-shell active-active rotations UV:

Diagonal blocks of the Fock matrix

The diagonal blocks of the Fock matrix are a priori not related to gradient elements and there is therefore freedom of choice in their specification. The specific choice will not affect the total energy, but will affect orbitals energies as well as convergence of the AOC HF calculation.

In order to obtain a meaningful definition of the diagonal blocks of the Fock matrix we will consider an extension of Koopmans’ theorem to average-of-configuration Hartree-Fock, that is, we consider average energy after removal of an electron from a specific shell T and using the same orbital set as for the original N-electron system.

The energy difference becomes

ENavEN1av=1MTtT[htt+SQStt+(aT1)QTtt]

If we now define the diagonal block of the Fock matrix corresponding to shell T as

Fpq=hpq+SQSpq+(aT1)QTpq

the ionization potential associated with the electron removal becomes

IP=EN1avENav=1MTtTεt

In the case of a degenerate shell we simply find

IP=EN1avENav=εt,tT

identical to the original Koopmans’ theorem, whereas one in the general case gets an average over the orbital energies of the shell.

Based on these observations we define the diagonal blocks of the AOC Fock matrix as

  • inactive-inactive block:
Fij=FIij
  • secondary-secondary block:
Fab=FIab
  • active-active block:
Fuv=FIuv+(aU1)QUuv;u,vU

These are the definitions employed in DIRAC12 and onwards (and also the definition found in the thesis of Jørn Thyssen [Thyssen2004]). In previous versions the term (aU1)QUuv was missing from the active-active block. Since QUuu is positive and for an open shell aU<1 removal of this term tend to shift orbital energies of the open shell upwards.

Convergence problems typically occur when orbital energies between shells have similar values such that the selection of occupied orbitals for the construction of the Fock matrix becomes ambiguous. In a closed-shell system this will for instance happen when the HOMO-LUMO gap closes. The definition of the active-active block in pre-DIRAC12 version (which was in fact an unintended omittal) can in some instances lead to improved convergence. More specifically, this happens when the orbitals of an open shell and the closed shell (or another open shell) are almost degenerate. However, such situations are often symptomatic for a wrong choice of partitioning of orbitals into closed and open shells. Furthermore, the definition of the active-active block in pre-DIRAC12 versions tend to close the HOMO-LUMO gap which may hamper convergence.

Level shift

Whenever there is almost degeneracy of orbitals between different shells the recommended strategy is to exploit the freedom in the definition of diagonal blocks of the Fock matrix and introduce a level shift λ, that is

FUuvFUuv+λδuv

The level shift of secondary (virtual) orbitals is controlled by the keyword .LSHIFT, whereas open shells can be shifted using the keyword .OLEVEL.