Electronic Structure Theory

A. Electronic structure theory describes the motions of the electrons and produces energy surfaces

The shapes and geometries of molecules, their electronic, vibrational, and rotational energy levels and wavefunctions, as well as the interactions of these states with electromagnetic fields lie within the realm of structure theory.
1. The Underlying Theoretical Basis
In the Born-Oppenheimer model of molecular structure, it is assumed that the electrons move so quickly that they can adjust their motions essentially instantaneously with respect to any movements of the heavier and slower moving atomic nuclei. This assumption motivates us to view the electrons moving in electronic wave functions (orbitals within the simplest and most commonly employed theories) that smoothly "ride" the molecule's atomic framework. These electronic functions are found by solving a Schrödinger equation whose Hamiltonian He contains the kinetic energy Te of the electrons, the Coulomb repulsions among all the molecule's electrons Vee, the Coulomb attractions Ven among the electrons and all of the molecule's nuclei treated with these nuclei held clamped, and the Coulomb repulsions Vnn among all of these nuclei. The electronic wave functions \(y_k\) and energies \(E_k\) that result from solving the electronic Schrödinger equation \(H_e \;y_k \;=\; E_k\; y_k\) thus depend on the locations \(Q_i\) at which the nuclei are sitting. That is, the \(E_k \)and \(y_k \) are parametric functions of the coordinates of the nuclei. These electronic energies' dependence on the positions of the atomic centers cause them to be referred to as electronic energy surfaces such as that depicted below for a diatomic molecule where the energy depends only on one interatomic distance R. For non-linear polyatomic molecules having N atoms, the energy surfaces depend on 3N-6 internal coordinates and thus can be very difficult to visualize. A slice through such a surface (i.e., a plot of the energy as a function of two out of 3N-6 coordinates) is shown below and various features of such a surface are detailed. The Born-Oppenheimer theory of molecular structure is soundly based in that it can be derived from a starting point consisting of a Schrödinger equation describing the kinetic energies of all electrons and of all N nuclei plus the Coulomb potential energies of interaction among all electrons and nuclei. By expanding the wavefunction Y that is an eigenfunction of this full Schrödinger equation in the complete set of functions \(y_k\) and then neglecting all terms that involve derivatives of any \(y_k\) with respect to the nuclear positions \(Q_i\), one can separate variables such that:
1. The electronic wavefunctions and energies must obey
\(H_e \;y_k \;=\; E_k\; y_k\)
2. The nuclear motion (i.e., vibration/rotation) wavefunctions must obey
\( (T_N + E_k ) c_{k,L} = E_{k,L} c_{k,L} \) ,
where TN is the kinetic energy operator for movement of all nuclei. That is, each and every electronic energy state, labeled k, has a set, labeled L, of vibration/rotation energy levels \(E_{k,L}\) and wavefunctions \(c_{k,L}\). Because the Born-Oppenheimer model is obtained from the full Schrödinger equation by making approximations (e.g., neglecting certain terms that are called non-adiabatic coupling terms), it is not exact. Thus, in certain circumstances it becomes necessary to correct the predictions of the Born-Oppenheimer theory (i.e., by including the effects of the neglected non-adiabatic coupling terms using perturbation theory).
For example, when developing a theoretical model to interpret the rate at which electrons are ejected from rotationally/vibrationally hot \(\text{NH}^-\) ions, we had to consider coupling between two states having the same total energy:
1. \(^2\)P NH\(^-\) in its v=1 vibrational level and in a high rotational level (e.g., J >30) prepared by laser excitation of vibrationally "cold" NH\(^-\) in v=0 having high J (due to natural Boltzmann populations); see the figure below, and
2. \(^3\)S NH neutral plus an ejected electron in which the NH is in its v=0 vibrational level (no higher level is energetically accessible) and in various rotational levels (labeled N).
Because NH has an electron affinity of 0.4 eV, the total energies of the above two states can be equal only if the kinetic energy KE carried away by the ejected electron obeys: \[\text{KE} = \text{E}_{vib/rot} (\text{NH}^- (v=1, J)) - \text{E}_{vib/rot} (\text{NH} (v=0, N)) - 0.4 eV\]
In the absence of any non-adiabatic coupling terms, these two isoenergetic states would not be coupled and no electron detachment would occur. It is only by the anion converting some of its vibration/rotation energy and angular momentum into electronic energy that the electron that occupies a bound N2p orbital in \(\text{NH}^-\) can gain enough energy to be ejected. Energies of \(\text{NH}^-\) and of NH pertinent to the autodetachment of v=1, J levels of \(\text{NH}^-\) formed by laser excitation of v=0 J'' \(\text{NH}^-\) .
2. What is Learned from an Electronic Structure Calculation? The knowledge gained via structure theory is great. The electronic energies Ek (Q) allow one to determine (see my book Energetic Principles of Chemical Reactions) the geometries and relative energies of various isomers that a molecule can assume by finding those geometries \(Q_i \) at which the energy surface \(E_k\) has minima \(\frac{\partial\text{E}_k}{\partial\text{Q}_i} = 0\), with all directions having positive curvature (this is monitored by considering the so-called Hessian matrix \(H_{i,j}=\frac{\partial^2\text{E}_k}{\partial_i Q_j}\) if none of its eigenvalues are negative, all directions have positive curvature). Such geometries describe stable isomers, and the energy at each such isomer geometry gives the relative energy of that isomer. Professor Berny Schlegel at Wayne State has been one of the leading figures whose efforts are devoted to using gradient and Hessian information to locate stable structures and transition states. Professor Peter Pulay has done as much as anyone to develop the theory that allows us to compute the gradients and Hessians appropriate to the most commonly used electronic structure methods. His group has also pioneered the development of so-called local correlation methods which focus on using localized orbitals to compute correlation energies in a manner that scales less severely with system size than when delocalized canonical molecular orbitals are employed. There may be other geometries on the \(E_k\) energy surface at which all "slopes" vanish \(\frac{\partial\text{E}_k}{\partial\text{Q}_i} = 0\), but at which not all directions possess positive curvature. If the Hessian matrix has only one negative eigenvalue there is only one direction leading downhill away from the point \(Q_i\) of zero force; all the remaining directions lead uphill from this point. Such a geometry describes that of a transition state, and its energy plays a central role in determining the rates of reactions which pass through this transition state. At any geometry \(Q_i\), the gradient or slope vector having components \(\frac{\text{E}_k}{\text{Q}_i}\) provides the forces (\(F_i = -\frac{\text{E}_k}{\text{Q}_i}\) ) along each of the coordinates \(Q_i \). These forces are used in molecular dynamics simulations (see the following section) which solve the Newton F = ma equations and in molecular mechanics studies which are aimed at locating those geometries where the F vector vanishes (i.e., the stable isomers and transition states discussed above). Also produced in electronic structure simulations are the electronic wavefunctions \(y_k\) and energies \(E_k\) of each of the electronic states. The separation in energies can be used to make predictions about the spectroscopy of the system. The wavefunctions can be used to evaluate properties of the system that depend on the spatial distribution of the electrons in the system. For example, the \(z^-\) component of the dipole moment of a molecule mz can be computed by integrating the probability density for finding an electron at position r multiplied by the z- coordinate of the electron and the electron's charge e: mz = Ú e \(y_k\)* \(y_k\) z dr . The average kinetic energy of an electron can also be computed by carrying out such an average-value integral: Ú \(y_k\)* (-\(\frac{h^2}{2m_e^2}\) ) \(y_k\) dr. The rules for computing the average value of any physical observable are developed and illustrated in popular undergraduate text books on physical chemistry (e.g., Atkins text or Berry, Rice, and Ross) and in graduate level texts (e.g., Levine, McQuarrie, Simons and Nichols).
Not only can electronic wavefunctions tell us about the average values of all physical properties for any particular state (i.e., \(y_k\) above), but they also allow us to tell how a specific "perturbation" (e.g., an electric field in the Stark effect, a magnetic field in the Zeeman effect, light's electromagnetic fields in spectroscopy) can alter the specific state of interest. For example, the perturbation arising from the electric field of a photon interacting with the electrons in a molecule is given within the so-called electric dipole approximation (see, for example, Simons and Nichols, Ch. 14) by: \[H_{pert} =S_j e^2 r_j\cdot \textbf{E} (t)\]
where E is the electric field vector of the light, which depends on time t in an oscillatory manner, and ri is the vector denoting the spatial coordinates of the ith electron. This perturbation, Hpert acting on an electronic state \(y_k\) can induce transitions to other states \(y^{\prime}_k\) with probabilities that are proportional to the square of the integral \[U^\prime y^{\prime}_k* H_{pert} y_k d\textbf{r} \] So, if this integral were to vanish, transitions between \(y_k\) and \(y^{\prime}_k\) would not occur, and would be referred to as "forbidden". Whether such integrals vanish or not often is determined by symmetry. For example, if \(y_k\) were of odd symmetry under a plane of symmetry sv of the molecule, while \(y^{\prime}_k\) were even under sv , then the integral would vanish unless one or more of the three cartesian components of the dot product \(r_j\cdot\)E were odd under sv The general idea is that for the integral to not vanish the direct product of the symmetries of \(y_k\) and of \(y^{\prime}_k\) must match the symmetry of at least one of the symmetry components present in Hpert . Professor Poul Jørgensen has been especially involved in developing such so-called response theories for perturbations that may be time dependent (e.g., as in the interaction of light's electromagnetic radiation).

3. Present Day Challenges in Structure Theory

a. Orbitals form the starting point; what are the orbitals?
The full N-electron Schrödinger equation governing the movement of the electrons in a molecule is \begin{align*} -\bigg[\frac{h^2}{2m_e}S_{i=1} \nabla^2_i - \frac{S_a S_i Z_a e^2}{r_{ia}} + \frac{\text{S}_{i,j}{e^2}}{r_{ij}} \bigg] y = E y \end{align*} In this equation, i and j label the electrons and a labels the nuclei. Even at the time this material was written, this equation had been solved only for the case N=1 (i.e., for H, He+ , Li2+ , Be3+ , etc.). What makes the problem difficult to solve for other cases is the fact that the Coulomb potential e2/rij acting between pairs of electrons depends upon the coordinates of the two electrons ri and rj in a way that does not allow the separations of variables to be used to decompose this single 3N dimensional second-order differential equation into N separate 3-dimensional equations. However, by approximating the full electron-electron Coulomb potential Si,j e2 /rij by a sum of terms, each depending on the coordinates of only one electron Si, V(ri ), one arrives at a Schrödinger equation [-h2 /2me Si=1 i2 - Sa Si Za e2 /ria + Si V(ri)] y = E y which is separable. That is, by assuming that y (r1 , r2 , ... rN ) = f1 (r1 ) f2 (r2) ... fN (rN), and inserting this ansatz into the approximate Schrödinger equation, one obtains N separate Schrödinger equations: \begin{equation*} \big[-\frac{\hbar^2}{2m_e} \nabla_i^2 -\frac{S_a Z_a e^2}{r_{ia}} + V\left(r_i\right)\big] f_i = E_i f_i \end{equation*} one for each of the N so-called orbitals fi whose energies Ei are called orbital energies. It turns out that much of the effort going on in the electronic structure area of theoretical chemistry has to do with how one can find the "best" effective potential V(r); that is, the V(r), which depends only on the coordinates r of one electron, that can best approximate the true pairwise additive Coulomb potential experienced by an electron due to the other electrons. The simplest and most commonly used approximation for V(r) is the so-called Hartree-Fock (HF) potential: V(r) fi (r) = Sj [ Ú|fj (r')|2 e2 /|r-r'| dr' fi (r) - Úfj*(r') fj (r') e2 /|r-r'| dr' fj (r) ]. This potential, when acting on the orbital fi , can be viewed as multiplying fi by a sum of potential energy terms (which is what makes it one-electron additive), each of which consists of two parts: a. An average Coulomb repulsion Ú |fj (r')|2 e2 /|r-r'| dr' between the electron in fi with another electron whose spatial charge distribution is given by the probability of finding this electron at location r' if it resides in orbital fj : |fj (r')|2 . b. A so-called exchange interaction between the electron in fi with the other electron that resides in fj.. The sum shown above runs over all of the orbitals that are occupied in the atom or molecule. For example, in a Carbon atom, the indices i and j run over the two 1s orbitals, the two 2s orbitals and the two 2p orbitals that have electrons in them (say 2px and 2py ) The potential felt by one of the 2s orbitals is obtained by setting fi = 2s, and summing j over j=1s, 1s, 2s, 2s, 2px , 2py . The term Ú |1s(r')|2 e2 /|r-r'| dr' 2s(r) gives the average Coulomb repulsion between an electron in the 2s orbital and one of the two 1s electrons; Ú |2px(r')|2 e2 /|r-r'| dr' 2s(r) gives the average repulsion between the electron in the 2px orbital and an electron in the 2s orbital; and Ú |2s(r')|2 e2 /|r-r'| dr' 2s(r) describes the Coulomb repulsion between one electron in the 2s orbital and the other electron in the 2s orbital. The exchange interactions, which arise because electrons are Fermion particles whose indistinguishability must be accounted for, have analogous interpretations. For example, Ú 1s*(r') 2s(r') e2 /|r-r'| dr' 1s(r) is the exchange interaction between an electron in the 1s orbital and the 2s electron; Ú 2px*(r') 2s(r') e2 /|r-r'| dr' 2px(r) is the exchange interaction between an electron in the 2px orbital and the 2s electron; and Ú 2s*(r') 2s(r') e2 /|r-r'| dr' 2s(r) is the exchange interaction between a 2s orbital and itself (note that this interaction exactly cancels the corresponding Coulomb repulsion Ú |2s(r')|2 e2 /|r-r'| dr' 2s(r), so one electron does not repel itself in the Hartree-Fock model). There are two primary deficiencies with the Hartree-Fock approximation: a. Even if the electrons were perfectly described by a wavefunction y (r1 , r2 , ... rN ) = f1 (r1 ) f2 (r2) ... fN (rN) in which each electron occupied an independent orbital and remained in that orbital for all time, the true interactions among the electrons Si,j e2 /rij are not perfectly represented by the sum of the average interactions. b. The electrons in a real atom or molecule do not exist in regions of space (this is what orbitals describe) for all time; there are times during which they must move away from the regions of space they occupy most of the time in order to avoid collisions with other electrons. For this reason, we say that the motions of the electrons are correlated (i.e., where one electron is at one instant of time depends on where the other electrons are at that same time). Let us consider the implications of each of these two deficiencies. b. The imperfections in the orbital-level picture are substantial To examine the difference between the true Coulomb repulsion between electrons and the Hartree-Fock potential between these same electrons, the figure shown below is useful. In this figure, which pertains to two 1s electrons in a Be atom, the nucleus is at the origin, and one of the electrons is placed at a distance from the nucleus equal to the maximum of the 1s orbital's radial probability density (near 0.13 Å). The radial coordinate of the second is plotted along the abscissa; this second electron is arbitrarily constrained to lie on the line connecting the nucleus and the first electron (along this direction, the inter-electronic interactions are largest). On the ordinate, there are two quantities plotted: (i) the Hartree-Fock (sometimes called the self-consistent field (SCF) potential) Ú |1s(r')|2 e2 /|r-r'| dr', and (ii) the so-called fluctuation potential (F), which is the true coulombic e2/|r-r'| interaction potential minus the SCF potential. As a function of the inter-electron distance, the fluctuation potential decays to zero more rapidly than does the SCF potential. However, the magnitude of F is quite large and remains so over an appreciable range of inter-electron distances. Hence, corrections to the HF-SCF picture are quite large when measured in kcal/mole. For example, the differences DE between the true (state-of-the-art quantum chemical calculation) energies of interaction among the four electrons in Be (a and b denote the spin states of the electrons) and the HF estimates of these interactions are given in the table shown below in eV (1 eV = 23.06 kcal/mole). Orb. Pair 1sa1sb 1sa2sa 1sa2sb 1sb2sa 1sb2sb 2sa2sb DE in eV 1.126 0.022 0.058 0.058 0.022 1.234 These errors inherent to the HF model must be compared to the total (kinetic plus potential) energies for the Be electrons. The average value of the kinetic energy plus the Coulomb attraction to the Be nucleus plus the HF interaction potential for one of the 2s orbitals of Be with the three remaining electrons -15.4 eV; the corresponding value for the 1s orbital is (negative and) of even larger magnitude. The HF average Coulomb interaction between the two 2s orbitals of 1s22s2 Be is 5.95 eV. This data clearly shows that corrections to the HF model represent significant fractions of the inter-electron interaction energies (e.g., 1.234 eV compared to 5.95- 1.234 = 4.72 eV for the two 2s electrons of Be), and that the inter-electron interaction energies, in turn, constitute significant fractions of the total energy of each orbital (e.g., 5.95 -1.234 eV = 4.72 eV out of -15.4 eV for a 2s orbital of Be). The task of describing the electronic states of atoms and molecules from first principles and in a chemically accurate manner (± 1 kcal/mole) is clearly quite formidable. The HF potential takes care of "most" of the interactions among the N electrons (which interact via long-range coulomb forces and whose dynamics requires the application of quantum physics and permutational symmetry). However, the residual fluctuation potential is large enough to cause significant corrections to the HF picture. This, in turn, necessitates the use of more sophisticated and computationally taxing techniques to reach the desired chemical accuracy. c. Going beyond the simplest orbital model is sometimes essential What about the second deficiency of the HF orbital-based model? Electrons in atoms and molecules undergo dynamical motions in which their coulomb repulsions cause them to "avoid" one another at every instant of time, not only in the average-repulsion manner that the mean-field models embody. The inclusion of instantaneous spatial correlations among electrons is necessary to achieve a more accurate description of atomic and molecular electronic structure. Some idea of how large the effects of electron correlation are and how difficult they are to treat using even the most up-to-date quantum chemistry computer codes was given above. Another way to see the problem is offered in the figure shown below. Here we have displayed on the ordinate, for Helium's 1S (1s2) state, the probability of finding an electron whose distance from the He nucleus is 0.13Å (the peak of the 1s orbital's density) and whose angular coordinate relative to that of the other electron is plotted on the absissa. The He nucleus is at the origin and the second electron also has a radial coordinate of 0.13 Å. As the relative angular coordinate varies away from 0 deg, the electrons move apart; near 0 deg, the electrons approach one another. Since both electrons have the same spin in this state, their mutual Coulomb repulsion alone acts to keep them apart. What this graph shows is that, for a highly accurate wavefunction (one constructed using so-called Hylleraas functions that depend explicitly on the coordinates of the two electrons as well as on their interparticle distance coordinate) that is not of the simple orbital product type, one finds a "cusp" in the probability density for finding one electron in the neighborhood of another electron with the same spin. The probability plot for the Hylleraas function is the lower dark line in the above figure. In contrast, this same probability density, when evaluated for an orbital-product wavefunction (e.g., for the Hartree-Fock function) has no such cusp because the probability density for finding one electron at r, q, f is independent of where the other electron is (due to the product nature of the wavefunction). The Hartree-Fock probability, which is not even displayed above, would thus, if plotted, be flat as a function of the angle shown above. Finally, the graph shown above that lies above the Hylleraas plot and that has no "sharp" cusp was extracted from a configuration interaction wavefunction for He obtained using a rather large correlation consistent polarized valence quadruple atomic basis set. Even for such a sophisticated wavefunction (of the type used in many state of the art ab initio calculations), the cusp in the relative probability distribution is clearly not well represented. d. For realistic accuracy, improvements to the orbital picture are required Although highly accurate methods do exist for handling the correlated motions of electrons (e.g., the Hylleraas method mentioned above), they have not proven to be sufficiently computationally practical to be of use on atoms and molecules containing more than a few electrons. Hence, it is common to find other methods employed in most chemical studies in which so-called correlated wavefunctions are used. By far, the most common and widely used class of such wavefunctions involve using linear combinations of orbital product functions (actually, one must use so-called antisymmetrized orbital products to properly account for the fact that Fermion wavefunctions such as those describing electrons are odd under permutations of the electrons' labels): Y = S J CJ| fJ1 fJ2 fJ3 ...fJ(N-!) fJN |, with the indices J1, J2, ..., JN labeling the spin-orbitals and the coefficients CJ telling how much of each particular orbital product to include. As an example, one could use Y = C1 |1sa 1sb | - C2 [|2pza2pzb | - |2pxa2pxb | -2pya2pyb |] as a wavefunction for the 1S state of He (the last three orbital products are combined to produce a state that is spherically symmetric and thus has L = 0 electronic angular momentum just as the |1sa1sb| state does). Using a little algebra, and employing the fact that the orbital products |f1 f2 | = (2)-1/2 [ f1 f2 - f2 f1 ] are really antisymmetric products, one can show that the above He wavefunction can be rewritten as follows: Y = C1/3 {|fz a f'z b| - |fx a f'xb| - |fya f'y b| }, where fz = 1s + (3C2/C1)1/2 2pz and f'z = 1s - (3C2/C1)1/2 2pz , with analogous definitions for fx , f'x , fy , and f'y . The physical interpretation of the three terms ({|fz a f'z b| , |fx a f'x b| , and |fy a f'y b| ) is that |fz a f'z b| describes a contribution to Y in which one electron of a spin resides in a region of space described by fz while the other electron of b spin is in a region of space described by f'z , and analogously for |fxa f'x b| and |fy a f'y b|. Such a wavefunction thus allows the two electrons to occupy different regions of space since each orbital f in a pair is different from its partner f'. The extent to which the orbital differ depends on the C2/C1 ratio which, in turn, is governed by how strong the mutual repulsions between the two electrons are. Such a pair of so-called polarized orbitals is shown in the figure below. e. Density Functional Theory (DFT) These approaches provide alternatives to the conventional tools of quantum chemistry. The CI, MCSCF, MPPT/MBPT, and CC methods move beyond the single-configuration picture by adding to the wave function more configurations whose amplitudes they each determine in their own way. This can lead to a very large number of CSFs in the correlated wave function, and, as a result, a need for extraordinary computer resources. The density functional approaches are different. Here one solves a set of orbital-level equations \[\bigg[ -\frac{h^2}{2m_e} - \frac{S_A Z_A e^2}{\big|r-R_A\big|} + \frac{\tilde{U^\prime}\left(r^\prime\right)e^2}{\big|r-r^\prime\big|} dr^\prime + U\left(r\right) \bigg] f_i = e_i f_i\] in which the orbitals {fi} 'feel' potentials due to the nuclear centers (having charges ZA), Coulombic interaction with the total electron density r(r'), and a so-called exchange-correlation potential denoted U(r'). The particular electronic state for which the calculation is being performed is specified by forming a corresponding density r(r'). Before going further in describing how DFT calculations are carried out, let us examine the origins underlying this theory. The so-called Hohenberg-Kohn theorem states that the ground-state electron density r(r) describing an N-electron system uniquely determines the potential V(r) in the Hamiltonian H = Sj {-h2/2mej2 + V(rj) + e2/2 Skj 1/rj,k }, and, because H determines the ground-state energy and wave function of the system, the ground-state density r(r) determines the ground-state properties of the system. The proof of this theorem proceeds as follows:
a. r(r) determines N because Ú r(r) d3r = N.
b. Assume that there are two distinct potentials (aside from an additive constant that simply shifts the zero of total energy) V(r) and V'(r) which, when used in H and H', respectively, to solve for a ground state produce E0, Y (r) and E0', Y'(r) that have the same one-electron density: Ú |Y|2 dr2 dr3 ... drN = r(r)= Ú |Y'|2 dr2 dr3 ... drN .
c. If we think of Y' as trial variational wave function for the Hamiltonian H, we know that \begin{equation*} E_0 \left = \left + \tilde{U}_r\left(r\right) [V(r) - V'(r)] d3r = E0' + Ú r(r) [V(r) - V'(r)] d3r. \end{equation*}
d. Similarly, taking Y as a trial function for the H' Hamiltonian, one finds that E0' < E0 + Ú r(r) [V'(r) - V(r)] d3r. e. Adding the equations in c and d gives E0 + E0' < E0 + E0', a clear contradiction. Hence, there cannot be two distinct potentials V and V' that give the same ground-state r(r). So, the ground-state density r(r) uniquely determines N and V, and thus H, and therefore Y and E0. Furthermore, because Y determines all properties of the ground state, then r(r), in principle, determines all such properties. This means that even the kinetic energy and the electron-electron interaction energy of the ground-state are determined by r(r). It is easy to see that Ú r(r) V(r) d3r = V[r] gives the average value of the electron-nuclear (plus any additional one-electron additive potential) interaction in terms of the ground-state density r(r), but how are the kinetic energy T[r] and the electron-electron interaction Vee[r] energy expressed in terms of r? The main difficulty with DFT is that the Hohenberg-Kohn theorem shows that the ground-state values of T, Vee , V, etc. are all unique functionals of the ground-state r (i.e., that they can, in principle, be determined once r is given), but it does not tell us what these functional relations are. To see how it might make sense that a property such as the kinetic energy, whose operator -h2 /2me 2 involves derivatives, can be related to the electron density, consider a simple system of N non-interacting electrons moving in a three-dimensional cubic "box" potential. The energy states of such electrons are known to be E = (h2/2meL2) (nx2 + ny2 +nz2 ), where L is the length of the box along the three axes, and nx , ny , and nz are the quantum numbers describing the state. We can view nx2 + ny2 +nz2 = R2 as defining the squared radius of a sphere in three dimensions, and we realize that the density of quantum states in this space is one state per unit volume in the nx , ny , nz space. Because nx , ny , and nz must be positive integers, the volume covering all states with energy less than or equal to a specified energy E = (h2/2meL2) R2 is 1/8 the volume of the sphere of radius R: F(E) = 1/8 (4p/3) R3 = (p/6) (8meL2E/h2)3/2 . Since there is one state per unit of such volume, F(E) is also the number of states with energy less than or equal to E, and is called the integrated density of states. The number of states g(E) dE with energy between E and E+dE, the density of states, is the derivative of F: g(E) = dF/dE = (p/4) (8meL2/h2)3/2 E1/2 . If we calculate the total energy for N electrons, with the states having energies up to the so-called Fermi energy (i.e., the energy of the highest occupied molecular orbital HOMO) doubly occupied, we obtain the ground-state energy: = (8p/5) (2me/h2)3/2 L3 EF5/2. The total number of electrons N can be expressed as N = 2 g(e)dE= (8p/3) (2me/h2)3/2 L3 EF3/2, which can be solved for EF in terms of N to then express E0 in terms of N instead of EF: E0 = (3h2/10me) (3/8p)2/3 L3 (N/L3)5/3 . This gives the total energy, which is also the kinetic energy in this case because the potential energy is zero within the "box", in terms of the electron density r (x,y,z) = (N/L3). It therefore may be plausible to express kinetic energies in terms of electron densities r(r), but it is by no means clear how to do so for "real" atoms and molecules with electron-nuclear and electron-electron interactions operative. In one of the earliest DFT models, the Thomas-Fermi theory, the kinetic energy of an atom or molecule is approximated using the above kind of treatment on a "local" level. That is, for each volume element in r space, one assumes the expression given above to be valid, and then one integrates over all r to compute the total kinetic energy: TTF[r] = Ú (3h2/10me) (3/8p)2/3 [r(r)]5/3 d3r = CF Ú [r(r)]5/3 d3r , where the last equality simply defines the CF constant (which is 2.8712 in atomic units). Ignoring the correlation and exchange contributions to the total energy, this T is combined with the electron-nuclear V and Coulombic electron-electron potential energies to give the Thomas-Fermi total energy: E0,TF [r] = CF Ú [r(r)]5/3 d3r + Ú V(r) r(r) d3r + e2/2 Ú r(r) r(r')/|r-r'| d3r d3r', This expression is an example of how E0 is given as a local density functional approximation (LDA). The term local means that the energy is given as a functional (i.e., a function of r) which depends only on r(r) at points in space but not on r(r) at more than one point in space. Unfortunately, the Thomas-Fermi energy functional does not produce results that are of sufficiently high accuracy to be of great use in chemistry. What is missing in this theory are a. the exchange energy and b. the correlation energy; moreover, the kinetic energy is treated only in the approximate manner described. In the book by Parr and Yang, it is shown how Dirac was able to address the exchange energy for the 'uniform electron gas' (N Coulomb interacting electrons moving in a uniform positive background charge whose magnitude balances the charge of the N electrons). If the exact expression for the exchange energy of the uniform electron gas is applied on a local level, one obtains the commonly used Dirac local density approximation to the exchange energy: Eex,Dirac[r] = - Cx Ú [r(r)]4/3 d3r, with Cx = (3/4) (3/p)1/3 = 0.7386 in atomic units. Adding this exchange energy to the Thomas-Fermi total energy E0,TF [r] gives the so-called Thomas-Fermi-Dirac (TFD) energy functional. Because electron densities vary rather strongly spatially near the nuclei, corrections to the above approximations to T[r] and Eex.Dirac are needed. One of the more commonly used so-called gradient-corrected approximations is that invented by Becke, and referred to as the Becke88 exchange functional: Eex(Becke88) = Eex,Dirac[r] -g Ú x2 r4/3 (1+6 g x si\(\text{NH}^-\)1(x))-1 dr, where x =r-4/3 |r|, and g is a parameter chosen so that the above exchange energy can best reproduce the known exchange energies of specific electronic states of the inert gas atoms (Becke finds g to equal 0.0042). A common gradient correction to the earlier T[r] is called the Weizsacker correction and is given by dTWeizsacker = (1/72)(h/me) Ú |r(r)|2/r(r) dr. Although the above discussion suggests how one might compute the ground-state energy once the ground-state density r(r) is given, one still needs to know how to obtain r. Kohn and Sham (KS) introduced a set of so-called KS orbitals obeying the following equation: {-1/22 + V(r) + e2/2 Ú r(r')/|r-r'| dr' + Uxc(r) }fj = ej fj , where the so-called exchange-correlation potential Uxc (r) = dExc[r]/dr(r) could be obtained by functional differentiation if the exchange-correlation energy functional Exc[r] were known. KS also showed that the KS orbitals {fj} could be used to compute the density r by simply adding up the orbital densities multiplied by orbital occupancies nj : r(r) = Sj nj |fj(r)|2. (here nj =0,1, or 2 is the occupation number of the orbital fj in the state being studied) and that the kinetic energy should be calculated as T = Sj nj . The same investigations of the idealized 'uniform electron gas' that identified the Dirac exchange functional, found that the correlation energy (per electron) could also be written exactly as a function of the electron density r of the system, but only in two limiting cases- the high-density limit (large r) and the low-density limit. There still exists no exact expression for the correlation energy even for the uniform electron gas that is valid at arbitrary values of r. Therefore, much work has been devoted to creating efficient and accurate interpolation formulas connecting the low- and high- density uniform electron gas expressions (see Appendix E in the Parr and Wang book for further details). One such expression is EC[r] = Ú r(r) ec(r) dr, where ec(r) = A/2{ln(x/X) + 2b/Q tan-1(Q/(2x+b)) -bx0/X0 [ln((x-x0)2/X) +2(b+2x0)/Q tan-1(Q/(2x+b))] is the correlation energy per electron. Here x = rs1/2 , X=x2 +bx+c, X0 =x02 +bx0+c and Q=(4c - b2)1/2, A = 0.0621814, x0= -0.409286, b = 13.0720, and c = 42.7198. The parameter rs is how the density r enters since 4/3 prs3 is equal to 1/r; that is, rs is the radius of a sphere whose volume is the effective volume occupied by one electron. A reasonable approximation to the full Exc[r] would contain the Dirac (and perhaps gradient corrected) exchange functional plus the above EC[r], but there are many alternative approximations to the exchange-correlation energy functional. Currently, many workers are doing their best to "cook up" functionals for the correlation and exchange energies, but no one has yet invented functionals that are so reliable that most workers agree to use them. To summarize, in implementing any DFT, one usually proceeds as follows: 1. An atomic orbital basis is chosen in terms of which the KS orbitals are to be expanded. 2. Some initial guess is made for the LCAO-KS expansion coefficients Cj,a: fj = Sa Cj,a ca. 3. The density is computed as r(r) = Sj nj |fj(r)|2 . Often, r(r) is expanded in an atomic orbital basis, which need not be the same as the basis used for the fj, and the expansion coefficients of r are computed in terms of those of the fj . It is also common to use an atomic orbital basis to expand r1/3(r) which, together with r, is needed to evaluate the exchange-correlation functional's contribution to E0. 4. The current iteration's density is used in the KS equations to determine the Hamiltonian {-1/2 2 + V(r) + e2/2 Ú r(r')/|r-r'| dr' + Uxc(r) }whose "new" eigenfunctions {fj} and eigenvalues {ej} are found by solving the KS equations. 5. These new fj are used to compute a new density, which, in turn, is used to solve a new set of KS equations. This process is continued until convergence is reached (i.e., until the fj used to determine the current iteration's r are the same fj that arise as solutions on the next iteration. 6. Once the converged r(r) is determined, the energy can be computed using the earlier expression E [r] = Sj nj + Ú V(r) r(r) dr + e2/2 Ú r(r)r(r')/|r-r'|dr dr'+ Exc[r]. In closing this section, it should once again be emphasized that this area is currently undergoing explosive growth and much scrutiny. As a result, it is nearly certain that many of the specific functionals discussed above will be replaced in the near future by improved and more rigorously justified versions. It is also likely that extensions of DFT to excited states (many workers are actively pursuing this) will be placed on more solid ground and made applicable to molecular systems. Because the computational effort involved in these approaches scales much less strongly with basis set size than for conventional (SCF, MCSCF, CI, etc.) methods, density functional methods offer great promise and are likely to contribute much to quantum chemistry in the next decade. There is a nice DFT web site established by the Arias research group at Cornell devoted to a DFT project involving highly efficient computer implementation within object-oriented programming. f. Efficient and widely distributed computer programs exist for carrying out electronic structure calculations The development of electronic structure theory has been ongoing since the 1940s. At first, only a few scientists had access to computers, and they began to develop numerical methods for solving the requisite equations (e.g., the Hartree-Fock equations for orbitals and orbital energies, the configuration interaction equations for electronic state energies and wavefunctions). By the late 1960s, several research groups had developed reasonably efficient computer codes (written primarily in Fortran with selected subroutines that needed to run especially efficiently in machine language), and the explosive expansion of this discipline was underway. By the 1980s and through the 1990s, these electronic structure programs began to be used by practicing "bench chemists" both because they became easier to use and because their efficiency and the computers' speed grew (and cost dropped) to the point at which modest to large molecules could be studied at reasonable cost and effort. Even with much faster computers, there remain severe bottlenecks to extending ab initio quantum chemistry tools to larger and larger molecules (and to extended systems such as polymers, solids, and surfaces). Two of the most difficult issues involve the two-electron integrals (ci cj |1/r1,2| ck cl ). Nearly all correlated electronic structure methods express the electronic energy E (as well as its gradient and second derivative or Hessian) in terms of integrals taken over the molecular orbitals, not the basis atomic orbitals. This usually then requires that the integrals be first evaluated in terms of the basis orbitals and subsequently transformed from the basis orbital to the molecular orbital representation using the LCAO-MO expansion fj = Sa Cj,a ca. For example, one such step in the transformation involves computing Sa Cj,a (ca cj |1/r1,2| ck cl ) = (fi cj |1/r1,2| ck cl ). Four such one-index transformations must performed to eventually obtain the (fi fj |1/r1,2| fk fl ) integrals. Given a set of M basis orbitals, there are ca. M4/8 integrals (ca cj |1/r1,2| ck cl ). Each one-index transformation step requires ca. M5 calculations (i.e., to form the sum of products such as Sa Cj,a (ca cj |1/r1,2| ck cl ). Hence the task of forming these integrals over the molecular orbitals scales as the fifth power of M. At present, more electronic structure calculations are performed by non-theorists than by practicing theoretical chemists. This is largely due to the proliferation of widely used computer programs. This does not mean that all that needs to be done in electronic structure theory is done. The rates at which improvements are being made in the numerical algorithms used to solve the problems as well as at which new models are being created remain as high as ever. For example, Professor Rich Friesner has developed and Professor Emily Carter has implemented for correlated methods a highly efficient way to replace the list of two-electron integrals (fi fj |1/r1,2| fk fl ), which number N4 , where N is the number of atomic orbital basis functions, by a much smaller list (fi fj |l) from which the original integrals can be rewritten as: (fi fj |1/r1,2| fk fl ) = Sg (fi (g)fj (g)) Ú dr fk(r) fl (r)/|r-g| . A key feature of our theoretical approach is the use of modern renormalization group and multi-scale ideas. These enable us to extend the range of simulation from the simple to the complex, and from the small to the very large. Some current phenomena under study include: (i) Energy and electron transfer in conjugated polymers: specifically photosynthetic carotenoids, optoelectronic polymers, and carbon nanotubes, (ii) Spin couplings in multiple-transition metal systems, including iron-sulfur proteins and molecular magnets. (iii) Lattice models of high Tc superconductors. Electronic Structure of Molecular Magnetism Molecular magnetism is currently a ``hot'' area in chemical physics because of the technological promise of colossal magneto-resistant materials compounds and super-paramagnetic molecules. We are interested in developing a better fundamental understanding of magnetism that will allow us to predict the behavior of systems like these in an ab initio way. For example, one should be able to extract the Heisenberg exchange parameters (even for challenging oxo-bridged transition metal compounds) by simulating the response of the system to localized magnetic fields. We are also interested in extending the commonly used Heisenberg Hamiltonian to include spin orbit interactions in a local manner. This would be useful, for example, if one is interested in assembling a large molecule out of smaller building blocks - by knowing the preferred axis of each fragment one could potentially extract the magnetic axis and anisotropy of the larger compound. Modeling Real-Time Electron Dynamics Ab initio methods tend to focus the lion's share of attention on the description of electronic structure. However, there are a variety of systems where a focus on the electron dynamics is extremely fruitful. On the one hand, there are systems where it is the motion of the electrons that is interesting. This is true, for example, in conducting organic polymers and crystals - where it is charge migration that leads conductivity - and in photosynthetic and photovoltaic systems - where excited state energy transfer determines the efficiency. Also, in a very deep way, dynamic simulations can offer improved pictures of static phenomena. Here, our attention is focused on the fluctuation-dissipation theorem, an exact relation between the static correlation function and the time-dependent response of the system, and on semiclassical techniques, which provide a simple ansatz for approximating quantum results using essentially classical information.

Molecular Dynamics

Molecular and chemical dynamics describes the motions of the atoms within the molecule and the surrounding solvent The collisions among molecules and resulting energy transfers among translational, vibrational, rotational, and electronic modes, as well as chemical reactions that occur intramolecularly or in bi-molecular encounters lie within the realm of molecular and chemical dynamics theory. The vibrational and rotational motions that a molecule's nuclei undergo on any one of the potential energy surfaces \(E_k Q\)) is also a subject for molecular dynamics and provides a logical bridge to the subject of molecular vibration-rotation spectroscopy. 1. Classical Newtonian Dynamics Can Often Be Used For any particular molecule with its electrons occupying one of its particular electronic states, the atomic centers (i.e., the N nuclei) undergo translational, rotational, and vibrational movements. The translational and rotational motions do not experience any forces and are thus "free motions" unless (1) surrounding solvent or lattice species are present or (2) an external electric or magnetic field is applied. Either of the latter influences will cause the translations and rotations to experience potential energies that depend on the location of the molecule's center of mass and the molecule's orientation in space, respectively. In contrast, the vibrational coordinates of a molecule experience forces that result from the dependence of the electronic state energy Ek on the internal coordinates Fi = - dEk({Q})/dQi. By expressing the kinetic energy T for internal vibrational motions and the corresponding potential energy V= Ek({Q}) in terms of 3N-6 internal coordinates {Qk}, classical equations of motion based on the so-called Lagrangian L = T - V: d/dt d[T-V]/di = d[T-V]/dQi can be developed. If surrounding solvent species or external fields are present, equations of motion can also be developed for the three center of mass coordinates R and the three orientational coordinates W . To do so one must express the molecule-solvent intermolecular potential energy in terms of R and W, and the translational and orientational kinetic energies must also be written in terms of the time rates of change of R and W. The equations of motion discussed above are classical. Most molecular dynamics theory and simulations are performed in this manner. When light atoms such as H, D, or He appear, it is often essential to treat the equations of motion that describe their motions (vibrations as well as translations and rotations in the presence of solvent or lattice surroundings) using the Schrödinger equation instead. This is a much more difficult task. a. A collision between an atom and a diatomic molecule Let us consider an example to illustrate the classical and quantum treatments of dynamics. In particular, consider the collision of an atom A with a diatomic molecule BC with all three atoms constrained to lie in a plane. I. The coordinates in which the kinetic energy has no cross terms Here the three atoms have a total of 2N = 6 coordinates because they are constrained to lie in the X,Y plane. The center of mass of the three atoms x = (mA xA + mB xB + mC xC )/M y = (mA yA + mB yB + mC yC )/M requires two coordinates to specify, which leaves 2N-2 = 4 coordinates to describe the internal and overall orientational arrangement of the three atoms. It is common (the reason will be explained below) in such triatomic systems to choose the following specific set of internal coordinates: 1. r, the distance between two of the atoms (usually the two that are bound in the A + BC collision); 2. R, the distance of the third atom (A) from the center of mass of the first two atoms (BC); 3. a and b , two angles between the r and R vectors and the laboratory-fixed X axis, respectively. II. The Hamiltonian or total energy In terms of these coordinates, the total energy (i.e., the Hamiltonian) can be expressed as follows: H = 1/2 m' 2 + 1/2 m 2 + 1/2 m'R22 + 1/.2 m r22 + 1/2 M (2 + 2 ) + V. Here, m is the reduced mass of the BC molecule (m = mBmC/(mB +mc )) M is the mass of the entire molecule M = mA + mB + mC , and m' is the reduced mass of A relative to BC (m' = mAmBC/(mA +mBC )). The potential energy V is a function of R, r, and q = a + b the angle between the r and R vectors. III. The conjugate momenta To eventually make a connection to the quantum mechanical Hamiltonian, it is necessary to rewrite the above H in terms of the coordinates and their so-called conjugate momenta. For any coordinate Q, the conjugate momentum PQ is defined by PQ = L/ . For example, PR = m', Pr = m , Pa = m' R2, Pb = m r2 allow H to be rewritten as H = PR2/2m' + Pr2 /2m + Pa2 /2m'R2 + Pb2 /2mr2 + (PX2 + PY2 )/2M +V. Because the potential V contains no dependence on X or Y, the center of mass motion (i.e., the kinetic energy (PX 2 + PY 2 )/2M) will time evolve in a straight-line trajectory with no change in its energy. As such, it can be removed from further consideration. The total angular momentum of the three atoms about the Z- axis can be expressed in terms of the four remaining coordinates as follows: LZ = m' R2 - m r2 = Pa - Pb . Since this total angular momentum is a conserved quantity (i.e., for each trajectory, the value of LZ remains unchanged throughout the trajectory), one can substitute this expression for Pa and rewrite H in terms of only three coordinates and their momenta: H = PR2 /2m' + pr2 /2m + (LZ -Pb )2 /2m'R2 + Pb2 /2mr2 +V. Notice that Pb is the angular momentum of the BC molecule (Pb = m r2 db/dt). IV. The equations of motion From this Hamiltonian (or the Lagrangian L = T - V), one can obtain, using d/dt d[T-V]/di = d[T-V]/dQi , the equations of motion for the three coordinates and three momenta: dPR /dt = L/R = -V/R - (LZ -Pb )2 /m'R3 , dPr /dt = -V/r - Pb2 /mr3 , dPb /dt = -V/b = -V/q , dR/dt = PR /m', dr/dt = Pr /m , db/dt = Pb /mr2 . V. The initial conditions These six classical equations of motion that describe the time evolution of r, R, b, PR , pr , and Pb can then be solved numerically as discussed earlier. The choice of initial values of the coordinates and momenta will depend on the conditions in the A + BC collision that one wishes to simulate. For example, R will be chosen as very large, and PR will be negative (reflecting inward rather than outward relative motion) and with a magnitude determined by the energy of the collision PR2 /2m' = Ecoll.. If the diatomic BC is initially in a particular vibrational state, say v, the coordinate r will be chosen from a distribution of values given as the square of the v-state's vibrational wavefunction |yv (r)|2 . The value of pr can then be determined within a sign from the energy ev of the v state: pr2 /2m + VBC = ev , where VBC is the BC molecule's potential energy as a function of r. Finally, the angle b can be selected from a random distribution within 02 ]-1/2 exp (-(R-R0 )2 /(42 )). Professor Rick Heller Here, the parameter 2 gives the uncertainty or "spread" along the R- degree of freedom for this wavefunction, defined as the mean squared displacement away from the average coordinate R0 , since it can be shown for the above function that Ú (R-R0)2 yR* (R) yR (R) R2 dR = 2 and that R0 = Ú R yR *(R) yR (R) R2 dR. It can also be shown that the parameter PR0 is equal to the average value of the momentum along the R coordinate: Ú yR (R)*{- i h yR/R } R2 dR = PR0, and that the uncertainty in the momentum along the R coordinate: = Ú yR (R)*{- i h yRR + PR0 yR (R) }2 R2 dR is related to the uncertainty in the R-coordinate by = h2/4. Note that for these coherent state wavefunctions, the uncertainties fulfill the general Heisenberg uncertainty criterion for any coordinate Q and its conjugate momentum P: > h2/4 but have the smallest possible product of uncertainties. In this sense, the coherent state function is the closest possible to a classical description (where uncertainties are absent). II. Time propagation of the wavefunction must be effected- the basis expansion approach The difficult part of the propagation process involves applying the time evolution operator exp (- iHt/h) to this initial wavefunction. The most powerful tools for applying this operator fall under the class of methods termed Feynman path-integral methods (see the links to Professors Greg Voth, Nancy Makri, and Jim Doll). However, let us first examine an approach that has been used for a longer time, for example in the hands of Professors George Schatz and John Light. a. Expanding the wavefunction in a basis If, alternatively, one prefers to attempt to solve the time-independent Schrödinger equation H\(y_k\) = Ek \(y_k\) the most widely used approaches involve expanding the unknown y(R,r,a,b) in a basis consisting of products of functions of R, functions of r, and functions of a and of b: \(y_k\) = Sn,v,L,M Ck (n,v,L,M) Fn (R) yv(r) yL (b) yM (a). This expansion is what one would use in the case of a non-reactive collision. If a chemical reaction (e.g., A + BC Æ AB + C) is to be examined, then one must add to the sum of terms given above another sum which contains products of functions of the so-called exit-channel coordinates (r, t, g, and f). In this case, one uses \(y_k\) = Sn,v,L,M Ck (n,v,L,M) Fn (R) yv(r) yL (b) yM (a) + Sn,v,L,M Ck (n,v,L,M) Fn (r) yv(t) yL (g) yM (f). Of course, the yv , yL, Fn, and yM functions relating to the exit-channel are different than those for the entrance channel (e.g., yv (r) would be a AB-molecule vibrational wavefunction, but yv (r) would be a BC-molecule vibrational function). As will be seen from the discussion to follow shortly, expanding y in this manner and substituting into the Schrödinger equation produces a matrix eigenvalue equation whose eigenvalues are the Ek and the eigenvectors are the expansion coefficients Ck (n,v,L,M). Given any wavefunction y(R,r,a,b;t=0) at t=0 (e.g., the one shown above may be appropriate for a collision of A with a BC molecule in a specified vibrational/rotational state), one can then express this wavefunction at later times as follows: y(R,r,a,b;t) = Sk \(y_k\) exp(-i Ekt/h) \(y_k\) * y(R,r,a,b;t=0) dr . This expression arises when one uses the facts that: 1. the {\(y_k\) } form a complete set of functions so one can expand y(R,r,a,b;t=0) in this set, and 2. the time evolution operator exp (-i Ht/h) can be applied to any eigenfunction \(y_k\) and produces exp (-i Ht/h) \(y_k\) = exp (-i Ekt/h) \(y_k\) . So, it is possible to follow the time development of an initial quantum wavefunction by first solving the time-independent Schrödinger equation for all of the {\(y_k\) } and then expressing the initial wavefunction in terms of the eigenfunctions (i.e, the integral Ú \(y_k\) * y(R,r,a,b;t=0) dr is nothing but the expansion coefficient of y(R,r,a,b;t=0) in terms of the \(y_k\) ) after which the wavefunction at a later time can be evaluated by the expression y(R,r,a,b;t) = Sk \(y_k\) exp(-i Ekt/h) Ú \(y_k\) * y(R,r,a,b;t=0) dr. b. The resulting matrix eigenvalue problem Let us now return to see how the expansion of y in the manner \(y_k\) = Sn,v,L,M Ck (n,v,L,M) Fn (R) yv(r) yL (b) yM (a) (or the generalization in which both entrance- and exit-channel product functions are used) produces a matrix eigenvalue problem. Substituting this expansion into Hy =Ey and subsequently multiplying through by the complex conjugate of one particular Fn' (R) yv' (r) yL' (b) yM' (a)and integrating over R, r, b, and a one obtains a matrix eigenvalue equation: H Ck = Ek Ck in which the eigenvector is the vector of expansion coefficients Ck (n.v,L,M) and the H matrix has elements H(n,v,L,M|n',v',L',M') = < Fn' (R) yv' (r) yL' (b) yM' (a)| H | Fn (R) yv(r) yL (b) yM (a)> given in terms of integrals over the basis functions with the above Hamiltonian operator in the middle. Notice that in writing the integral on the right-hand side of the above equation, the so-called Dirac notation has been used. In this notation, any integral with a wavefunction on the right, a complex conjugated wavefunction on the left, and an operator in the middle is written as follows: ÚY*Opfdq = < Y|Op|f> c. The dimension of the matrix scales as the number of basis functions to number of coordinates power What makes the solution of the four-variable Schrödinger equation much more difficult than the propagation of the four-coordinate (plus four-momenta) Newtonian equations? The Hamiltonian differential operator is non-separable because V depends on R, r, and q (which is related to b and a by q = a + b ) in a manner that can not be broken apart into an R-dependent piece plus an r-dependent piece plus a q-dependent piece. As a result, the four dimensional second-order partial differential equation can not be separated into four second-order ordinary differential equations. This is what makes its solution especially difficult. To make progress solving the four-dimensional Schrödinger equation on a computer, one is thus forced either to 1. expand y (R,r,a,b) as S C(n,v,L,M) Fn (R) yv(r) yL (b) yM (a), and then solve the resultant matrix eigenvalue problem, or to 2. represent y (R,r,a,b) on a grid of points in R, in r, and in a and b space and use finite-difference expressions (such as [F(R+dR) + F(R-dR) -2F(R)]/d2 to represent d2F(R)/dR2 ) to also express the derivatives on this same grid to ultimately produce a sparse matrix whose eigenvalues must be found. In either case, one is left with the problem of finding eigenvalues of a large matrix. In, the former case, the dimension of the H matrix is equal to the product of the number of Fn (R) functions used in the expansion times the number of yv(r) functions used times the number of yL (b) used and times the number of yM (a) functions. In the latter case, the dimension of H is equal to the number of grid points along the R coordinate times the number of grid points along r times the number along b and times the number along a. It is the fact that the dimension of H grows quartically with the size of the basis or grid set used (because there are four coordinates in this problem; for a molecule with N atoms, the dimension of H would grow as the number of grid points or basis functions to the 3N-6 power!) and that the computer time needed to find all of the eigenvalues of a matrix grows as the cube of the dimension of the matrix (to find one eigenvalue requires time proportional to the square of the dimension) that makes the solution of this quantum problem very difficult. Let us consider an example. Suppose that a grid of 100 points along the R-coordinate and 100 points along the r-coordinate were used along with say only 10 points along the a and b-coordinates. This would produce a (albeit very sparse) H matrix of dimension 106 . To find just one eigenvalue of such a large matrix on a 100 Mflop desktop workstation would require of the order of several times (106)2 /(100x106 ) sec, or at least 100 minutes. To find all 106 eigenvalues would require 106 times as long. III. Time propagation of the wavefunction can be effected using Feynman path integrals instead Given a wavefunction at t=0, y(R,r,a,b;t=0), it is possible to propagate it forward in time using so-called path integral techniques that Richard Feynman pioneered and which have become more popular and commonly used in recent years (see links to Professors Greg Voth, Jim Doll, and Nancy Makri). In these approaches, one divides the time interval between t=0 and t=t into N small increments of length t = t/N, and then expresses the time evolution operator as a product of N short-time evolution operators: exp[-i Ht/h ] = {exp[-i Ht/h ]}N . For each of the short time steps t , one then approximates the propagator as exp[-i Ht/h] = exp[-i Vt/2h] exp[-i Tt/h] exp[-i Vt/2h], where V and T are the potential and kinetic energy operators that appear in H = T + V. It is possible to show that the above approximation is valid up to terms of order t4 . So, for short times (i.e., small t ), these so-called symmetric split operator approximations to the propagator should be accurate. The time evolved wavefunction y (t) can then be expressed as y (t) = { exp[-i Vt/2h] exp[-i Tt/h] exp[-i Vt/2h]}N y (t=0). The potential V is (except when external magnetic fields are present) a function only of the coordinates {qj } of the system, while the kinetic term T is a function of the momenta {pj } (assuming cartesian coordinates are used). By making use of the completeness relations for eigenstates of the coordinate operator 1 = dqj| qj> < qj| and inserting this identity N times (once between each power of exp[-i Vt/2h] exp[-i Tt/h] exp[-i Vt/2h]), the expression given above for y (t) can be rewritten as follows: y (qN ,t)= dqN-1 dqN-2 . . . dq1 dq0 exp{(-it/2h)[V(qj) + V(qj-1)]} < qj| exp(-itT / h ) |qj-1>y (q0 ,0). Then, by using the analogous completeness identity for the momentum operator 1 = 1 / h dpj| pj>< pj | one can write < qj| exp(-itT / h ) |qj-1> = 1 / hdp < qj|p > exp(-ip2t / 2mh ) < p|qj-1 >. Finally, by using the fact that the momentum eigenfunctions |p>, when expressed as functions of coordinates q are given by < qj|p > = (1/2p)1/2 exp(ipq/h), the above integral becomes < qj | exp(-itT / h) |qj-1> = 1/2 ph dp exp(-ip2 t / 2mh) exp[ip(qj - qj - 1)]. This integral over p can be carried out analytically to give < qj | exp(-itT / h) |qj-1> =exp[im(qj - qj - 1)2 / 2ht]. When substituted back into the multidimensional integral for y (qN ,t), we obtain y (qN ,t)= dqN-1 dqN-2 . . . dq1 dq0 exp{(-it/2h)[V(qj) + V(qj-1)]} exp[im(qj - qj-1)2 /2ht] y (qo,0) or y (qN ,t) = dqN-1 dqN-2 . . . dq1 dq0 exp{i/h [m(qj - qj-1)2 / 2t - t(V(qj) + V(qj-1))/2]} y (q0 , t=0). Why are such quantities called path integrals? The sequence of positions q0 , q2 , ... qN describes a "path" connecting q0 to qN . By integrating over all of the intermediate positions q1 , q2 ,... qN-1 one is integrating over all such paths. Further insight into the meaning of the above is gained by first realizing that (qj - qj-1)2 = (qj - qj-1)2 t = Tdt is the representation, within the N discrete time steps of length t, of the integral of Tdt over the jth time step, and that [V(qj) + V(qj-1)] = V(q)dt is the representation of the integral of Vdt over the jth time step. So, for any particular path (i.e., any specific set of q0 , q1, , ... qN-1 , qN values), the sum over all N such terms [m(qj - qj-1)2 / 2t - t(V(qj) + V(qj-1))/2] is the integral over all time from t=0 until t=t of the so-called Lagrangian L = T - V: [m(qj - qj-1)2 / 2t - t(V(qj) + V(qj-1))/2] = Ldt. This time integral of the Lagrangian is called the "action" S in classical mechanics. Hence, the N-dimensional integral in terms of which y (qN ,t) is expressed can be written as y (qN ,t) = exp{i / h dt L } y (q0 ,t=0). Here, the notation "all paths" is realized in the earlier version of this equation by dividing the time axis from t=0 to t=t into N equal divisions, and denoting the coordinates of the system at the jth time step by qj . By then allowing each qj to assume all possible values (i.e., integrating over all possible values of qj ), one visits all possible paths that begin at q0 at t=0 and end at qN at t=t. By forming the classical action S S = dtL for each path and then summing exp(iS/h) y ( q0 , t=0) over all paths and multiplying by , one is able to form y (qN ,t). dqN-1 dqN-2 . . . dq1 dq0 exp{i/h [m(qj - qj-1)2 / 2t - t(V(qj) + V(qj-1))/2]} The difficult step in implementing this Feynman path integral method in practice involves how one identifies all paths connecting q0 , t=0 to qN , t. Each path contributes an additive term involving the complex exponential of the quantity [m(qj - qj-1)2 / 2t - t(V(qj) + V(qj-1))/2] This sum of many, many (actually, an infinite number) oscillatory exp(iS/h) = cos (S/h) + i sin(S/h) terms is extremely difficult to evaluate because of the tendency of contributions from one path to cancel those of another path. The evaluation of such sums remains a very active research subject. The most commonly employed approximation to this sum involves finding the path(s) for which the action S= [m(qj - qj-1)2 / 2t - t(V(qj) + V(qj-1))/2] is smallest because such paths produce the lowest frequency oscillations in exp(iS/h), and thus are not subject to cancellation by contributions from other paths. The path(s) that minimize the action S are, in fact the classical paths. That is, they are the paths that the system whose quantum wavefunction is being propagated in time would follow if the system were undergoing classical Newtonian mechanics subject to the conditions that the system be at q0 at t=0 and at qN at t=t. In this so-called semi-classical approximation to the propagation of the initial wavefunction using Feynman path integrals, one finds all classical paths that connect q0 at t=0 and at qN at t=t, and one evaluates the action S for each such path. One then applies the formula y (qN ,t) = exp{i / hdt L } y (q0 ,t=0) but includes in the sum only the contribution from the classical path(s). In this way, one obtains an approximate quantum propagated wavefunction via a procedure that requires knowledge of only classical propagation paths. b. How is information extracted from a quantum dynamics simulation? Once an initial quantum wavefunction has been propagated for a time long enough for the event of interest to have occurred (e.g., long enough for an A + BC Æ AB + C reactive collision to take place in the above example problem), one needs to interpret the wavefunction in terms of functions that are applicable to the final-state. In the example considered here, this means interpreting y(R, r, a, b; t) in terms of exit-channel basis states that describe an intact AB molecule with a C atom moving away from it (i.e., the terms Sn,v,L,M Ck (n,v,L,M) Fn (r) yv(t) yL (g) yM (f)). As explained earlier when the classical trajectory approach was treated, to describe this final-state configuration of the three atoms, one uses different coordinates than R,r,a,b that were used for the initial state. The appropriate coordinates are shown below in the figure. That is, one projects the long-time wavefunction y(R,r,a,b;t) = Sk \(y_k\) exp(-i Ekt/h) Ú \(y_k\) * y(R,r,a,b;t=0) dr onto a particular exit-channel product function Fn (r) yv(t) yL (g) yM (f) to obtain the amplitude A of that exit channel in the final wavefunction: A = < Fn (r) yv(t) yL (g) yM (f)| y(R,r,a,b;t)> = Sk < Fn (r) yv(t) yL (g) yM (f)| \(y_k\)> exp(-iEkt/h) Ú \(y_k\) * y(R,r,a,b;t=0) dr. Using the earlier expansion for \(y_k\) , one sees that < Fn (r) yv(t) yL (g) yM (f)| \(y_k\)> = Ck (n,v,L,M) for the exit channel, so the modulus squared of A, which gives the probability of finding the AB + C system in the particular final state Fn (r) yv(t) yL (g) yM (f), is given by: |A|2 = | Sk Ck (n,v,L,M) exp(-i Ekt/h) Ú \(y_k\) * y(R,r,a,b;t=0) dr |2 . This result can be interpreted as saying that the probability of finding the state Fn (r) yv(t) yL (g) yM (f) is computed by (1) first, evaluating the projection of the initial wavefunction along each eigenstate (these components are the Ú \(y_k\) * y(R,r,a,b;t=0) dr), (2) multiplying by the projection of the specified final wavefunction along each eigenstate (the < Fn (r) yv(t) yL (g) yM (f)| \(y_k\)> = Ck (n,v,L,M)), (3) multiplying by a complex phase factor exp(-i h Ek t) that details how each eigenstate evolves in time, and (4) summing all such products after which (5) taking the modulus squared of the entire sum. 3. Present Day Challenges in Chemical Dynamics In addition to the people specifically mentioned in this text for whom I have already provided web links, I encourage you to look at the following web pages for further information: a. Large Biomolecules and Polymers Following the motions of large molecules even using classical Newton equations involves special challenges. First, there is the fact that large molecules contain more atoms, so the Newtonian equations that must be solved involve many coordinates (N) and momenta. Because the potential energy functions,and thus the Newtonian forces, depend on the relative positions of the atoms, of which there are N(N-1)/2 , the computer time needed to carry out a classical trajectory varies as (at least) the square of the number of atoms in the molecule. Moreover, to adequately sample an ensemble of initial coordinates and momenta appropriate to a large molecule, one needs to run many trajectories. That is, one needs to choose a range initial values for each of the molecule's internal coordinates and momenta; the number of such choices is proportional to N. The net result is that computer time proportional to N3 (or worse) is needed,and this becomes a major challenge for large biomolecules or polymers. One way this problem is being addressed is to use parallel computers which allow one to run trajectories with different initial conditions on different processors. Another problem that is special to large molecules involves the force field itself (see, for example, the web pages of Professors Andy McCammon, Charles Brooks and the late Peter Kollman, as well as of Biosym and CHARMm). Usually, the potential energy is expressed as a sum of terms involving: 1. bond stretching potentials of either harmonic V(r) = 1/2 k (r-re)2 or Morse form V(r) = De (1-exp(-b(r-re)))2 2. bond angle bending potentials of harmonic form depending on the angles between any three atoms A, B, C that are bonded to one another (e.g., the H-C-H angles in -CH2 - groups) 3. dihedral angles (e.g., the H-C-C-H angle in -H2C-CH2- groups 4. Van der Waals interactions V(r) = Ar-12 - Br-6 among all pairs of atoms (these potentials allow for repulsions among, for example, the H atoms of a methyl group in H3C-CH2-CH2-CH2-CH2-C H3 to repel the other H atoms as this "floppy" molecule moves into geometries that bring such groups into strong steric interaction) 5. electrostatic attraction (for opposite charges) and repulsions (for like charges) among charged or highly polar groups (e.g., between pairs of phosphate groups in a biopolymer, between a Na+ ion and a phosphate group, or between a charged group and a neighboring H2O molecule) 6. polarization of the electronic clouds of the large solute molecule or of surrounding solvent molecules induced by ionic or highly polar groups approaching highly polarizable parts of the molecule. One of the earliest pioneers in applying fundamental statistical mechanics, molecular mechanics, and solvation modelling theories to bio-molecules is Professor Arieh Warshel (below). His groups more recent interests include the dynamics of photobiological processes, enzyme catalysis and protein action, as well as the study of basic chemical reactions in solutions. The primary difficulties in using such a multi-parameterized force field are 1. that the values of the parameters characterizing each kind of potential have to be obtained either by requiring the results of a simulation to "fit" some experimental data or by extracting these parameters from ab initio quantum chemical calculations of the interaction potentials on model systems; 2. these developments and calibrations of the parameters in such force fields are still undergoing modifications and improvements, so they are not yet well established for a wide range of functional groups, solvents, ionic strengths, etc. One more difficulty that troubles most large molecule simulations is the wide range of time scales that must be treated when solving the Newton equations. For example, in attempting to monitor the uncoiling of a large biomolecule, which may occur over a time scale of ca. 10-5 to 1 sec, one is usually forced to "freeze" the much faster motions that occur (i.e., to treat the C-H, O-H, and N-H bonds, which oscillate over ca.10-14 sec, as rigid). If one were to attempt to follow the faster motions, one would have to take time steps in solving the Newton equations of ca. 10-14 sec, as a result of which the uncoiling event would require 109 to 1014 time steps. Even with a computer that could perform 109 floating point operations per second (i.e., a one gigaflop computer), and assuming that a single time step would use at least 1000 floating point operations (which is very optimistic), a single such trajectory would use 103 to 108 seconds of computer time. This is simply unrealistic. As a result, one is forced to freeze (i.e., by constraining to fixed geometries) the faster motions so that the Newton equations can be solved only for the slower motions. It is an area of current research development to invent new methods that allow such multiple time scale issues to be handled in a more efficient and accurate manner. b. Mixed Classical and Quantum Dynamics One of the most pressing issues in chemical reaction dynamics involves how to treat a very large molecular system that is undergoing a chemical reaction or a photochemical event (i.e., a change that affects its electronic structure). Because the system's bonding and other attributes of its electronic wavefunction are changing during such a process, one is required to use quantum mechanical methods. However, such methods are simply impractical (because their computer time, memory, and disk space needs scale as the number of electrons in the system to the fourth (or higher) power) to use on very large molecules. If the chemical reaction and/or photon excitation is localized within a small portion of the molecular system (that may involve solvent molecules too), there are so-called quantum mechanics/molecular mechanics (QMMM) methods currently under much development that can be employed. In these methods, one uses an explicit quantum chemical (ab initio or semi-empirical) method to handle that part of the system that is involved in the electronic process. For example, in treating the tautomerization of formamide H2 N-CHO to formamidic acid HN=CHOH in aqueous solution, one must treat the six atoms in the formamide explicitly using quantum chemistry tools. The surrounding water molecules (H2O)n could then be handled using a classical force field (i.e., the formamide-water and water-water interaction potentials as well as the internal potential energy function of each H2O molecule could be expressed as a classical potential like those described earlier in this section). If, on the other hand, one wished to treat one or more of the H2O solvent molecules as actively involved in promoting the tautomerization (as shown in the figure below) one would use quantum chemistry tools to handle the formamide plus the actively involved water molecule(s), and use classical potentials to describe the interactions of the remaining solvent molecules with these active species and among one another. At present, much effort is being devoted to developing efficient and accurate ways to combine quantum treatment of part of the system with classical treatment of the remainder of the system. Professor Chris Cramer has been very active developing new ways to model the influence of surrounding solvent molecules on chemical species whose spectroscopy or reactions one wishes to study. c. The Car-Parrinello Method As discussed earlier, it is often important to treat the electronic degrees of freedom, at least for the electrons involved in a chemical reaction or in a photon absorption or emission event, at a quantum level while still, at least for practical reasons, treating the time evolution of the nuclei classically. A novel approach developed by Car and Parrinello, and reviewed very nicely by Remler and Madden, incorporates the task of optimizing the electronic wavefunction into the Newtonian dynamics equations used to follow the nuclear movements. The LCAO-MO coefficients Ci,k relating molecular orbital fi to atomic orbital ck are assumed to minimize an energy function E[{Ci,k}] which can be a Hartree-Fock or DFT-type function, for example. At each time step during which the Newtonian equations ma d2Xa /dt2 = - E/Xa are propagated, a corresponding propagation equation is used to follow the time evolution of the {Ci,k } coefficients. The latter equation was put forth by Car and Parrinello by defining a (fictitious) kinetic energy T = Si, k 1/2 m |dCi,k/dt|2 in which the time derivatives of the Ci,k coefficients are viewed as analogs to the time rate of changes of the molecule's Cartesian coordinates {Xa}, and m is a (fictitious mass introduced to give this expression the units of energy. Then, using the dependence of the electronic energy E on the {Ci,k } to define the potential energy V = E[{Ci,k}], and then introducing a (fictitious) Lagrangian L = T-V whose classical action S = Ú L dt is made stationary with respect to variations in the {Ci,k } coefficients but subject to the constraint that the orbitals {fi} remain orthonormal: di,j = S k,l C*i,k Sk,l Cj,l the following dynamical equations are obtained for the Ci,k coefficients: m d2Ci,k /dt2 = -E/Ci,k - Sj,l li,j Sk,. Cj,l . The left hand side of this equation gives the mass m multiplied by the "acceleration" of the Ci,k variable. The right side gives the "force" -E/Ci,k acting on this variable, and - Sj,l li,j Sk,. Cj,l is the coupling force that connects Ci.k to the other Cj,l variables (this term arises, with its Lagrange multipliers li,j , from the constraints di,j = S k,l C*i,k Sk,l Cj,l relating to the orthonormality of the molecular orbitals. The result is that the time evolution of the Ci,k coefficients can be followed by solving an equation that is exactly the same in form as the Newtonian equations ma d2Xa /dt2 = - E/Xa used to follow the nuclear positions. Of course, one wonders whether the idea used by Car and Parrinello to obtain the classical-like equations for the time development of the Ci,k coefficients is valid. These equations were obtained by making stationary, subject to orthonormality constraints, an action. However, if one were to set the mass m equal to zero, the Lagrangian L would reduce to L = T-V = -V, since the (fictitious) kinetic energy would vanish. Since the Ci,k coefficients, in a conventional quantum chemical study, would be chose to make E[{Ci.k }] stationary, it can be shown that making the action stationary when T vanishes is equivalent to making V stationary, which, in turn, makes E[{Ci,k}] stationary. So, the Car-Parrinello approach is valid, but one has to be aware of the restriction to the ground-state (since E[{Ci,k }] is made stationary with no constraint of orthogonality to lower-energy states) and one must realize that finite-time-step propagations with non-zero m render the {Ci,k} amplitudes obtained via classical propagation not equivalent to those obtained by minimizing E at each geometry. Before closing this section on dynamics, I wish to bring to the readers' attention several other leading researchers in this exciting area of theoretical chemistry by showing photos of them below.

Statistical Mechanics

Statistical mechanics provides the framework for studying large collections of molecules and tells us how to average over positions and velocities to properly simulate the laboratory distribution of molecules When dealing with a sample containing a large number (e.g., billions) of molecules, it is impractical to attempt to monitor the time evolution of the coordinates and momenta (or analogous quantum state populations) of the individual constituent molecules. Especially for systems at or near equilibrium, these properties of individual molecules may vary irregularly on very short time scales (as the molecules undergo frequent collisions with neighbors), but the average behavior (e.g, average translational energy or average population of a particular vibrational energy level) changes much more gently with time. Such average properties can be viewed as averages for any one molecule over a time interval during which many collisions occur or as averages over all the molecules in the sample at any specific time. By focusing on the most probable distribution of the total energy available to a sample containing many molecules and by asking questions that relate to the average properties of the molecules, the discipline of statistical mechanics provides a very efficient and powerful set of tools (i.e., equations) for predicting and simulating the properties of such systems. In a sense, the machinery of statistical mechanics allow one to describe the "most frequent" behavior of large molecular systems; that is how the molecules are moving and interacting most of the time. Fluctuations away from this most probable behavior can also be handled as long as these fluctuations are small. 1. The framework of statistical mechanics provides efficient equations for computing thermodynamic properties from molecular properties a. The Boltzmann population equation The primary outcome of asking what is the most probable distribution of energy among a large number N of molecules within a container of volume V that is maintained at a specified temperature T is the most important equation in statistical mechanics, the Boltzmann population formula: Pj = Wj exp(- Ej /kT)/Q, where Ej is the energy of the jth quantum state of the system (which is the whole collection of N molecules), T is the temperature in K, Wj is the degeneracy of the jth state, and the denominator Q is the so-called partition function: Q = Sj Wj exp(- Ej /kT). The classical mechanical equivalent of the above quantum Boltzmann population formula for a system with M coordinates (collectively denoted q and M momenta denoted p) is: P(q,p) = h-M exp (- H(q, p)/kT)/Q, where H is the classical Hamiltonian, and the partition function Q is Q = h-M Ú exp (- H(q, p)/kT) dq dp . b. The limit for systems containing many molecules Notice that the Boltzmann formula does not say that only those states of a given energy can be populated; it gives non-zero probabilities for populating all states from the lowest to the highest. However, it does say that states of higher energy Ej are disfavored by the exp (- Ej /kT) factor, but if states of higher energy have larger degeneracies Wj (which they usually do), the overall population of such states may not be low. That is, there is a competition between state degeneracy, which tends to grow as the state's energy grows, and exp (-Ej /kT) which decreases with increasing energy. If the number of particles N is huge, the degeneracy W grows as a high power (say M) of E because the degeneracy is related to the number of ways the energy can be distributed among the N molecules; in fact, M grows at least as fast as N. As a result of W growing as EM , the product function P(E) = EM exp(-E/kT) has the form shown below (for M=10). By taking the derivative of this function P(E) with respect to E, and finding the energy at which this derivative vanishes, one can show that this probability function has a peak at E* = MkT, and that at this energy value, P(E*) = (MkT)M exp(-M), By then asking at what energy E' the function P(E) drops to exp(-1) of this maximum value P(E*) P(E') = exp(-1) P(E*), one finds E' = MkT (1+ ). So the width of the P(E) graph, measured as the change in energy needed to cause P(E) to drop to exp(-1) of its maximum value divided by the value of the energy at which P(E) assumes this maximum value is (E'-E*)/E* =. This width gets smaller and smaller as M increases. So, as the number N of molecules in the sample grows, which causes M to grow as discussed earlier, the energy probability functions becomes more and more sharply peaked about the most probable energy E*. It is for the reasons just shown that for so-called macroscopic systems, in which N (and hence M) is extremely large (i.e., 10L with L being ca. 10-24), only the most probable distribution of the total energy among the N molecules need be considered that the equations of statistical mechanics are so useful. Certainly, there are fluctuations (as evidenced by the finite width of the above graph) in the energy content of the molecular system about its most probable value. However, these fluctuations become less and less important as the system size (i.e., N) becomes larger and larger. c. The connection with thermodynamics What are some of these equations? The first is the fundamental Boltzmann population formula shown earlier: Pj = Wj exp(- Ej /kT)/Q. Using this result, it is possible to compute the average energy of a system = Sj Pj Ej , and to show that this quantity can be recast (try to do this derivation; it is not that difficult) as = kT2 (lnQ/T)N,V . If the average pressure

is defined as the pressure of each quantum state pj = (Ej /V)N multiplied by the probability Pj for accessing that quantum state, summed over all such states, one can show that

= Sj (Ej /V)N Wj exp(- Ej /kT)/Q = kT(lnQ/V)N,T . Without belaboring the point much further, it is possible to express all of the usual thermodynamic quantities in terms of the partition function Q. The average energy and average pressure are given above; the average entropy is given as = k lnQ + kT(lnQ/N)V,T . So, if one were able to evaluate the partition function Q for N molecules in a volume V at a temperature T, either by summing the quantum-state degeneracy and exp(-Ej/kT) factors Q = Sj Wj exp(- Ej /kT) or by evaluating the classical phase-space integral (phase space is the collection of coordinates and conjugate momenta) Q = h-M Ú exp (- H(q, p)/kT) dq dp, one could then compute all thermodynamic properties of the system. This is the essence of how statistical mechanics provides the tools for connecting the molecule-level properties, which ultimately determine the Ej and the Wj, to the macroscopic properties such as , ,

, etc. 2. Statistical mechanics gives equations for probability densities in coordinate and momentum space Not only is statistical mechanics useful for relating thermodynamic properties to molecular behavior but it is also necessary to use in molecular dynamics simulations. When one attempts, for example, to simulate the reactive collisions of an A atom with a BC molecule to produce AB + C, it is not appropriate to consider a single classical or quantal collision between A and BC. Why? Because in any laboratory setting, 1. The A atoms are moving toward the BC molecules with a distribution of relative speeds. That is, within the sample of molecules (which likely contains 1010 or more molecules), some A + BC pairs have low relative kinetic energies when they collide, and others have high kinetic energies. There is a probability distribution P(EKE ) for this relative kinetic energy. 2. The BC molecules are not all in the same rotational (J) or vibrational (v) state. There is a probability distribution function P(J,v) describing the fraction of BC molecules that are in a particular J state and a particular v state. 3. When the A and BC molecules collide with a relative motion velocity vector v, they do not all hit "head on". Some collisions have small impact parameter b (the closest distance from A to the center of mass of BC if the collision were to occur with no attractive or repulsive forces), and some have large b-values (see below). The probability function for these impact parameters is P(b) = 2p b db, which is simply a statement of the geometrical fact that larger b-values have more geometrical volume element than smaller b-values. So, to simulate the entire ensemble of collisions that occur between A atoms and BC molecules in various J, v states and having various relative kinetic energies EKE and impact parameters b, one must: 1. run classical trajectories (or quantum propagations) for a large number of J, v, EKE , and b values, 2. with each such trajectory assigned an overall weighting (or importance) of Ptotal = P(EKE ) P(J,v) 2pb db. 3. Present Day Challenges in Statistical Mechanics In addition to the scientists whose work is discussed explicitly below or later in this section, the reader is encouraged to visit the following web sites belonging to other leaders in the area of statistical mechanics. I should stress that the "dividing line" between chemical dynamics and statistical mechanics has become less clear in recent years, especially as dynamics theory has become more often applied to condensed-media systems containing large numbers of molecules. For this reason, one often finds researchers who are active in the chemical dynamics community producing important work in statistical mechanics and vice versa. One of the most active research areas in statistical mechanics involves the evaluation of so-called time correlation functions. The correlation function C(t) is defined in terms of two physical operators A and B, a time dependence that is carried by a Hamiltonian H via exp(-iHt/h), and an equilibrium average over a Boltzmann population exp(-bH)/Q. The quantum mechanical expression for C(t) is C(t) = Sj exp(-bEj)/Q, while the classical mechanical expression is C(t) = Ú dq Ú dp A(q(0),p(0)) B(q(t),p(t)) exp(-bH(q(0),p(0)))/Q, where q(0) and p(0) are the values of all the coordinates and momenta of the system at t=0 and q(t) and p(t) are their values, according to Newtonian mechanics, at time t. An example of a time correlation function that relates to molecular spectroscopy is the dipole-dipole correlation function: \(C(t) = S_j \ exp(-bEj)/Q, for which A and B are both the electric dipole interaction e•m between the photon's electric field and the molecule's dipole operator. The Fourier transform of this particular C(t) relates to the absorption intensity for light of frequency w: I(w) µÚ dt C(t) exp(iwt). The computation of correlation functions involves propagating either wavefunctions or classical trajectories. In the quantum case, one faces many of the same difficulties discussed earlier in Sec. III.B.2. However, one aspect of the task faced in the equilibrium averaging process that is also included in C(t) makes this case somewhat easier than in the quantum dynamics case. To illustrate, consider the time propagation issue contained in the quantum definition of C(t). One is faced with 1. propagating |yj > from t=0 up to time t, using exp(-iHt/h) |yj > and then multiplying by B 2. propagating A+ |yj > from t=0 up to time t, using exp(-iHt/h)A+ |yj >. The exp(-bH) operator can be combined with the first time propagation step as follows: exp(-iHt/h) |yj > exp(-bEj)/Q = exp(-iHt/h) exp(-bH) |yj >/Q =exp(-i/h[t+bh/i]H) |yj>/Q. Doing so introduces a propagation in complex time from t = 0 +bh/i to t = t + bh/i. The Feynman path integral techniques can now be used to carry out this propagation. One begins, as before, by dividing the time interval into N discrete steps exp[-i Ht/h ] = \(e^{-i\frac{H t}{Nh}}\)N . and then utilizing the same kind of short-time split propagator methodology described earlier in our discussion of quantum dynamics and Feynman path integrals. Unlike the real-time propagation case, which is plagued by having to evaluate sums of oscillatory functions, the complex-time propagations that arise in statistical mechanics (through the introduction of t = t + bh/i) are less problematic. That is, the quantity exp[-i Ejt/Nh ] = exp[-i Ejt/Nh ] exp[- Ejb/N] contains an exponential "damping factor" exp[- Ejb/N] in the complex-time case that causes the evaluation of correlation functions to be less difficult than the evaluation of real-time wavefunction propagation. For a good overview of how time correlation functions relate to various molecular properties, I urge you to look at McQuarrie's text book on Statistical Mechanics. Other areas of active research in statistical mechanics tend to involve systems for which the deviations from "ideal behavior" (e.g., dilute gases whose constituents have weak intermolecular forces, nearly harmonic highly ordered solids, weakly interacting dilute liquid mixtures, species adsorbed to surfaces but not interacting with other adsorbed species, highly ordered polymeric materials) are large. Such systems include glasses, disordered solids, polymers that can exist in folded or disentangled arrangements, concentrated electrolyte soltions, solvated ions near electrode surfaces, and many more. In addition to those scientists already mentioned above, below, I show several of the people who are pursuing research aimed at applying statistical mechanics to such difficult problems.

Subdisciplines of Theoretical Chemistry

Electronic Structure Theory

A. Electronic structure theory describes the motions of the electrons and produces energy surfaces

Molecular Dynamics

Statistical Mechanics