Skip to main content
. 2020 Sep 17;11:4691. doi: 10.1038/s41467-020-18282-2

Table 1.

Comparison of the quantum mechanics (QM) and machine learning (ML) formalism.

QM ML
Consider an archetypal case, which does not reduce the generality, a solid with one orbital i per atomic site i. In tight-binding formalism using the hopping integrals tij, the probability of transition from the orbital i to orbital j, the Hamiltonian reads: H=i,jtijij. The energy levels of the system are the eigenvalues of Schrödinger equation: Hm=ϵmm. We are interested in the estimation of the local energy ϵi associated with the atom i. Consider that we have learned the sample covariance matrix Σb of M data points, xmRD. The data are centred to mean zero. The mth element of the descriptor space can be written in an initial basis as xm=xmii. The eigenelement of Σb is {λm,vm}. We are interested in the statistical distance di of the data point xi.
H=i,jtijij (t.1) Σb=i,j1M1mMxmixmjij (t.2)
H=mϵmmm (t.3) Σb=mλmvmvm (t.4)
E = ∑mdϵn(ϵ)ϵδ(ϵ − ϵm) (t.5) Tr(Σb) = ∑mdλλδ(λ − λm) (t.6)
ρi(ϵ)=mim2δ(ϵϵm) (t.7) ρi(λ)=mxivm2δ(λλm) (t.8)
ϵi=dϵρi(ϵ)ϵn(ϵ) (t.9) di2=dλρi(λ)1λ (t.10)
p(ϵi)exp(βϵi)forβ0 (t.11) p(xi)exp(di2/2) (t.12)

Commonly used quantum mechanics (QM) formalism of local energies (on left) is compared with ML formalism of sample covariance matrix and statistical distances (on right). To emphasize the similarities between the two approaches, we adopt the QM bra-ket notation for statistical distance. The data points of the descriptor space are the ket vectors x=xRD×1, whereas the bra vectors are the transposed vectors x=xTR1×D. ρi and ρλ are the local density of states and variance, respectively, for the state i / data point xi. p(ϵi) is the probability of the state i in the limit of high temperature, where the Fermi-Dirac distribution becomes classical Boltzmann distribution. p(xi) is the marginal likelihood of the data point xi.