. 2020 Sep 17;11:4691. doi: 10.1038/s41467-020-18282-2

Table 1.

Comparison of the quantum mechanics (QM) and machine learning (ML) formalism.

QM		ML
Consider an archetypal case, which does not reduce the generality, a solid with one orbital $∣i⟩$ per atomic site i. In tight-binding formalism using the hopping integrals t_ij, the probability of transition from the orbital $∣i⟩$ to orbital $∣j⟩$ , the Hamiltonian reads: $H = \sum_{i, j} t_{i j} ∣i⟩ ⟨j∣$ . The energy levels of the system are the eigenvalues of Schrödinger equation: $H ∣m⟩ = ϵ_{m} ∣m⟩$ . We are interested in the estimation of the local energy $ϵ_{i_{⋆}}$ associated with the atom i_⋆.		Consider that we have learned the sample covariance matrix Σ_b of M data points, $x_{m} \in R^{D}$ . The data are centred to mean zero. The m^th element of the descriptor space can be written in an initial basis as $∣ x_{m} ⟩ = \sum x_{m}^{i} ∣ i ⟩$ . The eigenelement of $Σ_{b}$ is ${λ_{m}, ∣ v_{m} ⟩}$ . We are interested in the statistical distance $d_{i_{⋆}}$ of the data point $∣ x_{i_{⋆}} ⟩$ .
$H = \sum_{i, j} t_{i j} ∣i⟩ ⟨j∣$	(t.1)	$Σ_{b} = \sum_{i, j} (\frac{1}{M - 1} \sum_{m}^{M} x_{m}^{i} x_{m}^{j}) ∣i⟩ ⟨j∣$	(t.2)
$H = \sum_{m} ϵ_{m} ∣m⟩ ⟨m∣$	(t.3)	$Σ_{b} = \sum_{m} λ_{m} ∣v_{m}⟩ ⟨v_{m}∣$	(t.4)
E = ∑_m∫dϵn(ϵ)ϵδ(ϵ − ϵ_m)	(t.5)	Tr(Σ_b) = ∑_m∫dλλδ(λ − λ_m)	(t.6)
$ρ_{i_{⋆}} (ϵ) = \sum_{m} {∣⟨ i_{⋆} ∣ m ⟩∣}^{2} δ (ϵ - ϵ_{m})$	(t.7)	$ρ_{i_{⋆}} (λ) = \sum_{m} ∣ ⟨ x_{i_{⋆}} ∣ v_{m} ⟩ ∣^{2} δ (λ - λ_{m})$	(t.8)
$ϵ_{i_{⋆}} = \int d ϵ ρ_{i_{⋆}} (ϵ) ϵ n (ϵ)$	(t.9)	$d_{i_{⋆}}^{2} = \int d λ ρ_{i_{⋆}} (λ) \frac{1}{λ}$	(t.10)
$p (ϵ_{i_{⋆}}) \propto \exp (- β ϵ_{i_{⋆}}) for β \to 0$	(t.11)	$p (x_{i_{⋆}}) \propto \exp (- d_{i_{⋆}}^{2} / 2)$	(t.12)

Commonly used quantum mechanics (QM) formalism of local energies (on left) is compared with ML formalism of sample covariance matrix and statistical distances (on right). To emphasize the similarities between the two approaches, we adopt the QM bra-ket notation for statistical distance. The data points of the descriptor space are the ket vectors $∣x⟩ = x \in R^{D \times 1}$ , whereas the bra vectors are the transposed vectors $⟨x∣ = x^{T} \in R^{1 \times D}$ . $ρ_{i_{⋆}}$ and ρ_λ are the local density of states and variance, respectively, for the state $∣i_{⋆}⟩$ / data point $∣ x_{i_{⋆}} ⟩$ . $p (ϵ_{i_{⋆}})$ is the probability of the state $∣i_{⋆}⟩$ in the limit of high temperature, where the Fermi-Dirac distribution becomes classical Boltzmann distribution. $p (x_{i_{⋆}})$ is the marginal likelihood of the data point $∣ x_{i_{⋆}} ⟩$ .