Deep elastic strain engineering of bandgap through machine learning

Zhe Shi; Evgenii Tsymbalov; Ming Dao; Subra Suresh; Alexander Shapeev; Ju Li

doi:10.1073/pnas.1818555116

. 2019 Feb 15;116(10):4117–4122. doi: 10.1073/pnas.1818555116

Deep elastic strain engineering of bandgap through machine learning

Zhe Shi ^a,^b,¹, Evgenii Tsymbalov ^c,¹, Ming Dao ^a, Subra Suresh ^d,², Alexander Shapeev ^c,², Ju Li ^a,^b,²

PMCID: PMC6410806 PMID: 30770444

Significance

Deforming a material to a large extent without inelastic relaxation can result in unprecedented properties. However, the optimal deformation state is buried within the vast continua of choices available in the strain space. Here we advance a unique and powerful strategy to circumvent conventional trial-and-error methods, and adopt artificial intelligence techniques for rationally designing the most energy-efficient pathway to achieve a desirable material property such as the electronic bandgap. The broad framework for tailoring any target figure of merit, for any material using machine learning, opens up opportunities to adapt elastic strain engineering of properties and performance in devices and systems in a controllable and efficient manner, for potential applications in microelectronics, optoelectronics, photonics, and energy technologies.

Keywords: electronic band structure, bandgap engineering, first-principles calculation, neural network, semiconductor materials

Abstract

Nanoscale specimens of semiconductor materials as diverse as silicon and diamond are now known to be deformable to large elastic strains without inelastic relaxation. These discoveries harbinger a new age of deep elastic strain engineering of the band structure and device performance of electronic materials. Many possibilities remain to be investigated as to what pure silicon can do as the most versatile electronic material and what an ultrawide bandgap material such as diamond, with many appealing functional figures of merit, can offer after overcoming its present commercial immaturity. Deep elastic strain engineering explores full six-dimensional space of admissible nonlinear elastic strain and its effects on physical properties. Here we present a general method that combines machine learning and ab initio calculations to guide strain engineering whereby material properties and performance could be designed. This method invokes recent advances in the field of artificial intelligence by utilizing a limited amount of ab initio data for the training of a surrogate model, predicting electronic bandgap within an accuracy of 8 meV. Our model is capable of discovering the indirect-to-direct bandgap transition and semiconductor-to-metal transition in silicon by scanning the entire strain space. It is also able to identify the most energy-efficient strain pathways that would transform diamond from an ultrawide-bandgap material to a smaller-bandgap semiconductor. A broad framework is presented to tailor any target figure of merit by recourse to deep elastic strain engineering and machine learning for a variety of applications in microelectronics, optoelectronics, photonics, and energy technologies.

Nanostructured materials can withstand extremely large deformation without mechanical relaxation or failure compared with their conventional counterparts, opening up a vast parameter space for rational engineering of material properties by tensorial elastic strain. The electronic, optical, thermal, and chemical properties of crystals are functions of the six-dimensional elastic strain tensor $ε$ ( $ε_{1} \equiv ε_{11}$ , $ε_{2} \equiv ε_{22}$ , $ε_{3} \equiv ε_{33}$ , $ε_{4} \equiv ε_{23}$ , $ε_{5} \equiv ε_{13}$ , $ε_{6} \equiv ε_{12}$ following the so-called Voigt notation), which provides a continuously tunable set of variables analogous to the chemical composition of a seven-element alloy. Electronic bandgap $E_{g}$ opens or closes with $ε$ , resulting in drastic alteration of the electrical, thermal, optical, and magnetic characteristics (1). With the proliferation of ultrastrength nanostructured materials that can sustain a wide range of nonhydrostatic and potentially dynamically varying stresses (2), and various miniaturization-enabled means of applying $ε$ (3), a historical window of opportunity has now opened up to scan a vast unexplored space for the development of materials and devices with desirable combinations of physical and functional properties (4). For example, while it is well known that unstrained Si has an electronic bandgap of 1.1 eV, we know that, when subjected to an equibiaxial strain of 5%, it would have a different bandgap. Furthermore, a 5% tensile strain on Si would produce a different bandgap from a 5% shear strain. At large strains, all these differently strained pure Si crystals would not behave as the unstrained “typical silicon.” An added benefit is that with strain engineering, it is in principle possible to dynamically change the mechanical actuation, and switch between these differently strained materials, something that bandgap engineering by chemical means such as molecular beam epitaxy cannot accomplish. Not only the value of $E_{g}$ , but also its character (e.g., direct or indirect), and the topological features of a band structure can be changed with $ε$ before the ideal strain surface [a five-dimensional (5D) surface] $f (ε_{1}, ε_{2}, ε_{3}, ε_{4}, ε_{5}, ε_{6}) = 0$ in six dimensions (6D) is reached (5).

Over the past two decades, elastic strain engineering (ESE) has achieved one substantial commercial success (6): strained silicon technology, where a biaxial elastic strain of the order of 1% applied to a thin channel of silicon enhances the mobility of charge carriers by more than 50% and increases central processing unit (CPU) clock speed correspondingly. Recent studies have shown that nanowires of silicon can sustain a tensile elastic strain of as much as 16% (7), while nanoscale needles of diamond can be bent to a local maximum tensile elastic strain in excess of 9% (8). As we show in this paper, if we are able to exploit the ability of Si and C to deform up to strains of these magnitudes under certain conditions, there exist much greater possibilities than what is currently realized for engineering of band structure and bandgap for a wide variety of electronic, optoelectronic, and photonic materials employed in communication, information, and energy applications that impact every aspect of modern life (9).

ESE seeks to identify metastable states of matter for optimizing functional properties and performance. A strained material is in a state of higher energy than when it is in a stress-free state, characterized by the strain-energy density $h$ which is measured in units of meV/ $Å^{3}$ . Therefore, addressing the following question is at the heart of ESE: What is the energy cost $(h)$ to achieve the desired property change? Consider the challenges of reducing the bandgap of Si from 1.1 eV in its stress-free state to 0 eV in a metal-like state, or converting diamond from an ultrawide-bandgap material into a wide or even medium-bandgap material so that the full potential of its many appealing characteristics for microelectronics and optoelectronics could be realized. To achieve the above transitions in the most efficient manner, it is important to design $ε$ through the most optimal combination of its normal and shear components.

To address the foregoing question, we resort to deep ESE which exploits the latest advances in artificial intelligence and multiscale modeling. To set the scene, consider a situation where it is desirable to examine all possible combinations of the components of $ε$ , over a range of potential interest, say between −10 and +10% in each strain component. Here, say that the objective is to determine the least energetically expensive route to alter the bandgap of a material by a desired amount. Although ab initio calculations such as those involving many-body corrections can provide accurate energy-band results, the scope of such calculations is somewhat limited to about 1,000 strain points because of high computational cost. On the other hand, by discretizing $ε$ with a regular grid comprising 20 nodes separated at each 1% strain interval over the strain range of −10 to +10%, the computational model would entail about $10^{8}$ band structures, up to five orders of magnitude higher computational requirement than what can be reasonably achieved presently. To overcome these difficulties, we present here a general method that combines machine learning (ML) and ab initio calculations to identify pathways to ESE. This method invokes artificial neural networks (NNs) to predict, to a reasonable degree of accuracy, material properties as functions of the various input strain combinations on the basis of only a limited amount of data. We also demonstrate the potential of our method for bandgap engineering with specific calculations for perfect crystals of Si and diamond. These two materials bookend the wide spectrum of current possibilities and potential opportunities for optimizing the performance of semiconductor materials and devices. Si, on the one hand, represents the most widely used and commercially successful semiconductor material. Diamond, on the other hand, represents the most appealing ultrawide-bandgap material due to its extremely high thermal conductivity and hardness, high electron/hole mobilities and saturation drift velocities, and breakdown field (10). Tuning bandgap, and more broadly the band structure, through deep ESE provides opportunities for tapping into the many appealing figures of merit for device performance of any material. Moreover, we choose Si, the most versatile electronic material, to demonstrate that our ML machinery is capable of predicting important physical phenomena such as indirect-to-direct bandgap transition and semiconductor-to-semimetal transition. We also visualize silicon’s “paleolith”-like isobandgap surfaces in strain space, akin to the yield surface commonly used to describe the plastic deformation of metallic materials, but with sharp ridges and corners that reflect band-edge cross-overs.

Results

ML and Density of States of Bandgap.

We aim to describe the electronic bandgap and band structure as functions of strain by training ML models on first-principles density-functional theory (DFT) data. This approach leads to reasonably accurate training with much fewer computed data than fine-grid ab initio calculations and a fast evaluation time. The DFT calculations were conducted in two settings: a large, computationally inexpensive Perdew–Burke–Ernzerhof (10) (PBE) dataset obtained for fitting and a small but accurate many-body GW [G, Green’s function; W, screened Coulomb interaction (11)] dataset for correction. As depicted in Fig. 1A, the strain tensor and/or the $k$ -point coordinates are fed into different ML models as input to fit or make predictions about energy eigenvalues or bandgap. Table 1 demonstrates the accuracy of these models on the PBE data, the best of which is attained by the NN. The data fusion technique (12, 13) is adopted to further improve the learning outcome of bandgap. The resulting model allows the prediction of bandgap to reach an extremely high accuracy of 8 meV in the mean absolute error (MAE), as shown in Fig. 1B and SI Appendix, Table S1. The successful combination of the quantitative advantage of PBE and the qualitative advantage of GW results in a bandgap-prediction model with a level of accuracy comparable to experiments.

Fig. 1. — (A) ML workflow with NN. For a typical bandgap-prediction task, the input contains the strain information only and the target is either $E_{g}^{PBE}$ or $E_{g}^{GW}$ . In the data fusion process, the bandgap predicted from fitting the PBE dataset is also taken in as an input to fit the GW bandgap. For the whole band structure fitting task, the input contains both strain information and the k-point coordinates and the target is the energy dispersion $ε_{n} (k; ε)$ , where $n$ is the band index, $k$ is the wavevector, and $ε$ is the crystal strain tensor. The hidden-layer structures of the two associated deep NNs are also depicted. (B) Better bandgap-fitting results measured by MAE are yielded by data fusion compared with the sole use of $ε$ as input to fit GW data. (*Inset*) Data-fusion-based learning of the difference between $E_{g}^{PBE}$ and $E_{g}^{GW}$ . Ensemble methods on decision-tree classifiers including gradient boosting regression (GBR) and random forest regression (RFR), Lagrange interpolation and NN are adopted for ML fitting. (C) Reachable bandgap values for various $h$ within the whole deformation space for silicon. The region where the strained silicon has a direct bandgap is colored in red. The circle at $h$ = 1.35 meV/ $Å^{3}$ indicates the lowest energy penalty for the semiconductor-to-metal transition. (D) Diamond bandgap envelope extending toward the small-bandgap semiconductor region. The upper- and lower-envelope functions are indicated by black and red dots, respectively. The arrows on the horizontal axes in C and D indicate reachable $h$ by the in situ experiments (7, 8).

Table 1.

Root-mean-squared error for various ML algorithms for the bandgap and band structure prediction tasks from PBE data for silicon (in units of electron volts)

ML input	ML algorithms			ML target
ML input	GBR	RFR	NN	ML target
$ε^{3 D}$	0.0367	0.0247	0.0049	Bandgap
$ε^{6 D}$	0.0743	0.0781	0.0264	Bandgap
$k$ and $ε^{6 D}$ VB	0.1125	0.1078	0.0131	$ϵ_{n} (k; ε)$
$k$ and $ε^{6 D}$ CB	0.1593	0.1555	0.0184	$ϵ_{n} (k; ε)$

Open in a new tab

$ε^{3 D}$ and $ε^{6 D}$ denote three-normal-strains deformation and general deformation cases, respectively. For all of the details on ML and DFT methodology, optimization, and implementation, see Methods and SI Appendix, Notes S1 and S2 and Figs. S1 and S2.

In ESE experiments, the objective is to identify the highest or lowest bandgap that can be achieved through the expenditure of a certain elastic strain energy density $(h)$ defined as

h (ε) \equiv \frac{E (ε) - E^{0}}{V^{0}},

[1]

where $E (ε)$ is the total energy of the cell deformed by strain $ε$ , and $E^{0}$ and $V^{0}$ are the total energy and volume of the undeformed cell, respectively. Here, we data-mine the 6D deformation by ML the bandgap distribution and the elastic strain energy density against $ε$ . The many-to-many relation between $h (ε)$ and the bandgap $E_{g} (ε)$ is shown in Fig. 1 C and D. In the stress-free equilibrium state, silicon has a bandgap of 1.1 eV; with an increase in strain energy density, a variety of possible bandgaps emerge. Even silicon with as little strain energy density as 0.2 meV/ $Å^{3}$ can become quite a different material from the stress-free silicon. As $h$ further increases, the largest allowable bandgap drops and an “envelope” forms, as evidenced by the change of maximal and minimal bandgap reachable under a fixed $h$ . The shading of the envelope regions in Fig. 1 C and D reflects the distribution of the available bandgap. A darker shading qualitatively indicates that the amount of possible strains to achieve a specific bandgap at a given $h$ is higher. Outside the envelope the shading color is white, meaning that the corresponding bandgap is not attainable. Mathematically, we can define the cumulative “density of states” of bandgap as

c (E_{g}'; h') \equiv \int_{h (ε) < h'} d^{6} ε δ (E_{g}' - E_{g} (ε)) = \int d^{6} ε δ (E_{g}' - E_{g} (ε)) H (h' - h (ε)),

[2]

where $d^{6} ε \equiv d ε_{1} d ε_{2} d ε_{3} d ε_{4} d ε_{5} d ε_{6}$ in the 6D strain space, $δ (\cdot)$ is the Dirac delta function, and $H (\cdot)$ is the Heaviside step function. We then define the density of states of bandgap (DOB) at $h'$ by taking the derivative of $c (E_{g}'; h')$ with respect to $h'$ :

ρ (E_{g}'; h') \equiv \frac{\partial c (E_{g}'; h')}{\partial h'} = \int d^{6} ε δ (E_{g}' - E_{g} (ε)) δ (h' - h (ε)) .

[3]

The meaning of DOB can be described by considering all possible elastically strained states within the $(h - \frac{d h}{2}, h + \frac{d h}{2})$ energy interval, and the resultant distribution of bandgaps arising from these states. The DOB function $ρ (E_{g}; h)$ offers a blueprint for determining which bandgaps are accessible at what energy cost. One can use the definition (3) not only for the electronic bandgap, but also generally for any scalar property that will provide an easy-to-visualize map for deep ESE such as the thermoelectric figure of merit $z T$ , Baliga’s figure of merit (14), Curie temperature, etc. (4). An upper-envelope function $E_{g}^{upper} (h)$ and lower-envelope function $E_{g}^{lower} (h)$ can also be defined based on $ρ (E_{g}; h)$ :

E_{g}^{upper} (h) \equiv {max supp}_{E_{g}} (ρ (E_{g}; h)), E_{g}^{lower} (h) \equiv {min supp}_{E_{g}} (ρ (E_{g}; h)),

[4]

which are rendered as black and red dotted lines in Fig. 1 C and D, so the nonzero DOB falls within $(E_{g}^{lower} (h), E_{g}^{upper} (h))$ . In deep ESE, $E_{g}^{lower} (h)$ also indicates the path to obtain the fastest change in $E_{g}$ . For instance, if the goal is to reduce the bandgap of silicon from 1.1 eV as fast as possible, with the least cost of elastic energy, the red-dotted line in Fig. 1C (which is further detailed in Fig. 2A) $E_{g}^{lower} (h)$ offers the best design of the strain tensor $ε$ to achieve this goal.

Fig. 2. — (A) The most energy-efficient strain pathway to reach the zero-bandgap state, i.e., the lower-envelope function $E_{g}^{lower} (h)$ in silicon corresponding to the red-dotted line in Fig. 1C. The zero-bandgap state (open red circle on the horizontal axis of Fig. 1C) corresponds to the deformation case of $ε_{1} = 0.5522 %$ , $ε_{2} = - 1.2582 %$ , $ε_{3} = - 1.036 %$ , $ε_{4} = - 1.9168 %$ , $ε_{5} = 0.7411 %$ , and $ε_{6} = 1.6878 %$ . (B) GW band structure associated with this deformation. The fractional coordinates for the three high-symmetry points along the selected $k$ path are (0.5, 0, 0), (0, 0, 0), and (0.5, 0, 0.5), respectively.

It is seen from Fig. 1 C and D that, with the application of a relatively small amount of mechanical energy, the overall distribution of Si bandgap shifts downward. This means that by modulating the tensorial strain (shear/tension/compression combinations) in multiple directions, strained silicon becomes capable of absorbing a different part of the electromagnetic spectrum than when it is in a stress-free state. It was also found that at 1.35 meV/ $Å^{3}$ the bandgap of Si can vanish, corresponding to the minimum energy required for semiconductor-to-metal transition in the whole 6D strain space (see Fig. 2B for the band structure, which corresponds to the red circle in Fig. 1C). Fig. 2A further illustrates that silicon’s “most energy efficient path to metallization” is actually a curved path in the strain space: The initial fastest-descent direction for $E_{g}$ (at h = 0) is quite different from when $E_{g}$ hits zero at h = 1.35 meV/ $Å^{3}$ and thus linear perturbation theory such as the deformation potential theory (15) is not expected to work well in deep-strain space. It is not straightforward yet to achieve this complex optimal strain state in 6D experimentally, despite Feynman’s prophecy to use “a hundred tiny hands” (3). To provide experimental guidance, we further implemented our ML model in experimentally feasible uniaxial strain cases. It is found that $〈 111 〉$ crystal direction is the most energy-efficient uniaxial strain direction for Si bandgap engineering (SI Appendix, Fig. S3). A complete ranking of the common crystal directions in terms of their ability to lower Si bandgap can be found in SI Appendix, Note S3. In the case of diamond, deep ESE provides an opportunity to reduce its bandgap to a level comparable to that of InAs. Our results thus demonstrate that by straining diamond in the most optimal way, it can be transformed to mimic the properties of a lower-bandgap semiconductor while almost preserving its own uniqueness such as high strength and thermal conductivity, thereby paving the way for designing hitherto unexplored combinations of material characteristics.

Another important issue for optical applications pertains to whether the bandgap is direct or indirect. This direct bandgap envelope is a subset of DOB. We define the density of direct bandgaps (DOD) in parallel to [2]–[4], but with $E_{direct g}$ instead of $E_{g}$ , to obtain DOD $ρ_{d} (E_{direct g}; h)$ and its bounds $E_{direct g}^{upper} (h)$ , $E_{direct g}^{lower} (h)$ . Obviously, if direct bandgaps exist at any strain, for that strain there will be

(E_{direct g}^{lower} (h), E_{direct g}^{upper} (h)) \subseteq (E_{g}^{lower} (h), E_{g}^{upper} (h)) .

[5]

Our deep ESE model found within experimentally accessible strain range that the indirect-to-direct bandgap transition takes place in silicon in the high- $h$ region and a minimum strain energy density $h_{d}^{min}$ around 15.4 meV/ $Å^{3}$ exists for the direct bandgap to appear (the red region in Fig. 1C):

h_{d}^{min} = {min supp}_{h} (E_{direct g}^{upper} (h) - E_{direct g}^{lower} (h)) .

[6]

This little “island” of DOD within the ocean of DOB can be achieved by applying $ε_{1} = ε_{2} = ε_{3} \geq 9.3 %$ .

The conventional way to modulate electronic properties in semiconductors is the so-called compositional grading technique. Through varying the stoichiometry of an alloy semiconductor, as for example by molecular beam epitaxy, a graded bandgap can be produced (16). This method of tweaking the material property is conceptually based on chemical alloying, whereby the chemical composition is tuned in an alloy melt to produce desirable strength or ductility. Invoking this approach, conventional bandgap engineering resorted to chemical alloying such as ${GaAl}_{1 - x} {As}_{x}$ or ${Ga}_{1 - x} {In}_{x}$ As (17). However, we have demonstrated here that the stress-free situation is usually not the optimal state for a figure of merit, and elastic strains allow the bandgap to exhibit many more possible values so that each pure material candidate should occupy a much larger hyperspace enabled through the achievable 6D strain space. The more general bandgap engineering approach could utilize gradients in both composition and strain to achieve the desired band alignment.

Exploring Bandgap Ridgelines in Strain Space.

Here we choose the most widely used semiconductor material, Si, as an example to demonstrate the generality and flexibility of our method. Since the full 6D strain space does not allow for easy visualization, we restrict ourselves to tensile and compressive normal strains only $(ε_{4} = ε_{5} = ε_{6} = 0)$ for illustration purposes. Note that combinations of tensile and compressive strains can be used to generate shear strains in the material even though not all shear strains are considered. Fig. 3A illustrates the isosurface for Si bandgap, i.e., the set of points in the strain space where the bandgap equals some given value, for different $E_{g}$ levels obtained by our high-throughput NN model. The most striking visual feature of this $E_{g}$ isosurface in $ε_{1} ε_{2} ε_{3}$ space is its piecewise smoothness. There are cusp singularities of different order: ridgelines where two smooth pieces of the $E_{g}$ isosurface meet, and corners where three ridgelines meet. These singularities are characterized by discontinuities in the slope (but not value) of the isosurface in the strain space due to band cross-over or even band topology change. Such cusp features also exist in $E_{g}$ isosurface in the general- $ε_{1} ε_{2} ε_{3} ε_{4} ε_{5} ε_{6}$ space, although they are more difficult to visualize directly. One can mathematically define these nonsmooth features on the 5D isosurface (embedded in 6D) as nth-order ridges $(E_{g})$ if they are differentiable in 5-n directions, while sustaining a change in slope in the other n directions in the strain space.

Fig. 3. — (A) Bandgap isosurfaces for silicon in the $ε_{1} ε_{2} ε_{3}$ strain space appear to have the paleolith shape for every $E_{g}$ level. The main corners $(χ, μ, α_{j}, β_{j})$ of an isosurface at $E_{g} =$ 0.9 eV are indicated by different colors and the “carapaces” are distinguished by their associated k-space CBM labels. The red triangular faces indicate the direct-bandgap region at different $E_{g}$ levels. As bandgap increases, the area for the red triangle eventually shrinks to a single $χ$ point. GW model was used. (B) Bandgap isosurface shown through the $ε_{1} - ε_{2}$ projection of Si at 1 eV level with GW data. The $χ$ point corresponds to the direct-bandgap case and it splits into three at small $E_{g}$ as shown in A. (C) Zero-bandgap isosurface in the strain space based on GW data. The blue point corresponds to the strain-free state; red points are strains with the least $h$ of 1.65 meV/ $Å^{3}$ on this isosurface. (D) Strain-space coordinates of the bandgap isosurface corners (defined as in A) as a function of the bandgap level. The maximum bandgap possible in this strain space is about $1.24$ eV, and it is reached at a triaxial strain of 6.5%. In the cases where three $χ$ -type points exist, $b$ equals the average coordinate of them.

Since both the crystal structure and deformation tensor have symmetries, and the bandgap as a function of strain is invariant with respect to some of them, the “paleolith”-like $E_{g}$ isosurface (in analogy to the Tresca yield surface in strength of materials) has the following symmetry structure:

i)
The points $μ$ (the most “compressive” hydrostatic strain point on the $E_{g}$ isosurface) and $χ$ (most “tensile” hydrostatic strain point on the $E_{g}$ isosurface) lie on the $ε_{1} = ε_{2} = ε_{3}$ line. We thus denote their strain-space coordinates by $(a, a, a)$ and $(b, b, b)$ , respectively. At small or moderate $E_{g}$ , $χ$ splits and gives rise to a topologically triangular region $χ_{1} χ_{2} χ_{3}$ as shown in Fig. 3A. It will later be shown these $χ$ -type points form the direct bandgap region on the $E_{g}$ isosurface.
ii)
The points $α_{j} (j = 1,2,3)$ form a regular triangle which lies in a plane orthogonal to the $ε_{1} = ε_{2} = ε_{3}$ line. Their coordinates are denoted by $(c, d, d), (d, c, d)$ , and $(d, d, c)$ , respectively.
iii)
The points $β_{j} (j = 1,2,3)$ also form a regular triangle which lies in a plane orthogonal to the $ε_{1} = ε_{2} = ε_{3}$ line. Their coordinates are denoted by $(f, e, e), (e, f, e)$ , and $(e, e, f)$ , respectively.

The shape of the isosurface is similar for both PBE and GW bandgaps, although the specific strain values may differ for the same PBE and GW bandgap levels. It was found that the easiest way [with the least $h (ε^{3 D})$ ] to obtain the 0-eV bandgap without any shear strain is to apply a normal strain of −3.86 and 4.36% along any two of the three $〈 100 〉$ directions while leaving the third $〈 100 〉$ direction undeformed. Therefore, there are six strain cases that are equivalent, as indicated by red dots in Fig. 3C. The position of the vertices of the $E_{g}$ isosurface in the strain space is the function of selected bandgap value, and the detailed relationship between the bandgap and the strains is shown in Fig. 3D. According to our PBE + GW model, the maximum bandgap reachable by strained silicon is 1.24 eV under a hydrostatic tensile strain of 6.5%. It should be noted that silicon strained to such an extent can nearly reach the maximum theoretical efficiency, known as the Shockley–Queisser limit (18), of a single p-n junction solar cell, demonstrating possible application of ESE in solar energy conversion devices.

The formation of the $E_{g}$ isosurfaces, such as the ones in Fig. 3A, is due to the relative position of the valence band maximum (VBM) and the conduction band minimum (CBM). Despite different shape variations of the two energy bands, modulating elastic strain provides possibilities for the VBM and CBM to differ by the same amount with respect to the vacuum level. For undeformed silicon with a bandgap of 1.1 eV, the VBM is located at the $Γ$ point and the CBM lies on the straight line (the $Δ$ line) in the k space and is positioned at about $85 %$ of the way from the Brillouin zone center to the zone boundary (19). Under 3D deformation, the cubic crystal symmetry of Si is lifted and we follow the k-point labeling scheme explained in SI Appendix, Note S1 and Fig. 1 to describe band extrema positions. It is found that VBM remains at $Γ$ irrespective of deformation whereas the position of CBM can be greatly affected by external strains. Using the geometry of the $E_{g}$ isosurface as a visualization tool, we identify four types of k-space transition in CBM that may happen across the ridgelines on the isosurface.

Starting with the strain points on the lower faces separated by $μ - α_{j}$ ridgelines of the $E_{g}$ isosurface in Fig. 3A, we found that the CBM retains roughly the same relative position along the “ $Δ$ ”-type line as in the undeformed case, and that crossing the ridgelines only switches CBM among $Δ_{1} = (0, k_{1}, k_{1})$ , $Δ_{2} = (k_{1}, 0, k_{1})$ , and $Δ_{3} = (k_{1}, k_{1}, 0)$ , where $k_{1} \approx 0.425$ . In other words, μ−α₁ ridgeline corresponds to Δ₂/Δ₃ transition, μ−α₂ ridgeline corresponds to Δ₁/Δ₃ transition, μ−α₃ ridgeline corresponds to Δ₁/Δ₂ transition, and we can indeed label each carapace by its CBM character Δ₁, Δ₂, Δ₃. We term this transition occurring in the small strain region as the $Δ$ switching. In this case, the linear deformation potential theory can be used to describe the strain effects on the band extremum (15). However, investigation of the large deformation points on its upper faces in Fig. 3A reveals that the CBM would not retain its location and major changes would happen.

Our ML model captures the occurrence of “ $L$ - $Δ$ ” transition across the $β_{i} - α_{j}$ ridgelines where the CBM changes to “ $L$ ” points in k space: L₁ = (0.5, 0, 0), L₂ = (0, 0.5, 0), L₃ = (0, 0, 0.5); see Fig. 4 A and B, where for example, “Δ₃ carapace” changes to “L₁ carapace” across the α₁−β₃ ridgeline, and “Δ₃ carapace” changes to “L₂ carapace” across the α₂−β₃ ridgeline. None of the ridgelines or carapaces (e.g., Δ₃ carapace bound by μ−α₁−β₃−α₂−μ) are truly flat. The large, nonperturbative deformation makes the conventional theory ineffective in predicting it. Moving further toward $χ$ in the strain space, CBM would remain at $L$ and a cross-over of the $χ_{2} - β_{j}$ ridgelines is referred to as an $L$ switching. Indirect-to-direct bandgap transition occurs near the upper tip of the paleolith-like isosurface where CBM appears at $Γ$ , as shown in Fig. 4C. This can be explained by the competition between drops of different band edges. In general, as strain increases, the band edge at both $Γ$ and $L$ would decrease. As a result of high strains, the energy decrease at $Γ$ is faster and eventually the bandgap becomes direct, as shown in Fig. 4D. In this case, we transition for example from the L₁ carapace (α₁−β₃−χ₃−χ₂−β₂−α₁ in Fig. 3A) to “Γ carapace” (χ₁−χ₂−χ₃−χ₁ in Fig. 3A) across the χ₂−χ₃ ridgeline. When the strained Si turns into a direct-bandgap semiconductor, it would exhibit a significant enhancement in its optical transitions around the fundamental adsorption edge compared with an undeformed Si, due to the elimination of phonon involvement to facilitate adsorption or emission. As absorbance increases exponentially with thickness in a material, a solar cell based on direct bandgap Si with high adsorption coefficient would require much less thickness to absorb the same amount of light, paving the way for the design of lightweight high-efficiency solar cells. SI Appendix, Table S2 summarizes all of the details of the k-space transitions, thus resolving the conduction band properties exhaustively for a wide range of strains.

Fig. 4. — Illustration of k-space transition in Si predicted by deep ESE. All of the transitions are verified by GW calculations. (A and B) Representation of the $Δ$ - $L$ transition. (B and C) The indirect-to-direct transition. The CBM (red arrows) locates at k point (0.433, 0.433, 0), (0.5, 0, 0), and (0, 0, 0) respectively. (D) The enlarged band structure around Fermi energy shows the competition of the three possible CBM positions. The three nonshear-strain cases for A–C are (−0.23, 1.84, 3.45%), (4.63, 8.23, 9.22%), and (9.85, 9.31, 9.4%), corresponding to points on the different faces of the bandgap isosurface in Fig. 3.

Incremental Fitting.

We next show that our NN-based surrogate models can successfully learn from several datasets and assimilate them. This capability is becoming increasingly important with the spread of materials property databases that collect data from different studies (20). The incremental training of the NN starts from the same weights but is done on the extended dataset with the additional data included. We also increase the learning rate of stochastic gradient descent algorithm and regularizers (dropout rate and weight regularization) to circumvent limitations arising from the same local minima of the loss function established during the training on the initial dataset. This allows the model to not only handle additional training on the incoming data appended to a database but to do it much faster than from scratch.

Numerical experiments conducted on the NN model demonstrate that incremental fitting of the models effectively reduces the error on a new dataset, see SI Appendix, Table S3. Such incrementally fitted models are, thus, equally applicable to the bandgap approximation and various optimization tasks. Moreover, these models may be reused when shifting to other materials such as Ge, since the implicit insights about symmetries, transitions, and extreme cases are stored in the parameters of NN. Training the model for the other material starting from the weights for Si would significantly reduce the time and amount of data needed due to knowledge transfer, also referred to as transfer learning (21), leading to rapid development of versatile surrogate models for ESE.

Discussion

ML models provide an efficient way of representing electronic band structure allowing for studies and accurate ESE predictions of a variety of physical phenomena such as band warping, degeneracy lifting, indirect-to-direct bandgap transition, and semiconductor-to-metal transition. In previous studies, bandgap engineering was conducted largely by tuning only one or two strain components. Our ML methods are capable of exploring the full spectrum of possibilities by efficiently analyzing highly nonlinear relations between electronic band dispersion and the strain tensor. To this end, the electronic band structure of silicon is accurately captured from ML through only a limited amount of calculations. Employing deep-NN algorithms, the bandgap of Si can be fitted as a function of strain within milli-electron-volt accuracy.

In prior approaches of analytically describing strain effects by traditional means, the linear deformation potential theory has often been invoked and its insufficiency at large deformation cases (Fig. 2A) makes it impossible to map out the entire strain space. By contrast, the general and systematic ML framework we demonstrate here makes the problem of representing the bandgap, and more broadly, the band structure, as a function of 6D strain computationally tractable. Many avenues remain for the application of our models on multiple fronts. Among these we mention the extension of the model to increasingly complex material structures, predicting their bandgap and band structure, and phonon and photonic band structure.

Different strains may result in the same bandgap, and in seeking a specific bandgap, or any other materials figure of merit, one should choose the strain with a minimal effort required given the nonuniqueness of choice of a given target property or figure of merit. For this purpose, the DOB envelope we developed here is essential in understanding and fully utilizing deep ESE. In our work, we use the elastic strain energy density as a scalar metric or “norm” of the strain tensor for rationally choosing the ESE route that requires the least energy metastability and corresponds to the safest deformation manner in principle. For example, we have demonstrated that our model is able to locate the most energy-efficient pathway in the entire strain space to transform silicon from a semiconductor to a metal or to convert diamond from an ultrawide-bandgap material to a wide or even small-bandgap semiconductor. Latest advances in methods to apply large strains have included wide adoption of microelectromechanical systems and nanoelectromechanical systems, in situ indentation techniques, and nano-cantilever-beam bending (7, 8) and anviling (22) on materials across different size scales. The growing variety of technologies available to apply strains in a precisely controlled manner through mechanical, electrical, magnetic, thermal, and other means also promises the design of experiments to impose and tune different components of strains (23–26). Thanks to the expanding maturity of available tools, experimental implementation of the ESE approaches identified here for the 6D strain space is a next step in advancing further progress in this field. The distinctive ML model we propose here thus offers a potentially powerful method in guiding the design of approaches for a wide variety of semiconductor materials including silicon and diamond that could lead to performance improvement in applications as diverse as flexible electronics (27), nanomechanical resonators (28), optical fibers (23), and energy storage systems (29).

Methods

First-Principles Calculations.

Details for DFT simulations are in SI Appendix, Note S2.

ML.

NN and tree-based ensemble algorithms were adopted. More details are in SI Appendix, Note S2.

Data Fusion.

Details for data fusion are in SI Appendix, Note S2.

Supplementary Material

Supplementary File

pnas.1818555116.sapp.pdf^{(885.9KB, pdf)}

Acknowledgments

The authors thank Dr. Wenbin Li and Dr. Xiaohui Liu. The computation works were performed on supercomputers at Massachusetts Institute of Technology (MIT) and Skolkovo Institute of Science and Technology (Skoltech). S.S. acknowledges support from Nanyang Technological University through the Distinguished University Professorship. The work is supported by the Skoltech Next Generation Program 2016-7/NGP (a Skoltech—MIT joint project).

Footnotes

Conflict of interest statement: The authors have filed a patent based on the research presented in this paper.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1818555116/-/DCSupplemental.

References

1.Gilman JJ. Electronic Basis of the Strength of Materials. 1st Ed Cambridge Univ Press; Cambridge, UK: 2008. [Google Scholar]
2.Zhu T, Li J. Ultra-strength materials. Prog Mater Sci. 2010;55:710–757. [Google Scholar]
3.Feynman RP. There’s plenty of room at the bottom. Eng Sci. 1960;23:22–36. [Google Scholar]
4.Li J, Shan Z, Ma E. Elastic strain engineering for unprecedented materials properties. MRS Bull. 2014;39:108–114. [Google Scholar]
5.Qian X, Liu J, Fu L, Li J. Solid state theory. Quantum spin Hall effect in two-dimensional transition metal dichalcogenides. Science. 2014;346:1344–1347. doi: 10.1126/science.1256815. [DOI] [PubMed] [Google Scholar]
6.Bedell SW, Khakifirooz A, Sadana DK. Strain scaling for CMOS. MRS Bull. 2014;39:131–137. [Google Scholar]
7.Zhang H, et al. Approaching the ideal elastic strain limit in silicon nanowires. Sci Adv. 2016;2:e1501382. doi: 10.1126/sciadv.1501382. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Banerjee A, et al. Ultralarge elastic deformation of nanoscale diamond. Science. 2018;360:300–302. doi: 10.1126/science.aar4165. [DOI] [PubMed] [Google Scholar]
9.Tsao JY, et al. Ultrawide-bandgap semiconductors: Research opportunities and challenges. Adv Electron Mater. 2018;4:1600501. [Google Scholar]
10.Perdew JP, Burke K, Ernzerhof M. Generalized gradient approximation made simple. Phys Rev Lett. 1996;77:3865–3868. doi: 10.1103/PhysRevLett.77.3865. [DOI] [PubMed] [Google Scholar]
11.Aryasetiawan F, Gunnarsson O. The GW method. Rep Prog Phys. 1998;61:237–312. [Google Scholar]
12.Ramakrishnan R, Dral PO, Rupp M, von Lilienfeld OA. Big data meets quantum chemistry approximations: The Δ-machine learning approach. J Chem Theory Comput. 2015;11:2087–2096. doi: 10.1021/acs.jctc.5b00099. [DOI] [PubMed] [Google Scholar]
13.Khaleghi B, Khamis A, Karray FO, Razavi SN. Multisensor data fusion: A review of the state-of-the-art. Inf Fusion. 2013;14:28–44. [Google Scholar]
14.Baliga BJ. Semiconductors for high‐voltage, vertical channel field‐effect transistors. J Appl Phys. 1982;53:1759–1764. [Google Scholar]
15.Bardeen J, Shockley W. Deformation potentials and mobilities in non-polar crystals. Phys Rev. 1950;80:72–80. [Google Scholar]
16.Capasso F. Compositionally graded semiconductors and their device applications. Annu Rev Mater Sci. 1986;16:263–291. [Google Scholar]
17.Chang KYS, von Lilienfeld OA. AlxGa1-xAs crystals with direct 2 eV band gaps from computational alchemy. Phys Rev Mater. 2018;2:073802. [Google Scholar]
18.Shockley W, Queisser HJ. Detailed balance limit of efficiency of p-n junction solar cells. J Appl Phys. 1961;32:510–519. [Google Scholar]
19.Jenkins DP. Calculations on the band structure of silicon. Proc Phys Soc A. 1956;69:548–555. [Google Scholar]
20.Jain A, et al. Commentary: The materials project: A materials genome approach to accelerating materials innovation. APL Mater. 2013;1:011002. [Google Scholar]
21.Pan SJ, Yang Q. A survey on transfer learning. IEEE Trans Knowl Data Eng. 2010;22:1345–1359. [Google Scholar]
22.Li B, et al. Diamond anvil cell behavior up to 4 Mbar. Proc Natl Acad Sci USA. 2018;115:1713–1717. doi: 10.1073/pnas.1721425115. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Healy N, et al. Extreme electronic bandgap modification in laser-crystallized silicon optical fibres. Nat Mater. 2014;13:1122–1127. doi: 10.1038/nmat4098. [DOI] [PubMed] [Google Scholar]
24.Feng J, Qian X, Huang C-W, Li J. Strain-engineered artificial atom as a broad-spectrum solar energy funnel. Nat Photonics. 2012;6:866–872. [Google Scholar]
25.Aage N, Andreassen E, Lazarov BS, Sigmund O. Giga-voxel computational morphogenesis for structural design. Nature. 2017;550:84–86. doi: 10.1038/nature23911. [DOI] [PubMed] [Google Scholar]
26.Lian H, Christiansen AN, Tortorelli DA, Sigmund O, Aage N. Combined shape and topology optimization for minimization of maximal von Mises stress. Struct Multidiscipl Optim. 2017;55:1541–1557. [Google Scholar]
27.Grumstrup EM, et al. Reversible strain-induced electron-hole recombination in silicon nanowires observed with femtosecond pump-probe microscopy. Nano Lett. 2014;14:6287–6292. doi: 10.1021/nl5026166. [DOI] [PubMed] [Google Scholar]
28.Ovartchaiyapong P, Lee KW, Myers BA, Jayich ACB. Dynamic strain-mediated coupling of a single diamond spin to a mechanical resonator. Nat Commun. 2014;5:4429. doi: 10.1038/ncomms5429. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Yu D, Feng J, Hone J. Elastically strained nanowires and atomic sheets. MRS Bull. 2014;39:157–162. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File

pnas.1818555116.sapp.pdf^{(885.9KB, pdf)}

[r1] 1.Gilman JJ. Electronic Basis of the Strength of Materials. 1st Ed Cambridge Univ Press; Cambridge, UK: 2008. [Google Scholar]

[r2] 2.Zhu T, Li J. Ultra-strength materials. Prog Mater Sci. 2010;55:710–757. [Google Scholar]

[r3] 3.Feynman RP. There’s plenty of room at the bottom. Eng Sci. 1960;23:22–36. [Google Scholar]

[r4] 4.Li J, Shan Z, Ma E. Elastic strain engineering for unprecedented materials properties. MRS Bull. 2014;39:108–114. [Google Scholar]

[r5] 5.Qian X, Liu J, Fu L, Li J. Solid state theory. Quantum spin Hall effect in two-dimensional transition metal dichalcogenides. Science. 2014;346:1344–1347. doi: 10.1126/science.1256815. [DOI] [PubMed] [Google Scholar]

[r6] 6.Bedell SW, Khakifirooz A, Sadana DK. Strain scaling for CMOS. MRS Bull. 2014;39:131–137. [Google Scholar]

[r7] 7.Zhang H, et al. Approaching the ideal elastic strain limit in silicon nanowires. Sci Adv. 2016;2:e1501382. doi: 10.1126/sciadv.1501382. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r8] 8.Banerjee A, et al. Ultralarge elastic deformation of nanoscale diamond. Science. 2018;360:300–302. doi: 10.1126/science.aar4165. [DOI] [PubMed] [Google Scholar]

[r9] 9.Tsao JY, et al. Ultrawide-bandgap semiconductors: Research opportunities and challenges. Adv Electron Mater. 2018;4:1600501. [Google Scholar]

[r10] 10.Perdew JP, Burke K, Ernzerhof M. Generalized gradient approximation made simple. Phys Rev Lett. 1996;77:3865–3868. doi: 10.1103/PhysRevLett.77.3865. [DOI] [PubMed] [Google Scholar]

[r11] 11.Aryasetiawan F, Gunnarsson O. The GW method. Rep Prog Phys. 1998;61:237–312. [Google Scholar]

[r12] 12.Ramakrishnan R, Dral PO, Rupp M, von Lilienfeld OA. Big data meets quantum chemistry approximations: The Δ-machine learning approach. J Chem Theory Comput. 2015;11:2087–2096. doi: 10.1021/acs.jctc.5b00099. [DOI] [PubMed] [Google Scholar]

[r13] 13.Khaleghi B, Khamis A, Karray FO, Razavi SN. Multisensor data fusion: A review of the state-of-the-art. Inf Fusion. 2013;14:28–44. [Google Scholar]

[r14] 14.Baliga BJ. Semiconductors for high‐voltage, vertical channel field‐effect transistors. J Appl Phys. 1982;53:1759–1764. [Google Scholar]

[r15] 15.Bardeen J, Shockley W. Deformation potentials and mobilities in non-polar crystals. Phys Rev. 1950;80:72–80. [Google Scholar]

[r16] 16.Capasso F. Compositionally graded semiconductors and their device applications. Annu Rev Mater Sci. 1986;16:263–291. [Google Scholar]

[r17] 17.Chang KYS, von Lilienfeld OA. AlxGa1-xAs crystals with direct 2 eV band gaps from computational alchemy. Phys Rev Mater. 2018;2:073802. [Google Scholar]

[r18] 18.Shockley W, Queisser HJ. Detailed balance limit of efficiency of p-n junction solar cells. J Appl Phys. 1961;32:510–519. [Google Scholar]

[r19] 19.Jenkins DP. Calculations on the band structure of silicon. Proc Phys Soc A. 1956;69:548–555. [Google Scholar]

[r20] 20.Jain A, et al. Commentary: The materials project: A materials genome approach to accelerating materials innovation. APL Mater. 2013;1:011002. [Google Scholar]

[r21] 21.Pan SJ, Yang Q. A survey on transfer learning. IEEE Trans Knowl Data Eng. 2010;22:1345–1359. [Google Scholar]

[r22] 22.Li B, et al. Diamond anvil cell behavior up to 4 Mbar. Proc Natl Acad Sci USA. 2018;115:1713–1717. doi: 10.1073/pnas.1721425115. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r23] 23.Healy N, et al. Extreme electronic bandgap modification in laser-crystallized silicon optical fibres. Nat Mater. 2014;13:1122–1127. doi: 10.1038/nmat4098. [DOI] [PubMed] [Google Scholar]

[r24] 24.Feng J, Qian X, Huang C-W, Li J. Strain-engineered artificial atom as a broad-spectrum solar energy funnel. Nat Photonics. 2012;6:866–872. [Google Scholar]

[r25] 25.Aage N, Andreassen E, Lazarov BS, Sigmund O. Giga-voxel computational morphogenesis for structural design. Nature. 2017;550:84–86. doi: 10.1038/nature23911. [DOI] [PubMed] [Google Scholar]

[r26] 26.Lian H, Christiansen AN, Tortorelli DA, Sigmund O, Aage N. Combined shape and topology optimization for minimization of maximal von Mises stress. Struct Multidiscipl Optim. 2017;55:1541–1557. [Google Scholar]

[r27] 27.Grumstrup EM, et al. Reversible strain-induced electron-hole recombination in silicon nanowires observed with femtosecond pump-probe microscopy. Nano Lett. 2014;14:6287–6292. doi: 10.1021/nl5026166. [DOI] [PubMed] [Google Scholar]

[r28] 28.Ovartchaiyapong P, Lee KW, Myers BA, Jayich ACB. Dynamic strain-mediated coupling of a single diamond spin to a mechanical resonator. Nat Commun. 2014;5:4429. doi: 10.1038/ncomms5429. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r29] 29.Yu D, Feng J, Hone J. Elastically strained nanowires and atomic sheets. MRS Bull. 2014;39:157–162. [Google Scholar]

PERMALINK

Deep elastic strain engineering of bandgap through machine learning

Zhe Shi

Evgenii Tsymbalov

Ming Dao

Subra Suresh

Alexander Shapeev

Ju Li

Significance

Abstract