Skip to main content
Scientific Data logoLink to Scientific Data
. 2020 Nov 11;7:387. doi: 10.1038/s41597-020-00723-8

A band-gap database for semiconducting inorganic materials calculated with hybrid functional

Sangtae Kim 1,#, Miso Lee 1,#, Changho Hong 1, Youngchae Yoon 1, Hyungmin An 1, Dongheon Lee 1, Wonseok Jeong 1, Dongsun Yoo 1, Youngho Kang 2, Yong Youn 1,, Seungwu Han 1,
PMCID: PMC7658987  PMID: 33177500

Abstract

Semiconducting inorganic materials with band gaps ranging between 0 and 5 eV constitute major components in electronic, optoelectronic and photovoltaic devices. Since the band gap is a primary material property that affects the device performance, large band-gap databases are useful in selecting optimal materials in each application. While there exist several band-gap databases that are theoretically compiled by density-functional-theory calculations, they suffer from computational limitations such as band-gap underestimation and metastable magnetism. In this data descriptor, we present a computational database of band gaps for 10,481 materials compiled by applying a hybrid functional and considering the stable magnetic ordering. For benchmark materials, the root-mean-square error in reference to experimental data is 0.36 eV, significantly smaller than 0.75–1.05 eV in the existing databases. Furthermore, we identify many small-gap materials that are misclassified as metals in other databases. By providing accurate band gaps, the present database will be useful in screening materials in diverse applications.

Subject terms: Electronic structure, Computational methods, Electronic properties and materials, Magnetic properties and materials


Measurement(s) band gap • semiconducting inorganic material
Technology Type(s) computational modeling technique
Sample Characteristic - Environment material entity

Machine-accessible metadata file describing the reported data: 10.6084/m9.figshare.13083980

Background & Summary

The band gap (Eg) is a fundamental quantity that directly relates to usability of materials in optical, electronic, and energy applications. For instance, in photovoltaic devices, materials with a direct Eg of ∼1.3 eV1,2, corresponding to the Shockley-Queisser limit, are favored as photo-absorbers that maximize the solar-cell efficiency. In power electronics, semiconductors with Eg ≥ 3 eV are employed to sustain high electric fields3. To increase the figure of merit in thermoelectric devices, materials with Eg of 10 kBTop where kB and Top are the Boltzmann constant and operating temperature, respectively, are selected4. Given the central role of Eg, a database of Eg over a wide range of materials can expedite the material selection in specific applications by factoring out suboptimal candidates rapidly. Currently, popular material databases such as the Materials Project5, the Automatic Flow of Materials Discovery Library (AFLOW)6, the Open Quantum Materials Database (OQMD)7, and the Joint Automated Repository for Various Integrated Simulations (JARVIS)8 provide theoretical Eg for up to one million inorganic materials. However, most of them were obtained by semilocal functionals with a generalized gradient approximation (GGA), which underestimates Eg by typically 30–40%9. (MatDB10 provides accurate quasi-particle band gaps, but the number of data is limited to hundreds.) To compensate for this, AFLOW provides adjusted Eg using a linear fit to experimental data11. However, such a universal correction may not address element-dependent error fluctuations. We note that JARVIS provides Eg based on meta-GGA12, which significantly improves the accuracy. As a related issue, many small-gap semiconductors such as Ge, InAs, PdO, Zn3As2, and Ag2O are misclassified as metals, which can affect selection of narrow-gap semiconductors in IR sensors13, for instance. (In JARVIS, some of these errors are resolved by meta-GGA.) Besides the underestimation of Eg, all the databases consider only the ferromagnetic ordering for spin-polarized systems due to computational convenience, which can cause significant errors in Eg of antiferromagnetic materials. For instance, the antiferromagnetic NiO has an experimental Eg of 4.3 eV14, but the computational Eg ranges over 2.2–2.6 eV in the ferromagnetic ordering and GGA functional57 while the correct antiferromagnetic ordering produces 4.5 eV within the hybrid functional.

Addressing limitations in the existing material databases, we herein report a theoretical dataset of fundamental and optical Eg computed by employing a hybrid functional and identifying the stable magnetic ordering, thus providing more accurate Eg than the existing databases. For the high-throughput computational workflow, we employ ‘Automated Ab initio Modeling of Materials Property Package’ (AMP2)15, which is a fully automated package for density functional theory (DFT) calculations of crystalline properties. AMP2 addresses the band-gap underestimation in semilocal functionals with the help of a hybrid functional, thereby producing a more accurate Eg, even if the material is incorrectly metallic within the semilocal functional. Furthermore, the package finds the antiferromagnetic ground state based on an effective Ising model. The present database focuses on materials with 0 eV < Eg < 5 eV, which covers most semiconducting materials. The target materials are selected from Inorganic Crystal Structure Database (ICSD)16 and partly filtered by information from the Materials Project database. In total, the database collects Eg for 10,481 materials that encompass most inorganic solids with Eg ranging between 0 and 5 eV. For 116 benchmark materials, the root-mean-square error (RMSE) with respect to experimental data is 0.36 eV, significantly smaller than 0.75–1.05 eV in the existing databases. The resulting data are available online at figshare17 or SNUMAT18.

Methods

High-throughput methodology: AMP2

The present database is constructed by employing AMP2 which is an automation script operating VASP1921. Starting only with the initial crystalline structure, AMP2 provides band structure, Eg, effective mass, density of states (DOS) and dielectric constants of the crystal by automatically pipelining computational procedures. To summarize computational settings relevant in the present work, we employ GGA developed by Perdew-Burke-Ernzerhof (PBE)22 for the exchange-correlation functional for structural relaxation and identifying band edges. The Eg is obtained by ‘one-shot’ hybrid functional (specifically, HSE0623 (simply HSE hereafter)) calculations in which the package estimates Eg from HSE eigenvalues at k points of band edges found with PBE (crystal structures are also fixed to those relaxed by PBE). In the previous study24, it was demonstrated that band edges from PBE and HSE lie at the same k points, which is confirmed again in the present work with Si, SrS, BAs, BeS, AlAs, AgI, AgGaTe2, ZnSiAs2, and ZnIn2Se4. In addition, the small structural differences between PBE and HSE25 would not affect the band gap significantly, except for small-gap semiconductors (see below). (This is also the case for systems that go from metallic in PBE to insulating in HSE.) This supports that the one-shot scheme can produce Eg close to the full hybrid calculations. If the material is identified as a metal within PBE, AMP2 inspects DOS, and if DOS at the Fermi level normalized by the valence band (DF/DVB) is less than a threshold, the package further tests a possible gap-opening by the one-shot hybrid calculation. The PBE+U method is applied on 3d orbitals26 only when the material has a finite Eg. About pseudopotentials, we mostly employ those without any tags in the VASP database, which tends to reduce the number of valence electrons. For further details, we refer to the original publication15.

Computational parameters used in the present work follow the default setting of AMP2 except that the package applies the PBE+U method on Ce 4f levels with the U value of 4 eV27. (The pseudopotentials for La and Ce treat f levels as valence.) Furthermore, for compounds including Tl, Pb, and Bi, Eg is recalculated by including the spin-orbit coupling (SOC) when the default Eg without SOC is smaller than 1 eV. (The band edges are also resought with SOC.) This is because typical SOC corrections of ∼0.5 eV would be critical in these cases.

In identifying the stable collinear magnetic ordering, AMP2 applies a genetic algorithm to the Ising model28. This approach finds the stable magnetic ordering correctly for many compounds. However, the original formulation requires a large supercell to isolate exchange interactions from periodic images, which costs significant computational resources and also suffers from ill-convergence in electronic iterations. To resolve this, we here develop an alternative method in obtaining exchange parameters. First, we choose a minimal supercell under the following two conditions: (i) A magnetic site α and its periodic images in other supercells are apart more than 5 Å (cutoff range for magnetic interactions). (ii) If two magnetic sites α and β (not necessarily belong to the same supercell) are within 5 Å, then the distance between α and β′, a periodic image of β (β′ ≠ β) is longer than 5 Å except when α-β and α-β′ are symmetrically equivalent. Within the Ising model, the total energy of the supercell (E) can be expressed as follows:

E=E0+ImJI(NI,PNI,A), 1

where E0 is the base energy excluding the magnetic interaction, and I is the index for independent exchange interactions (total m interactions) with the maximum range of 5 Å and the exchange parameter of JI. In Eq. (1), NI,P and NI,A are the numbers of parallel and antiparallel spin pairs within the supercell corresponding to the interaction I, respectively. Then, based on the ferromagnetic configuration (all spin-up), diverse spin configurations are sampled by spin-flipping a magnetic pair (both atoms) or a certain magnetic site. The number of resulting equations is larger than m and an optimal {E0, JI} can be obtained by the pseudoinverse method. We find that this approach produces essentially the same parameters as the original scheme but is more reliable and efficient.

Selection of materials

Figure 1 schematizes the workflow of constructing the database. Starting from the ICSD, we only consider compounds consisting of elements with atomic number (Z) < 84. Among the lanthanides, we limit the elements to La and Ce. We remove structural duplicates and structures with partially occupied sites, and also omit large primitive cells that contain more than 40 atoms. For unary and binary compounds, all the structures are calculated with AMP2. For ternary and higher compounds, we utilize information on Eg and DOS in the Materials Project database (calculated by PBE) to filter out materials that are likely to be metallic or large-gap insulators. To be specific, we exclude materials with EgGGA bigger than 3 eV since they are likely to have EgHSE larger than 5 eV. (Compiling data of 4,421 compounds from the previous screening studies24,2931, we find that 99.7% of materials with EgHSE < 5 eV have EgGGA < 3 eV.) We also include metallic materials with DF/DVB < 0.8 for possible gap opening (see above; a larger threshold is used because of low-resolution DOS in the Materials Project). If a Materials Project data has incomplete entries for Eg or DOS, the material is included in the computation list. In this way, we could factor out 5,059 materials from the list of ternary and higher compounds. Finally, we calculate 21,353 materials with AMP2. After computation, we collect 10,481 materials with finite Eg (unary: 63, binary: 1,919, ternary: 5,074, quaternary: 2,804, quinary: 573, and higher: 48).

Fig. 1.

Fig. 1

The computational workflow for collecting the dataset. Numbers in the parentheses indicate material counts.

Data Records

All the calculated properties for 10,481 compounds can be downloaded from the Figshare Repository17. The whole data including metals are also uploaded to SNUMAT (www.snumat.com), which provides easy search and visualization of materials through its own interactive interface. SNUMAT also supports REST API32 for users to search the materials with authorization. The authorization token expires 24 hours after they are issued.

File format

The data are stored in the JSON format. The name of the file is X_ICSD#.json, where X is chemical formula and ICSD# is the ICSD collection code of the initial structure used for calculation. Each JSON file includes final relaxation structure information, EgGGA, EgHSE, and DOS. Table 1 summarizes keys for metadata.

Table 1.

Description of metadata keys in JSON file.

Key Type Description
SNUMAT_id string ID in the SNUMAT
ICSD_number int ICSD collection code
Band_gap_GGA float Calculated fundamental band gap in GGA (eV)
Band_gap_GGA_optical float Calculated direct band gap in GGA (eV)
Band_gap_HSE float Calculated fundamental band gap in HSE (eV)
Band_gap_HSE_optical float Calculated direct band gap in HSE (eV)
Direct_or_indirect string Type of band gap in GGA (direct or indirect)
Direct_or_indirect_HSE string Type of band gap in HSE (direct or indirect)
Structure_rlx string Relaxed structure information (VASP POSCAR format)
Space_group_rlx int Space group number of relaxed structure
Magnetic_ordering string Magnetic ordering of final structure
SOC boolean Spin-orbit coupling (True or False)

Graphical representation of the data

In Fig. 2, we present the distribution of EgGGA and EgHSE for 10,481 materials. Most materials with EgHSE > 5 eV (663 cases) are unary or binary compounds for which AMP2 is applied to the whole structure dataset from ICSD.

Fig. 2.

Fig. 2

Distribution of EgGGA and EgHSE. Top and right are occurrence histograms of EgGGA and EgHSE, respectively.

Technical Validation

Comparison to experimental measurements and other databases

In Fig. 3a, we compare experimental and theoretical values for 116 benchmark materials with experimental Eg between 0 and 5 eV. The list of compounds is shown in Online-only Table 1. For comparison, theoretical results from other databases are also shown in Fig. 3b–d. The RMSE values are 0.36 eV (present work), 1.05 eV (Materials Project), 0.93 eV (AFLOW), 0.75 eV (AFLOW fitted), 1.02 eV (OQMD), and 0.85 eV (JARVIS). (The meta-GGA values of 19 materials, mostly with small Eg, are missing in JARVIS.) This confirms that the present database provides more accurate Eg than the existing databases on average. In particular, we correctly identify the semiconducting nature for small-gap semiconductors such as AgSbTe2, CdO, CoP3, Cu3AsSe4, Cu3SbS4, Cu3SbSe4, CuFeS2, Ge, Mg2Sn, RhSb3, and ZnSnSb2, which are mostly misreported as metals in other databases. In addition, other databases exhibit pronounced errors for every antiferromagnetic material (CuFeS2, CuO, FeF2, MnO, MnTe, and NiO) because these materials are considered as ferromagnetic or non-magnetic. (For non-magnetic materials in Online-only Table 1, the Eg calculated with pure PBE (without +U and SOC) by AMP2 agrees well with those from Materials Project (the mean absolute error is 0.034 eV).)

Fig. 3.

Fig. 3

Comparison of Eg for benchmark materials between experimental and theoretical data from (a) this work, (b) Materials Project, (c) AFLOW and (d) OQMD and JARVIS. AFLOW-fitted values are obtained from Egfit=1.34EgGGA+0.913 eV.

Online-only Table 1.

The list of materials and their band gaps from experiment, present work, Materials Project, OQMD, AFLOW, AFLOW (fitted), and JARVIS.

Name ICSD Space group Exp. Band gap
Present work Materials Project OQMD AFLOW AFLOW (fitted) JARVIS
Mg2Pb 104846 225 0.1 0 0 0 0 0 0.32
InSb 640411 216 0.16 0.01 0 0.26 0 0 0.13
Bi2Te3 15753 166 0.21 0.02 0.53 0.52 0.28 1.29 -
PbTe 63098 225 0.26 0.42 0.87 0.97 0.8 1.99 1.4
CdSnAs2 16737 122 0.26 0.04 0.05 0.3 0 0 -
Sb2Te3 193341 166 0.3 0.43 0.01 0.21 0.04 0.97 -
Cu3SbSe4 400652 121 0.31 0.11 0 0 0 0 -
Te 161690 152 0.33 0.69 0.41 0.5 0.15 1.12 0.62
Bi2Se3 617072 166 0.35 0.23 0.54 0.42 0.49 1.58 -
Mg2Sn 104869 225 0.36 0.09 0 0 0 0 0.24
InAs 610687 216 0.36 0.1 0 0.37 0 0 0.4
PbSe 63096 225 0.37 0.36 0.47 0.69 0.4 1.45 0.96
ZnSnSb2 42669 122 0.4 0.36 0 0 0 0 0.32
CoP3 624594 204 0.43 0.11 0 0 0 0 -
CdSb 52830 61 0.46 0.52 0.14 0.19 0.09 1.03 -
Cu3SbS4 2857 121 0.46 0.32 0 0 0 0 -
PbS 62190 225 0.5 0.47 0.47 0.69 0.47 1.54 1.3
ZnSb 43265 61 0.5 0.63 0.04 0.2 0.17 1.14 -
CuFeS2 60166 122 0.53 1.6 0 0 0 0 0
CdGeAs2 153593 122 0.53 0.09 0.08 0.34 0.03 0.96 0.52
CoSb3 153504 204 0.63 0.75 0.17 0.22 0.15 1.12 -
ZnSnAs2 18203 122 0.65 0.55 0.01 0.2 0 0 0.8
Ge 41980 227 0.67 0.17 0 0 0 0 0.61
GaSb 41675 216 0.67 0.41 0 0.37 0 0 0.59
CoAs3 31111 204 0.69 0.52 0 0.15 0 0 -
InN 109463 186 0.7 0.5 0 0.28 0 0 0.76
Mg2Ge 52283 225 0.74 0.54 0.19 0.24 0.19 1.17 0.58
TlSe 44706 140 0.74 0.55 0.25 0 0.11 1.06 0.74
Mg2Si 104864 225 0.77 0.59 0.24 0.28 0.24 1.23 0.62
RhSb3 650248 204 0.8 0.16 0 0.15 0 0 -
CdO 181294 225 0.8 0.76 0 0 0 0 1.31
CuGaTe2 28738 122 0.82 0.96 0.2 0.4 0.43 1.49 0.04
RhAs3 34052 204 0.85 0 0 0 0 0 -
ZnGeAs2 16735 122 0.85 0.84 0.05 0.34 0.17 1.14 1.07
HgIn2Te4 25652 82 0.86 1.12 0.52 0.61 0.56 1.67 1.35
CuInSe2 73351 122 0.86 0.74 0.02 0.16 0 0 0.64
Cu3AsSe4 610361 121 0.88 0.05 0 0 0 0 -
Zn3As2 44091 137 0.93 0.54 0 0 0.13 1.09 -
CuInTe2 169048 122 0.95 0.71 0.02 0.24 0.25 1.25 1.04
AgInSe2 28751 122 0.96 0.74 0.06 0.16 0.4 1.45 1
CuGaSe2 247513 122 0.96 1.08 0.04 0.25 0.4 1.45 1.39
AgGaSe2 28748 122 1.1 1.24 0.25 0.41 0.7 1.86 1.56
Si 51688 227 1.12 1.19 0.62 0.77 0.61 1.74 1.28
AgGaTe2 28749 122 1.15 0.9 0.19 0.32 0.52 1.62 -
Cu2CdSnS4 238144 121 1.16 0.97 0 0.21 0.1 1.05 -
CdSnP2 22183 122 1.17 0.9 0.27 0.51 0.14 1.1 1.21
InSe 185172 194 1.17 1.18 0.46 0.55 0.43 1.5 1.43
IrSb3 640958 204 1.18 0.61 0.05 0.21 0 0 0.24
AglnS2 28750 122 1.18 1.26 0.38 0.49 0.92 2.16 1.62
ZnIn2Te4 25650 82 1.2 1.51 0.82 0.92 0.93 2.17 1.77
CuInS2 66865 122 1.2 1.22 0.01 0.42 0.4 1.45 0.69
Cu3AsS4 14285 31 1.24 0.96 0.01 0 0.2 1.18 -
Cdln2Te4 25651 82 1.26 1.49 0.84 0.92 0.9 2.12 1.72
InP 53105 216 1.27 1.12 0.47 0.74 0.58 1.69 1.39
GaAs 41981 216 1.35 0.95 0.19 0.57 0.3 1.32 1.32
ZnGa2Te4 290911 82 1.35 1.72 1.01 1.18 1.11 2.41 1.74
CuO 16025 15 1.4 1.94 0 0 0 0 0.01
CdTe 93944 216 1.44 1.63 0.59 0.87 0.71 1.87 1.64
HgIn2Se4 25649 82 1.45 1.38 0.62 0.72 0.67 1.81 1.64
ZnSnP2 22179 122 1.45 1.53 0.7 0.92 0.68 1.83 1.69
MnTe 174028 194 1.46 1.47 0 0 0 0 0
BAs 43871 216 1.5 1.86 1.21 1.42 1.2 2.53 1.93
Se (gray) 164264 152 1.5 1.76 1.06 0.88 0.98 2.23 1.71
CdTe 620518 186 1.5 1.67 0.62 0.87 0.71 1.92 1.76
Cdln2Se4 151954 82 1.55 1.77 0.95 1.03 1.04 2.31 2.16
CdSiAs2 22187 122 1.6 1.14 0.4 0.69 0.46 1.53 1.33
AlSb 44325 216 1.6 1.88 1.23 1.38 1.23 2.57 1.78
AgGaS2 23698 122 1.66 1.99 0.96 1.09 1.5 2.94 2.36
GaTe 635512 12 1.69 1.88 1.06 0.93 0.89 2.12 1.72
ZnSiAs2 22184 122 1.7 1.67 0.89 1.06 1.12 2.42 1.85
CdSe 41826 186 1.74 1.47 0.57 0.77 0.69 1.84 2.06
CdGeP2 100467 122 1.8 1.33 0.64 0.9 0.7 1.85 1.65
ZnIn2Se4 25647 82 1.82 1.83 0.98 1.12 1.13 2.44 2.22
HgGa2Se4 188545 82 1.95 1.72 0.91 1.05 0.95 2.2 2.08
GaSe 63122 194 2.02 1.77 1.23 0.97 0.81 2.01 2.14
CuAlTe2 28735 122 2.06 1.88 1.02 1.2 1.29 2.65 1.82
BP 615154 216 2.1 1.97 1.24 1.37 1.25 2.59 1.91
AlAs 185081 216 2.16 2.15 1.52 1.63 1.5 2.94 2.28
ZnGeP2 16734 122 2.2 1.83 1.19 1.37 1.25 2.59 1.93
CdSiP2 22188 122 2.2 1.99 1.43 1.4 1.41 2.81 2.02
ZnGa2Se4 44887 82 2.2 2.34 1.39 1.54 1.55 3 2.84
AgI 164963 216 2.22 2.49 1.37 1.48 1.98 3.58 2.87
GaP 77087 216 2.24 2.43 1.59 1.75 1.64 3.13 2.37
ZnTe 185141 216 2.26 2.19 1.08 1.45 1.48 2.9 2.23
ZnSiP2 22190 122 2.3 1.92 1.36 1.44 1.38 2.77 2.03
β-SiC 603798 216 2.3 2.26 1.39 1.59 1.37 2.76 2.31
AgAlTe2 28746 122 2.35 1.84 1.05 1.2 1.36 2.75 1.99
CdS 154186 186 2.42 2.38 1.12 0.77 1.25 2.6 2.6
CdGa2Se4 2287 82 2.43 2.22 1.35 1.45 1.41 2.82 2.72
AlP 24490 216 2.45 2.33 1.63 1.75 1.63 3.11 2.56
AgAlSe2 28745 122 2.5 2.17 1.14 1.33 1.56 3.02 2.36
ZnSe 185134 216 2.58 2.54 1.17 1.5 1.7 3.2 2.63
La2S3 15151 62 2.73 1.75 1.03 1.2 0.92 2.16 -
AgI 62789 186 2.63 2.52 1.4 1.55 1.98 3.61 2.85
CuAlSe2 28734 122 2.65 2.08 0.9 1.07 1.24 2.58 2.51
BeTe 53945 216 2.8 2.69 2.02 2.15 2.02 3.63 2.82
HgGa2S4 189737 82 2.84 2.57 1.57 0.72 1.62 3.1 2.83
α-SiC 164429 186 2.86 2.94 2.04 2.15 2.02 3.63 3.08
CuBr 23989 216 2.91 2.12 0.49 0.62 1.13 2.44 1.55
Cul 33724 216 2.95 2.56 1.14 1.29 1.65 3.14 2.2
Al2Se3 14373 9 3.1 2.77 1.81 1.86 1.76 3.28 2.85
CuCl 23988 216 3.17 2.33 0.56 0.64 1.28 2.64 1.58
CeO2 184584 225 3.2 3.36 1.87 2.1 2.42 4.17 2.01
ZnO 154486 186 3.2 2.66 0.73 1.04 1.82 3.37 2.47
GaN 153890 186 3.34 2.94 1.74 2.09 1.91 3.49 3.08
CuAlS2 189083 122 3.35 3.04 1.69 1.88 2.05 3.67 2.49
FeF2 14143 136 3.4 3.63 3.38 0.42 2.63 4.46 0
ZnGa2S4 44886 82 3.4 3.31 2.27 2.41 2.44 4.2 3.86
CdGa2S4 106362 82 3.44 3.12 2.12 1.45 2.21 3.89 3.71
SnO2 154960 136 3.47 2.29 0.65 1.2 0 0 2.39
ZnS 77090 216 3.54 3.61 2.02 2.32 2.68 4.52 3.59
BeSe 616419 216 3.61 3.54 2.69 2.82 2.67 4.51 3.88
MnO 9864 225 3.9 2.85 1.72 0.42 2.62 4.44 0
SrS 651054 225 4.1 3.46 2.5 2.59 2.5 4.28 3.61
BeS 44724 216 4.17 4.1 3.15 3.23 3.14 5.15 4.38
NiO 43740 225 4.5 4.53 2.31 2.62 2.22 3.91 0.01
RMSE 0.36 1.05 1.02 0.93 0.75 0.85

The experimental data were adopted from a CRC Handbook39 except AgAlSe240, AgAlTe240, AgGaTe240, CdO241, CdSnP242, CuAlS243, CuAlTe244, HgIn2Se445, and InN46 for which more reliable measurements are cited, and CeO247 and La2S348 which represent lanthanides.

In most cases, the present database provides Eg that agrees well with experiment. However, there are some materials with large errors of ≥0.5 eV such as AgAlTe2, Cu3AsSe4, CuAlSe2, CuBr, CuCl, CuFeS2, CuO, Ge, IrSb3, La2S3, MnO, RhAs3, RhSb3, SnO2, SrS, and ZnO. For small-gap materials such as Cu3AsSe4, Ge, and IrSb3, Eg is sensitive to the lattice parameters that are slightly overestimated by PBE. Employing experimental lattice parameters or those relaxed within HSE significantly improves the results15. For Cu-bearing materials, it is known that HSE often exhibits substantial errors in Eg due to nonlocal screening effects in Cu, which requires GW calculations33,34. We also note that van der Waals interactions are not described by semilocal functionals, and lattice parameters can be overestimated in layered structures such transition-metal dichalcogenides35. This can significantly affect Eg, and so care is needed in referring to Eg in layered materials. The present results do not consider finite-temperature effects on Eg, which can be significant in some materials, for example, hybrid perovskites36. More generally, Eg dataset with the ultimate theoretical accuracy would be obtained by the quasiparticle approaches such as GW or Bethe-Salpeter equations37,38.

Acknowledgements

This work was supported by Nano·Material Technology Development Program through the National Research Foundation of Korea (NRF-2020M3A7C2095492) and Creative Materials Discovery Program through the NRF funded by the Ministry of Science and ICT (2017M3D1A1040689). The computations were carried out at Korea Institute of Science and Technology Information (KISTI) National Supercomputing Center (KSC-2020-CRE-0064).

Online-only Table

Author contributions

S.K., M.L., C.H., H.A., D.L., W.J., D.Y. and Y. Youn performed the high-throughput calculations and data collections. Y. Yoon constructed the SNUMAT site. S.K., M.L., Y.K., Y. Youn, and S.H. analyzed the data and wrote the manuscript. S.H. coordinated the whole work. All authors reviewed and commented on the manuscript.

Code availability

The AMP2 package used for constructing the present database is available at https://github.com/MDIL-SNU/AMP2 and was released under a GPLv3 (GNU General Public License). The package requires pre-installation of numpy, scipy, spglib, and PyYAML modules. Detailed guidelines and examples can be found in the manual (https://amp2.readthedocs.io/en/latest/).

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Sangtae Kim, Miso Lee.

These authors jointly supervised this work: Yong Youn, Seungwu Han.

Contributor Information

Yong Youn, Email: yybbyb1@gmail.com.

Seungwu Han, Email: hansw@snu.ac.kr.

References

  • 1.Shockley W, Queisser HJ. Detailed Balance Limit of Efficiency of p-n Junction Solar Cells. J. Appl. Phys. 1961;32:11. doi: 10.1063/1.1736034. [DOI] [Google Scholar]
  • 2.Kang Y, Youn Y, Han S, Park J, Oh C-S. Computational Screening of Indirect-Gap Semiconductors for Potential Photovoltaic Absorbers. Chem. Mater. 2019;31:4072–4080. doi: 10.1021/acs.chemmater.9b00708. [DOI] [Google Scholar]
  • 3.Gorai P, McKinney RW, Haegel NM, Zakutayev A, Stevanovic V. A computational survey of semiconductors for power electronics. Energy & Environ. Sci. 2019;12:3338–3347. doi: 10.1039/C9EE01529A. [DOI] [Google Scholar]
  • 4.Sofo JO, Mahan GD. Optimum band gap of a thermoelectric material. Phys. Rev. B. 1994;49:4565–4570. doi: 10.1103/PhysRevB.49.4565. [DOI] [PubMed] [Google Scholar]
  • 5.Jain A, et al. Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. APL Mater. 2013;1:011002. doi: 10.1063/1.4812323. [DOI] [Google Scholar]
  • 6.Curtarolo S, et al. AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations. Comput. Mater. Sci. 2012;58:227–235. doi: 10.1016/j.commatsci.2012.02.002. [DOI] [Google Scholar]
  • 7.Saal JE, Kirklin S, Aykol M, Meredig B, Wolverton C. Materials Design and Discovery with High-Throughput Density Functional Theory: The Open Quantum Materials Database (OQMD) JOM. 2013;65:1501–1509. doi: 10.1007/s11837-013-0755-4. [DOI] [Google Scholar]
  • 8.Choudhary K, et al. Computational screening of high-performance optoelectronic materials using OptB88vdW and TB-mBJ formalisms. Sci. Data. 2018;5:180082. doi: 10.1038/sdata.2018.82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Park S, Lee B, Jeon SH, Han S. Hybrid functional study on structural and electronic properties of oxides. Curr. Appl. Phys. 2011;11:S337–S340. doi: 10.1016/j.cap.2010.09.008. [DOI] [Google Scholar]
  • 10.Lany S. Band-structure calculations for the 3d transition metal oxides in GW. Phys. Rev. B. 2013;87:085112. doi: 10.1103/PhysRevB.87.085112. [DOI] [Google Scholar]
  • 11.Setyawan W, Gaume RM, Lam S, Feigelson RS, Curtarolo S. High-Throughput Combinatorial Database of Electronic Band Structures for Inorganic Scintillator Materials. ACS Comb. Sci. 2011;13:382–390. doi: 10.1021/co200012w. [DOI] [PubMed] [Google Scholar]
  • 12.Tran F, Blaha P. Accurate Band Gaps of Semiconductors and Insulators with a Semilocal Exchange-Correlation Potential. Phys. Rev. Lett. 2009;102:226401. doi: 10.1103/PhysRevLett.102.226401. [DOI] [PubMed] [Google Scholar]
  • 13.Seiler DG. C. L. L. International conference on narrow-gap semiconductors and related materials. J. Res. Natl. Inst. Standards Technol. 1990;95:13. doi: 10.6028/jres.095.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Sawatzky GA, Allen JW. Magnitude and Origin of the Band Gap in NiO. Phys. Rev. Lett. 1984;53:2339–2342. doi: 10.1103/PhysRevLett.53.2339. [DOI] [Google Scholar]
  • 15.Youn Y, et al. AMP2: A fully automated program for ab initio calculations of crystalline materials. Comput. Phys. Commun. 2020;256:107450. doi: 10.1016/j.cpc.2020.107450. [DOI] [Google Scholar]
  • 16.FIZ Karlsruhe Inorganic Crystal Structure Database. https://icsd.products.fiz-karlsruhe.de/. 16.02 version.
  • 17.Kim S, 2020. A band-gap database for semiconducting inorganic materials calculated with hybrid functional. figshare. [DOI] [PMC free article] [PubMed]
  • 18.SNUMAT: SNU material data center. https://www.snumat.com/. Accessed: 2020-08-17.
  • 19.Kresse G, Furthmüller J. Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set. Comput. Mater. Sci. 1996;6:15–50. doi: 10.1016/0927-0256(96)00008-0. [DOI] [PubMed] [Google Scholar]
  • 20.Kresse G, Furthmüller J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B. 1996;54:11169–11186. doi: 10.1103/PhysRevB.54.11169. [DOI] [PubMed] [Google Scholar]
  • 21.Blöchl PE. Projector augmented-wave method. Phys. Rev. B. 1994;50:17953–17979. doi: 10.1103/PhysRevB.50.17953. [DOI] [PubMed] [Google Scholar]
  • 22.Perdew JP, Burke K, Ernzerhof M. Generalized Gradient Approximation Made Simple. Phys. Rev. Lett. 1996;77:3865–3868. doi: 10.1103/PhysRevLett.77.3865. [DOI] [PubMed] [Google Scholar]
  • 23.Heyd J, Scuseria GE, Ernzerhof M. Hybrid functionals based on a screened Coulomb potential. The. J. Chem. Phys. 2003;118:8207–8215. doi: 10.1063/1.1564060. [DOI] [Google Scholar]
  • 24.Yim K, et al. Novel high-k dielectrics for next-generation electronic devices screened by automated ab initio calculations. NPG Asia Mater. 2015;7:e190. doi: 10.1038/am.2015.57. [DOI] [Google Scholar]
  • 25.Heyd J, Peralta JE, Scuseria GE, Martin RL. Energy band gaps and lattice parameters evaluated with the Heyd-Scuseria-Ernzerhof screened hybrid functional. The. J. Chem. Phys. 2005;123:174101. doi: 10.1063/1.2085170. [DOI] [PubMed] [Google Scholar]
  • 26.Wang L, Maxisch T, Ceder G. Oxidation energies of transition metal oxides within the GGA+U framework. Phys. Rev. B. 2006;73:195107. doi: 10.1103/PhysRevB.73.195107. [DOI] [Google Scholar]
  • 27.Da Silva JLF, Ganduglia-Pirovano MV, Sauer J, Bayer V, Kresse G. Hybrid functionals applied to rare-earth oxides: The example of ceria. Phys. Rev. B. 2007;75:045121. doi: 10.1103/PhysRevB.75.045121. [DOI] [Google Scholar]
  • 28.Lee K, Youn Y, Han S. Identification of ground-state spin ordering in antiferromagnetic transition metal oxides using the Ising model and a genetic algorithm. Sci. Technol. Adv. Mater. 2017;18:246–252. doi: 10.1080/14686996.2017.1300046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Yim K, et al. Computational discovery of p-type transparent oxide semiconductors using hydrogen descriptor. npj Comput. Mater. 2018;4:17. doi: 10.1038/s41524-018-0073-z. [DOI] [Google Scholar]
  • 30.Lee M, Youn Y, Yim K, Han S. High-throughput ab initio calculations on dielectric constant and band gap of non-oxide dielectrics. Sci. Reports. 2018;8:14794. doi: 10.1038/s41598-018-33095-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Youn Y, et al. Large-Scale Computational Identification of p-Type Oxide Semiconductors by Hierarchical Screening. Chem. Mater. 2019;31:5475–5483. doi: 10.1021/acs.chemmater.9b00816. [DOI] [Google Scholar]
  • 32.Fielding RT. Principled Design of the Modern Web Architecture. ACM Transactions on Internet Technol. 2002;2:36. doi: 10.1145/514183.514185. [DOI] [Google Scholar]
  • 33.Wang Y, et al. Electronic structures of Cu2O, Cu4O3, and CuO: A joint experimental and theoretical study. Phys. Rev. B. 2016;94:245418. doi: 10.1103/PhysRevB.94.245418. [DOI] [Google Scholar]
  • 34.van Veenendaal MA, Eskes H, Sawatzky GA. Strong nonlocal contributions to Cu 2p photoelectron spectroscopy. Phys. Rev. B. 1993;47:11462–11469. doi: 10.1103/PhysRevB.47.11462. [DOI] [PubMed] [Google Scholar]
  • 35.Ramasubramaniam A, Naveh D, Towe E. Tunable band gaps in bilayer transition-metal dichalcogenides. Phys. Rev. B. 2011;84:205325. doi: 10.1103/PhysRevB.84.205325. [DOI] [Google Scholar]
  • 36.Wiktor J, Rothlisberger U, Pasquarello A. Predictive determination of band gaps of inorganic halide perovskites. Phys. Chem. Lett. 2017;8:5507. doi: 10.1021/acs.jpclett.7b02648. [DOI] [PubMed] [Google Scholar]
  • 37.Schilfgaarde MV, Kotani T, Faleev S. Quasiparticle self-consistent GW theory. Phys. Rev. Lett. 2006;96:226402. doi: 10.1103/PhysRevLett.96.226402. [DOI] [PubMed] [Google Scholar]
  • 38.Rohlfing M, Louie SG. Electron-hole excitations and optical spectra from first principles. Phys. Rev. B. 2000;62:4927. doi: 10.1103/PhysRevB.62.4927. [DOI] [Google Scholar]
  • 39.CRC Handbook of Chemistry and Physics, 86th Edition Edited by David R. Lide. (CRC Press, 2005).
  • 40.Tell B, Shay J, Kasper H. Some properties of AgAlTe2, AgGaTe2, and AgInTe2. Phys. Rev. B. 1974;9:5203–5208. doi: 10.1103/PhysRevB.9.5203. [DOI] [Google Scholar]
  • 41.Ashrafi ABMA, Kumano H, Suemune I, Ok Y-W, Seong T-Y. Single-crystalline rocksalt CdO layers grown on GaAs (001) substrates by metalorganic molecular-beam epitaxy. Appl. Phys. Lett. 2001;79:470–472. doi: 10.1063/1.1387258. [DOI] [Google Scholar]
  • 42.Shay JL, Buehler E, Wernick JH. Electroreflectance Study of the Energy-Band Structure of CdSnP2. Phys. Rev. B. 1970;2:4104–4109. doi: 10.1103/PhysRevB.2.4104. [DOI] [Google Scholar]
  • 43.Robbins M, Phillips JC, Lambrecht VG, Laboratories B, Hill M. Solid solution formation in the systems CuMIIIX2-AgMIIIX2 where MIII=Al, Ga, In and X2=S, Se. J. Phys. Chem. Solids. 1973;34:5. doi: 10.1016/S0022-3697(73)80210-0. [DOI] [Google Scholar]
  • 44.Benchouk K, et al. New buffer layers, large band gap ternary compounds: CuAlTe2. The Eur. Phys. J. Appl. Phys. 2000;10:9–14. doi: 10.1051/epjap:2000114. [DOI] [Google Scholar]
  • 45.Radautsan SI, Tiginyanu IM. Defect Engineering in II–III2–VI4 and Related Compounds. Jpn. J. Appl. Phys. 1993;32:5. doi: 10.7567/JJAPS.32S3.5. [DOI] [Google Scholar]
  • 46.Davydov, V. Y., Ivanov, S. V. & Mudryi, A. V. Absorption and Emission of Hexagonal InN. Evidence of Narrow Fundamental Band Gap. Phys. Status Solidi (b)229, 3, https://doi.org/10.1002/1521-3951(200202)229:3<R1::AID-PSSB99991>3.0.CO;2-O (2002).
  • 47.Goubin F, et al. Experimental and Theoretical Characterization of the Optical Properties of CeO2, SrCeO3, and Sr2CeO4 Containing Ce4+ (f0) Ions. Chem. Mater. 2004;16:662–669. doi: 10.1021/cm034618u. [DOI] [Google Scholar]
  • 48.Forster CM, White WB. Optical absorption edge in rare earth sesquisulfides. Mater. Res. Bull. 2006;41:448–454. doi: 10.1016/j.materresbull.2005.07.035. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

  1. Kim S, 2020. A band-gap database for semiconducting inorganic materials calculated with hybrid functional. figshare. [DOI] [PMC free article] [PubMed]

Data Availability Statement

The AMP2 package used for constructing the present database is available at https://github.com/MDIL-SNU/AMP2 and was released under a GPLv3 (GNU General Public License). The package requires pre-installation of numpy, scipy, spglib, and PyYAML modules. Detailed guidelines and examples can be found in the manual (https://amp2.readthedocs.io/en/latest/).


Articles from Scientific Data are provided here courtesy of Nature Publishing Group

RESOURCES