Skip to main content
Data in Brief logoLink to Data in Brief
. 2021 Sep 16;38:107346. doi: 10.1016/j.dib.2021.107346

Machine learning-based prediction of phases in high-entropy alloys: A data article

Ronald Machaka a,b,, Glenda T Motsi a, Lerato M Raganya a,b, Precious M Radingoana a, Silethelwe Chikosha a
PMCID: PMC8477136  PMID: 34611534

Abstract

A systematic framework for choosing the most determinant combination of predictor features and solving the multiclass phase classification problem associated with high-entropy alloy (HEA) was recently proposed [1]. The data associated with that research paper, titled “Machine learning-based prediction of phases in high-entropy alloys”, is presented in this data article. This dataset is a systematic documentation and comprehensive survey of experimentally reported HEA microstructures. It contains microstructural phase experimental observations and metallurgy-specific features as introduced and reported in peer-reviewed research articles. The dataset is provided with this article as a supplementary file. Since the dataset was collected from experimental peer-reviewed articles, these data can provide insights into the microstructural characteristics of HEAs, can be used to improve the optimization HEA phases, and have an important role in machine learning, material informatics, as well as in other fields.

Keywords: High entropy alloys, HEA microstructures, Phases, Machine learning, Deep learning, Material informatics

Graphical abstract

Image, graphical abstract

Specifications Table

Subject Materials Science
Specific subject area High Entropy Alloys (HEA)
Type of data Text file, tables and figures
How data were acquired The dataset was gathered from experimental observations reported in peer-reviewed research articles. Each entry of the 1362 dataset entries is therefore an experimental observation as reported in the literature.
Data format Raw, tabulated and plotted
Parameters for data collection The following wide range of parameters were collected:
  • i.

    Identification of the multi-component alloy,

  • ii.

    the number of elements making up the alloy system,

  • iii.

    the valence electron concentration,

  • iv.

    the atomic size difference,

  • v.

    the difference in Pauling negativities,

  • vi.

    the enthalpy of mixing for a alloy system,

  • vii.

    the entropy of mixing of a multi-component alloy system,

  • viii.

    how the alloy was synthesis and processing conditions (post-process working and heat treatment),

  • ix.

    the experimentally observed and reported, and

  • x.

    the bibliographical references against each entry.

Description of data collection Secondary data (i.e. composition-specific features, alloy processing and post-processing parameters, and the resulting phases) were collected. Some typical empirical HEA design parameters were calculated using well-known approaches. Data was processed using Excel and R, a language and environment for statistical computing, for purposes of visualization and data analysis.
Data source location Unit: Advanced Materials Engineering, Manufacturing Cluster
Institution: Council for Scientific and Industrial Research (CSIR)
Address: Scientia Campus, 627 Meiring Naude Rd, Brummeria, Pretoria 0185
Country: South Africa
GPS: 25°44’35.2”S 28°16’52.3”E
Data accessibility Data is provided with this article
Related research articles Research articles associated with research data presented are.

Value of the Data

  • This dataset documents synthesis routes, processing conditions (post-process working and heat treatment), and the resulting microstructural observations which can be valuable for researchers in the field of Materials Science in the development of experiments.

  • This dataset specifically contains experimentally reported HEA microstructures which provides enough observations to train and test machine learning and deep learning algorithms.

  • Different machine learning and material informatics computational methods can be applied to this dataset inorder to extract insights and trends not immediately available from individual studies thereby advancing the real-world applications of these alloys.

1. Data Description

The data presented in this article are related to the research articles [1,2]. The data is presented in the supplementary data file.

1.1. The dataset

This dataset is a systematic documentation and comprehensive survey of experimentally reported HEA microstructures. The dataset was constructed from microstructural observations reported in peer-reviewed experimental HEA research articles; it is built upon datasets prior published by Miracle et al. [3], Couzinié et al. [4], and Ye et al. [5]. The dataset presents metallurgy-specific features and microstructural phases experimentally observed.

The dataset, provided with this article as supplementary material, has seventeen columns and 1422 entries.

  • Columns 1 and 2 correspond to the identification of the dataset entry and the multi-component alloy system – Alloy_ID and Alloy, respectively. Composition-specific features can further be developed from the Alloy specification

  • Columns 3 to 27 correspond to the elemental compositions of the multi-component alloy while Column 28 corresponds to the number of elements making up the multi-component alloy system (Num_of_Elem).

  • Columns 29 to 37 correspond to some typical empirical HEA design parameters [6,7] such as the density estimate (Density_calc), the enthalpy of mixing for a multi-component alloy system (dHmix), entropy of mixing of a multi-component alloy system (dSmix), melting temperature estimate (Tm), valence electron concentration (VEC), atomic size difference (δ), and difference in Pauling negativities (χ),. While this article is limited to these typical parameters for succinctness, others can also be developed from the Alloy specifications [7], [8], [9], [10], [11], [12], [13].

  • Columns 38 to 45 correspond to metallurgy-informed alloy processing and post-processing parameters indicating that: how the alloy was synthesised (Sythesis_Route); hot- or cold-worked (Hot-Cold_Working); undergone homogenization processing at temperature (Homogenization_Temp) and time (Homogenization_Time); undergone annealing processing at temperature (Annealing_Temp) and time (Annealing_Time); and/or undergone some quenching processing (Quenching).

  • The values tabulated in Column 50 respresent the microstructure or phases corresponding to the multi-component alloy as reported in the literature. Each dataset entry clearly documents the microstructural observations reported for the respective alloy as either single solid solution phases (face-centered cubic – designated FCC and body-centered cubic - designated BCC), dual-phase solid solutions (designated FCC+BCC), other intermetallic, laves, and martensitic phases (designated Im).

  • The last column, Column 52 provides the bibliographical references against the entries, for transparency.

Table 1 gives a summarised description of these features.

Table 1.

Descriptions of the empirical and metallurgy-specific features cites ML-based studies that attempted predicting HEA phases therefrom.

Symbol Description of Feature References
Num_of_Elem Number of elements in a multi-component alloy system
δ A parameter describing the atomic size mismatch or difference in a multi-component alloy system
δ=i=1nci·(1rir¯)2
where ci and ri is the atomic percentage and atomic radius of the ith component and r¯ is the average atomic radius of the components of the alloy, respectively.
[7], [8], [9], [10], [11], [12], [13]
Χ Pauling negativities mismatch for multi-component alloy system
χ=i=1nci·(χiχ¯)2
where ci and χi is the atomic percentage and Pauling electronegativity of the ith component and χ¯ is the mean value of electronegativity for a multi-component alloy system, respectively.
[7], [8], [9], [10], [11], [12], [13]
VEC The valence electronic concentration of a multi-component alloy system calculated on the basis of the rule of mixtures approach
VEC=i=1nci·VECi
where ci and VECi are the atomic percentage and the valence electron concentration of the ith component, respectively.
[7], [8], [9], [10], [11], [12], [13]
ΔSmix The entropy of mixing of a multi-component alloy system calculated as follows
ΔSmix=R·i=1nci·lnci
where R (= 8.314 JK−1mol−1) is the universal gas constant and ci is the atomic percentage of the ith, component.
[8], [9], [10], [11], [12]
ΔHmix Enthalpy of mixing for a multi-component alloy system
ΔHmix=i=1,ijn4cicjΔHmixij
where ci and cj is the atomic percentages of the ith and jth components, respectively. ΔHmixij is mixing enthalpy of binary liquid alloys accessible from conventional tables prepared by Takeuchi and Inoue [15] based on Miedema's atomistic model [16].
[8], [9], [10], [11], [12], [13]
Synthesis_Route A categorical feature indicating that the alloy was synthesised via vacuum melted (AC), powder metallurgy (PM), or otherwise
Hot-Cold_Working A categorical feature indicating that the alloy has been subjected to a cold/hot working treatment.
(nan, CW – cold worked, HW – hot worked, HIP – hot-isostatically pressed)
Homog_Temp A feature indicating the temperature at which the alloy was subjected to a homogenization treatment (in °C)
Homog_Time A feature indicating the duration for which the alloy was subjected to a homogenization treatment (in minutes).
Annealing_Temp A feature indicating the temperature at which the alloy was subjected to a annealing treatment (in °C)
Annealing_Time A feature indicating the duration for which the alloy was subjected to a annealing treatment (in minutes).
Quench_Proc Categorical feature indicating that the alloy has been subjected to an quenching heat treatment
Microstructure Experimentally observed microstructure(s), namely BCC_SS (body-centered cubic solid solutions), FCC_SS (face-centered cubic single solid solutions) FCC+BCC_SS (dual-phase solid solutions) and Im (intermetallic, intermetallic, laves, martensitic, and other phases but excluding glassy or amorphous and hexagonal close-packed – HCP)

2. Experimental Design and Methods

The dataset is built upon datasets prior published by Miracle et al. [3], Couzinié et al. [4], and Ye et al. [5]. It is constructed from HEA microstructural observations reported in experimental peer-reviewed research articles.

Selected HEAs reported in the literature through the end of August 2020 make up the dataset presented in this article. The as-constructed dataset has at least 1362 multicomponent alloys. After removing entries missing some data and eliminating glassy, amourphous, and hexagonal close-packed phases, the as-constructed dataset is reduced to 1362 multicomponent alloys. Supplementary materials documents accompanying this article contain the full list of alloys and references.

The equations used to estimate each of the selected empirical design parameters are summarised in Table 1. Fig. 1 shows the distribution of phase classes in the dataset (the atomic size difference, δ as a function of VEC) after Refs [7,14]. The visualization also shows that solid solution phases are particularly more sensitive to empirical parameters than IM phases [1,2].

Fig. 1.

Fig. 1

Distribution visualization of the (a) HEA elements and (b) the full dataset phase distribution in the VEC- δ space.

Ethics Statement

None.

Data Availability

Dataset for High-Entropy Alloys Phases (Reference data) (Mendeley Data).

CRediT authorship contribution statement

Ronald Machaka: Conceptualization, Funding acquisition, Resources, Methodology, Visualization, Supervision, Writing – original draft, Writing – review & editing. Glenda T. Motsi: Data curation, Investigation, Writing – original draft, Writing – review & editing. Lerato M. Raganya: Data curation, Investigation, Writing – original draft, Writing – review & editing. Precious M. Radingoana: Data curation, Investigation, Writing – original draft, Writing – review & editing. Silethelwe Chikosha: Data curation, Supervision, Investigation, Writing – original draft, Writing – review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships which have, or could be perceived to have, influenced the work reported in this article.

Acknowledgments

The athours greatly appreciates the financial support received from the National Research Foundation through the Y-Rated Researcher Support Grant [Unique Grant No.: 120-413] and the NRF-DST Professional Development Programme (PDP) for Postdoctoral Research Fellowships Block Grant.

Footnotes

Supplementary material associated with this article can be found in the online version at doi:10.1016/j.dib.2021.107346.

Appendix. Supplementary materials

mmc1.csv (348.2KB, csv)

References

  • 1.Machaka R. Machine learning-based prediction of phases in high-entropy alloys. Comput. Mater. Sci. 2020 doi: 10.1016/j.dib.2021.107346. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Machaka R. Machine learning based prediction of phases in high-entropy alloys. SSRN Electron. J. 2020 doi: 10.2139/ssrn.3530328. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Gorsse S., Nguyen M.H., Senkov O.N., Miracle D.B. Database on the mechanical properties of high entropy alloys and complex concentrated alloys. Data Br. 2018;21:2664–2678. doi: 10.1016/j.dib.2018.11.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Couzinié J.-P., Senkov O.N., Miracle D.B., Dirras G. Comprehensive data compilation on the mechanical properties of refractory high-entropy alloys. Data Br. 2018;21:1622–1641. doi: 10.1016/j.dib.2018.10.071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Ye Y.F., Wang Q., Lu J., Liu C.T., Yang Y. High-entropy alloy: challenges and prospects. Mater. Today. 2016;19:349–362. doi: 10.1016/j.mattod.2015.11.026. [DOI] [Google Scholar]
  • 6.King D.J.M., Middleburgh S.C., McGregor A.G., Cortie M.B. Predicting the formation and stability of single phase high-entropy alloys. Acta Mater. 2016;104:172–179. doi: 10.1016/j.actamat.2015.11.040. [DOI] [Google Scholar]
  • 7.Tang Z., Zhang S., Cai R., Zhou Q., Wang H. Designing high entropy alloys with dual fcc and bcc solid-solution phases: structures and mechanical properties. Metall. Mater. Trans. A. 2019;50:1888–1901. doi: 10.1007/s11661-019-05131-1. [DOI] [Google Scholar]
  • 8.Zhang Y., Wen C., Wang C., Antonov S., Xue D., Bai Y., Su Y. Phase prediction in high entropy alloys with a rational selection of materials descriptors and machine learning models. Acta Mater. 2020;185:528–539. doi: 10.1016/j.actamat.2019.11.067. [DOI] [Google Scholar]
  • 9.Islam N., Huang W., Zhuang H.L. Machine learning for phase selection in multi-principal element alloys. Comput. Mater. Sci. 2018;150:230–235. doi: 10.1016/j.commatsci.2018.04.003. [DOI] [Google Scholar]
  • 10.Zhou Z., Zhou Y., He Q., Ding Z., Li F., Yang Y. Machine learning guided appraisal and exploration of phase design for high entropy alloys. Npj Comput. Mater. 2019;5:128. doi: 10.1038/s41524-019-0265-1. [DOI] [Google Scholar]
  • 11.Zhang H., Zhou G., Li S., Fan X., Guo Z., Xu T., Xu Y., Chen X., Dai D., Qian Q. Application of fuzzy learning in the research of binary alloys: revisit and validation. Comput. Mater. Sci. 2020;172 doi: 10.1016/j.commatsci.2019.109350. [DOI] [Google Scholar]
  • 12.Li S., Zhang H., Dai D., Ding G., Wei X., Guo Y. Study on the factors affecting solid solubility in binary alloys: an exploration by Machine Learning. J. Alloys Compd. 2019;782:110–118. doi: 10.1016/j.jallcom.2018.12.136. [DOI] [Google Scholar]
  • 13.Qu N., Chen Y., Lai Z., Liu Y., Zhu J. The phase selection via machine learning in high entropy alloys. Procedia Manuf. 2019;37:299–305. doi: 10.1016/j.promfg.2019.12.051. [DOI] [Google Scholar]
  • 14.Huang W., Martin P., Zhuang H.L. Machine-learning phase prediction of high-entropy alloys. Acta Mater. 2019;169:225–236. doi: 10.1016/j.actamat.2019.03.012. [DOI] [Google Scholar]
  • 15.Takeuchi A., Inoue A. Classification of bulk metallic glasses by atomic size difference, heat of mixing and period of constituent elements and its application to characterization of the main alloying element. Mater. Trans. 2005;46:2817–2829. doi: 10.2320/matertrans.46.2817. [DOI] [Google Scholar]
  • 16.F.R. De Boer, R. Boom, W.C.M. Mattens, A.R. Miedema, Cohesion in Metals, North-Holland, Netherlands, 1988.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.csv (348.2KB, csv)

Data Availability Statement

Dataset for High-Entropy Alloys Phases (Reference data) (Mendeley Data).


Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES