Abstract
This article presents theoretical data on geometric and energetic features of halobenzenes and xylenes. Data were obtained from ab initio geometry optimization and frequency calculations at HF, B3LYP, MP2 and CCSD levels of theory on 6–311++G(d,p) basis set. In total, 1504 structures of halobenzenes, three structures of xylenes and one structure of benzene were generated and processed by custom-made codes in Mathematica. The quantum chemical calculation was completed in Q-Chem software package. Geometric and energetic data of the compounds are presented in this paper as supplementary tables. Raw output files as well as codes and scripts associated with production and extraction of data are also provided.
Keywords: Halobenzene, Xylene, Relative stability, Steric effect
Specifications Table
| Subject | Chemistry |
| Specific subject area | Physical and Theoretical Chemistry/Spectroscopy |
| Type of data | Tables and Q-Chem output files |
| How data were acquired | Quantum chemical computation on Q-Chem 5.2.1, Developer Version |
| Data format | Raw and analysed |
| Parameters for data collection | Hartree-Fock (HF)/6–311++G(d,p), Becke, 3-parameter, Lee–Yang–Parr (B3LYP)/6–311++G(d,p), Second order Møller–Plesset perturbation theory (MP2)/6–311++G(d,p) Coupled Cluster Singles and Doubles (CCSD)/6–311++G(d,p) |
| Description of data collection | Geometric and energetic data from quantum chemical calculations of halobenzenes, xylenes and benzene were generated by quantum chemical computation and processed by custom-made codes |
| Data source location | Mahidol University, Salaya, Thailand Latitude and longitude: 13.792790, 100.325707 |
| Data accessibility | With the article |
Value of the data
-
•
All 1505 possible halobenzenes and three xylenes are explicitly shown in this paper with numbering, IUPAC name, PubChem CID and SMILES. These can be used as a reference for both theoretical and experimental work involving this class of compounds.
-
•
Geometric and energetic data can be used for further analysis to understand relative stability of isomers. In particular, the unexpected trend in relative stability of isomers are of particular interest to scientists in a similar manner to cis and gauche effect. The data set includes many examples where steric hindrance alone fails to account for the behaviour observed in halobenzenes and xylenes.
-
•
Raw data as well as associated scripts and codes are provided so that interested researchers can reproduce our data and perform calculation at other levels of theory or for other relevant classes of compounds. Vibrational spectrum and other detailed information can be extracted from output files as needed. There are many potential uses of the spectral information, for example, detection of xylene for food safety application [1] and understanding formation of polychlorinated biphenyls (PCBs) [2]. The data can also be a test set for molecular modelling software packages.
1. Data description
A total of 1505 unique compounds of benzene, including all degrees of substitution with F, Cl, Br and I atoms, and three isomers of xylene were investigated. Classification and counting of the 1505 compounds are exhaustively shown in Tables 1 and 2 with specific examples in Fig. 1, Fig. 2, Fig. 3. The main difference between Tables 1 and 2 is the treatment of hydrogen atom. In Table 1, hydrogen is treated in the same way as halogen and this leads to the binomial coefficients for five kinds of elements. In Table 2, hydrogen is treated in a special way and this leads to binomial coefficients for four kinds of halogen atoms. Table 3 summarizes the total number of Q-Chem 5.2.1 [3] output files for different classes of compounds, types of calculation (geometry optimization/frequency calculation) and levels of theory (HF, B3LYP, MP2, and CCSD)
Table 1.
List of all compounds by the number of elements bonded to carbon atoms (In total, there are 1505 benzene and halobenzene compounds with 210 possible empirical formulas.).
| Number of elements | Distribution of elements | Number of empirical formulas | Position of elements | Number of isomers per formula | Number of structures |
|---|---|---|---|---|---|
| 1 | C6α6 (6) | 5 | n/a | 1 | 5 |
| 2 | C6α5β (1–5) | 20 | 1- | 1 | 20 |
| C6α2β4 (2–4) | 20 | 1,2- | 1 | 20 | |
| 1,3- | 1 | 20 | |||
| 1,4- | 1 | 20 | |||
| C6α3β3 (3–3) | 10 | 1,2,3- | 1 | 10 | |
| 1,2,4- | 1 | 10 | |||
| 1,3,5- | 1 | 10 | |||
| 3 | C6αβγ4 (1–1–4) |
30 | 1,2- | 1 | 30 |
| 1,3- | 1 | 30 | |||
| 1,4- | 1 | 30 | |||
| C6αβ2γ3 (1–2–3) |
60 | 1,2,3- | 2 | 120 | |
| 1,2,4- | 3 | 180 | |||
| 1,3,5- | 1 | 60 | |||
| C6α2β2γ2 (2–2–2) |
10 | 1,2,3,4- | 4 | 40 | |
| 1,2,3,5- | 4 | 40 | |||
| 1,2,4,5- | 3 | 30 | |||
| 4 | C6αβγδ3 (1–1–1–3) |
20 | 1,2,3- | 3 | 60 |
| 1,2,4- | 6 | 120 | |||
| 1,3,5- | 1 | 20 | |||
| C6αβγ2δ2 (1–1–2–2) |
30 | 1,2,3,4- | 6a | 180 | |
| 1,2,3,5- | 7a | 210 | |||
| 1,2,4,5- | 3a | 90 | |||
| 5 | C6αβγδε2 (1–1–1–1–2) |
5 | 1,2,3,4- | 12b | 60 |
| 1,2,3,5- | 12 | 60 | |||
| 1,2,4,5- | 6 | 30 | |||
Table 2.
List of all compounds by different degrees of substitution to benzene (In total, the number of compounds and empirical formulas is the same as in Table 1).
| Group of compounds | Number of halogen substituents | Distribution of substituents | Number of empirical formulas | Position of substituent | Number of isomers per formula | Number of structures |
|---|---|---|---|---|---|---|
| Benzene | 0 | C6H6 | 1 | n/a | 1 | 1 |
| Monohalobenzene | 1 | C6H5α (1) | 1 | 1 | 4 | |
| Dihalobenzene | 1 | C6H4α2 (2) |
4 | 1,2- | 1 | 4 |
| 1,3- | 1 | 4 | ||||
| 1,4- | 1 | 4 | ||||
| 2 | C6H4αβ (1–1) |
6 | 1,2- | 1 | 6 | |
| 1,3- | 1 | 6 | ||||
| 1,4- | 1 | 6 | ||||
| Trihalobenzene | 1 | C6H3α3 (3) |
4 | 1,2,3- | 1 | 4 |
| 1,2,4- | 1 | 4 | ||||
| 1,3,5- | 1 | 4 | ||||
| 2 | C6H3αβ2 (1–2) |
= 12 | 1,2,3- | 2 | 24 | |
| 1,2,4- | 3 | 36 | ||||
| 1,3,5- | 1 | 12 | ||||
| 3 | C6H3αβγ (1–1–1) |
4 | 1,2,3- | 3 | 12 | |
| 1,2,4- | 6 | 24 | ||||
| 1,3,5- | 1 | 4 | ||||
| Tetrahalobenzene | 1 | C6H2α4 (4) |
4 | 1,2,3,4- | 1 | 4 |
| 1,2,3,5- | 1 | 4 | ||||
| 1,2,4,5- | 1 | 4 | ||||
| 2 | C6H2αβ3 (1–3) |
12 | 1,2,3,4- | 2 | 24 | |
| 1,2,3,5- | 3 | 36 | ||||
| 1,2,4,5- | 1 | 12 | ||||
| C6H2α2β2 (2–2) |
6 | 1,2,3,4- | 4 | 24 | ||
| 1,2,3,5- | 4 | 24 | ||||
| 1,2,4,5- | 3 | 18 | ||||
| 3 | C6H2αβγ2 (1–1–2) |
12 | 1,2,3,4- | 6 | 72 | |
| 1,2,3,5- | 7 | 84 | ||||
| 1,2,4,5- | 3 | 36 | ||||
| 4 | C6H2αβγδ (1–1–1–1) |
1 | 1,2,3,4- | 12 | 12 | |
| 1,2,3,5- | 12 | 12 | ||||
| 1,2,4,5- | 6 | 6 | ||||
| Pentahalobenzene | 1 | C6Hα5 (5) | 4 | 1,2,3,4,5- | 1 | 4 |
| 2 | C6Hαβ4 (1–4) | 12 | 1,2,3,4,5- | 3 | 36 | |
| C6Hα2β3 (2–3) | 12 | 1,2,3,4,5- | 6 | 72 | ||
| 3 | C6Hαβγ3 (1–1–3) |
12 | 1,2,3,4,5- | 10 | 120 | |
| C6Hαβ2γ2 (1–2–2) |
12 | 1,2,3,4,5- | 16 | 192 | ||
| 4 | C6Hαβγδ2 (1–1–1–2) |
4 | 1,2,3,4,5- | 30a | 120 | |
| Hexahalobenzene | 1 | C6α6 (6) | 4 | 1,2,3,4,5,6- | 1 | 4 |
| 2 | C6αβ5 (1–5) | 12 | 1,2,3,4,5,6- | 1 | 12 | |
| C6α2β4 (2–4) | 12 | 1,2,3,4,5,6- | 3 | 36 | ||
| C6α3β3 (3–3) | 6 | 1,2,3,4,5,6- | 3 | 18 | ||
| 3 | C6αβγ4 (1–1–4) |
12 | 1,2,3,4,5,6- | 3 | 36 | |
| C6αβ2γ3 (1–2–3) |
24 | 1,2,3,4,5,6- | 6 | 144 | ||
| C6α2β2γ2 (2–2–2) |
4 | 1,2,3,4,5,6- | 11 | 44 | ||
| 4 | C6αβγδ3 (1–1–1–3) |
4 | 1,2,3,4,5,6- | 10 | 40 | |
| C6αβγ2δ2 (1–1–2–2) |
6 | 1,2,3,4,5,6- | 16b | 96 | ||
Fig. 1.
List of 6 + 7 + 3 = 16 structures of halobenzene with empirical formula C6αβγ2δ2 (distribution of elements 1-1-2-2). For simplicity, the two δ are omitted and structures are organised into groups by which from left to right, the first four substituents are in positions 1,2,3,4-, 1,2,3,5- and 1,2,4,5-, respectively. If switching the red letters of a structure leads to a different isomer, then that single depiction represents two different structures as shown with the notation “×2”. Letters α, β, γ, and δ represent different substituents of F, Cl, Br and I. (For Table 1, one of the letters may represent a hydrogen atom.).
Fig. 2.
List of halobenzenes with the formula C6αβγδε2 where permutation of α, β, γ, δ at four adjacent positions (1,2,3,4-) leads to possible structures. The division by two arises due to the symmetry of the structure.
Fig. 3.
Possible structures of pentahalobenzene C6Hαβγδ2 with 4 different halogens acting as substituents (distribution of elements: 1-1-1-2). Structures are divided into three groups with 12, 12 and 6 structures due to permutation for δ atoms (any halogen listed but not H) in ortho-, meta-, and para- positions, respectively. A full list of structures of the ortho group is shown in Fig. 2. (Reassignment of letters is needed.).
Table 3.
Summary of investigated compounds, levels of theory (HF, B3LYP, MP2, and CCSD) on 6–311++G(d,p) basis set and types of calculation (opt for geometry optimization and freq for frequency calculation).
| Group of compounds | Number of tuples | Number of structures | HF |
B3LYP |
MP2 |
CCSD |
||||
|---|---|---|---|---|---|---|---|---|---|---|
| opt | freq | opt | freq | opt | freq | opt | freq | |||
| Benzene | 1 | 1 | all | all | all | all | all | – | all | – |
| Monohalobenzene | 24 | 4 | all | all | all | all | all | – | all | – |
| Dihalobenzene | 240 | 30 | all | all | all | all | all | – | all | – |
| Trihalobenzene | 1280 | 124 | all | all | all | all | all | – | – | – |
| Tetrahalobenzene | 3840 | 372 | all | all | all | all | all | – | – | – |
| Pentahalobenzene | 6144 | 544 | all | all | all | all | all | – | – | – |
| Hexahalobenzene | 4096 | 430 | all | all | all | all | all | – | – | – |
| Xylene | 15 | 3 | all | all | all | all | all | – | all | – |
| Total | 15,640 | 1508 | 1508 | 1508 | 1508 | 1508 | 1508 | – | 38 | – |
In supplementary information, summary table files (.csv) are provided per level of theory.
-
•
Geometric data of 12 bond lengths, 12 bond angles and 12 torsional angles in a single csv file
-
•
Energetic data, in separate files, include electronic energy (Eelec) in Hartree, thermal correction to enthalpy (Hcorr) in kcal mol−1, zero-point vibrational energy (EZPE) in kcal mol−1 and entropy (S) in cal mol−1 K−1.
The following associated files are also provided.
-
•
Raw Q-Chem output files (.out) for all compounds.
-
•
Geometry in Z-matrix and Cartesian coordinate format (.xyz) for all compounds.
-
•
Wolfram Mathematica notebook (benzene.nb) and associated script (script.txt).
2. Experimental design, materials, and methods
Due to prohibitive computational cost, frequency calculations at MP2 and CCSD levels of theory were excluded and only benzene to dihalobenzenes and xylenes were selected for CCSD optimization jobs. The output files were processed by custom-made scripts and Wolfram Mathematica 12.0 [4] codes to extract geometric and energetic data of all halobenzene compounds in a similar manner to our previous work [5]. Data from the three xylene compounds are provided for reference purpose and were read from IQmol 2.13 manually [6].
Acknowledgments
Acknowledgments
We are grateful for materials and software purchased previously by MUIC and IPST grants.
Conflict of Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Footnotes
Supplementary material associated with this article can be found, in the online version, at doi:10.1016/j.dib.2020.105386.
Appendix. Supplementary materials
References
- 1.Shi J., Miao X., Liu Y., Tan Y., Zhang M., Cai H. 2014 International Conference on Manipulation, Manufacturing and Measurement on the Nanoscale (3M-NANO) 2014. Raman spectrum calculation and analysis of p-xylene; pp. 295–298. [Google Scholar]
- 2.Cioslowski J., Liu G., Moncrieff D. Energetics of the Homolytic C−H and C−Cl Bond Cleavages in Polychlorobenzenes: The Role of Electronic and Steric Effects. J. Phys. Chem. A. 1997;101:957–960. [Google Scholar]
- 3.Shao Y., Gan Z., Epifanovsky E., Gilbert A.T., Wormit M., Kussmann J., Lange A.W., Behn A., Deng J., Feng X. Advances in molecular quantum chemistry contained in the Q-Chem 4 program package. Mol. Phys. 2015;113:184–215. [Google Scholar]
- 4.Wolfram Research Inc, Mathematica, Champaign, Illinois, 2019.
- 5.Chinsukserm K., Lorpaiboon W., Teeraniramitr P., Limpanuparb T. Geometric and energetic data from ab initio calculations of haloethene, haloimine, halomethylenephosphine, haloiminophosphine, halodiazene, halodiphosphene and halocyclopropane. Data Brief. 2019;27 doi: 10.1016/j.dib.2019.104738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.A.T.B. Gilbert, IQmol, http://iqmol.org, 2019.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.



