Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2020 Dec 28;15(12):e0243887. doi: 10.1371/journal.pone.0243887

ACE2 polymorphisms as potential players in COVID-19 outcome

André Salim Khayat 1,2,*, Paulo Pimentel de Assumpção 2, Bruna Claudia Meireles Khayat 1, Taíssa Maíra Thomaz Araújo 2, Jéssica Almeida Batista-Gomes 1, Luciana Carvalho Imbiriba 2, Geraldo Ishak 2, Paula Baraúna de Assumpção 2, Fabiano Cordeiro Moreira 2, Rommel Rodriguez Burbano 2,3, André Ribeiro-dos-Santos 1, Ândrea Kelly Ribeiro-dos-Santos 1,2, Ney Pereira Carneiro dos Santos 1,2, Sidney Emmanuel Batista dos Santos 1,2
Editor: Mark Allen Pershouse4
PMCID: PMC7769452  PMID: 33370311

Abstract

The clinical condition COVID-19, caused by SARS-CoV-2, was declared a pandemic by the WHO in March 2020. Currently, there are more than 5 million cases worldwide, and the pandemic has increased exponentially in many countries, with different incidences and death rates among regions/ethnicities and, intriguingly, between sexes. In addition to the many factors that can influence these discrepancies, we suggest a biological aspect, the genetic variation at the viral S protein receptor in human cells, ACE2 (angiotensin I-converting enzyme 2), which may contribute to the worse clinical outcome in males and in some regions worldwide. We performed exomics analysis in native and admixed South American populations, and we also conducted in silico genomics databank investigations in populations from other continents. Interestingly, at least ten polymorphisms in coding, noncoding and regulatory sites were found that can shed light on this issue and offer a plausible biological explanation for these epidemiological differences. In conclusion, there are ACE2 polymorphisms that could influence epidemiological discrepancies observed among ancestry and, moreover, between sexes.

Introduction

At the end of 2019, a new outbreak caused by SARS-CoV-2 (a coronavirus) started in Hubei Province, China. The clinical condition, COVID-19, probably arose from natural selection in bat reservoirs [1]. There are currently more than five million cases worldwide, and the pandemic has been increasing exponentially in many countries since the disease was deemed a pandemic by WHO in March 2020 [2].

The lethality rate is influenced by the speed of contagion, idiosyncrasies of the affected populations according to the containment policies adopted, socioeconomic conditions and the absorption limit of the health system [3]. Due to the high rate of transmission of the virus by air and the novelty of the infection to humans, the disease has become a global emergency problem, forcing periods of social confinement to contain the pandemic, in addition to hygiene habits [4].

Although mortality in men is significantly higher [5], representing an increase from 20% to 70% in European countries, approximately 65% in some Asian countries, and, even more peculiarly, Dominican Republic citizens showed three times more deaths in men than women [2,6,7]. It is unknown whether this is due to biological differences between genders, differences in behavioral habits, or comorbidities rates [8].

Studies indicate that cell-virus interaction is mediated by the connection of the transmembrane glycoprotein spike (S), present in the form of trimers on the viral surface, to angiotensin I-converting enzyme 2 (ACE2, also called hACE2) [9].

The level and expression pattern of ACE2 in different tissues and cells can be critical to the susceptibility and symptoms resulting from SARS-CoV-2 infection [10]. Zhou and collaborators [11], using scRNA-seq datasets, classified organs vulnerable to infection as high and low risk based on their expression levels of ACE2. At the clinical level, the symptoms of COVID-19 may be related to the entry and affinity of the virus in these organs, as observed in heart failure disease and increased ACE2 expression, in which viral infection is related to a higher risk of heart attack and worse ill condition [12]. For Li and collaborators [13], ACE2 genetic variations could be crucial to the susceptibility in different cohorts and to clinical outcomes of COVID-19.

Currently, investigations of potential genetic variations that may favor or hinder interactions between the virus and the host have been conducted [1416], showing a high number of codons that can, if altered, interfere with the complexity of the virus-cell interaction, as experimentally demonstrated [17]. It is noteworthy that the ACE2 is located on the X chromosome, causing the impossibility of heterozygosity in men. Therefore, polymorphisms in their single copy could be related to the worst outcomes observed in males [18].

Considering the above, we sought explanations for an intrinsic factor that differed between sexes and populations that may justify the differences observed in the incidence and lethality of SARS-CoV-2 infection among the different regions of the world, as well as between sexes. We analyzed global data in the 1000 Genomes Database and, in addition, we conducted studies of exomes in two population groups in the Brazilian Amazon (Indians and miscegenated), without description in public genomic banks, and we compared this information with a public databank from a population in southeastern Brazil. These comparisons are important because Brazil has a continental size and an admixed population in the North (more Amerindians among all regions), Northeast (more Africans), and South and Southeast (more Europeans) [19].

Materials and methods

Analyses in the 1000 Genomes project

The analysis was performed on data from the 1000 Genomes Phase 3 database (1000G), which comprises 84,4 million variants in 2,504 individuals from 26 different populations [20]. These populations were concentrated in five large groups: African (AFR), Ad Mixed American (AMR), East Asian (EAS), European (EUR), and South Asian (SAS).

Through the complete sequence of the X chromosome, a region between nucleotides 15620281 and 15512643 was selected in revision GRCh37.p13 (15602158 and 15494520 in GRCh38.p13) since the angiotensin I-converting enzyme 2 gene (ACE2, Gene ID: 59272, updated on 22-Mar-2020) is located on the complementary strand, covering 107639 bp. Additionally, a region of ten thousand base pairs upstream to the gene was included in the analysis so that the initial search site became 15630281 (GRCh37.p13) [20,21], aiming to search for modifications in noncanonical sites to locate minor allele frequency (MAF) and allelic differences that could be relevant. The index considered was the difference in allele frequency among all polymorphisms (SNP, INDEL and SV) contained in the study region, regardless of their global or populational frequency.

Analyses in Amazon natives and admixed population

For the allelic comparison between the populations cataloged in the 1000 Genomes database and the population not described in the respective project, we investigated a population composed of 64 Amerindians and 82 admixed individuals from the Amazon region of northern Brazil (data available at S4 Table). This study was approved by the National Committee for Ethics in Research (CONEP) and the Research Ethics Committee of the UFPA Tropical Medicine Center under CAAE number 20654313.6.0000.5172. The participants signed an informed consent form. The Amerindians represent 10 different Amazonian ethnic groups that were grouped together as the Native American (NAM) group. Tribe names and geographic coordinates of the Brazilian Amazon Indian populations are presented in S5 Table.

Compared with other Native Americans (from different countries and even different regions of Brazil), the Amazonian indigenous groups are similar to each other. Commonly, there are quite differences between distinct ethnic groups of native Americans. However, in comparison to populations of other ancestry backgrounds (Europeans or Africans), the Native Americans of the Brazilian Amazon may be grouped as a homogeneous population [2224].

The 82 admixed individuals (Brazilian Admixed Population—BAP) live in Belém city, located in northern Brazil, where, due to the colonization process, are characterized by three ancestral genetic components: European, Native American and African. This sample group is also enrolled in a broad project. Furthermore, we also compared our findings to a database of variants analyzed in a Southeast Brazilian population, the Online Archive of Brazilian Mutations (ABraOM, we represent here as ABM) [25].

The Brazilian population is one of the populations with the broadest genetic diversity in the world due to the high degree of miscegenation in its formation [2628]. Different regions of Brazil have significant genetic variations in the contribution of their ancestry. The population of southeastern Brazil, which forms the ABM database, has a high contribution from European ancestors [29]. Our data (BAP) were generated based on the population of Northern Brazil of formation with high Amerindian ancestry, resulting in relevant genetic differences between regions [30].

DNA extraction and Exome library

The DNA was extracted as described by Sambrook and collaborators [31]. The genetic material was quantified using a Nanodrop spectrophotometer (Thermo Fisher Scientific Inc., USA). The libraries were prepared using the Nextera Rapid Capture Exome (Illumina) and SureSelect Human All Exon V6 (Agilent) Kits. Sequencing reactions were run using the NextSeq 500 High-output v2 300 Cycle Kit (Illumina®, USA) on the NextSeq 500® platform (Illumina®, USA).

Exomic bioinformatics

The quality of the FASTQ reads was analyzed (FastQC v.0.11- http://www.bioinformatics.babraham.ac.uk/projects/fastqc/), and the samples were filtered to eliminate low-quality readings (fastx_tools v.0.13 - http://hannonlab.cshl.edu/fastx_toolkit/). The sequences were aligned with the reference genome (GRCh37) using the BWA v.0.7 tool (http://bio-bwa.sourceforge.net/). The file was indexed and sorted (SAMtools v.1.2 - http://sourceforge.net/projects/samtools/). Subsequently, the alignment was processed (duplicate PCR removal) (Picard Tools v.1.129 - http://broadinstitute.github.io/picard/), and mapping quality recalibration and local realignment (GATK v.3.2 - https://www.broadinstitute.org/gatk/) were performed. The results were processed to determine the variants (GATK v.3.2) from the reference genome. SnpEff v.4.3t, Ensembl Variant Effect Predictor (Ensembl release 99) and ClinVar (v.2018-10) were used for variant annotations.

Databank analysis

Information from the databank and exomes were analyzed using descriptive statistics, considering the values of allele frequencies of populations and subpopulations as the data explored comparatively. Genotypic differences between sexes in the homo/hemizygous state were calculated based on the premise that populations are in Hardy-Weinberg equilibrium.

Results

Analyzing the polymorphisms contained in the ACE2 locus, in addition to ten thousand base pairs upstream, we found 2266 polymorphisms, of which 199 were contained in the region 5´ upstream of the gene, 85 were located in exonic regions, and the others were located in the introns.

Polymorphisms in exonic regions that might influence ACE2 protein structure

In the exonic region, 15 SNPs of the 85 polymorphisms found in 1000G present differences greater than 1% between some of the populations (all between exons 17 to 21). Another three polymorphisms (rs889263894, rs1027571965, rs147464721) appear mainly in the Brazilian population. Nine of these exonic polymorphisms are present in the most common isomorphs (v1 and v2) and show important differences in populational frequency (Fig 1, additional data on S1 Table).

Fig 1. MAF of the main polymorphisms in exons present in the most common isoforms (v1 and v2) of the ACE2 gene.

Fig 1

RS: Reference SNP; Ref: reference allele; Alt: alternative allele; EUR: European; EAS: East Asian; SAS: South Asia; AMR: Ad Mixed American; AFR: African; NAM: Native American; BAP: Brazilian Admixed Population; ABM: Online Archive of Brazilian Mutations.

As noted, some of these polymorphisms have very significant differences between MAFs from different populations. rs35803318 is absent in Asians (virtually absent in AFR) and has an average MAF of 0.05 in EUR and ABM, increased to 0.074 in BAP, and has the highest allele frequency in the indigenous population among all populations (MAF = 0.121).

On the other hand, rs4646179 is absent in indigenous, miscegenated people from the Amazon, Asians and Europeans and is found in the population of southeastern Brazil and Americans, with a MAF = 0.023 and an even greater frequency in Africans (MAF = 0.074).

Interestingly, rs1027571965 and rs889263894 presented allele frequencies exclusively in indigenous people and are not being found in any other world population of 1000G, neither in BAP nor in the Brazilian ABM database, with MAF = 0.095 and 0.034, respectively.

rs147464721 has MAF = 0.014 in the miscegenated population of the Amazon (BAP); however, they did not present any allelic frequency among the indigenous population, as well as in Asians and Europeans, presenting a slightly lower MAF difference when compared to AFR and AMR.

Polymorphisms in upstream regions that might influence in ACE2 expression

Differences greater than 1% of MAF in the region 10,000 base pairs 5´ upstream to the gene were observed in 57 polymorphisms (Fig 2).

Fig 2. MAF of polymorphisms of the 10k 5´ upstream region of ACE 2, with MAF differences greater than 1%.

Fig 2

Coordinate represents the frequency of the allele (MAF).

Among the highlights, rs5934250, with a change from G to T at approximately 5700 bp upstream to the gene, presented a difference of up to 0.47 in the AFR (0.10), AMR (0.29), EAS (0.01), EUR (0.47) and SAS (0.22) populations for the T allele. This means that the T allele is almost zero in the East Asian population, while it has a MAF in almost half of Europeans.

It is also worth mentioning that the rs2097723 SNP presents a very heterogeneous distribution, oscillating between 7% in Africans, 32% in Americans, 42% in East Asians, 28% in Europeans and 22% in South Asians.

Polymorphisms in intronic regions that might modify ACE2 regulation

In intronic polymorphisms, many of them present a very relevant MAF interpopulational difference (up to 0.46). Two of them deserve to be highlighted, rs2285666 and rs4646140, because they are near exons.

It is important to mention that rs2285666 has the highest frequency of the rarest allele (MAF = 0.71) in the indigenous population, with very important MAF differences of 0.17 (EAS), 0.23 (SAS), 0.36 (BAP), and 0.37 (AMR) and an average difference of 0.48 for the others (EUR, AFR and ABM). In contrast, rs4646140 has a MAF ranging from zero in Indians to 0.13 in Africans through EUR, AMR, BAP, EAS, ABM and SAS (Fig 3). Furthermore, considering the possibility of influence in determining isoforms v2, it is worth noting rs190614788 on intron 1 (with a difference of more than 0.11 between EUR and EAS).

Fig 3. Minor allele frequency of the SNP intronic rs2285666 and rs4646140.

Fig 3

The main findings of our study are concatenated in Table 1 (additional information in S3 Table), and they are discussed below.

Table 1. ACE2 polymorphisms with the potential to influence gene structure or gene expression.

Interpopulation and inter-sex differences.

Population* RS Site MAF* Others (MAF) Modification (score) Men risk Women risk M/W
BAP rs147464721 codon 351 0.014 absent Synonymous 1.4% 0.0% 71.4
ABM, AMR, AFR rs4646179 codon 690 0.074 absent Synonymous 7.4% 0.5% 13.5
NAM rs35803318 codon 749 0.121 EAS, SAS, AFR (null) Synonymous 12.1% 1.5% 8.3
EUR rs41303171 codon 720 0.018 absent Asn > Asp (0.02) 1.8% 0.0% 55.6
AFR rs147311723 codon 731 0.017 EAS, SAS, AMR, ABM (null) Leu > Phe (0.941) 1.7% 0.0% 58.8
NAM rs1027571965 codon 673 0.095 absent Ala > Gly (0.045) 9.5% 0.9% 10.5
NAM rs889263894 codon 541 0.034 absent Lys > Ile (0.958) 3.4% 0.1% 29.4
EAS rs182366225 3´UTR 0.018 absent Upregulation 1.8% 0.0% 55.6
AFR rs142017934 3´UTR 0.013 absent Upregulation 1.3% 0.0% 76.9
NAM rs2285666 intron 0.71 EAS (0.54) to AFR (0.17) Upregulation (Brain, Nerve) 71.0% 50.4% 1.4
AFR rs4646140 intron 0.13 SAS (0.085) to NAM (0) No evidence 13.0% 1.7% 7.7
EAS rs2097723 upstream 0.42 AMR (0.32) to AFR (0.07) Upregulation (Brain/Nerve) 42.0% 17.6% 2.4
EUR rs5934250 upstream 0.47 AMR (0.29) to EAS (0.01) Downregulation (Brain, Nerve, Artery, Pituitary, Prostate) 47.0% 22.1% 2.1

Population*: Population with greater MAF; RS: Reference SNP; Site: Genic region; MAF*: Minor allele frequency of population*; Others (MAF): MAF of the other populations (or populations without MAF = null); Modification (score): Type of molecular consequence and PolyPhen Score [32]; Men risk: Based on hemizygous genotypic frequency (q); Women risk: Based at a homozygous genotypic frequency (q2); M/W: Risk ratio between sexes to be a carrier of the minor allele only.

Discussion

Polymorphisms of the ACE2 gene related to the protein binding region to the viral particle

Considering the regions where the virus commonly binds to ACE2 [15,16], our results point to the absence of relevant polymorphisms at these sites because many of those located in coding regions have MAFs close to or less than 0.001, as well as a very low possibility of conferring any global (or population) impact on the destination of the disease. Therefore, there does not seem to be any direct mechanism (at the sites of interaction) that could confer some form of resistance or greater propensity to contagion in humans. In view of this finding, other molecular modifications with potential functional repercussions were investigated, including 1) changes in sites distant to the viral binding locus, but which may bring about some structural protein change with the potential to influence the cell-virus interaction; 2) changes in translation regulation regions at 3´ UTR sites, at points of interaction with miRNA; and 3) modifications in transcription regulation zones in 5´ upstream regions and intragenic promoters.

Polymorphisms with the potential to cause changes in the protein structure of ACE2 that may impact virus-cell interactions

The ACE2 gene is mainly composed of two isoforms with 18 or 19 exons (v1 and v2) that encode the same protein (805 amino acids) and three other smaller variants: x1, x2 and x3, which have rarely been studied [21,33]. Thus, we searched for genotypic information in these exons that could allow us to infer a disruption with the potential to culminate in impacts on the disease process.

Fifteen SNPs showed MAF differences greater than 1% among the studied populations, mainly belonging to exons 17, 18, 19, 20 and 21; the last two are terminal exons belonging only to rare isoforms [21].

Among these, rs41303171 is a missense SNP, causing the replacement of an asparagine (neutral amino acid) with aspartic acid (electronegative amino acid) at codon 720, which can culminate in a conformational disorder of this protein that, directly or indirectly, can change viral interactions. This polymorphic variant C is practically exclusive to Europeans (MAF = 0.018), a fact corroborated by Cao and collaborators [10], mainly in British individuals (MAF = 0.03). Across Asia, this allele is not found, and is only present in a single Asian resident of the UK [20]. In Brazil, ABraOM data point to MAF<0.01. Thus, the possible biological implications of this change may have some clinical-epidemiological consequences in a small niche of European patients with COVID-19 when compared to other regions of the world.

The change from leucine to phenylalanine in codon 731 (rs147311723) results in the exchange of two nonpolar amino acids that are structurally different (with the presence of an aromatic ring in this last), which may culminate in functional modifications in the ACE2 protein, with the prediction that this will occur equally to 0.941, according to the PolyPhen algorithm [32]. This polymorphism is absent in Asians and Europeans, with low frequency in Americans and Southeast Brazilians, and it has MAF = 0.017 in Africans, mainly in Nigerians, with MAF = 0.043 of allele A. In a study by Cao and colleagues [10], this polymorphism was described as a low frequency SNP in the 1000G database, but without any mention in the China Metabolic Analysis Project (ChinaMAP) database, reinforcing its absence in this population group.

In a context focused on the Brazilian Amazon, unprecedented data showed the presence of two SNPs absent in all populations of the 1000G. One of them is rs1027571965, which is characterized by an exchange of G>C in exon 16, leading to a substitution of alanine for glycine at codon 673 (MAF = 0.095), and the other, rs889263894, an exchange of T>A in exon 13 (MAF = 0.034), causing an alteration of lysine to isoleucine in codon 541; thus, this last SNP should cause structural differences by the exchange of a polar and electropositive amino acid with a hydrophobic amino acid, possibly resulting in functional changes in ACE2, with a probability of 0.958 that this event will occur [32]. Both SNPs had uncertain significance until now.

It is also noteworthy that in codons 351 (rs147464721), 690 (rs4646179) and 749 (rs35803318), there are just synonymous alterations, without any study of modifications of this enzyme in a genotype-dependent manner.

Polymorphisms with the potential to accentuate ACE2 gene expression or translation that may impact virus-cell interactions

Our data point to a series of polymorphisms in the region upstream of the ACE2 gene, which oscillate markedly among populations and, therefore, in individuals, although some of them may not have relevant frequency at a global level. Among them, there are rs2097723 and rs5934250, with population allele differences of up to 0.35 and 0.47, respectively.

The lower frequency allele (C) of rs2097723 has a normalized effect size (NES) of up to 0.36 [34] for increased expression of the ACE gene in brain tissue. The T allele of rs5934250 has an NES of 0.64 for lower ACE2 expression in this tissue, among others [34]. Thus, when looking at population data, it can be inferred that, according to these two variants, populations in East Asia would have the most related scenario regarding increased gene expression. Europeans or Africans have a lower frequency of ACE2 higher-expression-related SNPs in the pre-genic region.

Another interesting observation involves the exon 19 polymorphisms, which are contained in the 3´ UTR region of the two most important isoforms, the canonical miRNA binding site for translation control dependent on this epigenetic mechanism [21,35]. In this regard, observing the regions of interaction between the miRNA and nucleotide exchange sites, it can be noted that rs182366225 is included in the site of attachment to the seed region of miR-140-3p.1 and miR-483-3p.2, both with a match of 7mer-1A and Context ++ score percentile of 90 and 74, respectively. Thus, the proportion of translation regulation that depends on these miRNAs will be upregulated in the East Asian population, especially in Vietnamese and Chinese individuals, who have an average MAF of 0.032 of the C allele, which is not observed in any other population in the world [36]. A reporter assay for miR-483-3p predicted targets by Kemp and collaborators [37] showed that the regulation of ACE2 is dependent on this microRNA.

The rs142017934 site includes the connection site for miR-610 and miR-3646, both with 8mer of match in ACE2 (context ++ score percentile of 98 and 77, respectively), in addition to the seed regions of miR-3609 and miR-548ah-5p, both with a match of 7mer-m8 and scores of 84 and 74, respectively [35]. This polymorphism occurs exclusively in the population of African origin (MAF = 0.013), mainly in Nigerians and individuals from Barbados with African ethnicity, with an average of MAF = 0.026. This higher frequency among people of African ethnicity residing on different continents is probably due to the strong population ancestry of Barbados being from West Africa, a region containing Nigeria [38].

Among the intronic polymorphisms, rs2285666 draws much attention because it presents the highest frequency of the rarest allele (MAF = 0.71) in the indigenous population, with differences ranging from 0.48 to 0.17 for the other populations, with the Asians being the most similar to the Indians, mainly the Chinese. The high MAF observed in the Chinese population in the present study corroborates the data of Cao and collaborators [10] using the ChinaMAP database. The substitution of C for T in intron 4, to only four nucleotides of exon 3 (located in the splicing region), influences the gene expression in brain tissues and tibial nerve in some way so that the T allele is related to a significant increase in the expression of ACE2 [34], thus may be a determinant in clinical differences in this naturally more vulnerable population.

Another intronic polymorphism, rs4646140, has no MAF in the indigenous population, reaching 0.13 in Africans, mainly in Nigerians (0.17). Some studies show the influence of these two intronic SNPs with hypertension [39,40], since there is a possibility that they will interfere in the ACE2 protein product.

In conclusion, considering this genetic aspect involving ACE2 in the complex relationship between SARS-CoV-2 and humans, we emphasize the following:

There are genetic markers that influence in ACE2 alterations and potentially to the unequal rates of aggravation and death observed between men and women. If considering some of the polymorphisms as harmful when in homozygosity (genotypic frequency equal to allelic frequency squared) or hemizygosity (genotypic frequency equal to allelic frequency), men would have this conditions from 1·4 to 77 times more (uniallelic) than women (Table 1). Thus, a relevant contribution to the understanding of higher mortality in males is presented, as reported in the various populations affected by COVID-19.

The rates of contagion and death fluctuate greatly; in this sense, ACE2 polymorphisms could contribute to these differences. The rs182366225 and rs2097723 polymorphisms that potentially may increase the expression of the ACE2 are more frequent in the East Asian population. These allele frequencies are even higher in Chinese and Vietnamese populations. Such markers are on the order of 30% to 180% more frequently in East Asians than in other populations.

Indigenous populations from Amazon have exclusive genetic polymorphisms (rs1027571965 and rs889263894) or with higher frequencies (rs2285666 and rs35803318) than other populations. These polymorphisms are related to increased expression of the ACE2 gene in brain tissues, among others. As supported previously [1012], this is a relevant finding because they can input some influence on the outcome in these populations, whose involvement by COVID-19 was recently reported. This population group, due to its genetic peculiarity and less previous exposure to viral infections, represents a major challenge in understanding and handling this pandemic.

Africans have higher rates of three relevant polymorphisms (rs147311723, rs142017934 and rs4646140). Polymorphism in rs142017934 is exclusive to this population and can influence the translation regulation of the ACE2 gene, thus enhancing the expression of this gene in individuals who carry this variant. However, Europeans and some Africans have a higher frequency of an allele (rs5934250) that seems to reduce the expression of ACE2 in some tissues.

Therefore, the study highlights the importance of considering this genetic factor as facilitating or restricting infection and, especially, in potential clinical manifestations and outcomes. Investigation of these polymorphisms in patients affected by COVID-19, with different clinical conditions and outcomes, could potentially advance our understanding of this pandemic disease.

Supporting information

S1 Table. The minor allele frequency (MAF) of SNPs in exonic regions of ACE2 that showed differences between the investigated populations.

RS: Reference SNP; Ref: Reference allele; Alt: Alternative allele; EUR: European; EAS: East Asian; SAS: South Asian; AMR: Ad Mixed American; AFR: African; NAM: Native American; BAP: Brazilian Admixed Population; ABM: Online Archive of Brazilian Mutations; GLO: Global (1000 Genomes). Bold exons are contained in common isoforms.

(DOCX)

S2 Table. Chromosome position, reference (RS) and minor allele of SNPs (MAF>0.01) in ten thousand base pairs 5´ upstream of ACE2.

*Insertion of an Alu mobile element relative to the reference.

(DOCX)

S3 Table. ACE2 polymorphisms with the potential to influence COVID-19, and their population and sex differences.

Population*: Population with greater MAF; RS: Reference SNP; Site: genic region; MAF*: minor allele frequency of population*; Subpopulation alert (MAF): Subpopulation with greater MAF; Others (MAF): MAF of the other populations (or populations without MAF = null); Modification (score): Type of molecular consequence and PolyPhen Score [32]; Hypothetical influence: based on the most probable hypothetical biological influence in disease; Men risk: based on hemizygous genotypic frequency (q); Women risk: based on homozygous genotypic frequency (q2); M/W: risk ratio between sexes to be a carrier of the minor allele only.

(DOCX)

S4 Table. Genotype frequencies data of NAM and BAP.

(XLSX)

S5 Table. Tribe names and geographic coordinates of the Brazilian Amazon indigenous populations enrolled in this study.

(DOCX)

Acknowledgments

The authors are grateful to the Federal University of Pará for lab and bioinformatics support to this research; to the 1000 Genomes Database and ABraOM for making their data available to researchers; and to all the people who, with their genetics or dedication, collaborated until here to understand an important aspect of this pandemic.

Data Availability

All 1000G files are available from the The International Genome Sample Resource database (number 20140730, reference GRCh37.p13; https://www.internationalgenome.org). All ABraOM files are available from the Brazilian genomic variants database (number 1.0.2, reference hg19; http://abraom.ib.usp.br/index.php). NAM and BAP data are available at S4 Table.

Funding Statement

S.E.B.S. received from The Brazilian National Council for Scientific and Technological Development (CNPq, www.cnpq.br) financial support (number 431488/2016-9) to perform exome sequencing assays. J.A.B.G. received a Doctoral fellowship from Coordination for the Improvement of Higher Education Personnel (CAPES, www.gov.br/capes). A.S.K., P.P.A, R.M.R.R., A.K.R.S., S.E.B.S. received Researcher Fellowship (PQ), and L.C.I. received Master fellowship from CNPq. the Federal University of Pará (UFPA, www.ufpa.br) financial support part of this work. The funders had no role in study design, data collection, analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Andersen K, Rambaut A, Lipkin W, Holmes E, Garry R. The proximal origin of SARS-CoV-2. Nature Medicine. 2020, 26: 450–452. 10.1038/s41591-020-0820-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.WHO. World Health Organization, member, states and regions. (accessed April 2020). Available from: https://who.maps.arcgis.com.
  • 3.Boccia S, Ricciardi W, Ioannidis J. What Other Countries Can Learn From Italy During the COVID-19 Pandemic. JAMA Inter Med 2020. (accessed Abril 2020). Available from: https://jamanetwork.com/journals/jamainternalmedicine/fullarticle/2764369 10.1001/jamainternmed.2020.1447 [DOI] [PubMed] [Google Scholar]
  • 4.Wang Y, Wang Y, Chen Y, Qin Q. Unique epidemiological and clinical features of the emerging2019 novel coronavirus pneumonia (COVID‐19) implicate special control measures. J Med Virol. 2020, (accessed March 2020). Available from: https://onlinelibrary.wiley.com/doi/full/10.1002/jmv.25748 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Wenham C, Smith J, Morgan R. On behalf of the Gender and COVID-19 Working Group COVID-19: the gendered impacts of the outbreak. The Lancet. 2020, 395 (10227): 846–848. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Chinese Center for Disease Control and Prevention. The epidemiological characteristics of an outbreak of 2019 novel coronavirus diseases (COVID-19) in China. Zhonghua Liu Xing Bing Xue Za Zhi. 2020, 17;41(2):145–151. 10.3760/cma.j.issn.0254-6450.2020.02.003 [DOI] [PubMed] [Google Scholar]
  • 7.GlobalHealth50/50. Global Health 50/50. (accessed April 2020). Available from: http://globalhealth5050.org/covid19/.
  • 8.Rozenberg S, Vandromme J, Martin C. Are we equal in adversity? Does Covid-19 affect women and men differently?. Maturitas. 2020,138:62–68. 10.1016/j.maturitas.2020.05.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Yan R, Zhang Y, Li Y, Xia L, Guo Y, Zhou Q.Structural basis for the recognition of SARS-CoV-2 by full-length human ACE2. Science. 2020, March 27;367(6485):1444–1448. 10.1126/science.abb2762 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Cao Y, Li L, Feng Z, et al. Comparative genetic analysis of the novel coronavirus (2019-nCoV/SARS-CoV-2) receptor ACE2 in different populations. Cell Discov. 2020, 24: 6:11 10.1038/s41421-020-0147-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Zhou P, Yang X, Wang X, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020, 579: 270–273. 10.1038/s41586-020-2012-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Chen L, Li X, Chen M, Feng Y, Xiong C. The ACE2 expression in human heart indicates new potential mechanism of heart injury among patients infected with SARS-CoV-2. Cardiovasc Res. 2020, March 30 pii: cvaa078 10.1093/cvr/cvaa078 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Li Y, Zhou W, Yang L, You R. Physiological and pathological regulation of ACE2, the SARS-CoV-2 receptor. Pharmacol Res. 2020, April 14:104833 10.1016/j.phrs.2020.104833 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Othman H, Bouslama Z, Brandenburg J, et al. Interaction of the spike protein RBD from SARS-CoV-2 with ACE2: similarity with SARS-CoV, hot-spot analysis and effect of the receptor polymorphism. (accessed March 2020). Available from: https://www.biorxiv.org/content/10.1101/2020.03.04.976027v3. 10.1101/2020.03.04.976027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Ortega J, Serrano M, Pujol F, Rangel H. Role of changes in SARS-coV-2 spike protein in the interaction with the human ACE2 receptor: an in silico analysis. EXCLI Journal. 2020, 19: 410–417. 10.17179/excli2020-1167 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Yan R, Zhang Y, Li Y, Xia L, Guo Y, Zhou Q. Structural basis for the recognition of SARS-CoV-2 by full-length human ACE2. Science. 2020, 367 (6485): 1444–1448. 10.1126/science.abb2762 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Chan KK, Dorosky D, Sharma P, Abbasi SA, Dye JM, Kranz DM, et al. Engineering human ACE2 to optimize binding to the spike protein of SARS coronavirus 2. Science. 2020, September 4;369(6508):1261–1265. 10.1126/science.abc0870 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Walls A, Park Y, Tortorici M, Wall A, McGuire A, Veesler D. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell. 2020, pii, S0092-8674(20)30262-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Santos N, Ribeiro-Rodrigues E, Ribeiro-Dos-Santos A, Pereira R, Gusmão L, Amorim Guerreiro J, et al. Assessing individual interethnic admixture and population substructure using a 48-insertion-deletion (INSEL) ancestry-informative marker (AIM) painel. Hum Mutat. 2010, 31(2): 184–90. 10.1002/humu.21159 [DOI] [PubMed] [Google Scholar]
  • 20.IGSR. The International Genome Sample Resource. IGRS; (accessed April 2020). Available from: https://www.internationalgenome.org/. [Google Scholar]
  • 21.NCBI. ACE2 angiotensin I converting enzyme 2 [Homo sapiens (human)]. PUBMED. (accessed April 2020). Available from: https://www.ncbi.nlm.nih.gov/gene/59272.
  • 22.Lindenau JD, Guimarães LS, Hurtado AM, Hill KR, Tsuneto LT, Salzano FM, et al. Association between HLA-DR4 haplotypes and tuberculin skin test response in the Aché population. Tissue Antigens. 2014, November;84(5):479–83. 10.1111/tan.12451 [DOI] [PubMed] [Google Scholar]
  • 23.Salzano FM, Sans M. Interethnic admixture and the evolution of Latin American populations. Genet Mol Biol 2014, March;37(1 Suppl):151–70. 10.1590/s1415-47572014000200003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Lindenau JD, Salzano FM, Hurtado AM, Hill KR, Petzl-Erler ML, Tsuneto LT, et al. Variability of innate immune system genes in Native American populations-relationship with history and epidemiology. Am J Phys Anthropol. 2016, April;1594:722–8. 10.1002/ajpa.22917 [DOI] [PubMed] [Google Scholar]
  • 25.ABraOM Online Archive of Brazilian Mutations. (accessed April 2020). Available from: http://abraom.ib.usp.br/.
  • 26.Carvalho DC, Wanderley AV, Amador MAT, Fernandes MR, Cavalcante GC, Pantoja KBCC, et al. Amerindian genetic ancestry and INDEL polymorphisms associated with susceptibility of childhood B-cell Leukemia in an admixed population from the Brazilian Amazon. Leukemia Research. 2015, 39(11), 1239–1245. [DOI] [PubMed] [Google Scholar]
  • 27.Rodrigues JCG, Fernandes MR, Guerreiro JF, et al. Polymorphisms of ADME-related genes and their implications for drug safety and efficacy in Amazonian Amerindians. Sci Rep. 2019, 9(1):7201 10.1038/s41598-019-43610-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Leitão LPC, Souza TP, Rodrigues JCG, Fernandes MR, Santos S, Santos NPC. The Metabolization Profile of the CYP2D6 Gene in Amerindian Populations: A Review. Genes (Basel). 2020, 11(3):262 10.3390/genes11030262 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Mingrone Netto RC, Angeli CB, Auricchio MTBM, Mesquita ERL, AKC Ribeiro-dos-Santos, Ferrari I, et al. Distribution of CGG repeats and FRAXAC1/DXS548 alleles in South American populations. American Journal of Medical Genetics. 2002, Part A, v. 111, p. 243–252. 10.1002/ajmg.10572 [DOI] [PubMed] [Google Scholar]
  • 30.Santos NPC, Ribeiro-Rodrigues EM, Ribeiro-dos-Santos ÂKC, Pereira R, Gusmão L, Amorim A, et al. Assessing individual interethnic admixture and population substructure using a 48-insertion-deletion (INSEL) ancestry-informative marker (AIM) panel. Human Mutation. 2010, 31(2) 184–190. 10.1002/humu.21159 [DOI] [PubMed] [Google Scholar]
  • 31.Sambrook J, Fritsch E, Maniatis T. Molecular cloning: a laboratory manual. New York: Cold Spring Harbor Laboratory. Press; 1989. [Google Scholar]
  • 32.Adzhubei I, Jordan D, Sunyaev S. Predicting functional effect of human missense mutations using PolyPhen-2. Curr Protoc Hum Genet. 2013; 76: 7.20.1–7.20.41. 10.1002/0471142905.hg0720s76 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Chen J, Jiang Q, Xia X, Liu K, Yu Z, Tao W, et al. Individual Variation of the SARS-CoV2 Receptor ACE2 Gene Expression and Regulation. Preprints 2020. (accessed March 2020). Available from: https://www.preprints.org/manuscript/202003.0191/v1. 10.1111/acel.13168 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.GTEx Portal. The Genotype-Tissue Expression (GTEx). GTEx Portal; (accessed March 2020). Available from: https://www.gtexportal.org/. [Google Scholar]
  • 35.Agarwal V, Bell G, Nam J, Bartel D. Predicting effective microRNA target sites in mammalian mRNAs. Elife. 2015, 4: e05005 10.7554/eLife.05005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Cunningham F, Achuthan P, Akanni W, et al. Ensembl 2019. Nucleic Acids Res. 2019, 47D, D745–D751. 10.1093/nar/gky1113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Kemp J, Unal H, Desnoyer R, Yue H, Bhatnagar A, Karnik S. Angiotensin II-regulated microRNA 483-3p directly targets multiple components of the renin-angiotensin system. J Mol Cell Cardiol. 2014, 75, 25–39. 10.1016/j.yjmcc.2014.06.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Gouveia M, Borda V, Leal T, et al. Origins, admixture dynamics and homogenization of the African gene pool in the Americas. Mol Biol Evol. 2020, pii, msaa033. 10.1093/molbev/msaa033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Zhao Q, Gu D, Kelly T, et al. Association of Genetic Variants in the Apelin–APJ System and ACE2 With Blood Pressure Responses to Potassium Supplementation: The GenSalt Study. Am J Hypertens. 2010, 23(6), 606–613. 10.1038/ajh.2010.36 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Luo Y, Liu C, Guan T, et al. Association of ACE2 genetic polymorphisms with hypertension-related target organ damages in south Xinjiang. Hypertens Res. 2019, 42(5), 681–689. 10.1038/s41440-018-0166-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Table. The minor allele frequency (MAF) of SNPs in exonic regions of ACE2 that showed differences between the investigated populations.

RS: Reference SNP; Ref: Reference allele; Alt: Alternative allele; EUR: European; EAS: East Asian; SAS: South Asian; AMR: Ad Mixed American; AFR: African; NAM: Native American; BAP: Brazilian Admixed Population; ABM: Online Archive of Brazilian Mutations; GLO: Global (1000 Genomes). Bold exons are contained in common isoforms.

(DOCX)

S2 Table. Chromosome position, reference (RS) and minor allele of SNPs (MAF>0.01) in ten thousand base pairs 5´ upstream of ACE2.

*Insertion of an Alu mobile element relative to the reference.

(DOCX)

S3 Table. ACE2 polymorphisms with the potential to influence COVID-19, and their population and sex differences.

Population*: Population with greater MAF; RS: Reference SNP; Site: genic region; MAF*: minor allele frequency of population*; Subpopulation alert (MAF): Subpopulation with greater MAF; Others (MAF): MAF of the other populations (or populations without MAF = null); Modification (score): Type of molecular consequence and PolyPhen Score [32]; Hypothetical influence: based on the most probable hypothetical biological influence in disease; Men risk: based on hemizygous genotypic frequency (q); Women risk: based on homozygous genotypic frequency (q2); M/W: risk ratio between sexes to be a carrier of the minor allele only.

(DOCX)

S4 Table. Genotype frequencies data of NAM and BAP.

(XLSX)

S5 Table. Tribe names and geographic coordinates of the Brazilian Amazon indigenous populations enrolled in this study.

(DOCX)

Data Availability Statement

All 1000G files are available from the The International Genome Sample Resource database (number 20140730, reference GRCh37.p13; https://www.internationalgenome.org). All ABraOM files are available from the Brazilian genomic variants database (number 1.0.2, reference hg19; http://abraom.ib.usp.br/index.php). NAM and BAP data are available at S4 Table.


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES