Skip to main content
PLOS One logoLink to PLOS One
. 2020 Sep 16;15(9):e0238255. doi: 10.1371/journal.pone.0238255

Most frequent South Asian haplotypes of ACE2 share identity by descent with East Eurasian populations

Anshika Srivastava 1, Rudra Kumar Pandey 1, Prajjval Pratap Singh 1, Pramod Kumar 2, Avinash Arvind Rasalkar 3, Rakesh Tamang 4, George van Driem 5, Pankaj Shrivastava 6, Gyaneshwer Chaubey 1,*
Editor: Alessandro Achilli7
PMCID: PMC7494073  PMID: 32936832

Abstract

It was shown that the human Angiotensin-converting enzyme 2 (ACE2) is the receptor of recent coronavirus SARS-CoV-2, and variation in this gene may affect the susceptibility of a population. Therefore, we have analysed the sequence data of ACE2 among 393 samples worldwide, focusing on South Asia. Genetically, South Asians are more related to West Eurasian populations rather than to East Eurasians. In the present analyses of ACE2, we observed that the majority of South Asian haplotypes are closer to East Eurasians rather than to West Eurasians. The phylogenetic analysis suggested that the South Asian haplotypes shared with East Eurasians involved two unique event polymorphisms (rs4646120 and rs2285666). In contrast with the European/American populations, both of the SNPs have largely similar frequencies for East Eurasians and South Asians, Therefore, it is likely that among the South Asians, host susceptibility to the novel coronavirus SARS-CoV-2 will be more similar to that of East Eurasians rather than to that of Europeans.

Introduction

The novel coronavirus SARS-CoV-2, the causative agent of the ongoing pandemic of COVID-19, today presents one of the major challenges to humanity [1]. Recent studies have effectively demonstrated that the Angiotensin-converting enzyme 2 (ACE2) encoded by a gene located on the X-chromosome is the host receptor for the virus [1, 2]. A decreased level of ACE2 expression mitigates the severity of the disease. The over-expression or a unique genetic polymorphism of the receptor among Asians have been ruled out in a recent study [3, 4]. ACE2 also maintains cardiovascular homeostasis and electrolyte balance and protects against lung injury by acid aspiration [5]. A comprehensive understanding of ACE2 variations among various ethnic groups has hitherto been largely unknown.

The South Asia subcontinent harbours diverse and endogamous ethnic groups [6]. Most of the genomes of South Asia are autochthonous but show a considerable amount of sharing with East and West Eurasia [7]. However, when we compare overall genome sharing with East vs. West Eurasia, South Asians show greater genetic affinity with West Eurasia [810]. The only exception is Tibeto-Burman speaking populations, who share a large amount of ancestry with East Eurasia [11]. The genetic structure of ACE2 haplotypes among South Asian populations is not known. Therefore, we have analysed the whole genome data of South Asians with respect to various world populations for ACE2 published elsewhere [12, 13] (S1 Table).

Materials and methods

The research has been approved by the Institutional Ethical Committee of Banaras Hindu University, Varanasi, India. To analyse the ACE2 among various populations, we have extracted the sequences from the published datasets [12, 13], by using PLINK 1.9 [14]. It has been shown that the 1000 genome dataset for South Asia does not capture the complete South Asian variation, mainly due to unsampled Austroasiatic populations [15]. Hence, we analysed Pagani et al. [12] by way of primary data and further confirmed the results with the 1000 genome data [13]. We extracted 447 samples designated as a diversity set panel in the Pagani et al. data [12]. After excluding samples from Africa, Sahul and relatives up to the second degree, we used 393 samples in all our analyses (S1 Table). A total of 248 polymorphisms were observed in the Pagani et al. data [12] (S2 Table). LD maps for each of the groups were analysed from Haploview [16] (S1 Fig). For both of the datasets, we converted plink file to fasta file (ped to IUPAC) from customised script. Phasing of the data, the calculation of population-wise genetic distances, and Arlequin and Network input files were generated by DnaSP v 6 [17]. The neighbour joining (NJ) tree was constructed by MEGA-X [18] (Fig 1A). Nei’s genetic distances and pairwise differences were calculated from Arlequin 3.5 [19] and plotted by R v 3.1 [20] (Fig 1B and S2 Fig). Network v5 [21] and Network publisher were used to construct the median joining (MJ) networks (Fig 2 and S3 Fig). The spatial map of rs4646120 and rs2285666 were drawn from PGG toolkit (S4 Fig) [22].

Fig 1.

Fig 1

a) The Neighbour-Joining (NJ) tree showing the genetic relationship of the studied populations. The figure was drawn from the Fst distances obtained from the haplotype analysis of Eurasian populations. b) Heat map showing the intra- and inter-population variation measured by average pairwise sequence differences of the ACE2 gene. The average pairwise differences between populations are shown in the upper triangle of the matrix (green). The average number of pairwise differences within each population group are shown along the diagonal (orange). The differences between populations based on Nei's genetic distances are depicted in lower triangle of the matrix (blue). The obtained values of various parameters have been shown at the color scales.

Fig 2. The median joining (MJ) network of 142 haplotypes belonging to gene ACE2.

Fig 2

Circle sizes are proportional to the number of samples with that haplotype. The three most common haplotypes are marked. The three important SNPs studied in details have been marked in the figure. We used median joining method implemented in the NETWORK programme ver. 5.

Result and discussion

Our pooled data have yielded 248 high quality polymorphisms (S2 Table). In the LD (linkage disequilibrium) plot analysis, significant LD blocks of different sizes were present among Caucasus, Central Asians, South Asians, mainland Southeast Asians, insular Southeast Asians and Siberians (S1 Fig). Europeans showed the lowest level of LD. We have used a haplotype based approach for the comparison. In contrast with the genome-wide analysis [810], the NJ (Neighbour Joining) tree based on Fst distances clustered South Asians together with insular and mainland Southeast Asian populations (Fig 1A). This unexpected result suggested closer a genetic affinity of South Asians with East Eurasians for ACE2. The pairwise difference analysis suggested lower diversity for South Asian, Southeast Asian and Siberian populations (Fig 1B). Similarly, the 1000 genome populations showed the lowest diversity for East Asian populations (S2 Fig).

The phylogenetic analysis of various haplotypes among studied populations helped to identify the SNPs responsible for the affinity of South Asians with East Eurasians (Fig 2 and S3 Fig). Three major distinct haplotypes were observed. Haplotype 1 (ht1) was more common in West Eurasians, including Central Asian populations, whereas haplotype 2 (ht2) was frequent among East Eurasians, South Asians and Americans (Fig 2 and S3 Fig). Haplotype 3 (ht3) was harboured mainly by East Eurasians and South Asians. The haplotype 2 (ht2) originated from SNP rs4646120, whereas ht3 was derived from SNP rs2285666. Phylogenetically both of these SNPs play a key role in the distinction between East and West Eurasian populations (Fig 2 and S3 and S4 Figs). Interestingly, the most frequent haplotypes of South Asia involve these SNPs. A recent study has also highlighted the highest frequency of this SNP (rs 2285666) among Chinese populations (0.5) as well as significant frequency differences among 1000 genome populations (S4 Fig) [4]. In our study, we also found high frequency (0.6) of this SNP among South Asians (S2 Table and S4 Fig). Moreover, we also found that a synonymous coding region variant rs35803318 was most frequent among Americans (0.15), followed by Europeans (0.055), Caucasians (0.051) and Central Asians (0.021), whilst this site was not polymorphic for West Asians, South Asians, Southeast Asians and Siberians (S2 Table).

Phylogenetic analysis has suggested that the majority of South Asian samples share with East Eurasians the monophyletic haplotypes 2 and 3 by the unique polymorphism events (rs4646120) and (rs2285666). Recent studies have suggested that the reference allele has a reduced ACE2 expression of up to 50%, resulting in greater severity of a SARS-CoV-2 infection [2325]. Additionally, a synonymous coding region variant rs35803318 was also significantly more polymorphic among Americans and Europeans than among South Asians. Hence, it is likely that among South Asians, the host susceptibility to the novel coronavirus SARS-CoV-2 more closely resembles that of East/Southeast Asians rather than that of Europeans or Americans.

Supporting information

S1 Fig. The LD (linkage disequilibrium) plots of ACE2 gene of various studied populations.

Shading from white to red indicates the intensity of r2 from 0 to 1. Strong LD is represented by a high percentage (>80) and a darker red square.

(TIF)

S2 Fig. Heat map showing the intra- and inter-population variation measured by average pairwise sequence differences of the ACE2 gene among 1000 genome populations.

The average pairwise differences between populations are shown in the upper triangle of the matrix (green). The average number of pairwise differences within each population group are shown along the diagonal (orange). The differences between populations based on Nei's genetic distances are depicted in lower triangle of the matrix (blue). The populations are grouped in to superpopulations e.g. European, South Asian, East Asian and American.

(TIF)

S3 Fig. The median joining (MJ) network of 491 haplotypes belonging to gene ACE2 among 1000 genome populations.

Circle sizes are proportional to the number of samples with that haplotype. The three most common haplotypes are marked. All three SNPs studied in detail have been marked by arrow. We used median joining method implemented in the NETWORK programme ver. 5.

(TIF)

S4 Fig. The spatial distribution of alleles of rs4646120 and rs2285666 among 1000 genome populations.

The map was obtained from the PGG toolkit implemented in the https://www.pggsnv.org/.

(TIF)

S1 Table. The number of samples from each of the region used in the analysis.

The number of South Asian groups shown with their linguistic affiliations.

(PDF)

S2 Table. The details of 248 polymorphic loci extracted from analysed data.

The frequencies of alternate alleles for each loci and group have been mentioned in the table. The three important SNPs studied in details have been highlighted.

(XLSX)

Acknowledgments

We thank to both of the reviewers and the Editor for their constructive suggestions.

Data Availability

The used data is available at https://evolbio.ut.ee/CGgenomes.html.

Funding Statement

This work is supported by the National Geographic Explorer grant HJ3-182R-18. Redcliffe Life Sciences Pvt Ltd. India provided support in the form of salaries for author AR. The specific roles of this author is articulated in the ‘author contributions’ section. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.

References

  • 1.Lu R, Zhao X, Li J, Niu P, Yang B, Wu H, et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. The Lancet. 2020;395:565–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Zhou P, Yang X-L, Wang X-G, Hu B, Zhang L, Zhang W, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579:270–3. 10.1038/s41586-020-2012-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Chen Y, Shan K, Qian W, Qian W. Asians do not exhibit elevated expression or unique genetic polymorphisms for ACE2, the cell-entry receptor of SARS-CoV-2. Preprints 2020. 2020020258 10.20944/preprints202002.0258.v2 [DOI] [Google Scholar]
  • 4.Cao Y, Li L, Feng Z, Wan S, Huang P, Sun X, et al. Comparative genetic analysis of the novel coronavirus (2019-nCoV/SARS-CoV-2) receptor ACE2 in different populations. Cell Discov. 2020;6:1–4. 10.1038/s41421-019-0132-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Yang J, Zheng Y, Gou X, Pu K, Chen Z, Guo Q, et al. Prevalence of comorbidities in the novel Wuhan coronavirus (COVID-19) infection: a systematic review and meta-analysis. Int J Infect Dis. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Nakatsuka N, Moorjani P, Rai N, Sarkar B, Tandon A, Patterson N, et al. The promise of discovering population-specific disease-associated genes in South Asia. Nat Genet. 2017;49:1403 10.1038/ng.3917 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Chaubey G, Metspalu M, Kivisild T, Villems R. Peopling of South Asia: investigating the caste-tribe continuum in India. BioEssays News Rev Mol Cell Dev Biol. 2007;29:91–100. [DOI] [PubMed] [Google Scholar]
  • 8.Xing J, Watkins WS, Hu Y, Huff CD, Sabo A, Muzny DM, et al. Genetic diversity in India and the inference of Eurasian population expansion. Genome Biol. 2010;11:R113 10.1186/gb-2010-11-11-r113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Metspalu M, Romero IG, Yunusbayev B, Chaubey G, Mallick CB, Hudjashov G, et al. Shared and unique components of human population structure and genome-wide signals of positive selection in South Asia. Am J Hum Genet. 2011;89:731–44. 10.1016/j.ajhg.2011.11.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Reich D, Thangaraj K, Patterson N, Price AL, Singh L. Reconstructing Indian population history. Nature. 2009;461:489–94. 10.1038/nature08365 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Chaubey G, Metspalu M, Choi Y, Mägi R, Romero IG, Soares P, et al. Population Genetic Structure in Indian Austroasiatic speakers: The Role of Landscape Barriers and Sex-specific Admixture. Mol Biol Evol. 2011;28:1013–24. 10.1093/molbev/msq288 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Pagani L, Lawson DJ, Jagoda E, Mörseburg A, Eriksson A, Mitt M. Genomic analyses inform on migration events during the peopling of Eurasia. Nature. 2016;538 10.1038/nature19792 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.1000 Genomes Project Consortium, Durbin RM, Abecasis GR, Altshuler DL, Auton A, Brooks LD, et al. A map of human genome variation from population-scale sequencing. Nature. 2010;467:1061–73. 10.1038/nature09534 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience. 2015;4:1–16. 10.1186/2047-217X-4-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Sengupta D, Choudhury A, Basu A, Ramsay M. Population Stratification and Underrepresentation of Indian Subcontinent Genetic Diversity in the 1000 Genomes Project Dataset. Genome Biol Evol. 2016;8:3460–70. 10.1093/gbe/evw244 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinforma Oxf Engl. 2005;21:263–5. [DOI] [PubMed] [Google Scholar]
  • 17.Rozas J, Ferrer-Mata A, Sánchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, et al. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol Biol Evol. 2017;34:3299–302. 10.1093/molbev/msx248 [DOI] [PubMed] [Google Scholar]
  • 18.Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol Biol Evol. 2018;35:1547–9. 10.1093/molbev/msy096 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Excoffier L, Lischer HEL. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10:564–7. 10.1111/j.1755-0998.2010.02847.x [DOI] [PubMed] [Google Scholar]
  • 20.R-Core-Team. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2012. http://www.R-project.org/. [Google Scholar]
  • 21.Bandelt H-J, Forster P, Röhl A. Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol. 1999;16:37–48. 10.1093/oxfordjournals.molbev.a026036 [DOI] [PubMed] [Google Scholar]
  • 22.Zhang C, Gao Y, Ning Z, Lu Y, Zhang X, Liu J, et al. PGG.SNV: understanding the evolutionary and medical implications of human single nucleotide variations in diverse populations. Genome Biol. 2019;20:215 10.1186/s13059-019-1838-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Asselta R, Paraboschi EM, Mantovani A, Duga S. ACE2 and TMPRSS2 variants and expression as candidates to sex and country differences in COVID-19 severity in Italy. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Wu Y, Li J, Wang C, Zhang L, Qiao H. The ACE 2 G8790A Polymorphism: Involvement in Type 2 Diabetes Mellitus Combined with Cerebral Stroke. J Clin Lab Anal. 2017;31:e22033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Singh KK, Chaubey G, Chen JY, Suravajhala P. Decoding SARS-CoV-2 Hijacking of Host Mitochondria in Pathogenesis of COVID-19. Am J Physiol-Cell Physiol. 2020. 10.1152/ajpcell.00224.2020 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Alessandro Achilli

Transfer Alert

This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present.

2 Jun 2020

PONE-D-20-12219

Most frequent South Asian haplotypes of ACE2 share identity by descent with East Eurasian populations

PLOS ONE

Dear Dr. Chaubey,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

In particular, the Reviewer 1 raised major points about dataset and analyses, pointing out even problems of reproducibility and suggesting a "comparison to 1000G data", and about english writing, "typos but also sentences, issues with references and figures", which should be revised by a native speaker.

Please address also the other issues raised by both Reviewers and submit your revised manuscript by the 30th June 30, 2020. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Alessandro Achilli, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Thank you for stating the following in the Acknowledgments Section of your manuscript:

This work is supported by the National Geographic Explorer grant HJ3-182R-18. The

funders had no role in study design, data collection and analysis, decision to publish or

preparation of the manuscript.

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

The authors received no specific funding for this work.

3. Thank you for stating the following in the Competing Interests section:

The authors have declared that no competing interests exist.

We note that one or more of the authors are employed by a commercial company: Redcliffe Life Sciences Pvt Ltd.

a. Please provide an amended Funding Statement declaring this commercial affiliation, as well as a statement regarding the Role of Funders in your study. If the funding organization did not play a role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript and only provided financial support in the form of authors' salaries and/or research materials, please review your statements relating to the author contributions, and ensure you have specifically and accurately indicated the role(s) that these authors had in your study. You can update author roles in the Author Contributions section of the online submission form.

Please also include the following statement within your amended Funding Statement.

“The funder provided support in the form of salaries for authors [insert relevant initials], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.”

If your commercial affiliation did play a role in your study, please state and explain this role within your updated Funding Statement.

b. Please also provide an updated Competing Interests Statement declaring this commercial affiliation along with any other relevant declarations relating to employment, consultancy, patents, products in development, or marketed products, etc. 

Within your Competing Interests Statement, please confirm that this commercial affiliation does not alter your adherence to all PLOS ONE policies on sharing data and materials by including the following statement: "This does not alter our adherence to  PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests) . If this adherence statement is not accurate and  there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared.

Please include both an updated Funding Statement and Competing Interests Statement in your cover letter. We will change the online submission form on your behalf.

Please know it is PLOS ONE policy for corresponding authors to declare, on behalf of all authors, all potential competing interests for the purposes of transparency. PLOS defines a competing interest as anything that interferes with, or could reasonably be perceived as interfering with, the full and objective presentation, peer review, editorial decision-making, or publication of research or non-research articles submitted to one of the journals. Competing interests can be financial or non-financial, professional, or personal. Competing interests can arise in relationship to an organization or another person. Please follow this link to our website for more details on competing interests: http://journals.plos.org/plosone/s/competing-interests

4. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information.

5. Please upload a copy of Supporting Information Table 2 which you refer to in your text on page 10.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Srivastava et al present “Most frequent South Asian haplotypes of ACE2 share identity by descent with East Eurasian populations”. While the basic concept is interesting and timely given the current SARS-CoV-2 pandemic, the manuscript unfortunately can not be published in its current form:

- Unfortunately important information is missing, reproducibility is not entirely given

- the manuscript was written quickly and includes several errors, which make it hard to read but also trust the results, e.g. wrong SNP name.

- Unfortunately the images in Figure1 where hardly readable, with the figure legends not very informative and helpful

Major Issues:

The main issue I see with this manuscript is that it was written in a hustle, which a reader can easily tell, as there is information missing, one would typically expect, besides several errors (typos but also sentences, issues with references and figures). Comparison to 1000G data is very limited; as the main focus is on South Asia, with only 25 samples included, whereas 1000G includes 662 samples. Therefore one of my main concerns is, how representative are the 25 (mostly males, as ACE2 on X?) samples for South Asia. How do the results differ to the one reported by Cao et al 2020, what is the frequency of the two main SNPs in South-Asians for 1000G data?

The file in the provided data includes 402 samples, Pagani et al describe 483 samples. This work according the haplotypes in Suppl.Materials a total of 393 samples. Can you describe the discordance?

Unfortunately (technical issue here) the figures can not be read well – can you please elaborate and include a larger font? Also the figure legends are not very informative.

Please check the manuscript with help of a native english speaker

Minor Issues:

Abstract:

- include sample size of your data set, including for super populations once mentioned (EAS, SAS, EUR).

- the human Angiotensin-converting enzyme 2 (ACE2)

- more related TO West Eurasians

-focusing ON South Asia

-The second SNP – without talking about the first SNP? Why only the second one, what is with the first one?

Introduction:

- Please recheck sentence: ...has been one of the serious threat to humanity now

- encoded by A gene located ON

- expression of ACE2 receptor? relate to the severity OF THE disease

- Confusing sentence: Growing evidence are accumulating, however, still the expression of ACE2 variations affecting host susceptibility among various world population is not known.

- South Asia is a country ...really? It is a region consisting of countries Afghanistan, Bangladesh, Bhutan, Maldives, Nepal, India, Pakistan and Sri Lanka 

- Please recheck, very confusing: For the ACE2, it is not known that how the polymorphisms present in South Asia is shared with the East and West Eurasian populations?

-Reference 10 and 11 are identical, Reference 3 missing journal,

Material and Methods:

- Reference 10 vs 11. a [11] (Supplementary Table 1) here I was searching the data in Paganis Supplementary table – please make clearer

- Supplementary Figure 3: one can clearly see that the frequency of unique polymorphisms is strongly correletad to the sample size with one exception (SEA_M, this gets obvious when you order by sample size) – what is the purpose of the barplot? Same with Figure 1, how useful is this plot, can the sample size difference also be biasing the main finding?

- What is the purpose of supplemental table 1 and figure 3 – as figure 3 reports the same data?

- Please check sentence: … were constructed from the Haploview.

ResultS and discussion

- Please check: Europeans and Siberians had THE highest number of private polymorphisms which ARE likely due to their large number of samples from these groups (Supplementary Figure 1, Figure 3, Table 1)

- what are the snps defining ht1?

- LD patters → LD patterns

- this unexpected result – wasn’t it reported by Cao et al 2020 yet?

- LD Plots – why are different amount of SNPs used for the subfigures on population level? However no chance to read. Can you please provide a better description of how the LD

- consistent naming of haplotypes ht1 … either with space or none

- rs446120 is on chr8 – should be rs4646120 according figure 1c. 2X wrong in text.

- was more common in West Eurasians

- What leads to the conclusion that rs2285666 has lower severity for SARS-CoV-2?

Reviewer #2: Comments on the manuscript entitled “Most frequent South Asian haplotypes of ACE2 share identity by descent with East Eurasian populations”, by Srivastava et al.

Srivastava et al. have analysed whole genome data of the populations around the world, focusing South Asia, and extracted 248 ACE2 variations. Phylogenetic analysis of the above data suggested that the majority of South Asian ACE2 haplotypes are closer to the East Eurasians in the background of two unique event polymorphisms. Hence, the authors have suggested that it is highly likely that among the South Asians, the host susceptibility to the novel coronavirus SARS-CoV-2 will be more similar to East/Southeast Asians rather than the Europeans.

Considering the ongoing pandemic, this manuscript is appropriate now. However, I suggest the authors to take care of the following points.

1. No author has affiliation No. 5!

2. No mention about the total number of samples, analysed; although supplementary Table 1 has the information. I suggest the authors to mention the numbers in the methods.

3. It may be useful to discuss in the manuscript about the linguistic background of South Asian populations that were included in the study.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Sep 16;15(9):e0238255. doi: 10.1371/journal.pone.0238255.r002

Author response to Decision Letter 0


30 Jun 2020

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

________________________________________

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

Reviewer #2: Yes

________________________________________

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

________________________________________

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

________________________________________

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Srivastava et al present “Most frequent South Asian haplotypes of ACE2 share identity by descent with East Eurasian populations”. While the basic concept is interesting and timely given the current SARS-CoV-2 pandemic, the manuscript unfortunately can not be published in its current form:

We have substantially revised the manuscript by addressing all the concerns of the reviewers.

- Unfortunately important information is missing, reproducibility is not entirely given

- the manuscript was written quickly and includes several errors, which make it hard to read but also trust the results, e.g. wrong SNP name.

- Unfortunately the images in Figure1 where hardly readable, with the figure legends not very informative and helpful

We have separated the figures now in the revised version.

Major Issues:

The main issue I see with this manuscript is that it was written in a hustle, which a reader can easily tell, as there is information missing, one would typically expect, besides several errors (typos but also sentences, issues with references and figures). Comparison to 1000G data is very limited; as the main focus is on South Asia, with only 25 samples included, whereas 1000G includes 662 samples. Therefore one of my main concerns is, how representative are the 25 (mostly males, as ACE2 on X?) samples for South Asia. How do the results differ to the one reported by Cao et al 2020, what is the frequency of the two main SNPs in South-Asians for 1000G data?

We have replicated our analysis with the 1000G data phase 3 data now in the revised version (Supplementary Figs 2-4). Results from both of the data has yielded similar results and conclusions drawn by us is supported by the analysis of independent datasets (Pagani et al. 2016 and 1000G).

We differ from Cao et al 2020 paper substantially. Cao et al have just looked the frequency of coding region substitutions among 1000G as well as Chinese dataset. They also didn’t add any analysis by the ancestry sharing point of view. Whereas, we used more robust haplotype based approach and moved a step ahead than Cao et al.

To have a better understanding of these SNPs worldwide, in the revised version, we have added the spatial distribution of two main SNPs in 1000G data (Supplementary Fig 4).

The file in the provided data includes 402 samples, Pagani et al describe 483 samples. This work according the haplotypes in Suppl.Materials a total of 393 samples. Can you describe the discordance?

According to the Supplementary Table 1. Pagani used 447 samples for their diversity panel. We have used the same data. After excluding regions e.g. Africa and Sahul as well as close relatives, we landed to 393 samples. We have explicitly mentioned this in the material and method section.

Unfortunately (technical issue here) the figures can not be read well – can you please elaborate and include a larger font? Also the figure legends are not very informative.

We have made separate figures now and also improved the legends.

Please check the manuscript with help of a native english speaker

We have included a co-author who has improved the quality of writing as well as edited the manuscript for the language.

Minor Issues:

Abstract:

- include sample size of your data set, including for super populations once mentioned (EAS, SAS, EUR).

- the human Angiotensin-converting enzyme 2 (ACE2)

- more related TO West Eurasians

-focusing ON South Asia

We have revised the manuscript as per the suggestions of the reviewer.

-The second SNP – without talking about the first SNP? Why only the second one, what is with the first one?

We have expanded the discussion about both of the SNPs in the main text.

Introduction:

- Please recheck sentence: ...has been one of the serious threat to humanity now

- encoded by A gene located ON

- expression of ACE2 receptor? relate to the severity OF THE disease

We have revised the sentences accordingly.

- Confusing sentence: Growing evidence are accumulating, however, still the expression of ACE2 variations affecting host susceptibility among various world population is not known.

We have rephrased the sentence.

- South Asia is a country ...really? It is a region consisting of countries Afghanistan, Bangladesh, Bhutan, Maldives, Nepal, India, Pakistan and Sri Lanka

We are sorry for this error! We have corrected it in the revised text.

- Please recheck, very confusing: For the ACE2, it is not known that how the polymorphisms present in South Asia is shared with the East and West Eurasian populations?

The sentence is revised now.

-Reference 10 and 11 are identical, Reference 3 missing journal,

We have rechecked all the references for the consistency.

Material and Methods:

- Reference 10 vs 11. a [11] (Supplementary Table 1) here I was searching the data in Paganis Supplementary table – please make clearer

These errors have been corrected.

- Supplementary Figure 3: one can clearly see that the frequency of unique polymorphisms is strongly correletad to the sample size with one exception (SEA_M, this gets obvious when you order by sample size) – what is the purpose of the barplot? Same with Figure 1, how useful is this plot, can the sample size difference also be biasing the main finding?

We agree with the reviewer and excluded these figures. To verify the main finding we used 1000G data as well as 26 random chromosomes for each of the groups. We didn’t get any deviation from the main finding in these analyses.

- What is the purpose of supplemental table 1 and figure 3 – as figure 3 reports the same data?

We agree with the reviewer. Since in the revised version we excluded Supp Fig. 3, we retained the Supplementary Table 1.

- Please check sentence: … were constructed from the Haploview.

It has been corrected now.

ResultS and discussion

- Please check: Europeans and Siberians had THE highest number of private polymorphisms which ARE likely due to their large number of samples from these groups (Supplementary Figure 1, Figure 3, Table 1)

We have excluded the figures as well as the sentence.

- what are the snps defining ht1?

We have updated Supp Table 1 SNPs with their location in different haplotypes.

- LD patters → LD patterns

It has been removed.

- this unexpected result – wasn’t it reported by Cao et al 2020 yet?

Cao et al. have looked the coding SNPs and didn’t analyse ancestry sharing, whereas we looked haplotypes and their sharing.

- LD Plots – why are different amount of SNPs used for the subfigures on population level? However no chance to read. Can you please provide a better description of how the LD

The program Haploview only consider the polymorphic SNPs for LD reconstruction, the polymorphic SNPs vary in different regions, therefore, the different amount of SNPs.

- consistent naming of haplotypes ht1 … either with space or none

It has been consistent now.

- rs446120 is on chr8 – should be rs4646120 according figure 1c. 2X wrong in text.

We are sorry for this typo, it is corrected in the manuscript.

- was more common in West Eurasians

Revised accordingly.

- What leads to the conclusion that rs2285666 has lower severity for SARS-CoV-2?

Recent studies have suggested that the Reference allele has reduced the ACE2 expression upto 50%, resulting the higher severity for SARS-CoV-2 (Singh et al. 2020; Wu et al., 2017; Asselta et al., 2020)

Reviewer #2: Comments on the manuscript entitled “Most frequent South Asian haplotypes of ACE2 share identity by descent with East Eurasian populations”, by Srivastava et al.

Srivastava et al. have analysed whole genome data of the populations around the world, focusing South Asia, and extracted 248 ACE2 variations. Phylogenetic analysis of the above data suggested that the majority of South Asian ACE2 haplotypes are closer to the East Eurasians in the background of two unique event polymorphisms. Hence, the authors have suggested that it is highly likely that among the South Asians, the host susceptibility to the novel coronavirus SARS-CoV-2 will be more similar to East/Southeast Asians rather than the Europeans.

Considering the ongoing pandemic, this manuscript is appropriate now. However, I suggest the authors to take care of the following points.

Thanks for the good words.

1. No author has affiliation No. 5!

We are sorry for this error. We have corrected it in the revised manuscript.

2. No mention about the total number of samples, analysed; although supplementary Table 1 has the information. I suggest the authors to mention the numbers in the methods.

We have revised the manuscript to reflect these informations.

3. It may be useful to discuss in the manuscript about the linguistic background of South Asian populations that were included in the study.

Thanks. We have added a supplementary table (table S1) with the linguistic information of South Asian samples.

________________________________________

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

We hope that in the revised version you will find everything in order.

Sincerely,

G Chaubey (on behalf of coauthors)

Attachment

Submitted filename: Reply to Reviewers.docx

Decision Letter 1

Alessandro Achilli

4 Aug 2020

PONE-D-20-12219R1

Most frequent South Asian haplotypes of ACE2 share identity by descent with East Eurasian populations

PLOS ONE

Dear Dr. Chaubey,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the few minor points raised by the Reviewer 1.

Please submit your revised manuscript by August 18th, 2020. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Alessandro Achilli, Ph.D.

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: I thank the authors for addressing my concerns and for the additional work carried out. The manuscript is now almost ready for publication - only few minor issue:

In the abstract there are 383 samples while you mentioned 393 in material and methods (which I can replicate now)

S1 Table - as we are on Chromosome X - Chromosomes (2n) describes the data incorrectly, as there are 258 males and only 135 females - therefore suggest to write number of individuals

S2 Table - highlight the three SNPs described in the main text - would prefer an Excel file over the PDF file here.

Figure S3: can you please indicate the SNP rs3503318 in Figure 2 and S3?

Figure 1b and S2: can you please specify the axis values on the legends for diversity and describe in the figure legend respectively?

Reviewer #2: I have gone through the revised manuscript and the pint-by-point reply to my comments. I am satisfied with all the reply to my previous questions. I have no more question.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Sep 16;15(9):e0238255. doi: 10.1371/journal.pone.0238255.r004

Author response to Decision Letter 1


5 Aug 2020

To,

Dr. Alessandro Achilli,

Academic Editor

PLOS ONE

Date: 05/08/2020

Dear Dr Achilli,

We have revised our manuscript considering the comments from reviewer1. Our pointwise answers are given below (in bold letters).

///////////////////////////////////////////////////////////////////////////////////////////////////////////////

Reviewer #1: I thank the authors for addressing my concerns and for the additional work carried out.

We are grateful to reviewers for their constructive suggestions which has helped us to improve our manuscript substantially.

The manuscript is now almost ready for publication - only few minor issue:

In the abstract there are 383 samples while you mentioned 393 in material and methods (which I can replicate now)

We have updated the number accordingly.

S1 Table - as we are on Chromosome X - Chromosomes (2n) describes the data incorrectly, as there are 258 males and only 135 females - therefore suggest to write number of individuals

Thanks for the great suggestion. We have corrected the table accordingly.

S2 Table - highlight the three SNPs described in the main text - would prefer an Excel file over the PDF file here.

We have followed the suggestions and updated the manuscript accordingly.

Figure S3: can you please indicate the SNP rs3503318 in Figure 2 and S3?

We have indicated the SNP in both of the figures.

Figure 1b and S2: can you please specify the axis values on the legends for diversity and describe in the figure legend respectively?

The values have been already mentioned in the each of the color scale.

We hope that you will find everything in order.

Sincerely,

G. Chaubey (on behalf of co-authors)

Decision Letter 2

Alessandro Achilli

13 Aug 2020

Most frequent South Asian haplotypes of ACE2 share identity by descent with East Eurasian populations

PONE-D-20-12219R2

Dear Dr. Chaubey,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Alessandro Achilli, Ph.D.

Academic Editor

PLOS ONE

Acceptance letter

Alessandro Achilli

19 Aug 2020

PONE-D-20-12219R2

Most frequent South Asian haplotypes of ACE2 share identity by descent with East Eurasian populations

Dear Dr. Chaubey:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Prof. Alessandro Achilli

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. The LD (linkage disequilibrium) plots of ACE2 gene of various studied populations.

    Shading from white to red indicates the intensity of r2 from 0 to 1. Strong LD is represented by a high percentage (>80) and a darker red square.

    (TIF)

    S2 Fig. Heat map showing the intra- and inter-population variation measured by average pairwise sequence differences of the ACE2 gene among 1000 genome populations.

    The average pairwise differences between populations are shown in the upper triangle of the matrix (green). The average number of pairwise differences within each population group are shown along the diagonal (orange). The differences between populations based on Nei's genetic distances are depicted in lower triangle of the matrix (blue). The populations are grouped in to superpopulations e.g. European, South Asian, East Asian and American.

    (TIF)

    S3 Fig. The median joining (MJ) network of 491 haplotypes belonging to gene ACE2 among 1000 genome populations.

    Circle sizes are proportional to the number of samples with that haplotype. The three most common haplotypes are marked. All three SNPs studied in detail have been marked by arrow. We used median joining method implemented in the NETWORK programme ver. 5.

    (TIF)

    S4 Fig. The spatial distribution of alleles of rs4646120 and rs2285666 among 1000 genome populations.

    The map was obtained from the PGG toolkit implemented in the https://www.pggsnv.org/.

    (TIF)

    S1 Table. The number of samples from each of the region used in the analysis.

    The number of South Asian groups shown with their linguistic affiliations.

    (PDF)

    S2 Table. The details of 248 polymorphic loci extracted from analysed data.

    The frequencies of alternate alleles for each loci and group have been mentioned in the table. The three important SNPs studied in details have been highlighted.

    (XLSX)

    Attachment

    Submitted filename: Reply to Reviewers.docx

    Data Availability Statement

    The used data is available at https://evolbio.ut.ee/CGgenomes.html.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES