Abstract
The Himalayan Arc is recognized as a global biodiversity hotspot. Among its numerous cryptic and undiscovered organisms, this composite high-mountain ecosystem harbors many taxa with adaptations to life in high elevations. However, evolutionary patterns and genomic features have been relatively rarely studied in Himalayan vertebrates. Here, we provide the first well-annotated transcriptome of a Greater Himalayan reptile species, the Ladakh Ground skink Asymblepharus ladacensis (Squamata: Scincidae). Based on tissues from the brain, an embryonic disc, and pooled organ material, using pair-end Illumina NextSeq 500 RNAseq, we assembled ~77,000 transcripts, which were annotated using seven functional databases. We tested ~1600 genes, known to be under positive selection in anurans and reptiles adapted to high elevations, and potentially detected positive selection for 114 of these genes in Asymblepharus. Even though the strength of these results is limited due to the single-animal approach, our transcriptome resource may be valuable data for further studies on squamate reptile evolution in the Himalayas as a hotspot of biodiversity.
Keywords: adaptation, evolution, genomic, high elevation, Himalayas, Scincidae
1. Introduction
The Himalayan arc represents one of the world’s most important faunal and floral hotspots with high species diversity and endemism [1], which result from the Tertiary orogeny of this mountain chain, its complex topography as well as its great climatic heterogeneity and isolation. The genesis of the Tibetan highlands and the Himalayas since the Paleogene, with the Greater Himalayas starting to rise presumably the earliest in the post-Eocene (for a review, see the supplementary in Hofmann et al. [2]), triggered the evolution of unique biodiversity under gradual high-altitude adaptation, as already shown for anurans [3,4,5,6,7]. Besides amphibians, there are also several reptiles that can cope with life at high altitude in those regions, e.g., Thermophis [8], Phrynocephalus [9], and some Laudakia species [10]. Potential constraints to upslope migration of reptiles (and amphibians) to high-elevation environments are the substantial UV-radiation, the thermal extremes, and especially the oxidative stress, referred to as high-altitude hypoxia, which interacts with temperature in a context-dependent manner to influence thermal performance and limits in terrestrial ectotherms [11,12]. Recent advances in high-throughput sequencing technologies have led to a growing number of genomic studies that address the molecular basis of high-altitude adaptation, some of them focused also on reptiles [13,14,15]. However, such data have been scarce in non-model species of the Greater Himalayas (but see [16]). This results from the general understudied biodiversity of this high-mountain range, presuming a relatively large number of cryptic and undiscovered species [17], even among vertebrates. Molecular data from Himalayan organisms can contribute to understanding of the taxonomic and functional diversity spectra across this species-rich, fragile ecosystem. These data resources are even more important because Himalayan biodiversity is threatened at the very core; rapid warming due to climate change, especially at higher elevations, as well as higher rates of forest degradation and deforestation, infrastructural development, trade routes, and hydropower dams are driving species loss at a very alarming speed [18,19]. To allow future studies in evolutionary biology at a genomic level and to generally provide sufficient and relevant data for Himalayan reptiles, in the present study, we have generated a new genomics data set based on RNAseq for a scincid species from the Greater Himalayas. Using these data, we specifically aimed to identify genes known to play roles in adaptation of terrestrial ectothermic vertebrates to high elevations. Since exposure to oxidative stress can particularly affect the physiology during early development [20] and in oxygen-sensitive organs [21,22], such as the nerve system, we focused on embryonic and brain tissue samples.
Our target species is a scincid lizard in the genus Asymblepharus, the Ladakh Ground Skink, A. ladacensis (GÜNTHER, 1864), which is endemic to the western part of the Himalayas. The genus further contains the following species (Figure 1): A. alaicus (ELPATJEVSKY, 1901), A. eremchenkoi PANFILOV, 1999, A. himalayanus (GÜNTHER, 1864), A. mahabharatus EREMCHENKO, SHAH & PANFILOV, 1998, A. nepalensis EREMCHENKO & HELFENBERGER, 1998, and A. tragbulensis (ALCOCK, 1898). Another two species, Asymblepharus medogensis JIANG, WU, GUO, LI & CHE, 2020 and A. nyingchiensis JIANG, WU, WANG, DING & CHE, 2020, have been described very recently from Mêdog, Nyingchi in SE Tibet, China. According to a large-scale phylogenetic study of squamates [23], the sampled specimen of A. sikimmensis (BLYTH, 1854) is nested within Scincella and was therefore suggested to be transferred to this genus. However, it remains unclear whether this single specimen had been misidentified as A. sikimmensis since originally it was labeled in the museum collection as Scincella potanini (voucher catalogue number CAS:HERP:194923, see http://portal.vertnet.org/o/cas/herp?id=urn-catalog-cas-herp-194923 (accessed on 29 July 2021) [24].
Figure 1.
Map of Asymblepharus species based on GBIF (www.gbif.org; accessed on 20 July 2021) records of preserved specimens and georeferenced localities in the taxonomic reptile database (https://reptile-database.reptarium.cz/; accessed on 20 July 2021). The location of our RNA sample of the female A. ladacensis (photo) is indicated by a green circle with a dot in the middle and an arrow. * Note, according to a large-scale phylogeny of squamates, A. sikimmensis is nested within Scincella; however, it remains unclear whether this single “A. sikimmensis” specimen, on which the sequence data are based, had been taxonomically correctly identified. Therefore, we also show the GBIF records of specimens collected as A. sikimmensis. Records of A. eremchenkoi in the databases could not be georeferenced due to insufficient information on the collection site. Photo credit: Sylvia Hofmann.
In general, Asymblepharus is a genus with a still poorly known endemic distribution, origin, and evolutionary history. No studies of its population genetic structure and genetic diversity exist to date for any Asymblepharus taxon, and the current taxonomic relationships of its lineages are in flux [23,24,25]. The three Himalayan species A. himalayanus, A. ladacensis, and A. tragbulensis have frequently been assigned to the genus Himalblepharus Eremchenko, 1987. According to literature data ([26] and references therein) and personal observation, they show a remarkably wide vertical distribution from the foothills (~150 m a.s.l.) to the high alpine zone and even up to the snow line (~5500 m a.s.l.), making them an excellent model to study the genetic basis of adaptations to high altitudes in ectotherms and the evolutionary processes accounting for them.
With this paper, we characterize the first transcriptome data set of a high-altitude reptile species from the Greater Himalayas and report genes known to play roles in adaptation of ectotherms to high elevations of this non-model reptile. Such data provide the necessary sources for future molecular studies in Himalayan reptiles and high-altitude vertebrates.
2. Materials and Methods
2.1. Sample Collection and Ethics Statement
A single gravid female Asymblepharus ladacensis was collected in Central Nepal, in the Dhaulagiri range, west of the Kali Gandaki valley (28.68° N, 83.59° E; 2714 m a.s.l.; Figure 1). Samples were collected in accordance with regulations for the protection of terrestrial wild animals under the permits of the Nepal expeditions of the Natural History Museum of Erfurt, Germany [27,28]. All treatments were carried out in accordance with approved guidelines and according to the permit as well as the local animal welfare committee’s instructions (VNME 17, 15–30). Tissues were transferred into RNAlater (Thermo Fisher), kept at ambient temperature during the time of the fieldwork, and later stored at −30 °C.
2.2. RNA Isolation, Library Preparation, and Sequencing
We followed the same procedure as previously described [16]. In brief, total RNA was isolated from the brain, an embryonic disc, and from pooled tissues (including lung, muscle, and heart) using TRIzol Reagent (Thermo Fisher Scientific, Waltham, MA, USA) according to the supplier’s recommendation, in combination with the RNeasy Mini Kit (Qiagen, Hilden, Germany) and adjusted to equal concentrations. RNA quality was assessed by RNA concentration, RIN (RNA Integrity Number) value, 28S/18S and fragment length distribution using an Agilent Bioanalyzer (Agilent Technologies, Inc., Santa Clara, CA, USA).
Complementary DNA (cDNA) library preparation and paired-end sequencing were carried out by BGI (BGI-Hongkong Co., Ltd., Tai Po District, Hong Kong), using Illumina NextSeq500 sequencing system (Illumina, San Diego, CA, USA). The raw reads quality was examined using FastQC v0.11.9 [29].
2.3. Assembly and Assessment of Transcriptome Quality and Completeness
First, we controlled our data for rRNA quantity using SortMeRNA 4 [30]. The three de novo assemblies were then created following the Oyster River Protocol (ORP; Docker image 2.2.8) best practices [31]. This protocol implements both pre-assembly procedures and a number of different kmer lengths and assemblers, finally merging these assemblies into a single, comprehensive transcriptome. The rationale behind it is that assembling RNAseq reads with different assembly tools increases assembly quality and mapping rate and, in turn, the ability to draw conclusions from that fraction of the sample [31]. Thus, merging the contigs resulting from several assemblers and parameter configurations to combine the advantages of certain assembly mechanisms and to overcome their different disadvantages seems to be the best way to obtain a comprehensive de novo transcriptome assembly [32,33].
Illumina sequencing adapters and nucleotides with quality Phred ≤ 2 were removed using Trimmomatic v0.36 [34], then the reads were error corrected by Rcorrector version 1.0.2 [35]. These reads were then assembled using Trinity release 2.8.4 [36] with default settings (k = 25), two independent runs of SPAdes assembler version 3.11 with kmer lengths of 55 and 75 [37], and the assembler Shannon version 0.0.2 with a kmer length of 75 [38]. The resulting four distinct transcriptome assemblies were then merged to a single, comprehensive transcriptome using Ortho-Fuser [31]. This final transcriptome was evaluated with TransRate version 1.0.3 [39], which is modified for and packaged with the ORP, and with BUSCO (Benchmarking Universal Single-Copy Orthologs) version 4.1.4 [40,41]. Searching the assembly for conserved single-copy orthologs found in orthologous sets of genes constructed from genomes representing eukaryotes (70 species: 255 BUSCOs), vertebrates (67 species: 3354 BUSCOs), and tetrapods (38 species: 5310 BUSCOs).
A detailed quality assessment of the assembly with respect to known genes was further obtained with rnaQUAST version 2.2.0 [42] using the reference genomes of Anolis carolinensis, Gekko japonicus, and Python bivittatus.
2.4. Functional Annotation
Transcripts were functionally annotated as previously described [16]. Briefly, sequence homology searches were conducted against seven databases (Gene Onthology, GO; Kyoto Encyclopedia of Genes and Genomes, KEGG; EuKaryotic Orthologous Groups, KOG; InterPro; the non-redundant nucleotide database, NT; the non-redundant protein database, NR; SwissProt). To align our data to KEGG, KOG, NR, and NT and SwissProt, we used Diamond v0.8.31 [43] or the BLASTx [44] algorithm; matched transcripts were filtered by using a cut-off e-value of 1 × 10−25. Transcripts that aligned to the NR database were transferred to the GO database with Blast2GO v2.5.0 [45] and assigned into the following three groups: biological process, cellular components, and molecular functions. InterProScan5 v5.11-51.0 tool [46] was used to annotate against the InterPro databases. Blast v2.2.23 was used to search in SwissProt and hmmscan v3.0 [47] for search against Pfam database (for each sample individually). Candidate coding areas within the transcript sequences were predicted by TransDecoder v.3.0.1 [36]. For a coding sequence with multiple open reading frames (ORF), the longest one was selected. We also used the getorf program of the EMBOSS v6.5.70 package [48] to find the ORF of each transcript and mapped them to the Animal Transcription Factor DataBase (AnimalTFDB2.0). The threshold of transcript lengths used for annotation and downstream analyses was ≥200 bp.
2.5. Positively Selected Genes Related to Mechanisms of High-Altitude Adaptation
We selected transcripts of genes reported to exhibit molecular adaptation to high elevations in lizards and anurans as provided in the literature [7,15]. This comprised a total of 143 genes identified to be under positive selection (PSG) in high-elevational lineages of lizards (Phrynocephalus vlangalii) [15]. We included further 1481 PSGs of these lizards (P. erythrurus, P. putjatia, P. vlangalii) and of dicroglossid frogs (Nanorana, Quasipaa) [7], genes that were identified across an elevational gradient (~1000 m to 4500 m). These additional, individual genes were grouped according to the phylogenetic tree branches across the elevational gradient as presented in Sun et al. (2018) [7]: PSGs attributed to branches that represent (i) lowland species (~1000 m), (ii) species distributed in colline zones up to about 2000 m, (iii) submontane and montane species (2000 and 3500 m), and (iv) subalpine and alpine species (>3500 and 4500 m). Given the vertical distribution of Asymblepharus ladacensis in the Himalayas between ~2500 m and 4500 m [49], we expected to find primarily genes under positive selection reported as PSGs for the submontane and montane, as well as the subalpine and alpine species.
To identify scincid orthologs to the genes under positive selection (high-altitude adaptations) in reptiles and amphibians, we used the corresponding coding sequences from Anolis carolinensis (AnoCar2.0, gene build from ensemble 104.2; we used Anolis gene numbers that were mentioned in the publications cited in the last paragraph) as a reference for blast searches. We performed blast searches with the newly sequenced and assembled transcriptome of A. ladacensis, as well as with publicly available transcriptome data from the following scincid lizards: Scincella lateralis (NCBI-SRA SRR629642), Lampropholis guichenoti (NCBI-SRA SRR4293354), Lepidothyris fernandi (NCBI-SRA SRR10360868), and Pseudemoia entrecasteauxii (NCBI-SRA SRR3099521). Only the best reciprocal hit between Anolis and each scincid species (and only if the e-value was below 10−25) were used for subsequent analyses.
Alignments of orthologs to the Anolis reference were done with mafft v.7.455 [50] making use of the “adjustdirection” and “keeplength” options to get alignments that are in the same direction and keep reading frames intact. Alignments were inspected for good representation of all species under study. When one or more species had substantially shorter, incomplete contigs as best reciprocal hits, we omitted those from the alignment. A phylogenetic tree for each alignment was produced using FastTree v. 2.1.10 [51] with default settings, except for the nucleotide option.
Alignments and trees were analyzed with the HyPhy (Hypothesis Testing using Phylogenies) package v. 2.5 [52,53] using the following methods (for details see [54]): (i) The BUSTED (Branch-Site Unrestricted Statistical Test for Episodic Diversification) [55] model, to test whether a given gene has been subject to positive, diversifying selection at any site, at any time (we tested all lineages for positive selection); (ii) FUBAR (Fast, Unconstrained Bayesian AppRoximation) [56], a Bayesian approach to infer which site(s) in a gene are subject to pervasive, i.e., consistently across the entire phylogeny, diversifying selection (we considered a posterior probability of at least 0.90 as significant); and (iii) the branch-site model aBSREL (adaptive Branch-Site Random Effects Likelihood) [57,58], to test whether codon sites and individual branches are subject to positive selection across the phylogeny. The threshold for significance in BUSTED and aBSREL was set at a p-value lower than 0.05.
GO term and metabolic pathway enrichment analysis was done using pantherdb (pantherdb.org; accessed on 2 August 2021, [59]), using the genome of Anolis carolinensis as reference.
2.6. Data Availability
Data generated in this study are publicly available from the NCBI GenBank database under the Bioproject ID PRJNA750278, BioSamples SAMN20458631, SAMN20458632, and SAMN20458631. All sequence data were deposited in the NCBI Sequence Read Archive (SRA, http://www.ncbi.nlm.nih.gov/Traces/sra/; accessed on 2 August 2021) under the accession numbers SRR15283177, SRR15283178, and SRR15283179’; assembled sequences were transmitted to NCBI Transcriptome Shotgun Assembly Sequence Database (TSA, http://www.ncbi.nlm.nih.gov/genbank/tsa; accessed on 2 August 2021).
3. Results
3.1. Sequencing and Transcriptome Assembly
All RNA was reasonably high quality; A260/280 ratios ranged between 1.81 (brain), 1.76 (embryonic disc), and 2.09 (pooled tissues); RIN values were between 7.90, 8.20, and 9.10, respectively. RNA-seq libraries of the two tissues yielded a total of 74.78, 73.56, and 64.20 million raw sequence reads (Table 1). Pre-processing of reads via read trimming and read error correction removed approximately 2–4% of the raw data, resulting in 73.14, 71.89, and 61.40 million clean reads for the brain, embryonic disc, and pooled tissues (Table 1). GC content of these clean reads was 48%.
Table 1.
Summary of sequencing data used to obtain the de novo transcriptome assemblies of Asymblepharus ladacensis based on paired-end Illumina sequencing. Final assemblies based on four unique assemblies per sample generated by ORP using different assemblers and k-mers.
| Brain Tissue | Embryonic Disc Tissue | Pooled Tissue | |
|---|---|---|---|
| Number of paired-end raw reads | 74,780,628 | 73,557,120 | 64,195,486 |
| Number of cleaned reads | 73,139,294 | 71,887,892 | 61,397,610 |
| Number of base pairs in final assembly | 102,605,079 | 98,917,807 | 47,613,446 |
| Number of transcripts in final assembly | 151,718 | 105,133 | 66,696 |
| Average transcript length (bp) | 676 | 940 | 712 |
| Minimum transcript length (bp) | 131 | 131 | 131 |
| Maximum transcript length (bp) | 17,543 | 18,168 | 15,866 |
| N50 | 1215 | 2052 | 1194 |
| N90 | 257 | 311 | 278 |
| GC% content of the final ORP assembly | 0.48 | 0.48 | 0.48 |
The final de novo assemblies generated from ORP resulted in 151,718 (brain), 105,133 (embryonic disc), and 66,696 (pooled tissues) transcripts with a total length of ~102.61, ~98.92, and ~47.61 million bp, respectively. Transcripts had an average length of 676 bp (brain), 940 bp (embryonic disc), and 712 bp (pooled tissues), and an N50 of 1215 bp, 2052 bp, and 1194 bp (Table 1, Supplementary Table S1 and Figure S1). A total of 48,884 (32.22%; brain), 44,358 (42.19%; embryonic disc), and 25,772 (38.64%; pooled tissues) transcripts were longer than 500 bp (Supplementary Table S2).
3.2. Assembly Completeness
TransRate’s optimal assembly score (min 0.0, max 1.0) is considered to be a good parameter of the quality of an assembly [32]; it captures the confidence and completeness of the assembly. The TransRate scores of our final assemblies were high, ranging between 0.44 (optimized score 0.49) for the brain sample, 0.45 (optimized score 0.57) for the embryonic disc sample, and 0.44 (optimized score 0.52) for the pooled tissues. A transRate score >0.22 is generally thought to be acceptable [31,39]. More than 90% of the reads were used to assemble the transcriptomes, and 87% (brain, pooled tissues), as well as 90% (embryonic disc) of the fragments, were considered as good mappings, while only 1.6% (brain), 3.7% (embryonic disc), and 2.7% (pooled tissues) of assembled contigs had no coverage (Supplementary Table S1).
The assessment of completeness of our assemblies by the BUSCO pipeline resulted in a moderate to high percentage of complete eukaryotic orthologues (from 69.8 to 98.0% of 255 BUSCOs) but also a significantly higher percentage of putative paralogues; in the vertebrate (3354 BUSCOs) and tetrapod (5310 BUSCOs) databases, more than half of the markers were recovered completely in the brain and embryonic tissue (Table 2). The fraction of missing BUSCOs ranged between 0.8% (Eukaryota; embryonic disc) and 54.9% (Tetrapoda; pooled tissues). These BUSCO values are comparable to recent de novo transcriptome studies in many vertebrates [60,61,62]. BUSCO recovery rate tends to be highest when full organism and/or multiple developmental stages were used to generate the transcriptomes, compared to those assembled from specific organs or tissues [63].
Table 2.
Benchmarking Universal Single-Copy Orthologs (BUSCO) results based on the eukaryotic (EU, eukaryota_odb10; 255 BUSCOs), vertebrates (VB, vertebrata_odb10; 3354 BUSCOs), and tetrapod databases (TP, tetrapoda_odb10, 5310 BUSCOs) searched. BUSCO searches for completed, single-copy, duplicated, fragmented, and missing orthologs within given genomes.
| BUSCO Statistics | Brain | Embryonic Disc | Pooled Tissues | ||||||
|---|---|---|---|---|---|---|---|---|---|
| EU | VB | TP | EU | VB | TP | EU | VB | TP | |
| Complete | 220/255 (86.3%) |
2052/3354 (61.1%) |
2696/5310 (50.8%) |
250/255 (98.0%) |
2743/3354 (81.8%) |
3778/5310 (71.1%) |
178/255 (69.9%) |
1405/3354 (41.9%) |
1750/5310 (33.0%) |
| Single-copy | 189/255 (74.1%) |
1759/3354 (52.4%) |
2305/5310 (43.4%) |
185/255 (72.5%) |
1871/3354 (55.8%) |
2555/5310 (48.1%) |
148/255 (58.0%) |
1150/3354 (34.3%) |
1421/5310 (26.8%) |
| Duplicated | 31/255 (12.2%) |
293/3354 (8.7%) |
391/5310 (7.4%) |
65/255 (25.5%) |
872/3354 (26.0%) |
1223/5310 (23.0%) |
30/255 (11.8%) |
255/3354 (7.6%) |
329/5310 (6.2%) |
| Fragmented | 23/255 (9.0%) |
589/3354 (17.6%) |
709/5310 (13.4%) |
3/255 (1.2%) |
221/3354 (6.6%) |
323/5310 (6.1%) |
47/255 (18.4%) |
654/3354 (19.5%) |
645/5310 (12.1%) |
| Missing | 12/255 (4.7%) |
713/3354 (21.3%) |
1905/5310 (35.8%) |
2/255 (0.8%) |
390/3354 (11.6%) |
1209/5310 (22.8%) |
30/255 (11.8%) |
1295/3354 (38.6%) |
2915/5310 (54.9%) |
Coverage of specific gene databases (Anolis carolinensis, Gekko japonicus, Python bivittatus) ranged between 7% and 29%, being highest for the Gekko reference genome (Supplementary Table S2). Up to 40,000 transcripts (22–39%) could be aligned to one of the three databases. The duplication ratio varied between 1.3 and 1.5 and was in the range reported for vertebrate transcriptomes [32,64]. Although the proportion of misassembled contigs was low (<2%), the assemblies showed only a small number of 95%-assembled genes and isoforms (Supplementary Table S2). We assume that this might reflect biological novelty in the study species rather than fragmentation of the assemblies [41]. For example, the low proportion of completeness against the three reptile gene databases contrasts with a >40% completeness score for the vertebrate gene set and could be the result of an overrepresentation of gene sets from more intensively studied lineages [65].
Alternatively, suboptimal sample quality and a resulting higher proportion of fragmented genes due to potential RNA degradation could be a reason for the relatively low number of assembled genes and lower Eukaryota BUSCO scores, especially in the pooled tissues sample.
3.3. Functional Annotation
Annotation of the complete set of transcriptomes from all three samples resulted in 39,975 (51.94%) transcripts annotated in at least one of the seven databases used for functional annotation; 7292 sequences generated hits in all of these databases (Table 3). Approximately 70% of the top hits matched to genes from Gekko japonicus (7716; 23.07%), Pogona vitticeps (7239; 21.65%), Anolis carolinensis (5399; 16.14%), and Python bivittatus (3176; 9.5%), Figure 2.
Table 3.
Number (N) of transcripts identified in Asymblepharus ladacensis that are shared (=Intersection) and unique among seven annotation database resources. GO—Gene Onthology; InterPro—integrative protein signature database; KEGG—Kyoto Encyclopedia of Genes and Genomes; KOG— EuKaryotic Orthologous Groups; NR—non-redundant protein database; NT—non-redundant nucleotide database; SwissProt—Swiss Protein Sequence Database.
| Total | NR | NT | SwissProt | KEGG | KOG | InterPro | GO | Intersection | Overall | |
|---|---|---|---|---|---|---|---|---|---|---|
| N | 76,968 | 33,444 | 34,114 | 30,994 | 28,961 | 26,608 | 27,013 | 11,010 | 7292 | 39,975 |
| % | 100 | 43.45 | 44.32 | 40.27 | 37.63 | 34.57 | 35.10 | 14.30 | 9.47 | 51.94 |
Figure 2.
Annotation of the Asymblepharus ladacensis transcriptome. (a) Species distribution of the top BLASTx hit performed against NR database; (b) GO (Gene Onthology) assignments as predicted by Blast2GO; (c) functional distribution of KOG (EuKaryotic Orthologous Groups) annotation and (d) KEGG (Kyoto Encyclopedia of Genes and Genomes) classifications of assembled transcripts.
In terms of the biological process ontology (GO database categories), the most common categories were cellular processes (6935), metabolic processes (4884), and biological regulation (4549). The most frequent classifications for the cellular component ontology were cellular (7485), cell compartments (7446), and organelles (5922). Regarding the molecular function, major categories involved binding (6397), catalytic activity (3899), and molecular function regulator (787) (Figure 2). The KOG functional classification revealed genes of signal transduction mechanisms (6714), general function (6592), unknown function (2599), posttranslational modification (2580), and transcription (2339) as top five categories (Figure 2).
We found 44,956 transcripts in the KEGG database that aligned to entries associated with pathways of cellular processes (5650), environmental information processing (5752), genetic information processing (3409), human diseases (12,499), metabolism (7697), and organismal systems (9949). Predominantly, the genes were enriched in “signal transduction” (4256), followed by “global and overview maps” (2901), “cancers: overview” (2665), “immune system” (2324), and “infectious diseases: viral” (2299) categories (Figure 2).
3.4. Positive Selection
Among the 143 PSGs previously reported for the high-elevation lineage of the toad-headed agama Phrynocephalus vlangalii [15], we found ten genes to be likewise under positive selection in Asymblepharus based on all three tests that we performed. Two of these genes (IL1RAP, GRK6) were specific to the Asymblepharus branch of the gene tree (Table 4 and Supplementary Table S3).
Table 4.
Summary of the positive selection analysis for high-altitude candidate genes of a toad-headed agama (Phrynocephalus vlangalii) [15] tested likewise positive in Asymblepharus (A) using BUSTED (B); p-value > 0.05) [55], FUBAR (FB; number of sites under positive selection) [56] and aBSREL (aB) [57,58] methods. PSGs are represented by the last six digits of the anole lizard’s (Anolis carolinensis) ENSEMBL gene and transcript identifiers (starting with ENSACAG00000, or ENSACAT00000, respectively).
| GeneID | Gene | p-Value [15] | Gene Description | Transcript | B | FB | aB |
|---|---|---|---|---|---|---|---|
| 000773 | IL1RAP | 3.58 × 10−2 | Interleukin 1 receptor accessory protein | 000813 | <0.00 × 10−5 | 2 | A. |
| 000907 | MICU1 | 1.79 × 10−2 | Mitochondrial calcium uptake 1 | 000909 | 1.30 × 10−3 | 1 | yes |
| 001142 | TARBP1 | 1.39 × 10−6 | TAR RNA binding protein 1 | 001104 | <0.00 × 10−5 | 1 | yes |
| 002254 | MIA3 | 1.83 × 10−2 | Melanoma inhibitory activity family member 3 | 002276 | 1.93 × 10−2 | 1 | yes |
| 002549 | RPS2 | 1.99 × 10−2 | Ribosomal protein S2 | 002541 | 5.00 × 10−4 | 2 | yes |
| 002995 | RNF10 | 4.38 × 10−2 | Ring finger protein 10 | 003046 | 2.63 × 10−2 | 5 | yes |
| 003987 | NUP107 | 1.54 × 10−4 | Nucleoporin 107kDa | 004158 | <0.00 × 10−5 | 1 | yes |
| 006133 | GRK6 | 1.07 × 10−2 | G protein-coupled receptor kinase 6 | 006252 | 1.16 × 10−2 | 2 | A. |
| 007074 | SMC4 | 1.56 × 10−3 | Structural maintenance of chromosomes 4 | 007191 | 1.00 × 10−4 | 1 | yes |
| 015860 | SH3RF1 | 4.55 × 10−3 | SH3 domain containing ring finger 1 | 015968 | 8.90 × 10−3 | 1 | yes |
Out of the 410 transcripts reported to be under positive selection in lowland frog and lizard species [7], 321 could be identified in Asymblepharus. Of these, 23 (7.2%) transcripts were found to be under positive selection in the Ladakh Ground Skink in all three tests (Table 5). Similarly, a total of 32 (7.1% of 449 transcripts) PSGs of colline, 24 (7.6% of 314 transcripts) of submontane and montane, and 24 (6.9% of 350 transcripts) of subalpine and alpine frog and lizard species were tested to be under positive selection in Asymblepharus. Moreover, out of the 32 parallel PSGs that were identified in both high-elevation frog and lizard species [7], one gene (PGS1) showed positive selection in the Ladakh Ground Skink (Table 5). Analyzing these genes under positive selection for GO terms and pathways reveals an overrepresentation of genes involved in the following processes: low-density lipoprotein receptors and catabolic processes; mitochondrial citrate transmembrane transport; glycolysis and fructose/galactose metabolism; nucleoside phosphate binding; p53 pathway feedback loop (involved in DNA repair), platelet-derived growth (PDGF) factor binding (involved in blood-vessel formation).
Table 5.
Summary of the positive selection analysis for candidate genes of lineages of dicroglossid frogs and toad-headed agamas [7] identified across an elevational gradient, tested likewise positive in Asymblepharus (A) using BUSTED (B); p-value > 0.05) [55], FUBAR (FB; number of sites under positive selection) [56] and aBSREL (aB, number of branches with positive selection) [57,58] methods. Ensembl gene and transcript identifier (ENSACAG00000, ENSACAT00000) refers to Anolis carolinensis.
| Lowland | Gene Description | Transcript | B | FB | aB | |
|---|---|---|---|---|---|---|
| GeneID | Gene | |||||
| 000146 | PCSK9 | Proprotein convertase subtilisin/kexin type 9 | 000163 | 4.40 × 10−3 | 1 | 2 |
| 000201 | NPC1 | NPC intracellular cholesterol transporter 1 | 000261 | 5.10 × 10−3 | 2 | 1 |
| 000264 | LAMP1 | Lysosomal associated membrane protein 1 | 000247 | 2.90 × 10−3 | 8 | 2 |
| 000531 | SEPT12 | Septin 12 | 000602 | 1.29 × 10−2 | 1 | 1 |
| 000768 | SYK | Spleen associated tyrosine kinase | 000802 | <0.00 × 10−5 | 1 | 3 |
| 000798 | WBP4 | WW domain binding protein 4 | 000804 | 2.00 × 10−2 | 3 | 1 |
| 000955 | QSOX1 | Quiescin sulfhydryl oxidase 1 | 000962 | 9.20 × 10−3 | 3 | 1 |
| 001070 | CADM1 | Cell adhesion molecule 1 | 001188 | 1.00 × 10−4 | 1 | 1 |
| 002270 | KANK1 | KN motif and ankyrin repeat domains 1 | 002294 | 8.00 × 10−4 | 2 | 2 |
| 003015 | VLDLR | Very low-density lipoprotein receptor | 003096 | 1.66 × 10−2 | 3 | 1 |
| 003460 | ANKRD12 | Ankyrin repeat domain 12 | 003484 | 1.80 × 10−3 | 1 | 1 |
| 003908 | MDM1 | Mdm1 nuclear protein | 003923 | 2.30 × 10−3 | 2 | 2 |
| 005884 | CCDC66 | Coiled-coil domain containing 66 | 025744 | <0.00 × 10−5 | 1 | 1 |
| 006279 | GLYR1 | Glyoxylate reductase 1 homolog | 006325 | <0.00 × 10−5 | 1 | 3 |
| 007694 | ACOX1 | Acyl-CoA oxidase 1 | 007823 | <0.00 × 10−5 | 8 | 1 |
| 008005 | PHACTR2 | Phosphatase and actin regulator 2 | 008047 | 4.30 × 10−3 | 5 | 1 |
| 008420 | VTA1 | Vesicle trafficking 1 | 008463 | 3.36 × 10−2 | 1 | 1 |
| 009938 | 010074 | <0.00 × 10−5 | 6 | 3 | ||
| 013301 | COL1A2 | Collagen type I α 2 chain | 013614 | 1.10 × 10−3 | 17 | 3 |
| 013917 | MSH2 | MutS homolog 2 | 014076 | 1.11 × 10−2 | 1 | 1 |
| 013984 | LRRCC1 | Leucine-rich repeat and coiled-coil centrosomal protein 1 | 014100 | 2.50 × 10−2 | 1 | 2 |
| 016683 | TENT2 | Terminal nucleotidyltransferase 2 | 016777 | <0.00 × 10−5 | 2 | 2 |
| 017936 | ATP6V0A1 | ATPase H+ transporting V0 subunit a1 | 018008 | 3.50 × 10−3 | 1 | 1 |
| Up to 2000 m | ||||||
| Gene ID | Gene | Gene description | Transcript | B | FB | aB |
| 000608 | FBXL3 | F-box and leucine-rich repeat protein 3 | 000548 | 3.47 × 10−2 | 2 | 1 |
| 000768 | SYK | Spleen associated tyrosine kinas | 000802 | <0.00 × 100 | 1 | 3 |
| 000837 | KATNB1 | Katanin p80 (WD repeat containing) subunit B 1 | 000896 | 4.96 × 10−2 | 1 | 1 |
| 000955 | QSOX1 | Quiescin sulfhydryl oxidase 1 | 000962 | 8.90 × 10−3 | 3 | 1 |
| 002090 | KIAA0232 | KIAA0232 | 002069 | <0.00 × 100 | 5 | 2 |
| 002091 | RANBP2 | RAN binding protein 2 | 002100 | 7.00 × 10−3 | 2 | 1 |
| 002556 | SLC25A1 | Solute carrier family 25 member 1 | 002543 | 5.00 × 10−3 | 2 | 1 |
| 002948 | RSPH1 | Radial spoke head component 1 | 002963 | 1.59 × 10−2 | 2 | 1 |
| 003975 | TOGARAM1 | TOG array regulator of axonemal microtubules 1 | 004004 | <0.00 × 10−5 | 1 | 3 |
| 003987 | NUP107 | Nucleoporin 107 | 004158 | <0.00 × 10−5 | 1 | 2 |
| 004938 | RARS1 | Arginyl-tRNA synthetase 1 | 005003 | <0.00 × 10−5 | 3 | 2 |
| 005569 | CCT4 | Chaperonin containing TCP1 subunit 4 | 005683 | 2.00 × 10−4 | 2 | 1 |
| 006189 | PARP1 | Poly(ADP-ribose) polymerase 1 | 006356 | 4.11 × 10−2 | 4 | 1 |
| 006926 | CTNND1 | Catenin delta 1 | 007006 | 3.00 × 10−4 | 1 | 1 |
| 007100 | ZNF277 | Zinc finger protein 277 | 007173 | 1.17 × 10−2 | 3 | 3 |
| 007489 | PRDX4 | Peroxiredoxin 4 | 007498 | 2.20 × 10−3 | 1 | 1 |
| 007985 | POLR3A | RNA polymerase III subunit A | 008077 | 1.37 × 10−2 | 1 | 1 |
| 008005 | PHACTR2 | Phosphatase and actin regulator 2 | 008047 | 3.90 × 10−3 | 5 | 1 |
| 008475 | SNTA1 | Syntrophin α 1 | 008544 | 5.20 × 10−3 | 5 | 1 |
| 008991 | KEAP1 | Kelch-like ECH-associated protein 1 | 009010 | 2.00 × 10−4 | 3 | 1 |
| 009213 | SWAP70 | Switching B cell complex subunit SWAP70 | 009242 | 8.70 × 10−3 | 1 | 1 |
| 013059 | JPH1 | Junctophilin 1 | 013093 | 1.09 × 10−2 | 4 | 1 |
| 013301 | COL1A2 | Collagen type I α 2 chain | 013614 | 1.10 × 10−3 | 17 | 3 |
| 013326 | ADGRF5 | Adhesion G protein-coupled receptor F5 | 013405 | 4.00 × 10−4 | 1 | 1 |
| 015062 | COL3A1 | Collagen type III α 1 chain | 015539 | 3.00 × 10−2 | 11 | 4 |
| 015374 | RANBP17 | RAN binding protein 17 | 015630 | 7.90 × 10−3 | 3 | 2 |
| 015422 | BAIAP2 | BAR/IMD-domain-containing adaptor protein 2 | 015540 | 4.96 × 10−2 | 2 | 2 |
| 016662 | PGS1 | Phosphatidylglycerophosphate synthase 1 | 016740 | <0.00 × 10−5 | 1 | 1 |
| 017208 | FLOT1 | Flotillin 1 | 017291 | <0.00 × 10−5 | 4 | 2 |
| 017228 | SLC4A1 | Solute carrier family 4 member 1 (Diego blood group) | 017345 | 4.36 × 10−2 | 2 | 1 |
| 017316 | PTBP3 | Polypyrimidine tract binding protein 3 | 017407 | 6.00 × 10−4 | 2 | 3 |
| 018003 | FGFR1 | Fibroblast growth factor receptor 1 | 018080 | 6.00 × 10−4 | 2 | 1 |
| 2000–3500 m | ||||||
| Gene ID | Gene | Gene description | Transcript | B | FB | aB |
| 000306 | ZNF622 | Zinc finger protein 622 | 000291 | 8.50 × 10−3 | 5 | 1 |
| 000907 | MICU1 | Mitochondrial calcium uptake 1 | 000909 | 1.30 × 10−3 | 1 | 1 |
| 001396 | ABCC3 | ATP-binding cassette subfamily C member 3 | 001480 | <0.00 × 10−5 | 1 | 1 |
| 002254 | MIA3 | Melanoma inhibitory activity family member 3 | 002276 | 2.22 × 10−2 | 1 | 2 |
| 002779 | CDH1 | Cadherin 1 | 003031 | 1.50 × 10−3 | 1 | 1 |
| 004137 | STARD13 | StAR-related lipid transfer domain containing 13 | 004235 | 2.00 × 10−4 | 3 | 1 |
| 005084 | COL1A1 | Collagen type I α 1 chain | 005298 | 5.00 × 10−4 | 18 | 2 |
| 006739 | RALGAPB | Ral GTPase-activating protein, β subunit | 006803 | 9.40 × 10−3 | 1 | 1 |
| 006920 | NEO1 | Neogenin 1 | 007074 | <0.00 × 10−5 | 1 | 1 |
| 006926 | CTNND1 | Catenin delta 1 | 007006 | 3.00 × 10−4 | 1 | 1 |
| 007694 | ACOX1 | Acyl-CoA oxidase 1 | 007823 | 0.00 × 100 | 8 | 1 |
| 007907 | FLOT2 | Flotillin 2 | 008015 | 0.00 × 100 | 1 | 1 |
| 008206 | ADGRG6 | Adhesion G protein-coupled receptor G6 | 008405 | 1.21 × 10−2 | 4 | 1 |
| 009366 | PLEKHG3 | Pleckstrin homology and RhoGEF domain containing G3 | 026649 | 1.43 × 10−2 | 1 | 2 |
| 010640 | MYLK | Myosin light chain kinase | 010735 | 3.00 × 10−4 | 2 | 1 |
| 011707 | FYN | FYN proto-oncogene, Src family tyrosine kinase | 011760 | 5.00 × 10−3 | 1 | 1 |
| 013938 | SPEG | SPEG complex locus | 023342 | 3.99 × 10−2 | 2 | 2 |
| 014232 | RBBP5 | RB binding protein 5 | 014319 | <0.00 × 10−5 | 5 | 2 |
| 014373 | CD82 | Tetraspanin | 014458 | 2.92 × 10−2 | 5 | 1 |
| 015062 | COL3A1 | Collagen type III α 1 chain | 015539 | 2.92 × 10−2 | 11 | 4 |
| 015121 | SLC26A4 | Solute carrier family 26 member 4 | 015204 | 3.02 × 10−2 | 2 | 1 |
| 015894 | PNN | Pinin, desmosome associated protein | 015973 | 1.56 × 10−2 | 1 | 1 |
| 016077 | 016133 | 1.00 × 10−3 | 4 | 3 | ||
| 017036 | NIF3L1 | NGG1 interacting factor 3 like 1 | 017110 | 8.50 × 10−3 | 4 | 1 |
| 3500–4500 m | ||||||
| GeneID | Gene | Gene description | Transcript | B | FB | aB |
| 000773 | IL1RAP | Interleukin 1 receptor accessory protein | 000813 | <0.00 × 10−5 | 2 | 5 |
| 001545 | ZCCHC8 | Zinc finger CCHC-type containing 8 | 001573 | 2.78 × 10−2 | 1 | 2 |
| 002034 | CCDC138 | Coiled-coil domain containing 138 | 002022 | 4.00 × 10−4 | 1 | 1 |
| 003612 | NR3C2 | Nuclear receptor subfamily 3 group C member 2 | 003696 | 4.97 × 10−2 | 7 | 1 |
| 004231 | COL6A3 | Collagen type VI α 3 chain | 004512 | <0.00 × 10−5 | 6 | 2 |
| 005058 | RPL11 | Ribosomal protein L11 | 005076 | 1.67 × 10−2 | 1 | 1 |
| 005562 | PFKM | Phosphofructokinase, muscle | 005957 | 4.83 × 10−2 | 3 | 2 |
| 006776 | FUS | FUS RNA-binding protein | 006895 | 6.70 × 10−3 | 1 | 1 |
| 006926 | CTNND1 | Catenin delta 1 | 007006 | 3.00 × 10−4 | 1 | 1 |
| 007250 | ATP11B | ATPase phospholipid transporting 11B (putative) | 007392 | 1.40 × 10−3 | 12 | 1 |
| 007887 | ABHD3 | Abhydrolase domain containing 3, phospholipase | 007896 | 4.00 × 10−3 | 2 | 2 |
| 009164 | NADK2 | NAD kinase 2, mitochondrial | 009211 | 1.18 × 10−2 | 1 | 1 |
| 009700 | 009686 | 5.50 × 10−3 | 1 | 1 | ||
| 009800 | FLNA | Filamin A | 010200 | 0.00 × 100 | 2 | 2 |
| 011843 | SENP7 | SUMO specific peptidase 7 | 011850 | 1.43 × 10−2 | 1 | 1 |
| 013301 | COL1A2 | Collagen type I α 2 chain | 013614 | 1.20 × 10−3 | 17 | 3 |
| 013313 | AGAP2 | ArfGAP with GTPase domain, ankyrin repeat, PH domain 2 | 013455 | 3.40 × 10−3 | 1 | 1 |
| 014695 | RFWD2 | COP1 E3 ubiquitin ligase | 014789 | 1.00 × 10−3 | 2 | 2 |
| 014919 | ALDOA | Aldolase, fructose-bisphosphate A | 014984 | 2.99 × 10−2 | 2 | 1 |
| 015785 | CDK14 | Cyclin-dependent kinase 14 | 015872 | 6.10 × 10−3 | 1 | 1 |
| 016662 | PGS1 | Phosphatidylglycerophosphate synthase 1 | 016740 | <0.00 × 10−5 | 1 | 1 |
| 017054 | NFIX | Nuclear factor I X | 017136 | 9.50 × 10−3 | 1 | 1 |
| 017166 | FARSA | Phenylalanyl-tRNA synthetase subunit α | 017239 | 2.58 × 10−2 | 1 | 2 |
| 017936 | ATP6V0A1 | ATPase H + transporting V0 subunit a1 | 018008 | 3.50 × 10−3 | 1 | 1 |
| Frogs and lizards, common genes at similar elevation | ||||||
| Gene ID | Gene | Gene description | Transcript | B | FB | aB |
| 016662 | PGS1 | Phosphatidylglycerophosphate synthase 1 | 016740 | <0.00 × 10−5 | 1 | 1 |
4. Discussion
This study presents the first transcriptome sequences from different tissues of the Ladakh Ground Skink Asymblepharus ladacensis, a high-altitude reptile species endemic to the Greater Himalayas. We provide high-quality de novo transcript assemblies and well-annotated results, enabling comparisons with transcriptomes of related scincid or higher lizards available at public databases. Although squamate reptiles (lizards and snakes) represent one of the most diverse vertebrate groups with over 10,000 species spanning more than 200 million years of evolution [66], genomic data of squamates are limited and still poorly studied [67]. To our knowledge, no annotated transcriptome has been published for the genus Asymblepharus so far. Although our study is mostly descriptive, it has yielded discoveries with respect to genes known to play roles in the adaptation of vertebrates to high elevations and adds data resources for genomic studies in Himalayan herpetofauna.
Yang et al. [15] used comparative transcriptomic analyses of two closely related lizards, Phrynocephalus przewalskii from low elevations (500–1500 m a.s.l.) and P. vlangalii from high elevations (2000–4600 m a.s.l.), to identify candidate genes that are potentially linked to adaptation to high elevation environments. In addition, Sun et al. [7] tested amphibian and reptile populations at various altitudes in Tibet, which show parallel evolution. These studies demonstrated convergent and continuous adaptation to high elevations in Anura (Ranidae) and Sauropsida (Agamidae). Genes with related functions, especially DNA repair and energy metabolism, exhibit featured rapid changes and are positively correlated to elevation. These data let us assume that a similar genomic high-elevation selection syndrome might be detectable in Asymblepharus, sampled in 2714 m a.s.l. (Methods), and with a vertical distribution between ~2500 and 5500 m ([26] and references therein).
Indeed, we identified a total of 10 out of 143 and 104 out of ~1500 key genes [7,15] under positive selection in the Asymblepharus transcriptome. Interestingly, several of the 10 genes (Table 4) have been reported to be under positive selection or significantly enriched or differentially methylated for pathways consistent with physiological compensation for limited oxygen in high elevation dwellers, e.g., IL1RAP [68] (human), MIA3 [69] (pika), and MICU1 [70] (Ladakhi cow). Several genes we identified to be under positive selection have GO terms that suggest their involvement in, e.g., energy metabolism and DNA repair. It is well-known that high UV radiation and hypoxia are major challenges for organisms in high-altitude habitats. The extreme environments in the Greater Himalayas necessitate high energy metabolism, strong resistance to UV by an efficient DNA repair, and adaptation to hypoxia in species endemic to these mountains. However, the proportion of genes we found in Asymblepharus per ‘altitude-specific’ gene group does not appear to reflect a convergently evolved gene set as previously reported [7]. In other words, we found a comparable number of genes under positive selection from the group of genes identified for lowland species and those identified for colline, montane, or alpine species. Given the idea of convergent evolutionary changes and, thus, a gradual accumulation of high-elevation genetic adaptations, a higher number of candidate genes reported for montane and alpine species would have been expected in Asymblepharus compared to those genes reported in species distributed in lower elevations. The potential reasons for this supposed lack of confirmation of the suggested pattern are complex. A major limitation is that our analyses were restricted so far to a single Asymblepharus transcriptome. Due to limitations of sampling, data were unobtainable from radiation of species or an altitudinal gradient as available for frogs and other lizards [7,15], preventing us from intraspecific comparisons. Moreover, data of a single specimen cannot reflect the breadth of allelic diversity in the selected genes, putatively associated with adaptations to high altitude, especially in the view of wide vertical distribution. Another deficiency results from the fact that only one female but no male could be sequenced. Indeed, many genes that might be sex-specifically expressed might not have been sequenced or characterized with our approach. Therefore, future research with multiple high-elevation species and populations across a larger scale of altitudinal variation should validate genes known to contribute to high elevation adaptation in scincid reptiles and, thus, yield additional evidence for potential convergent evolution.
A promising future genomic approach might be to include populations of Asymblepharus between the species‘ lower and upper distributional periphery, sampling three populations at each of four elevation levels (e.g., <2000 m, 3000 m, 4000 m, and 5000 m a.s.l.), to investigate the expression of genes presumably related to adaptions to high altitude. It would also be desirable to reveal potential tissue-specific expression patterns across altitude-associated genes, samples of organs sensitive to UV radiation, and for oxygen, e.g., heart, lung, skin, and embryonic structures, which are particularly of interest. Moreover, to address adaptive convergence, additional comparative transcriptomic analyses on Japalura or Laudakia species might be promising since these taxa have a similarly broad vertical distribution as Asymblepharus and often co-occur with sympatric ground skinks. Asymblepharus might even become an excellent model to study local adaptation by reciprocal transplantation experiments between high and low altitude populations. Such studies could enhance our understanding of how organisms might cope with rapid environmental changes in fluctuating demographic contexts. However, such intensive field studies require adequate access to suitable habitats in the Himalayas and, thus, a much higher logistic and financial effort than available in our pilot study.
5. Conclusions
In summary, our present study provides the first transcriptomic data for a Himalayan reptile of the genus Asymblepharus and evidence for genes under positive selection for high-altitude adaptation of the Ladakh Ground Skink. Further research is encouraged to validate the key genes confirmed in this study by population genetic and functional genomic approaches. Comparative sequencing analyses for other Asymblepharus species may enable further insights into the adaptive basis of reptiles to different altitude environments in the Himalayas. Our data are available for future investigations on the evolution and environmental adaptation in Himalayan high-altitude vertebrates.
Acknowledgments
We are grateful to Markus Sandner and Santos Tamang for their assistance during fieldwork.
Supplementary Materials
The following are available online at https://www.mdpi.com/article/10.3390/genes12091423/s1, Figure S1: Basic statistics for the data derived from brain, embryonic disc, and pooled tissues based on the rnaQUAST report, Table S1: TransRate results for Asymblepharus ladacensis ORP assemblies, Table S2: Results of the quality assessment of the transcriptome assemblies from brain tissue, an embryonic disc, and pooled tissues of Asymblepharus ladacensis using rnaQUAST and the reference database of (a) Anolis carolinensis, (b) Gekko japonicus, and (c) Python bivittatus, Table S3: List of positively selected genes in Phrynocephalus species (according to [2]) and Asymblepharus ladacensis, their functional categories and summary of tests for positive selection.
Author Contributions
Conceptualization, S.H., C.B.B., and M.S.; methodology, S.H., C.B.B., M.S., and L.P.; formal analysis, S.H. and L.P.; investigation, S.H.; resources, S.H., C.B.B., M.S., and L.P.; data curation, S.H.; writing—original draft preparation, S.H.; writing—review and editing, all authors; visualization, S.H.; project administration, S.H.; funding acquisition, S.H. and M.S. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the German Research Foundation, grant number HO 3792/8-1, to S.H., and by COFASP/ERANET (STURGEoNOMICS) by the German Federal Ministry of Food and Agriculture through the Federal Office for Agriculture and Food, grant no. 2816ERA04G, to MS.
Institutional Review Board Statement
All treatments were carried out in accordance with approved guidelines and according to the permit as well as the local animal welfare committee’s in-structions (VNME 17, 15–30).
Informed Consent Statement
Not applicable.
Data Availability Statement
Data generated in this study are publicly available from the NCBI GenBank database under the Bioproject ID PRJNA750278, BioSamples SAMN20458631, SAMN20458632, and SAMN20458631. All sequence data were deposited in the NCBI Sequence Read Archive (SRA, http://www.ncbi.nlm.nih.gov/Traces/sra/; accessed on 2 August 2021) under the accession numbers SRR15283177, SRR15283178, and SRR15283179; assembled sequences were transmitted to NCBI Transcriptome Shotgun Assembly Sequence Database (TSA, http://www.ncbi.nlm.nih.gov/genbank/tsa (accessed on 2 August 2021).
Conflicts of Interest
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Mittermeier R.A., Robles-Gil P., Homan M., Pilgrim J., Brooks T., Mittermeier C.G., Lamoreux J., da Fonseca G.A.B. Hotspots Revisited. CEMEX; Mexico City, Mexico: 2004. p. 392. [Google Scholar]
- 2.Hofmann S., Stoeck M., Zheng Y., Ficetola F.G., Li J.T., Scheidt U., Schmidt J. Molecular Phylogenies Indicate a Paleo-Tibetan Origin of Himalayan Lazy Toads (Scutiger) Sci. Rep. 2017;7:3308. doi: 10.1038/s41598-017-03395-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Yang W., Qi Y., Fu J. Genetic signals of high-altitude adaptation in amphibians: A comparative transcriptome analysis. BMC Genet. 2016;17:134. doi: 10.1186/s12863-016-0440-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Wang G.-D., Zhang B.-L., Zhou W.-W., Li Y.-X., Jin J.-Q., Shao Y., Yang H.-C., Liu Y.-H., Yan F., Chen H.-M., et al. Selection and Environmental Adaptation Along a Path to Speciation in the Tibetan Frog Nanorana parkeri. Proc. Natl. Acad. Sci. USA. 2018;115:E5056–E5065. doi: 10.1073/pnas.1716257115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Che J., Zhou W.-W., Hu J.-S., Yan F., Papenfuss T.J., Wake D.B., Zhang Y.-P. Spiny frogs (Paini) illuminate the history of the Himalayan region and Southeast Asia. Proc. Natl. Acad. Sci. USA. 2010;107:13765–13770. doi: 10.1073/pnas.1008415107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lu B., Jin H., Fu J. Molecular convergent and parallel evolution among four high-elevation anuran species from the Tibetan region. BMC Genom. 2020;21:839. doi: 10.1186/s12864-020-07269-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Sun Y.B., Fu T.T., Jin J.Q., Murphy R.W., Hillis D.M., Zhang Y.P., Che J. Species Groups Distributed across Elevational Gradients Reveal Convergent and Continuous Genetic Adaptation to High Elevations. Proc. Natl. Acad. Sci. USA. 2018;115:E10634–E10641. doi: 10.1073/pnas.1813593115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Dorge T., Hofmann S., Wangdwei M., Duoje L., Solhøy T., Miehe G. The ecological specialist, Thermophis baileyi (Wall, 1907)—New records, distribution, and biogeographic conclusions. Herpetol. Bull. 2007;101:8–12. [Google Scholar]
- 9.Jin Y.T., Brandt D.Y.C., Li J., Wo Y., Tong H., Shchur V. Elevation as a selective force on mitochondrial respiratory chain complexes of the Phrynocephalus lizards in the Tibetan plateau. Curr. Zool. 2020;67:191–199. doi: 10.1093/cz/zoaa056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Baig K.J., Wagner P., Ananjeva N.B., Böhme W. A morphology-based taxonomic revision of Laudakia Gray, 1845 (Squamata: Agamidae) Vertebr. Zool. 2012;62:213–260. [Google Scholar]
- 11.Gangloff E.J., Telemeco R.S. High Temperature, Oxygen, and Performance: Insights from Reptiles and Amphibians. Integr. Comp. Biol. 2018;58:9–24. doi: 10.1093/icb/icy005. [DOI] [PubMed] [Google Scholar]
- 12.Jackson D.C. Temperature and hypoxia in ectothermic tetrapods. J. Thermal. Biol. 2007;32:125–133. doi: 10.1016/j.jtherbio.2007.01.007. [DOI] [Google Scholar]
- 13.Yang W.J., Qi Y., Bi K., Fu J. Toward Understanding the Genetic Basis of Adaption to High-Elevation Life in Poikilothermic Species: A Comparative Transcriptomic Analysis of Two Ranid Frogs, Rana chensinensis and R. kukunoris. BMC Genom. 2012;13:588. doi: 10.1186/1471-2164-13-588. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Li J.T., Gao Y.-D., Xie L., Deng C., Shi P., Guan M.-L., Huang S., Ren J.-L., Wu D.-D., Ding L., et al. Comparative genomic investigation of high-elevation adaptation in ectothermic snakes. Proc. Natl. Acad. Sci. USA. 2018;115:8406–8411. doi: 10.1073/pnas.1805348115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Yang W., Qi Y., Fu J. Exploring the genetic basis of adaptation to high elevations in reptiles: A comparative transcriptome analysis of two toad-headed agamas (genus Phrynocephalus) PLoS ONE. 2014;9:e112218. doi: 10.1371/journal.pone.0112218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Hofmann S., Kuhl H., Baniya C.B., Stock M. Multi-Tissue Transcriptomes Yield Information on High-Altitude Adaptation and Sex-Determination in Scutiger cf. sikimmensis. Genes. 2019;10:873. doi: 10.3390/genes10110873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Mosbrugger V., Favre L., Müllner-Riehl A., Päckert M., Mulch A. Cenozoic Evolution of Geo-Biodiversity in the Tibeto-Himalayan Region. In: Hoorn C., Perrigo A., Antonelli A., editors. Mountains, Climate and Biodiversity. Wiley-Blackwell; Oxford, UK: 2018. pp. 429–449. [Google Scholar]
- 18.Pacifici M., Foden W.B., Visconti P., Watson J.E.M., Butchart S.H.M., Kovacs K.M., Scheffers B.R., Hole D.G., Martin T.G., Akçakaya H.R., et al. Assessing species vulnerability to climate change. Nat. Clim. Chang. 2015;5:215–225. doi: 10.1038/nclimate2448. [DOI] [Google Scholar]
- 19.Xu J., Badola R., Chettri N., Chaudhary R.P., Zomer R., Pokhrel B., Hussain S.A., Pradhan S., Pradhan R. Sustaining Biodiversity and Ecosystem Services in the Hindu Kush Himalaya. In: Wester P., Mishra A., Mukherji A., Shrestha A., editors. The Hindu Kush Himalaya Assessment. Springer; Berlin/Heidelberg, Germany: 2019. pp. 127–165. [Google Scholar]
- 20.Kouyoumdjian L., Gangloff E., Souchet J., Cordero G.A., Dupoué A., Aubret F. Transplanting gravid lizards to high elevation alters maternal and embryonic oxygen physiology, but not reproductive success or hatchling phenotype. J. Exp. Biol. 2019;222:jeb206839. doi: 10.1242/jeb.206839. [DOI] [PubMed] [Google Scholar]
- 21.Banasiak K.J., Xia Y., Haddad G.G. Mechanisms underlying hypoxia-induced neuronal apoptosis. Prog. Neurobiol. 2000;62:215–249. doi: 10.1016/S0301-0082(00)00011-3. [DOI] [PubMed] [Google Scholar]
- 22.Bickler P.E., Donohoe P.H. Adaptive responses of vertebrate neurons to hypoxia. J. Exp. Biol. 2002;205:3579–3586. doi: 10.1242/jeb.205.23.3579. [DOI] [PubMed] [Google Scholar]
- 23.Pyron R.A., Burbrink F.T., Wiens J.J. A phylogeny and revised classification of Squamata, including 4161 species of lizards and snakes. BMC Evol. Biol. 2013;13:93. doi: 10.1186/1471-2148-13-93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Linkem C.W., Diesmos A.C., Brown R.M. Molecular systematics of the Philippine forest skinks (Squamata: Scincidae: Sphenomorphus): Testing morphological hypotheses of interspecific relationships. Zool. J. Linn. Soc. Lond. 2011;163:1217–1243. doi: 10.1111/j.1096-3642.2011.00747.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Shea G.M., Greer A.E. From Sphenomorphus to Lipinia: Generic reassignment of two poorly known New Guinea skinks. J. Herpetol. 2002;36:148–156. doi: 10.1670/0022-1511(2002)036[0148:FSTLGR]2.0.CO;2. [DOI] [Google Scholar]
- 26.Borkin L.J., Litvinchuk S.N., Melnikov D.A., Skorinov D.V. Altitudinal distribution of skinks of the genus Asymblepharus in the Western Himalaya, India (Reptilia: Sauria: Scincidae) In: Hartmann R., Weipert J., Barclay M., editors. Biodiversität und Naturausstattung im Himalaya VI. Verein der Freunde & Förderer des Naturkundemuseums Erfurt e.V.; Erfurt, Germany: 2018. pp. 163–167. [Google Scholar]
- 27.Hartmann M., Weipert J., Weigel A. Die zoologischen Nepal-Expeditionen des Naturkundemuseums Erfurt. Veröffentlichungen des Nat. Erf. 1998;17:15–30. [Google Scholar]
- 28.Hartmann M., Weipert J. Biodiversität und Naturausstattung im Himalaya V. Verein der Freunde und Förderer des Naturkundemuseums Erfurt e.V.; Erfurt, Germany: 2015. p. 580. [Google Scholar]
- 29.Andrews S. FastQC: A Quality Control Tool for High Throughput Sequence Data. 2010. [(accessed on 2 July 2021)]. Available online: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
- 30.Kopylova E., Noe L., Touzet H. SortMeRNA: Fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics. 2012;28:3211–3217. doi: 10.1093/bioinformatics/bts611. [DOI] [PubMed] [Google Scholar]
- 31.Mac Manes M.D. The Oyster River Protocol: A multi assembler and Kmer approach for de novo transcriptome assembly. PEERJ. 2018;6:e5428. doi: 10.7717/peerj.5428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hoelzer M., Marz M. De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers. Gigascience. 2019;8:1–16. doi: 10.1093/gigascience/giz039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Lu B., Zeng Z., Shi T. Comparative study of de novo assembly and genome-guided assembly strategies for transcriptome reconstruction based on RNA-Seq. Sci. China Life Sci. 2013;56:143–155. doi: 10.1007/s11427-013-4442-z. [DOI] [PubMed] [Google Scholar]
- 34.Bolger A.M., Lohse M., Usadel B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Song L., Florea L. Rcorrector: Efficient and accurate error correction for Illumina RNA-seq reads. Gigascience. 2015;4:48. doi: 10.1186/s13742-015-0089-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Haas B., Papanicolaou A., Yassour M., Grabherr M., Blood P., Bowden J., Couger M., Eccles D., Li B., Lieber M., et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 2013;8:1494–1512. doi: 10.1038/nprot.2013.084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Chikhi R., Medvedev P. Informed and automated k-mer size selection for genome assembly. Bioinformatics. 2014;30:31–37. doi: 10.1093/bioinformatics/btt310. [DOI] [PubMed] [Google Scholar]
- 38.Kannan S., Hui J., Mazooji K., Pachter L., Tse D. Shannon: An Information-Optimal de novo RNA-Seq Assembler. BioRxiv. 2016 doi: 10.1101/039230. [DOI] [Google Scholar]
- 39.Smith-Unna R., Boursnell C., Patro R., Hibberd J.M., Kelly S. TransRate: Reference-free quality assessment of de novo transcriptome assemblies. Genome Res. 2016;26:1134–1144. doi: 10.1101/gr.196469.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Seppey M., Manni M., Zdobnov E.M. BUSCO: Assessing Genome Assembly and Annotation Completeness. In: Kollmar M., editor. Gene Prediction. Methods in Molecular Biology. Volume 1962. Humana; Louisville, KY, USA: 2019. [DOI] [PubMed] [Google Scholar]
- 41.Simao F.A., Waterhouse R.M., Ioannidis P., Kriventseva E.V., Zdobnov E.M. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
- 42.Bushmanova E., Antipov D., Lapidus A., Suvorov V., Prjibelski A.D. rnaQUAST: A quality assessment tool for de novo transcriptome assemblies. Bioinformatics. 2016;32:2210–2212. doi: 10.1093/bioinformatics/btw218. [DOI] [PubMed] [Google Scholar]
- 43.Buchfink B., Xie C., Huson D.H. Fast and sensitive protein alignment using DIAMOND. Nat. Methods. 2015;12:59–60. doi: 10.1038/nmeth.3176. [DOI] [PubMed] [Google Scholar]
- 44.Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. Basic Local Aligment Search Tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 45.Conesa A., Gotz S. Blast2GO: A comprehensive suite for functional analysis in plant genomics. Int. J. Plant Genom. 2008;2008:619832. doi: 10.1155/2008/619832. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Quevillon E., Silventoinen V., Pillai S., Harte N., Mulder N., Apweiler R., Lopez R. InterProScan: Protein domains identifier. Nucleic Acids Res. 2005;33:W116–W120. doi: 10.1093/nar/gki442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Mistry J., Finn R.D., Eddy S.R., Bateman A., Punta M. Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions. Nucleic Acids Res. 2013;41:e121. doi: 10.1093/nar/gkt263. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Rice P., Longden I., Bleasby A. EMBOSS: The European Molecular Biology Open Software Suite. Trends Genet. 2000;16:276–277. doi: 10.1016/S0168-9525(00)02024-2. [DOI] [PubMed] [Google Scholar]
- 49.Litvinchuk S.N., Borkin L.J., Hofmann S. Rediscovery of the high-altitude Lazy Toad, Scutiger occidentalis Dubois, 1978, in India. Russ. J. Herpetol. 2019;26:17–22. doi: 10.30906/1026-2296-2019-26-1-17-22. [DOI] [Google Scholar]
- 50.Katoh K., Standley D.M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Price M.N., Dehal P.S., Arkin A.P. FastTree 2—Approximately maximum-likelihood trees for large alignments. PLoS ONE. 2010;5:e9490. doi: 10.1371/journal.pone.0009490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Kosakovsky Pond S.L., Poon A.F.Y., Velazquez R., Weaver S., Hepler N.L., Murrell B., Shank S.D., Magalis B.R., Bouvier D., Nekrutenko A., et al. HyPhy 2.5—A Customizable Platform for Evolutionary Hypothesis Testing Using Phylogenies. Mol. Biol. Evol. 2020;37:295–299. doi: 10.1093/molbev/msz197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Pond S.L., Frost S.D., Muse S.V. HyPhy: Hypothesis testing using phylogenies. Bioinformatics. 2005;21:676–679. doi: 10.1093/bioinformatics/bti079. [DOI] [PubMed] [Google Scholar]
- 54.Spielman S.J., Weaver S., Shank S.D., Magalis B.R., Li M., Kosakovsky Pond S.L. Evolution of Viral Genomes: Interplay Between Selection, Recombination, and Other Forces. In: Anisimova M., editor. Evolutionary Genomics. Methods in Molecular Biology. Volume 1910. Humana; Louisville, KY, USA: 2019. [DOI] [PubMed] [Google Scholar]
- 55.Murrell B., Weaver S., Smith M.D., Wertheim J.O., Murrell S., Aylward A., Eren K., Pollner T., Martin D.P., Smith D.M., et al. Gene-wide identification of episodic selection. Mol. Biol. Evol. 2015;32:1365–1371. doi: 10.1093/molbev/msv035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Murrell B., Moola S., Mabona A., Weighill T., Sheward D., Kosakovsky Pond S.L., Scheffler K. FUBAR: A fast, unconstrained Bayesian approximation for inferring selection. Mol. Biol. Evol. 2013;30:1196–1205. doi: 10.1093/molbev/mst030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Smith M.D., Wertheim J.O., Weaver S., Murrell B., Scheffler K., Pond S.L.K. Less is more: An adaptive branch-site random effects model for efficient detection of episodic diversifying selection. Mol. Biol. Evol. 2015;32:1342–1353. doi: 10.1093/molbev/msv022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Kosakovsky P.S.L., Murrell B., Fourment M., Frost S.D., Delport W., Scheffler K. 2011 A random effects branch-site model for detecting episodic diversifying selection. Mol. Biol. Evol. 2011;28:3033–3043. doi: 10.1093/molbev/msr125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Mi H., Muruganujan A., Thomas P.D. PANTHER in 2013: Modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees. Nucleic Acids Res. 2013;41:D377–D386. doi: 10.1093/nar/gks1118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Moreno-Santillan D.D., Machain-Williams C., Hernandez-Montes G., Ortega J. De Novo Transcriptome Assembly and Functional Annotation in Five Species of Bats. Sci. Rep. 2019;9:6222. doi: 10.1038/s41598-019-42560-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Theissinger K., Falckenhayn C., Blande D., Toljamoc A., Gutekunstb J., Makkonenc J., Jussilac J., Lykob F., Schrimpfa A., Schulza R., et al. De novo assembly and annotation of the freshwater crayfish Astacus astacus transcriptome. Mar. Genom. 2016;28:7–10. doi: 10.1016/j.margen.2016.02.006. [DOI] [PubMed] [Google Scholar]
- 62.Waits D.S., Simpson D.Y., Sparkman A.M., Bronikowski A.M., Schwartz T.S. The utility of reptile blood transcriptomes in molecular ecology. Mol. Ecol. Resour. 2020;20:308–317. doi: 10.1111/1755-0998.13110. [DOI] [PubMed] [Google Scholar]
- 63.Carruthers M., Yurchenko A.A., Augley J.J., Adams C.E., Herzyk P., Elmer K.R. De novo transcriptome assembly, annotation and comparison of four ecological and evolutionary model salmonid fish species. BMC Genom. 2018;19:32. doi: 10.1186/s12864-017-4379-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Bushmanova E., Antipov D., Lapidus A., Prjibelski A.D. rnaSPAdes: A de novo transcriptome assembler and its application to RNA-Seq data. Gigascience. 2019;8:giz100. doi: 10.1093/gigascience/giz100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Gates K., Sandoval-Castillo J., Bernatchez L., Beheregaray L.B. De novo transcriptome assembly and annotation for the desert rainbowfish (Melanotaenia splendida tatei) with comparison with candidate genes for future climates. Mar. Genom. 2017;35:63–68. doi: 10.1016/j.margen.2017.05.008. [DOI] [PubMed] [Google Scholar]
- 66.Murphy W.J., Pringle T.H., Crider T.A., Springer M.S., Miller W. Using genomic data to unravel the root of the placental mammal phylogeny. Genome Res. 2007;17:413–421. doi: 10.1101/gr.5918807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Pasquesi G.I.M., Adams R.H., Card D.C., Schield D.R., Corbin A.B., Perry B.W., Reyes-Velasco J., Ruggiero R.P., Vandewege M.W., Shortt J.A., et al. Squamate reptiles challenge paradigms of genomic repeat element evolution set by birds and mammals. Nat. Commun. 2018;9:2774. doi: 10.1038/s41467-018-05279-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Childebayeva A., Goodrich J.M., Léon-Velarde F., Rivera-Chira M., Kiyamu M., Brutsaert T., Dolinoy D., Bigham A. Genome-wide DNA methylation changes associated with high-altitude acclimatization during an Everest base camp trek. Front. Physiol. 2021;12:660906. doi: 10.3389/fphys.2021.660906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Solari K.A., Ramakrishnan U., Hadly E.A. Gene expression is implicated in the ability of pikas to occupy Himalayan elevational gradient. PLoS ONE. 2018;13:e0207936. doi: 10.1371/journal.pone.0207936. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Verma P., Sharma A., Sodhi M., Thakur K., Kataria R., Niranjan S.K., Bharti V., Kumar P., Giri A., Kalia S., et al. Transcriptome analysis of circulating PBMCs to understand mechanism of high-altitude adaptation in native cattle of Ladakh region. Sci. Rep. 2018;8:7681. doi: 10.1038/s41598-018-25736-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data generated in this study are publicly available from the NCBI GenBank database under the Bioproject ID PRJNA750278, BioSamples SAMN20458631, SAMN20458632, and SAMN20458631. All sequence data were deposited in the NCBI Sequence Read Archive (SRA, http://www.ncbi.nlm.nih.gov/Traces/sra/; accessed on 2 August 2021) under the accession numbers SRR15283177, SRR15283178, and SRR15283179; assembled sequences were transmitted to NCBI Transcriptome Shotgun Assembly Sequence Database (TSA, http://www.ncbi.nlm.nih.gov/genbank/tsa (accessed on 2 August 2021).


