Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2017 Mar 24;7:423. doi: 10.1038/s41598-017-00439-7

Variants in the host genome may inhibit tumour growth in devil facial tumours: evidence from genome-wide association

Belinda Wright 1, Cali E Willet 1,2, Rodrigo Hamede 3, Menna Jones 3, Katherine Belov 1, Claire M Wade 1,
PMCID: PMC5428454  PMID: 28341828

Abstract

Devil facial tumour disease (DFTD) has decimated wild populations of Tasmanian devils (Sarcophilus harrisii) due to its ability to avoid immune detection and pass from host to host by biting. A small number of devils have been observed to spontaneously recover from the disease which is otherwise fatal. We have sequenced the genomes of these rare cases and compared them to the genomes of devils who succumbed to the disease. Genome-wide association, based on this limited sampling, highlighted two key genomic regions potentially associated with ability to survive DFTD. Following targeted genotyping in additional samples, both of these loci remain significantly different between cases and controls, with the PAX3 locus retaining significance at the 0.001 level, though genome-wide significance was not achieved. We propose that PAX3 may be involved in a regulatory pathway that influences the slowing of tumour growth and may allow more time for an immune response to be mounted in animals with regressed tumours. This provides an intriguing hypothesis for further research and could provide a novel route of treatment for this devastating disease.

Introduction

The Tasmanian devil (Sarcophilus harrisii), a carnivorous marsupial native to Australia, is threatened with extinction in the wild due to devil facial tumour disease (DFTD). DFTD is a rare form of transmissible cancer that was first officially observed in 1996 in the north-east of Tasmania1. The tumour is transmitted as an allograft by biting during social interactions2, 3. As DFTD arose once in a north-eastern devil and then spread throughout the population, it is unusual in that the tumour genome has a different origin to that of its host. Since its emergence, DFTD has rapidly spread throughout all but the far-western populations of devils with infection rates of up to 95%4. This high infection rate is in part due to the ability of the tumour to evade immune detection by down-regulating the major histocompatibility complex (MHC)5. Tumours usually develop on the face and metastasise to distal organs, resulting in organ failure and death6, 7. Examination of the tumour transcriptome has indicated a Schwann cell origin – a cell of the peripheral nerve with a neural crest precursor8. There is now evidence of evolution in response to DFTD with allele frequency changes over time following disease-mediated sweeps in wild devil populations9.

In one wild north-western Tasmanian population, West Pencil Pine (WPP)10, it has been observed that some devils have spontaneously cleared the cancer. These individuals were confirmed to have DFTD when first sampled, yet tumours were recorded to have shrunk or disappeared on subsequent sampling11. There are no apparent differences in MHC profiles within this population relating to DFTD infection12. Some devils from this population have demonstrated serum antibodies against DFTD11. The underlying cause of this rare immune response remains unclear. Our hypothesis is that variants within the genomes of these particular devils may confer some resistance to DFTD progression or metastasis.

To investigate this we compared the genomes of seven devils that survived DFTD infection or displayed evidence of tumour regression to six young control devils that succumbed to the disease. Young control devils were used to account for potential effects of immunological senescence increasing susceptibility to infection. Through comparisons with the Tasmanian devil reference genome13, we discovered variable sites across the genomes of these samples. We next employed a genome-wide association to identify genomic regions that segregate different haplotypes between young, DFTD-infected devils and devils with regressed tumours from the WPP population. Three candidate loci were further targeted in additional controls from the same population. We present two key regions of the genome that may contribute to the observed tumour regression in these devils.

Results/Discussion

Genome-wide association (GWAS) analysis revealed a large region spanning 1.6 megabases (Mb) on chromosome 3. The region contains 38 single nucleotide polymorphisms (SNPs) that fall within the top 100 loci from association after filtering on quality, seven of which map within introns of the PAX3 gene (best p value = 0.00007; Table 1, also Table S1). A second associated region on chromosome 6 spans 830 kilobases (kb). This region contains 17 SNPs within the top 100 associated loci after filtering and eight of these map to intronic sequence of the TLL1 gene (best p value = 0.00022; Table 1, also Table S1). No SNPs affecting coding sequence were found within the associated regions in the genomes of the sampled devils.

Table 1.

Summary of loci segregating with DFTD regression identified by GWAS. Key candidate genes for DFTD regression are in bold.

Scaffold Scaffold Size (Mb) No. of SNPs (In top 100) Best p-value INDELs Nearest Candidate Genes
Chr1_0002 1.08 97 (2) 0.00039 189 MKI67
Chr1_0216 3.85 203 (1) 0.00094 443 RBL1
Chr1_0248 0.97 102 (1) 0.00049 217 CHRNA4
Chr1_0256 4.38 410 (1) 0.00094 1047 DOK5
Chr1_0386 0.71 136 (1) 0.00006 173 NBAS, DDX1
Chr2_0233 4.02 222 (2) 0.00045 562 YWHAZ
Chr2_0238 0.81 274 (1) 0.00094 411 EBAG9
Chr2_0398 3.39 419 (4) 0.00045 945 UNC13B
Chr3_0006 0.94 14 (1) 0.00067 74 SEPT2
Chr3_0048 2.23 96 (1) 0.00131 328 CD96
Chr3_0369 3.06 146 (2) 0.00028 451 STK11IP, INHA, OBSL1
Chr3_0388 4.28 407 (38) 0.00007 1496 PAX3, EPHA4
Chr3_0519 0.60 23 (1) 0.00028 55 PAX3 (pseudogene)
Chr4_0035 2.57 132 (2) 0.00028 342 RASAL2, BRINP2
Chr4_0110 2.19 106 (2) 0.00094 338 NGFR, PHB
Chr5_0005 1.75 41 (1) 0.00049 146 CERK
Chr5_0006 2.48 292 (1) 0.00067 636 FAM19A5
Chr5_0069 4.66 224 (2) 0.00131 971 BTBD11
Chr6_0050 4.00 131 (1) 0.00094 655 MAPK10, ARHGAP24
Chr6_0079/80 0.12 12 (0)* 0.00165 69 SPOCK3
Chr6_0081 1.17 161 (17) 0.00022 480 TLL1

*Included as scaffolds contains annotations for neighbouring genes of TLL1 (identified by GWAS) and corresponding to a syntenic block as identified by comparison with human, mouse and opossum.

A single SNP in the region of the NBAS gene on chromosome 1 obtained the lowest p value from the GWAS (p = 0.00006, Table 1). This SNP along with 2 SNPs from each segregating haplotype at the PAX3 and TLL1 loci were targeted in an additional 13 control samples from WPP and one additional regressed devil from this population. The significant association at the NBAS locus disappeared following addition of further samples (Table 2). The significant associations at both TLL1 and PAX3 were retained following addition of further samples though was somewhat diminished especially for the TLL1 locus (Table 2).

Table 2.

Results from association analysis in targeted SNPs (N = 27) as compared to results obtained from genome-wide association (N = 13).

Locus SNP GWAS Targeted loci
Chi-square p-value Chi-square p-value
NBAS 377874_T 16.34 0.00005*** 0.9441 0.33120
PAX3 2290941_C 13.48 0.00024** 10.94 0.00094**
PAX3 2253019_T 16.34 0.00006*** 15.48 0.00008***
TLL1 476140_C 11.8 0.00059** 5.44 0.01968*
TLL1 635881_C 11.8 0.00059** 5.347 0.02076*

*Significant at 0.05; **Significant at 0.001; ***Significant at 0.0001.

See Supplementary Material, Table S1 for a full distribution of genotypes for cases and controls.

PAX3 and TLL1 share a common physiological pathway. Both are involved in angiogenesis through regulation of intermediate targets, known as bone morphogenic proteins (BMPs). BMPs stimulate angiogenesis through activation of vascular endothelial growth factor (VEGF) in both normal development and cancer metastases1416. In our regressed samples we have investigated the host genomes rather than the tumour genome, so variants identified here in PAX3 and TLL1 would be expected to play a role in the tumour microenvironment rather than the tumour itself. PAX3 expression has been implicated in a number of cancers including neuroblastoma and may enhance oncogenic potential through regulating the expression of a suite of other genes17.

It is possible that variants of PAX3 and TLL1 affect angiogenesis in DFTD. PAX3 is also believed to directly regulate FLT1, a VEGF receptor18, 19. If the process of angiogenesis is disrupted by variants of PAX3 and/or TLL1, then tumour metastasis and growth may be hindered as it has no access to the blood supply and nutrients required for growth. This may allow the infected individual more time to mount an immune response to the disease. This serves as a potential mode of action for the regression of tumours observed in devils in this study.

There may be alternative modes of action whereby PAX3 and TLL1 influence the tumour microenvironment and impact tumour progression. It remains unclear whether PAX3 and TLL1 activity is triggered by DFTD infection in regressed cases. PAX3’s involvement in a number of other cancers makes it a particularly intriguing target for further research. We cannot rule out the possibility that other candidate genes identified in the GWAS play a role in tumour regression, or that tumour lineage may be an influencing factor.

Confirmation of these results is not possible from the current dataset, however results from targeted genotyping increase confidence in the involvement of the PAX3 locus in tumour regression. Assessment of the expression of BMP and FLT (VEGF) in tissues surrounding tumours during tumour progression would enable us to understand their role in angiogenesis in DFTD. However, such samples are currently unavailable and would be very difficult to source in the field as regressing tumours are rare and sequential access to tumour and host biopsies over the time points needed for analysis is not feasible. We cannot rule out the possibility that tumour regression is not under genetic control. Taken together with recent research identifying a genetic response to DFTD infection at a population level9, along with identification of an immune response to DFTD in some devils in the WPP population11, there is mounting evidence that genetics plays a role in resistance to DFTD. The current study is preliminary in nature and we have here identified two regions worthy of further investigation and proposed a potential mode of action whereby these loci could influence tumour progression.

Conclusion

Disruption of angiogenesis to developing tumours that slows growth and metastasis serves as a plausible model for DFTD regression in seven Tasmanian devils. Slowing of tumour growth rate may allow more time for an infected individual to mount an immune response. Tumour regression is expected to be a complex trait and many other factors and genes may be involved in this rare ability. Future work should aim to identify the frequency of this phenotype in the wild. Given the small number of animals that have recovered from the disease, it is likely to be very rare. However, the frequency may increase over time due to strong selection pressure from the disease. Confirmation of the involvement of PAX3 in DFTD regression may lead to novel treatments for this devastating disease.

Materials and Methods

Samples

We have sequenced the genomes of 12 devils from WPP – six of these (five females, one male) were individuals with regressed tumours that survived in the wild disease-free for some time (Table 3). Five devils (two females, three males) who had succumbed to DFTD at a young age were selected as controls. One female sample was discarded from the genome-wide association analysis as infection status was ambiguous, although genomic data from this sample was utilised for identification of variants. Devil samples for whole genome sequencing were collected in the field from WPP with approval from the University of Tasmania’s Animal Ethics Committee (A0010296 and A0013326) and from the Tasmanian Department of Primary Industries and Water (TFA15214, TFA08211, TFA14228, TFA13924). All methods were performed according to the relevant guidelines and regulations. In addition, we have utilised genome sequences from a previous study (GenBank: GCA_000219685.1, ref. 20). These two animals consist of a south-eastern male devil who succumbed to DFTD at a young age and a north-western male devil who survived a vaccination trial though eventually succumbed when re-infected in a laboratory trial.

Table 3.

Details of 11 West Pencil Pine samples used in the genome-wide association study.

Infection status Sex Y.O.B. Age 1st infected Diagnostic criteria Comments
Regressed F 2006 5 Visual inspection Survived in the wild until age 5 DFTD free
Regressed F 2006 5 Visual inspection Survived in the wild until age 6 DFTD free
Regressed F 2008 3 Histopathology/cytology Tumour regressed for ~6 months
Regressed F 2009 3 Visual inspection Tumour regressed for ~2 years
Regressed F 2006 3 Histopathology/cytology Tumour regressed for ~6 months
Regressed M 2009 3 Cytology Survived in wild until age 7 DFTD free
DFTD-infected M 2011 18 mths Histopathology/cytology Succumbed
DFTD-infected F 2011 15 mths Histopathology/cytology Succumbed
DFTD-infected M 2010 18 mths Histopathology/cytology Succumbed
DFTD-infected F 2011 18 mths Histopathology/cytology Succumbed
DFTD-infected F 2010 15 mths Histopathology/cytology Succumbed

Nb. When a tumour is detected within the mouth on sampling then no biopsy or fine needle aspiration is possible therefore visual inspection scores are used to assign infection status. As sampling took place at 3 month intervals, periods of tumour regression and survival are approximate. Y.O.B. – year of birth. Ages at first infection are given in years except for those given in months (mths).

Sequencing and variant discovery

Tissue (ear biopsy) was obtained and DNA extracted using a phenol-chloroform extraction method21. The 12 Tasmanian devil genomes were sequenced by the Ramaciotti Centre for Genomics at the University of New South Wales, Kensington, with 100 base pair (bp) paired-end libraries using Truseq library preparation kits on an Illumina HiSeq 2000. An initial seven samples were sequenced in a separate lane each with another five samples sequenced across three lanes. The average read depth coverage was 10–15 times. Genome sequences were deposited in the European Nucleotide Archive (accession numbers: ERS682204-ERS682210; ERS1202857-ERS1202861). Re-sequenced genomes were aligned to the Tasmanian devil reference genome assembly version 7.0, (GenBank: GCA_000189315.1, ref. 13) using Burrows-Wheeler Aligner version 0.6.222 with default parameters for paired-end data.

SNPs and INDELs

Polymorphic sites across the entire genome were identified using SAMtools mpileup version 0.1.1923, including only bases and reads with minimum base and mapping quality scores of 20, respectively. Pileup was generated at all sites to allow homozygous reference genotypes to be included in the dataset if at least one sample displayed a non-reference allele after quality filtering. PCR duplicates were removed using SAMtools and local realignment around insertions and deletions was performed using GATK IndelRealigner version 2.724. Insertions and deletions (INDELs) were detected using SAMtools23 with a coefficient of 50 based on a phred-scaled probability score used to scale down mapping quality of reads containing multiple mismatches. Variant call format files (.vcf) for individual animals were merged and filtered to retain individual locus calls with quality scores greater than 20.

Genome-wide association

Using custom perl scripts, the SNP vcf files were filtered and re-formatted into transposed PED (tped) format to be compatible with association analysis with PLINK version 1.0725 and Haploview version 4.226. SNPs where the root mean squared mapping quality at the site was less than 20 were excluded. For inclusion in the tped, genotypes were required to be covered by at least four high quality reads, with at least two reads or 20% of covering reads (whichever was greater) supporting the non-reference allele in order to call a heterozygous SNP. To minimize calling SNPs in duplicated loci, genotype calls were only accepted where read depth was less than 2.5 times the average mapped read depth. Variant sites were filtered for a minor allele frequency of 12% (to require an allele that is seen in at least two separate samples – i.e. 26 potential alleles from 13 samples, 3/26 = 11.5) and genotyping fail rate of <1%.

An adjusted standard genome-wide association (GWAS) in PLINK25 was applied to variant SNPs across the genome. The −log10 transformed probabilities arising from the individual locus Chi-squared tests were plotted across chromosomes using Haploview26 to visualise regional significance of allele frequency differences between cases (regressed devils) and controls (non-regressed devils). Due to the large number of loci in the analysis relative to the numbers of cases and controls, a second association analysis pruned the variant sites for local linkage disequilibrium (LD) using PLINK25. This removed SNPs in 50 bp windows where loci had LD of greater than 0.8.

Mutation detection

Within regions of association, the available genome annotation information was combined with SNP and INDEL variants detected in our sample set. Annotated genes in devil as well as the predicted locations of genes in devil based on the human and mouse genomes were obtained from the University of California Santa Cruz Genome Browser (http://genome.ucsc.edu/; refs 27 and 28) and corresponding protein sequences were downloaded from Ensembl release 82 (www.ensembl.org; ref. 29). With these, predictions on the functional consequences of variants were made. The concordances of genomic locations of SNPs and INDELS with coding regions or untranslated regions were recorded. Next, their potential to change protein sequence (SNPs only) was assessed by translating the sequence with and without the variant in relation to the known reading frame of the protein. SNPs in conserved regions were identified from comparisons with 33 placental mammals using phastCons and phylop30, 31 and converted to devil coordinates using liftOver27.

Associated reference scaffolds arising from the GWAS were investigated further. At this time the Tasmanian devil reference genome v7.013 does not have scaffolds that have been fully ordered and oriented. Instead, the reference genome comprises 35,974 scaffold sequences presumptively mapped to six autosomes and an X chromosome. We cannot conclusively identify all of the candidate genes in an associated region. To compensate for the lack of reference contiguity, significant SNPs occurring on smaller scaffolds with annotated genes were further examined by assessing synteny within more complete reference genomes including human (hg38)32, mouse (GRCm38)33 and opossum (monDom5)34 for potentially neighbouring genes.

Stratification and relatedness

Relatedness among samples was assessed by calculating the pair-wise identity by descent using PLINK25 to identify any underlying stratification that may influence the analysis. We employed multi-dimensional scaling to visualise the two main principal components explaining the genotypic variation among our samples to test for obvious stratification between the case and control groups. A small sample size and high relatedness estimates amongst our samples impacted the power to detect a genomic signal. For this reason we chose to investigate those 100 most significant SNPs from the GWAS as this method has previously been used to successfully map a causative mutation in a genome-wide association study35.

Targeted genotyping in additional samples

Primers were designed to target SNPs in the three key regions identified from the GWAS (Table S2). An additional 13 control devils from WPP who succumbed to DFTD with no apparent host response to infection were genotyped at these SNPs to identify whether the significance of the association between cases and controls was sustained for these loci (Table S3). One additional devil with confirmed regression of DFTD was also included (Table S3). Targeted SNPs were amplified by PCR using a HotStarTaq Plus DNA Polymerase kit (Qiagen) under the following conditions: 2 uL dNTPs (10 mM), 10 uL PCR buffer (10x), 2 uL forward and reverse primers (10 mM), 0.5 uL taq polymerase (5 U) per reaction using PCR cycling times of 95°C for 5 min initial denaturation, followed by 35 cycles of 94°C for 1 min, 58°C for 1 min, 72°C for 1 min and a final extension of 72°C for 10 min. Resulting PCR product was subjected to Sanger sequencing at the Australian Genome Research Facility, Westmead, NSW, Australia. Sequences were analysed using MEGA v6.0636 to determine individual genotypes. A second association test was conducted in PLINK on a locus by locus basis for the five targeted SNPs including both the new samples and the original 13 samples used in the GWAS.

Electronic supplementary material

Acknowledgements

We are grateful to Mark and Claire Walsh from Discovery Holiday Parks Cradle Mountain for providing accommodation and logistic support during fieldwork. Forico Pty Ltd. facilitated land access during our study. This work was funded by an ARC Discovery Grant (DP110102656) to M.J., K.B. and C.W. K.B. (FT0992212) and M.J. (FT100100250) were supported by ARC Future Fellowships, R.H. is supported by a National Science Foundation grant (NSF DEB-1316549). The authors wish to thank all volunteers who participated in field sampling and the Save the Tasmanian devil Program of the Tasmanian Department of Primary Industries, Parks, Water and Environment who provided help with DFTD diagnostics. The authors acknowledge the Sydney Informatics Core Research Facility and the University of Sydney HPC service at The University of Sydney for providing resources that have contributed to the research results reported within this paper.

Author Contributions

B.W. wrote the manuscript and analysed the GWAS data, C.E.W. wrote the custom perl scripts used in data transformation and predicted functional consequences of variants, R.H. performed trapping of wild devils and along with M.J. provided guidance on DFTD diagnoses and study design, C.M.W. and K.B. conceived of the study and supervised the project, C.M.W. provided guidance on interpretation of GWAS data. All authors have revised and contributed to the manuscript.

Competing Interests

The authors declare that they have no competing interests.

Footnotes

Electronic supplementary material

Supplementary information accompanies this paper at doi:10.1038/s41598-017-00439-7

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Hawkins CE, et al. Emerging disease and population decline of an island endemic, the Tasmanian devil Sarcophilus harrisii. Biological Conservation. 2006;131:307–324. doi: 10.1016/j.biocon.2006.04.010. [DOI] [Google Scholar]
  • 2.Pearse AM, Swift K. Allograft theory: Transmission of devil facial-tumour disease. Nature. 2006;439:549–549. doi: 10.1038/439549a. [DOI] [PubMed] [Google Scholar]
  • 3.Hamede RK, McCallum H, Jones M. Biting injuries and transmission of Tasmanian devil facial tumour disease. Journal of Animal Ecology. 2013;82:182–190. doi: 10.1111/j.1365-2656.2012.02025.x. [DOI] [PubMed] [Google Scholar]
  • 4.McCallum H, et al. Distribution and impacts of Tasmanian devil facial tumor disease. EcoHealth. 2007;4:318–325. doi: 10.1007/s10393-007-0118-0. [DOI] [Google Scholar]
  • 5.Siddle HV, et al. Reversible epigenetic down-regulation of MHC molecules by devil facial tumour disease illustrates immune escape by a contagious cancer. Proceedings of the National Academy of Sciences. 2013;110:5103–5108. doi: 10.1073/pnas.1219920110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Loh R, et al. The pathology of devil facial tumor disease (DFTD) in Tasmanian devils (Sarcophilus harrisii) Veterinary Pathology Online. 2006;43:890–895. doi: 10.1354/vp.43-6-890. [DOI] [PubMed] [Google Scholar]
  • 7.Pyecroft SB, et al. Towards a case definition for devil facial tumour disease: what is it? EcoHealth. 2007;4:346–351. doi: 10.1007/s10393-007-0126-0. [DOI] [Google Scholar]
  • 8.Murchison EP, et al. The Tasmanian devil transcriptome reveals Schwann cell origins of a clonally transmissible cancer. Science. 2010;327:84–87. doi: 10.1126/science.1180616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Epstein, B. et al. Rapid evolutionary response to a transmissible cancer in Tasmanian devils. Nature Communications7 (2016). [DOI] [PMC free article] [PubMed]
  • 10.Hamede RK, et al. Transmissible cancer in Tasmanian devils: localized lineage replacement and host population response. Proc. R. Soc. B. 2015;282:1814. doi: 10.1098/rspb.2015.1468. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Pye, R. et al. Demonstration of immune responses against devil facial tumour disease in wild Tasmanian devils. Biology Letters12, doi:10.1098/rsbl.2016.0553 (2016). [DOI] [PMC free article] [PubMed]
  • 12.Lane A, et al. New insights into the role of MHC diversity in devil facial tumour disease. PloS One. 2012;7:e36955. doi: 10.1371/journal.pone.0036955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Murchison EP, et al. Genome sequencing and analysis of the Tasmanian devil and its transmissible cancer. Cell. 2012;148:780–791. doi: 10.1016/j.cell.2011.11.065. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Deckers MM, et al. Bone morphogenetic proteins stimulate angiogenesis through osteoblast-derived vascular endothelial growth factor A. Endocrinology. 2002;143:1545–1553. doi: 10.1210/endo.143.4.8719. [DOI] [PubMed] [Google Scholar]
  • 15.Dai J, et al. Vascular endothelial growth factor contributes to the prostate cancer-induced osteoblast differentiation mediated by bone morphogenetic protein. Cancer research. 2004;64:994–999. doi: 10.1158/0008-5472.CAN-03-1382. [DOI] [PubMed] [Google Scholar]
  • 16.Langenfeld EM, Langenfeld J. Bone morphogenetic protein-2 stimulates angiogenesis in developing tumors. Molecular Cancer Research. 2004;2:141–149. [PubMed] [Google Scholar]
  • 17.Fang WH, et al. PAX3 in neuroblastoma: oncogenic potential, chemosensitivity and signalling pathways. Journal of cellular and molecular medicine. 2014;18:38–48. doi: 10.1111/jcmm.12155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Linardic CM. PAX3–FOXO1 fusion gene in rhabdomyosarcoma. Cancer letters. 2008;270:10–18. doi: 10.1016/j.canlet.2008.03.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Barber TD, et al. Identification of target genes regulated by PAX3 and PAX3–FKHR in embryogenesis and alveolar rhabdomyosarcoma. Genomics. 2002;79:278–284. doi: 10.1006/geno.2002.6703. [DOI] [PubMed] [Google Scholar]
  • 20.Miller W, et al. Genetic diversity and population structure of the endangered marsupial Sarcophilus harrisii (Tasmanian devil) Proceedings of the National Academy of Sciences. 2011;108:12348–12353. doi: 10.1073/pnas.1102838108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Sanbrook, J., Fritsch, E. & Maniatis, T. Molecular cloning: a laboratory manual. Cold Spring Harbor Laboratory Cold Spring Harbor, NY (1989).
  • 22.Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Li H, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.McKenna A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Purcell S, et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. American journal of human genetics. 2007;81:559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005;21:263–265. doi: 10.1093/bioinformatics/bth457. [DOI] [PubMed] [Google Scholar]
  • 27.Kent WJ, et al. The human genome browser at UCSC. Genome research. 2002;12:996–1006. doi: 10.1101/gr.229102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Karolchik D, et al. The UCSC Table Browser data retrieval tool. Nucleic acids research. 2004;32:D493–D496. doi: 10.1093/nar/gkh103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Flicek P, et al. Ensembl 2014. Nucleic Acids Research. 2014;42:D749–D755. doi: 10.1093/nar/gkt1196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Pollard KS, Hubisz MJ, Rosenbloom KR, Siepel A. Detection of nonneutral substitution rates on mammalian phylogenies. Genome research. 2010;20:110–121. doi: 10.1101/gr.097857.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Siepel A, et al. Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome research. 2005;15:1034–1050. doi: 10.1101/gr.3715005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Consortium IHGS. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. doi: 10.1038/35057062. [DOI] [PubMed] [Google Scholar]
  • 33.Consortium MGS. Initial sequencing and comparative analysis of the mouse genome. Nature. 2002;420:520–562. doi: 10.1038/nature01262. [DOI] [PubMed] [Google Scholar]
  • 34.Mikkelsen TS, et al. Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences. Nature. 2007;447:167–177. doi: 10.1038/nature05805. [DOI] [PubMed] [Google Scholar]
  • 35.Willet CE, et al. Canine disorder mirrors human disease: exonic deletion in hes7 causes autosomal recessive spondylocostal dysostosis in miniature schnauzer dogs. PloS One. 2015;10:e0117055. doi: 10.1371/journal.pone.0117055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Tamura, K. et al. MEGA6: Molecular evolutionary genetics analysis version 6.0. Molecular Biology and Evolution30, 2725–2729 (2013). [DOI] [PMC free article] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES