ABSTRACT
Dogs are the most phenotypically diverse mammalian species, and they possess more known heritable disorders than any other non-human mammal. Efforts to catalog and characterize genetic variation across well-chosen populations of canines are necessary to advance our understanding of their evolutionary history and genetic architecture. To date, no organized effort has been undertaken to sequence the world's canid populations. The Dog10K Consortium (http://www.dog10kgenomes.org) is an international collaboration of researchers from across the globe who will generate 20× whole genomes from 10 000 canids in 5 years. This effort will capture the genetic diversity that underlies the phenotypic and geographical variability of modern canids worldwide. Breeds, village dogs, niche populations and extended pedigrees are currently being sequenced, and de novo assemblies of multiple canids are being constructed. This unprecedented dataset will address the genetic underpinnings of domestication, breed formation, aging, behavior and morphological variation. More generally, this effort will advance our understanding of human and canine health.
Keywords: genomics, genome-wide association studies (GWAS), breed, selection, variation, evolution
INTRODUCTION
Domestic dogs (Canis lupus familiaris) are the most variable mammalian species on Earth [1–3] (Fig. 1). Strong artificial selection has produced approximately 450 globally recognized breeds with distinct traits related to morphology [4] including, but not limited to, body size [4,5], tail phenotype [6], fur type [7,8], skull shape [6,9–11] and pigmentation [12–15]. Strong breed variation also exists in behavioral traits including herding, guarding and hunting [16], as well as personality traits (e.g. hypersocial behavior) [17] including boldness [18] and aggression [19]. The adoption of the ‘breed barrier rule’, i.e. that no dog may become a registered member of a breed unless both its dam and sire are registered members, has led to the establishment of breeds with highly restricted gene pools [20–22]. As a result, there is strong phenotypic homogeneity within all breeds [23]. Most breeds were established within the last 200 years [23,24] and were derived from small numbers of founders [24,25]. Consequently, the extraordinary phenotypic variation across dog breeds is accessible through analysis of only a modest number of genetic markers [3,26–29].
To create dogs with specific phenotypes, breeders often cross closely related individuals, and this particularly took place during the early formative years of many breeds. One consequence of this strategy has been an increased incidence of breed-specific genetic disease. A growing community has taken advantage of these observations to identify genes for canine maladies that recapitulate human disorders, many of which lack suitable mouse models. Indeed, with few exceptions, dogs experience the same common disorders as humans including cancer, heart disease, neurological disorders and diabetes (reviewed in: [20,21,30]). The underlying disease pathology is often similar to humans, as is the response to treatment and final outcomes (e.g. [31]). One additional consequence of the restrictive breeding programs that produced many modern breeds is the observed excess of recessive diseases, many of which have the potential to significantly advance our understanding of human orphan disorders, a benefit of no other medical model [32].
One of the primary goals of this initiative is therefore to advance dogs as a model genetic species. Dogs were the first domesticated species and the only animal domesticated prior to the advent of agriculture [33]. In order to understand the range of genetic variability in dogs, it is crucial to investigate the entirety of canine evolution and domestication history (Fig. 2), which are tightly linked to that of humans [34–36]. As a result, strong selection on relatively few genes underlies many modern domestic phenotypes and may, in some cases, have led to genetic hitchhiking of deleterious alleles that contribute to disease risk [37]. In order to maximize the power of dogs as a genetic system for the study of human health and biology, and to comprehend the genetic basis for the myriad stages of domestication, it is crucial to ascertain the timing, geographical location and number of wolf populations that were involved during domestication [38,39].
AIMS, SCOPE AND METHODS
Aim 1: define the consequences of domestication on existing dog genomic diversity
Domestication
Numerous aspects of dog domestication are well accepted. There is a clear evidence, for example, that dogs were derived from gray wolves [40] and that no other canine species were involved during the initial phases of domestication [40–42]. In addition, the wolf population(s) that were involved in the early phases of domestication are likely extinct [42]. Despite the recent publication of numerous genetic studies of both modern and ancient dogs, there is as yet no firm consensus regarding either the timing [38,43,44] or location(s) [42,45–49] of domestication, the long-term effect of domestication on modern dog genomes [37,50], or even the number of independent wolf populations that were involved in the process [33,39]. The interpretation of the archaeological record has also been contentious. Though canid remains potentially derived from dogs have been excavated from Mesolithic contexts dated to 15 000 years before present in Europe and East Asia, the status of these remains as dogs or wolves is not easily resolved [33,51]. The characterization of genomic sequences from additional ancient and modern dog populations has the potential to resolve these controversies, and establish the early history of human–dog relationships.
Since their domestication, dogs have both adapted to novel environments as they dispersed across continents with their human companions [52–59], and been subjected to human selection for a diversity of occupations and aesthetic preferences. As a result, dogs are now globally dispersed, and they are the most abundant carnivore species in the world with a cosmopolitan distribution. Whole-genome sequencing (WGS) studies of numerous populations, breeds and wild canids are therefore crucial to advance our basic knowledge regarding the genetic outcomes of canine evolution (Fig. 2). Generating genome sequences on this scale will enable us to identify and characterize signals of selection related to domestication and dog breed formation at an unprecedented level of detail. Such studies will include not only the analysis of single nucleotide variants (SNVs), insertion deletions (indels) and copy number variants (CNVs), but also large structural variants, which are known for their key roles in aiding our understanding of the evolution of humans [28,60].
Admixture
Following domestication, dogs have traveled alongside humans across continents, often hybridizing with local wild canids. As a result, patterns of recent and ancient admixture among wild and domestic canid species are complex [38,42,60–63].
Dog10K will take advantage of this introgression history to explore the nature of selection, specifically in admixed populations. There are several examples of adaptive alleles in canids that have entered the population through admixture with other canid populations [54,57,62–65]. We will study adaptive introgression more generally and search for genomic regions enriched for ancestry from other populations. Furthermore, we will search for regions depleted of admixed ancestry. Such loci are less tolerant to admixture, and may contain genes that are important for speciation and domestication [38].
Selection
Using Dog10K data, we will expand maps of positive selection across the dog genome. More specifically, we will identify genes that differentiate modern dog and wolf populations [66]. Through integration of the sequence data derived from radiocarbon-dated individuals, we will use modeling techniques (e.g. [67]) to assess the strength and timing of selection over at least the last 15 000 years. Additionally, we will identify positive selection on genes within dog breeds through comparisons of haplotypes and linked variation among different breeds. The Dog10K data will also serve to tie signatures of selection with specific traits, local adaptation processes in wild canid populations, and provide a format for investigating the potential role of polygenic selection in canines, all of which are understudied.
The Dog10K effort to provide deep and comprehensive genome sequence data will further enable an improved inference of evolutionary rates in canids, a parameter that has been challenging to estimate [38,42,43]. This effort will reduce the confidence intervals surrounding inferred divergence times, effective population sizes and other demographic parameters. We will estimate mutation rates for a diverse array of molecular features, including SNVs, CNVs and mobile element insertions, and test whether mutational events cluster along the genome. These efforts will improve our understanding of the mutational process in dogs and wolves, and how mutations are affected by sequence composition, tandem repeats, CpG sites, chromatin accessibility and the unique nature of canine recombination hotspots [68]. To make these inferences, we will leverage patterns of linkage disequilibrium (LD) along with existing pedigree-based genetic maps. Lastly, the Dog10K data will provide a comprehensive catalog of rates of gene gain/loss in distinct canid populations and dog breeds relative to the canid phylogeny.
Aim 2: dissect modern dog breed structure and morphological diversity
Modern breed structure
Having been subjected to centuries of strong human-mediated selection, dogs have evolved an extraordinary level of morphological and behavioral diversity (Fig. 1). By studying breeds, we can decipher the genetic basis of phenotype diversity and the consequences for numerous diseases. Our research will expand upon previous studies that have defined genes and/or specific variants for body size, skull shape, leg length, fur texture and pattern, and other traits (reviewed in [30]). An understanding of modern breed structure is critical when studying any locus in dogs, particularly those associated with disease, as closely related breeds are likely to share common susceptibility alleles due to shared, extensive tracks of homozygosity [63]. The closed structure of breeds yields disease phenotypes where few susceptibility alleles are likely to be causative, including complex diseases like cancer (reviewed in: [20,22,69,70]).
Each dog breed possesses a dense population structure and recognizable patterns of haplotype inheritance [3,63,71,72], which can both help and hinder attempts to map causal genetic variants in dogs. During breed development, where individual dogs were selectively bred to propagate desirable traits, the resultant population bottleneck led to large chromosomal regions that were concentrated within the new smaller populations (Fig. 3) [3,50,61]. This process has produced LD blocks that are 50-fold longer than those observed in human populations [3,73]. Initial measurements predicted that within-breed LD blocks could reach 2–5 megabases, but it is now established that LD blocks range from tens of kilobases across breeds to many megabases within breeds [74]. This substantial increase in LD block size permits the use of fewer genetic markers to capture patterns of selection and divergence than would be possible in species (such as humans) with smaller average LD blocks. The most recently developed canine array has approximately 650 000 single nucleotide polymorphisms (SNPs) (Affymetrix Infinium HD Ultra), which is a significant increase over the Illumina 170 000 SNP chip.
While canine genomics has evolved significantly with respect to family and association studies, the identification of functional variants remains difficult. Not unexpectedly, recessively acting alleles, gene–gene interactions and CNVs all contribute to complex phenotypes in dogs, and are not easily found through either segregation or association studies [75]. Single-breed studies often contain large blocks of LD, making the transition from locus to gene and, finally, to causative variant especially challenging (reviewed in [20–22]). In addition, statistical associations that include only a single breed may result in false-positive signals due to breed-specific population structure. The solution is to employ multiple breeds in a well-balanced association study, which can negate the impact of a single breed's demographic history and allow for greater cohort sizes [15,76]. Importantly, the use of SNPs or other markers derived from sequencing large numbers of diverse breeds can permit the identification of not just loci, but also genes and associative variants [4]. Greater numbers of individual dogs and breeds improve the resolution of association studies, since each additional breed reduces the inevitable skewing of results due to inherent phenotypic or phylogenetic imbalances between cases and controls.
To take the greatest advantage of the dog model, it is critical to determine the relationships between dog breeds and the traits they share. Early studies of breed structure and relatedness failed to explain the mechanisms through which distinct breeds have developed, such as geographical separation and immigration, the role of hybridization and the timeline of breed formation. Building on previous studies [61,77], a bootstrapped cladogram using a neighbor-joining tree algorithm that defined 23 supported multi-breed clades representing geographical and occupational groupings was recently developed (Fig. 4) [63]. Identical-by-decent haplotype sharing was calculated using 1359 dogs from 161 breeds to assess introgression during breed development [63]. Individual instances of haplotype sharing between breeds with diverse phylogenetic backgrounds suggest that inter-clade crosses were carried out intentionally and often for specific reasons, such as the introduction of a new trait. By establishing a linear relationship between the total length of haplotype sharing and the age of known introgression events, undocumented crosses or divisions from older breeds that occurred within the last 200 years can be accurately estimated. These studies provide guidance regarding foundational breeds to select for the Dog10K sequencing efforts, particularly as it pertains to disease gene mapping.
Dog10K design recommendations also consider the population structure for each breed that will be sequenced. Dreger et al. [71] used a panel of 170 000 SNPs to genotype 80 breeds in order to define single-breed patterns of homozygosity, shared homozygosity over 10 same-breed dogs and the rate at which any one dog will reduce the calculated shared homozygosity for its breed (Fig. 5). These efforts revealed that the breed-specific rate of homozygosity decay ranges over 3-fold, which can be used to estimate the number of dogs required to theoretically represent the entire amount of genetic variation within that breed. For example, a breed with a homozygosity decay value of 0.2 would require 18 dogs to reflect 99% of the within-breed variation. In general, the use of three unrelated dogs captures ∼85% of the variation in a registered breed [71]. These results provide guidance on the number of dogs of a given breed undergoing sequencing through Dog10K. We will also collect metadata on each dog whenever possible, including birth dates, registration numbers, standard morphological measures, disease histories, pedigree data and other data (Table 1).
Table 1.
Phase 1 | |
Registered breeds (300 breeds, five per breed) | 1500 |
Geographic distribution breeds (10 breeds, 50 per breed) | 500 |
Most popular breeds (25 breeds, 50 per breed) | 1250 |
Niche populations (20 populations, five per population) | 100 |
Mixed breeds (30 populations, 10 per population) | 300 |
Village dogs (100 populations, five per population) | 500 |
Phase 2 | |
Ancient breeds (10 breeds, 100 dogs per breed) | 1000 |
Phase 3 | |
Dog pedigrees (15 pedigrees, six per pedigree) | 90 |
Wolf pedigrees (six lineages, six per lineage) | 36 |
Phase 4 | |
Non-pedigreed wolves (20 populations, five per population) | 100 |
Coyotes (three populations, five per population) | 15 |
Golden jackals (three populations, five per population) | 15 |
Wild Canids (six populations, five canids per population) | 30 |
De novo assemblies (14 individuals) | 14 |
Total | 5450 |
Morphological phenotypes
The first genome-wide SNP array-based association studies of morphology were published in 2010, linking over 60 traits to large genomic regions [26,27]. These results have been expanded several times [28,29], most recently using over 20 million SNPs and small indels culled from WGS of 722 canids [4]. Fine-scale studies have also identified specific
genes and/or variants associated with many morphological features, often by relying on strict breed standard measures, which have been shown to accurately reflect breed variance [18,78]. Individual measures are therefore not necessary to map breed-associated morphological traits.
Among the most-studied traits in dog breeds is body size, which is a composite of many features (Fig. 6). Large- and small-sized dog breeds differ in weight and some measures of size by nearly 40× (e.g. Great Dane and Chihuahua), a claim no other land mammal can make. A number of genes have been found to be major contributors to breed standard body weight (BSW), including GHR, HMGA2, SMAD2, IGF1, IGF1R and STC2 [79–81], which account for 46%–52% of overall variance across modest and small breeds <41 kg (90 pounds). Variants in three genes on the X chromosome contribute to breeds with a BSW >41 kg (90 pounds) [5]. Among these are IRS4 and IGSF1, both of which are involved in the thyroid hormone pathway and are associated with IGF1R signaling, obesity and body mass index in humans [82–84]. Also important in large body mass are variants in the ACSL4 gene, which is associated with insulin resistance in humans. A similar role in dogs likely explains why the derived ACSL4 variant is homozygous only in large ‘bulky’ dogs (e.g. English Mastiff and St. Bernard), while the ancestral allele is homozygous in large lean breeds (e.g. Irish Wolfhound and Greyhound). Overall, just 14 genes account for >90% of BSW in purebred dogs, with IGF1 and LCORL being the largest contributors [4]. These studies highlight a recurring theme in dog genetics whereby a small number of genes of large effect control complex phenotypes, as opposed to many genes of small effect exerting similar levels of control, which is typical for human traits (e.g. >180 human body size loci [85,86]).
While latter studies of BSW in dogs were done using WGS data, initial studies were conducted using SNP arrays containing about 170 000 markers. Array-based studies often have the limitation of querying SNPs at low density across the genome with loci unrepresented by SNPs on the array. Furthermore, only a small fraction of the total SNP count on arrays is informative for every mapping study. Some investigators still rely on candidate gene analysis for such studies. One successful example is that of a 14 bp deletion in the proopiomelanocortin (POMC) gene, which has been shown to be important in obesity phenotypes in Labradors and Flat Coated Retrievers [87–89]. At least three other genes are associated with obesity in dogs [90], but it remains unknown how these influence BSW measures. These studies are valuable since they demonstrate that at least some of these genes (e.g. G protein coupled receptor 120) are associated with obesity in humans, and they show an ongoing role for candidate gene studies in dogs that can draw links to human health.
Aim 3: explore the recurring theme of shared diseases between dogs and humans
Multiple studies have established the dog as a viable model for studies of disease susceptibility, progression, treatment response and outcomes (reviewed in [20,22,30,31,91]). Dog10K will expand that applicability by developing an exhaustive catalog of genetic variation to enable the generation of an accurate imputation panel for dog genome-wide association studies (GWAS), in much the same way that the 1000 Genomes Project has improved the study of low-frequency variants in human association studies. In addition, recent SNP array studies have indicated that significant allele frequency differences exist between sample collections of the same breeds from distinct geographical locations, calling for sample sequencing from different geographical regions to create the most accurate catalog possible.
Dog10K’s emphasis on providing WGS from aged, healthy individuals will also benefit disease studies. These data are often missing from current studies as a result of cost constraints, which have led investigators to preferentially sequence diseased individuals and rely on publicly available sequence databases to determine breed allele frequencies. However, the populations sampled in public databases are not representative of all breeds and data are lacking even for breeds where disease frequency is high. Dog10K will address this information gap by sequencing aged healthy individuals representing breeds whose relative risk for a heritable disease is high, such as a particular type of cancer [92–94], e.g. Bernese Mountain Dogs and histiocytic sarcoma [95–97], Scottish Terriers and bladder cancer [98], Golden Retrievers and lymphoma [99], and Irish Wolfhounds and osteosarcoma [100]. Equally important are breeds with a predisposition to cardiac issues, neurological, neuromuscular and autoimmune disorders, deafness and ocular disease, etc., and breeds that experience multiple distinct diseases at a high frequency. Reference sequence data on all breeds will provide the scientific community with an ability to perform matched WGS-based case-control analysis, either directly or through imputation. This aim is important, as it is the link between canine and human health that has generated the most interest on the part of the biomedical community.
WGS data can also be used to determine the genes that contain the lowest levels of polymorphism across canids. Such genes are likely to be under strong purifying selection in canids. We can intersect the resultant list with a similar database derived from humans. These comparisons will reveal the extent to which purifying selection has shifted across the mammalian phylogeny. The same dataset will be used to screen mutations assigned ‘variant of unknown significance’ status in human disease association studies. Variants found in the Dog10K catalog in breeds that are not at increased risk for the disease in question, or that are found frequently across breeds, are unlikely to be disease-associated, whereas those that are present in at-risk breeds not only provide useful data for human genetic studies, but also suggest an animal model for the development of therapeutics. The challenge will be in setting thresholds, as both neutral and disease alleles of varying types are nearing fixation in a variety of breeds. Despite that difficulty, Dog10K may benefit the human medical community by providing foundational data and resources for diseases of interest, for both human and veterinary disorders.
Behavior
Behavioral traits have come under recent scrutiny in dogs. It has become a recurring theme in dog genetics that studying dog breed phenotypes can reveal genes that, when heavily altered, cause human anomalous behaviors [101]. For instance, vonHoldt and colleagues recently showed that structural variants in genes associated with human Williams–Beuren syndrome, particularly GTF2I and GTF2IRD1, may contribute to behavioral differences between dogs and wolves [17]. In humans, a hemizygous deletion of this region causes delayed development, cognitive impairment, behavioral abnormalities and, most importantly, hypersociability. Increased levels of hypersociability may have been important during the domestication process that led to the emergence of dogs as companion animals. Indeed, a key phase in domestication appears to have been changes in social behavior and the corresponding genes (e.g. oxytocin receptor genes [102] and neural-related genes [103–105]).
The mapping of anomalous behavior will be a further goal of Dog10K and may be one situation where ‘affected’ dogs can be included without compromising other goals, as the resulting sequence is useful for mapping not only behavioral anomalies but also all other studies of demography, Mendelian disease and morphology. Obvious breeds for inclusion are those with obsessive compulsive disorders (OCD) [106] including the Bull Terrier, which is well known for its compulsive tail chasing [107,108], and Doberman Pinschers with their blanket- and flank-sucking behavior [109]. Genetic analyses suggest that the CDH2, CTNNA2, ATXN1, and PGCP genes are involved in OCD [110] and have led to the detection of four genes in related pathways in humans [111]. The first three genes mentioned above are themselves excellent candidates for the identification of additional human OCD genes, since they play a role in both brain development and synaptic plasticity. Further exploration of canine OCD by Dog10K may prove crucial for future studies that address human and animal mental health.
Since Dog10K will focus on mapping breed-specific behaviors, we face challenges in accurate phenotypic dissection of complex behavioral traits, especially those that are breed-related [112]. Incorporating experts in behavioral assessment and data storage will be key in disentangling the genetics of breed behaviors, and understanding how identical genes and pathways function in humans. Dog10K will therefore employ a behavioral scientist to collect and quantitate data. Finally, we are mindful that epigenetic variation also plays a highly relevant and underappreciated role in behavior, and focused studies will be needed to tackle related questions [113,114].
Sampling scheme
The Dog10K sampling scheme encompasses multiple goals and numerous populations, ranging from the most strictly controlled breeds to lineages with loose associations to human settlements [51]. At the most intensive end of the spectrum of artificial selection are registered breed dogs. These have been developed through closed breeding lines and strong selection for appearance and function, a process that began during the Victorian era [115], have well-documented long-term pedigrees and clearly defined aesthetic breed standards by which individuals are judged [23]. Registered breeds capture most canine morphological variation and are key for identifying genotype–phenotype relationships.
A second sampling focus is on niche or non-breed populations of dogs that exist in communities throughout the world, each with a unique history that often mimics the settlement of humans in the region. These are in contrast to traditional or purebred breed dogs, and have been primarily and selectively bred to have specific occupations (e.g. herding, pointing, guarding and retrieving) that often constrain morphological or physical traits (e.g. long legs, particular coat color and muzzle shape). Many such breeds are often supposed to possess origins in ‘antiquity’ and were foundational for developing the modern breeds, although it is well established that many so-called ‘ancient’ breeds are in fact modern constructions with established histories [51,63].
The majority of the global population of dogs are not selectively bred, but nevertheless live in some degree of association with humans (e.g. guarding or companion versus scavengers on the margin of human society) [49]. Referred to as free-breeding dogs, the reproductive success and survival of these indigenous regional populations of dogs is not strictly determined by humans. We distinguished between two major types of free-breeding dogs: village dogs that live in rural regions and are often, but not always, relatively unaffected by admixture from other regions, and free-breeding dogs that inhabit major metropolitan areas termed street dogs [116]. Further, we define an additional category of feral dogs, which are those that are largely under natural selection and have little human interaction. While the ancestors of feral dogs were domestic, these populations exist today largely as wild animals (e.g. Australian Dingoes).
Our studies on the genomic impact of canine domestication will rely on extensive sampling across wild canid lineages (Fig. 2). There is substantial phenotypic variation across wolf populations driven by local adaptation and, in some cases, introgression with dogs (e.g. coat color) [57,64]. The inclusion of diverse wild canids, particularly gray wolves, is crucial for understanding both their demographic history and trait evolution [42,103]. Dog10K will utilize a phylogenetic outgroup species (e.g. cat [117]) to improve the detection of accelerated evolution within branches of Canis and identify derived alleles, a crucial consideration for many methods of inference in evolutionary genomics. An appropriate evolutionary outgroup would share little, if any, segregating variation with dogs.
Sequencing effort
Sequencing will be carried out in four phases. The selection of the first 5450 canids to be sequenced is described below (Table 2). Overall, we will generate moderate-coverage genomes (≥20-fold average) to curate the breadth of contemporary genomic variation in dogs and to investigate how this variation leads to phenotypic variation.
Table 2.
Documentation of approval by an ethics committee (if applicable) |
Copy of pedigree certificate for registered individuals |
Date of birth |
Date of sample collection |
Sex |
Country of birth |
Photo(s) to document phenotype |
Laboratory or institute of sample origin |
Sample type (e.g. blood, tissue type, hair) and storage buffer (e.g. ethylenediaminetetraacetic acid) |
Optional: disease phenotypes; standardized X-rays; morphometrics; weight at time of sample collections |
Phase 1
Phase 1 will include the sampling of at least five unrelated dogs across 300 distinct registered breeds that represent the spectrum of modern breed dog populations worldwide (nbreeds1 = 1500). Inclusion of a specific breed will be determined by the demographic relationship to other breeds, the hypothesized strength of artificial selection for derived phenotypes (example: body size in Great Danes, brachycephaly in Pugs and olfaction in Bloodhounds) and disease susceptibility. In cases where individual breeds possess strong population structure, e.g. divergence between populations in US and in other continents, as in the Shar-Pei, we will include at least 10 individuals in order to capture the genetic diversity in each subpopulation. We predict this to be the case for an additional 50 breeds (nbreeds2 = 500). In addition, we will select 25 popular breeds (e.g. German Shepherd Dog, Golden Retriever, Bulldog and Poodle) for which we will thoroughly explore intra-breed variation by sequencing a minimum of 50 dogs in each (nbreeds3 = 1250). We will emphasize individual selection to maximize phenotypic variation (i.e. disease susceptibility and morphology) and minimize relatedness. This has already been done for the Yorkshire Terrier, and is available in the public 722 WGS data described below.
Though most of canine phenotypic variation is found within the registered breed dogs, many specialized traits are uniquely found among isolated indigenous populations. Thus, we will additionally sample five unrelated individuals from 20 niche or non-breed populations (nniche = 100). Further, since 47% of all dogs in the US are categorized as either mixed-breed or designer-breed dogs [118], we will sequence 10 unrelated ‘mixed’ breed individuals from 30 different locations (nmixed = 300). Free-breeding indigenous, village and street dogs possess more genetic variability than both breed and ‘mixed’ dogs, so we will include five unrelated individuals from 100 such dog populations across six continents, with an emphasis on indigenous village dog populations showing geographical signatures consistent with historical diversity instead of modern admixture (nvillage = 500). We have already sequenced a total of 298 village dogs and the total for phase 1 will include 4150 samples.
Phase 2
Phase 2 (Table 2) will generate data from an additional 1000 samples, focusing on non-breed dogs from geographical regions that have not been systematically sampled. These include: East/Central Asia, the High Arctic, the Middle East, Eastern Europe, Africa and Island Southeast Asia. We will generate ≥20× genomes of 100 unrelated individuals of 10 specified breeds, which are yet to be determined (total nancient = 1000), and this number will likely increase.
Phase 3
In phase 3, we will sequence extended pedigrees of canids (including at least three generations) to ≥50× per individual. We estimate that this will encompass six dogs each for 15 pedigrees (npedigree_dog = 90). In addition to dogs, we will sequence gray wolves from Yellowstone with genetically verified pedigrees [119], including six wolves per founder lineage, with approximately six lineages available (npedigree_wolf = 36) for an initial total of 126 wolves. These are distinct from those mentioned above. This study design will reduce false-positive mutational events due to sequencing errors. Further, it will limit the false-negative rate by ensuring that we can reliably call heterozygous genotypes. Because pedigree data are so extensive on these individuals, the data will also allow the estimation of fine-scale recombination and mutation rates across the genome in different canid populations.
Phase 4
In phase 4, we will capture the genomic variation among wild relatives of dogs and their evolutionary relationships. We will sequence ≥20× genomes of five unrelated gray wolves from 20 populations (nwolf = 100), five unrelated coyotes and golden jackals from three populations each (ncoyote = 15 and ngolden_jackal = 15), and five unrelated wild canids including each of the following species: golden wolves, Ethiopian wolf, dhole, black-backed jackal, African wild dog and side-striped jackal (nwild_canid = 30). All will be sequenced to 20× and we expect this phase to expand significantly as the project progresses to include, for instance, gray and red foxes.
Prior to sample collection, all dog owners must sign standard Animal Care and Use Consent forms providing signed permission for the collection of a blood sample, pedigree data and registration number (if available), demographic history (if relevant), owner contact information, consent to re-contact owner, medical history and pedigree data for privately owned dogs as listed (Table 1). While blood samples are desirable, samples from some dogs may only be available using buccal swabs.
The numbers presented here for phases 1 and 2 are in addition to the 722 WGS samples already released and cataloged by members of the community, producing over 91 million variants (https://research.nhgri.nih.gov/dog_genome/data_release/index.shtml [4]; see also ncbi.nlm.nih.gov/sra). All of the sequence data produced and made public by the general canine genome community will be integrated into the total dataset. The initial 5450 samples selected here (Table 2) represent about one-half of the proposed 10 000 samples. Subsequent expansion will focus on rare breeds, comprehensive village dog sampling, disease phenotypes, robust within-breed sampling and geographical representation to highlight subtle phenotypic variation. Members of the scientific or lay community can nominate breeds based on these criteria.
Dog10K also plans to establish high-quality reference genomes that will be de novo-assembled using a variety of technologies including Pacific Biosciences long-reads (100×), Bacterial Artificial Chromosome end sequencing, optical-mapping (Bionano Saphyr), phased haplotypes (10X genomics 60×) and chromosome conformation (Hi-C), etc. We will begin with a de novo assembly of the original Boxer from which the 2005 sequencing study was carried out [3]. The production of this assembly, combined with a realignment of currently available genomes, will improve the reference dog genome by utilizing gap-filling and sequence error correction to facilitate the identification of critical regions constituting genic and regulatory elements. Furthermore, we will generate a single de novo-assembled genome for each of the following: outgroup species, gray wolf, coyote, golden wolf, African village dog, South Chinese village dog, Siberian or Alaskan dog, Middle East, Indian village and a Golden Retriever. We will begin with this initial set, but expect that more will be added. Ongoing de novo assembly efforts in the community also include the Labrador Retriever, German Shepherd Dog, Great Dane, Rottweiler, Collie and others. For all phases, the final sample numbers will fluctuate depending on sample availability and the advancing priorities of the community.
Online resources and data sharing
Raw sequence data along with sample metadata, including sample name, breed, sampling location and any disease phenotypes, will be deposited in the Genome Sequence Archive (GSA) (http://gsa.big.ac.cn/), as well the International Nucleotide Sequence Database Collaboration (INSDC, a collaboration that includes DNA Data Bank of Japan (DDBJ), European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI) and National Center for Biotechnology Information (NCBI)), which will enable automatic synchronization across the three repositories. Variant call files, as well information on coverage and genome quality control, will be made available through the project website and individually submitted to the Ensembl Variation database. Revised genome assemblies and associated annotations will be deposited in the GSA and INSDC databases, along with required files for exploring the data through existing genome browsers. The web sites that will be maintained as part of this project include: the Dog10K website (http://www.dog10kgenomes.org), the iDog database [120] (http://bigd.big.ac.cn/idog), the DogSD database [121] (http://dogsd.big.ac.cn/), https://research.nhgri.nih.gov/dog_genome/data_release/index.shtml, https://ncbi.nlm.nih.gov/sra, https://midgard.nhgri.nih.gov/dog_genome/index.shtml and the European Consortium; NCBI and European Nucleotide Archive (ENA) for FASTQ files.
CONCLUSION
The development of the Dog10K resource will significantly advance studies of human-relevant disease in domestic dogs. Not only will dense sequencing of key breeds enhance the ability to find susceptibility loci, genes and variants, but the ensuing results are more likely to mimic those of humans than those observed in more common model systems such as mice, where disease states are often induced rather than naturally occurring. The ultimate legacy of the dog as a biomedical model for human disease lies in its translational potential.
The evolutionary history of dogs is controversial and requires complete genomes to provide sufficient resolving power to test hypotheses related to demographic history, admixture and selection during dog domestication. Furthermore, a population genomics approach to understanding the constraints on variation and the effects of deleterious mutations is in its infancy, and will benefit from wider population sampling. Full genomes from diverse populations will provide new data to test ideas about the distribution of the effects of selection, which are integral to evolutionary models of demography and adaptation. Finally, the landscape of recombination is a critical issue in canines who lack a functional version of the nearly universal protein PRDM9, which is associated with recombination hotspots. Therefore, the study of canine genomes will allow unique insights into the alternative mechanisms by which recombination is maintained.
Acknowledgements
Dog10K gratefully acknowledges our many colleagues and trainees who have provided samples including: Guo-Dong Wang (Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China), Hannes Lohi (University of Helsinki, Helsinki, Finland), Tosso Leeb (University of Bern, Bern, Switzerland), Elaine A. Ostrander (National Institutes of Health, Bethesda, MD, USA), Christophe Hitte (University of Rennes, Rennes, France), Peter Savolainen (Royal Institute of Technology, Stockholm, Sweden), Greger Larson (University of Oxford, Oxford, UK) and Bridgett vonHoldt (Princeton University, Princeton, NJ, USA). We also thank those who have provided expertise or participated in sequencing efforts that have contributed to Dog10K. We thank many members of our laboratories who provided figures or data for figures. Finally, we thank dozens of breed clubs, private collectors, health club representatives, museums and curators, government and non-government entities, and scientists from many disciplines with a passion for canines for their invaluable contributions regarding samples, information and historical data regarding their breeds.
Contributor Information
Dog10K Consortium:
Catherine André, Erik Axelsson, Adam Boyko, Brian W Davis, Oliver Forman, Laurent Frantz, Christophe Hitte, Vidhya Jagannathan, Elinor Karlsson, Jeffrey Kidd, Greger Larson, Tosso Leeb, Kerstin Lindblad-Toh, Hannes Lohi, Kirk E Lohmueller, Tomas Marques-Bonet, Catherine Mellersh, *Elaine A Ostrander, Peter Savolainen, Robert Schnabel, Bridgett M vonHoldt, *Guo-Dong Wang, *Robert K Wayne, Ziheng Yang, Weiwei Zhai, and *Ya-Ping Zhang
FUNDING
This work was supported by a number of entities including grants from the Breakthrough Project of the Strategic Priority Program of the Chinese Academy of Sciences (CAS) (XDB13000000 to G-DW); the National Natural Science Foundation of China (91731304, 31621062); the Intramural Program of the National Human Genome Research Institute of the National Institutes of Health, USA (to EAO); a CAS international collaborating grant proposal (152453KYSB20150002 to G-DW); the Morris Animal Foundation (to RKW); the Swedish Research Council (to KLT); the European Research Council (ERC-2013-StG-337574-UNDEAD to GL and LF); the Natural Environmental Research Council (NE/K005243/1 and NE/K003259/1 to GL and LF); grant ‘Cani-DNA_CRB’ (ANR-11-INBS-0003) for the French Tissue Bank (to CA and CH); and a Young Innovation Promotion Association Award (to G-DW).
Alphabetical listing of Dog10K Consortium members (* indicates primary organizers)
Catherine André: University of Rennes, CNRS, IGDR, UMR6290 - F-3500 Rennes, France, christophe.hitte@univ-rennes1.fr
Erik Axelsson: Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala biomedicinska centrum BMC, Husarg. 3 75237 Uppsala, Sweden, erik.axelsson@imbim.uu.se
Adam Boyko: Department of Biomedical Sciences, Cornell University College of Veterinary Medicine, Veterinary Research Tower T8016A, Box 17, Ithaca, NY 14853, USA, boyko@cornell.edu
Brian W. Davis: College of Veterinary Medicine and Biomedical Sciences, Texas A&M University, College Station, TX, USA, bdavis@cvm.tamu.edu
Oliver Forman: Genetics Research Unit, Mars Inc., Cambridge, UK.
Laurent Frantz: School of Biological and Chemical Sciences, Queen Mary University of London, London, UK, Laurent.frantz@qmul.ac.uk
Christophe Hitte: University of Rennes, CNRS, IGDR - UMR6290, F-3500 Rennes, France, hitte@univ-rennes1.fr
Vidhya Jagannathan: Institute of Genetics, Vetsuisse Faculty, University of Bern, Bern, Switzerland, vidhya.jagannathan.@vetsuisse.unibe.ch
Elinor Karlsson: Bioinformatics and Integrative Biology at University of Massachusetts Medical School, and Vertebrate Genomics and Genetics Group at the Broad Institute of the Massachusetts Institute of Technology and Harvard, Boston, MA, USA, elinor.broadinstitute.org.
Jeffrey Kidd: Departments of Human Genetics and Computational Medicine and Bioinformatics, University of Michigan Medical School, 3726A Med Sci II, Ann Arbor, MI 48109, USA, jmkidd@umich.edu
Greger Larson: Research Laboratory for Archaeology and the History of Art, Dysons Perrins Building, South Parks Road, University of Oxford, Oxford, OX1 3QY, UK, greger.larson@arch.ox.ac.uk
Tosso Leeb: Institute of Genetics, Vetsuisse Faculty, University of Bern, Bern, Switzerland, tosso.leeb@vetsuisse.unibe.ch
Kerstin Lindblad-Toh: Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden and Broad Institute of the Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA, kersli@broadinstitute.org.
Hannes Lohi: Research Programs Unit, Molecular Neurology and Department of Veterinary Biosciences, University of Helsinki and Folkhälsan Institute of Genetics, PO Box 63, Helsinki, Finland.
Kirk E. Lohmueller: Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA 90095, USA klohmueller@ucla.edu
Tomas Marques-Bonet: Institut Biologia Evolutiva (Universitat Pompeu Fabra/CSIC) ICREA, Barcelona, Spain, tomas.marques.@upf.edu
Catherine Mellersh: Animal Health Trust, Lanwades Park, Kentford, Newmarket, Suffolk, CB8 7UU, UK, cathryn.mellersh@aht.org.uk
*Elaine A. Ostrander: Room 5351, Building 50, National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Bethesda, MD 20892, USA, eostrand@mail.nih.gov
Peter Savolainen: School of Biotechnology, Science for Life Laboratory, Royal Institute of Technology in Stockholm, PO Box 1031, SE-171 21, Solna, Sweden, savo@biotech.kth.se
Robert Schnabel: Division of Animal Sciences, Informatics Institute, University of Missouri-Columbia, Columbia, MO 65211, USA
Bridgett M. vonHoldt: Ecology and Evolutionary Biology, Princeton University, Princeton, NJ 08544, USA, vonHoldt@princeton.edu
*Guo-Dong Wang: State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China, wanggd@mail.kiz.ac.cn
*Robert K. Wayne: Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA 90095, USA, rwayne@ucla.edu
Ziheng Yang, Department of Genetics, University College London, London, WC1E 6BT, UK, z.yang@ucl.ad.uk
Weiwei Zhai: Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, P.R. China, wwzhai@ntu.edu.sg
*Ya-Ping Zhang: State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China, zhangyp@mail.kiz.ac.cn
Conflict of interest statement . None declared.
REFERENCES
- 1. Vila C, Maldonado JE, Wayne RK. Phylogenetic relationships, evolution, and genetic diversity of the domestic dog. J Hered 1999; 90: 71–7. [DOI] [PubMed] [Google Scholar]
- 2. Wayne RK, Ostrander EA. Origin, genetic diversity, and genome structure of the domestic dog. Bioessays 1999; 21: 247–57. [DOI] [PubMed] [Google Scholar]
- 3. Lindblad-Toh K, Wade CM, Mikkelsen TSet al.. Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature 2005; 438: 803–19. [DOI] [PubMed] [Google Scholar]
- 4. Plassais J, Kim J, Davis BWet al.. Whole genome sequencing of canids reveals genomic regions under selection and variants influencing morphology. Nat Commun 2019; 10: 1489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Plassais J, Rimbault M, Williams FJet al.. Analysis of large versus small dogs reveals three genes on the canine X chromosome associated with body weight, muscling and back fat thickness. PLoS Genet 2017; 13: e1006661. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Mansour TA, Lucot K, Konopelski SEet al.. Whole genome variant association across 100 dogs identifies a frame shift mutation in DISHEVELLED 2 which contributes to Robinow-like syndrome in Bulldogs and related screw tail dog breeds. PLoS Genet 2018; 14: e1007850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Drögemüller C, Karlsson EK, Hytönen MKet al.. A mutation in hairless dogs implicates FOXI3 in ectodermal development. Science 2008; 321: 1462. [DOI] [PubMed] [Google Scholar]
- 8. Cadieu E, Neff MW, Quignon Pet al.. Coat variation in the domestic dog is governed by variants in three genes. Science 2009; 326: 150–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Bannasch D, Young A, Myers Jet al.. Localization of canine brachycephaly using an across breed mapping approach. PLoS One 2010; 5: e9632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Marchant TW, Johnson EJ, McTeir Let al.. Canine brachycephaly is associated with a retrotransposon-mediated missplicing of SMOC2. Curr Biol 2017; 27: 1573–84. e6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Schoenebeck JJ, Hutchinson SA, Byers Aet al.. Variation of BMP3 contributes to dog breed skull diversity. PLoS Genet 2012; 8: e1002849. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Berryere TG, Kerns JA, Barsh GSet al.. Association of an agouti allele with fawn or sable coat color in domestic dogs. Mamm Genome 2005; 16: 262–72. [DOI] [PubMed] [Google Scholar]
- 13. Candille SI, Kaelin CB, Cattanach BMet al.. A -defensin mutation causes black coat color in domestic dogs. Science 2007; 318: 1418–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Drögemüller C, Philipp U, Haase Bet al.. A noncoding melanophilin gene (MLPH) SNP at the splice donor of exon 1 represents a candidate causal mutation for coat color dilution in dogs. J Hered 2007; 98: 468–73. [DOI] [PubMed] [Google Scholar]
- 15. Karlsson EK, Baranowska I, Wade CMet al.. Efficient mapping of mendelian traits in dogs through genome-wide association. Nat Genet 2007; 39: 1321–8. [DOI] [PubMed] [Google Scholar]
- 16. Kim J, Williams FJ, Dreger DLet al.. Genetic selection of athletic success in sport-hunting dogs. Proc Natl Acad Sci USA 2018; 115: E7212–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. vonHoldt BM, Shuldiner E, Koch IJet al.. Structural variants in genes associated with human Williams-Beuren syndrome underlie stereotypical hypersociability in domestic dogs. Sci Adv 2017; 3: e1700398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Jones P, Chase K, Martin Aet al.. Single-nucleotide-polymorphism-based association mapping of dog stereotypes. Genetics 2008; 179: 1033–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Våge J, Bønsdorff TB, Arnet Eet al.. Differential gene expression in brain tissues of aggressive and non-aggressive dogs. BMC Vet Res 2010; 6: 34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Karlsson EK, Lindblad-Toh K. Leader of the pack: gene mapping in dogs and other model organisms. Nat Rev Genet 2008; 9: 713–25. [DOI] [PubMed] [Google Scholar]
- 21. Boyko AR. The domestic dog: man's best friend in the genomic era. Genome Biol 2011; 12: 216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Davis BW, Ostrander EA. Domestic dogs and cancer research: a breed-based genomics approach. ILAR J 2014; 55: 59–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Club AK. The New Complete Dog Book. Mount Joy: Fox Chapel Publishing, 2017. [Google Scholar]
- 24. Rogers CA, Brace AH. The International Encyclopedia of Dogs. New York: Howell Book House, 1995. [Google Scholar]
- 25. Wilcox B, Walkowicz C. Atlas of Dog Breeds of the World. Neptune City, NJ: T.F.H. Publications, 1995. [Google Scholar]
- 26. Akey JM, Ruhe AL, Akey DTet al.. Tracking footprints of artificial selection in the dog genome. Proc Natl Acad Sci USA 2010; 107: 1160–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Boyko A, Quignon P, Li Let al.. A simple genetic architecture underlies morphological variation in dogs. PLoS Biol 2010; 8: e1000451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Vaysse A, Ratnakumar A, Derrien Tet al.. Identification of genomic regions associated with phenotypic variation between dog breeds using selection mapping. PLoS Genet 2011; 7: e1002316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Hayward JJ, Castelhano MG, Oliveira KCet al.. Complex disease and phenotype mapping in the domestic dog. Nat Commun 2016; 7: 10460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Ostrander EA, Wayne RK, Freedman AHet al.. Demographic history, selection and functional diversity of the canine genome. Nat Rev Genet 2017; 18: 705–20. [DOI] [PubMed] [Google Scholar]
- 31. Schiffman JD, Breen M. Comparative oncology: what dogs and other species can teach us about humans with cancer. Philos Trans R Soc Lond B Biol Sci 2015; 370: 20140231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Sams AJ, Boyko AR. Fine-scale resolution of runs of homozygosity reveal patterns of inbreeding and substantial overlap with recessive disease genotypes in domestic dogs. G3 (Bethesda) 2019; 9: 117–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Perri A, Thalmann O. Paleogenomic Inferences of Dog Domestication. Cham: Springer, 2018. [Google Scholar]
- 34. Ollivier M, Tresset A, Frantz LAFet al.. Dogs accompanied humans during the Neolithic expansion into Europe. Biol Lett 2018; 14: 20180286. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Wang GD, Fan RX, Zhai Wet al.. Genetic convergence in the adaptation of dogs and humans to the high-altitude environment of the Tibetan plateau. Genome Biol Evol 2014; 6: 2122–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Axelsson E, Ratnakumar A, Arendt MLet al.. The genomic signature of dog domestication reveals adaptation to a starch-rich diet. Nature 2013; 495: 360–4. [DOI] [PubMed] [Google Scholar]
- 37. Marsden CD, Ortega-Del Vecchyo D, O’Brien DPet al.. Bottlenecks and selective sweeps during domestication have increased deleterious genetic variation in dogs. Proc Natl Acad Sci USA 2016; 113: 152–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Frantz LA, Mullin VE, Pionnier-Capitan Met al.. Genomic and archaeological evidence suggest a dual origin of domestic dogs. Science 2016; 352: 1228–31. [DOI] [PubMed] [Google Scholar]
- 39. Botigue LR, Song S, Scheu Aet al.. Ancient European dog genomes reveal continuity since the Early Neolithic. Nat Commun 2017; 8: 16082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Fan Z, Silva P, Gronau Iet al.. Worldwide patterns of genomic variation and admixture in gray wolves. Genome Res 2016; 26: 163–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Vilà C, Savolainen P, Maldonado JEet al.. Multiple and ancient origins of the domestic dog. Science 1997; 276: 1687–9. [DOI] [PubMed] [Google Scholar]
- 42. Freedman AH, Gronau I, Schweizer RMet al.. Genome sequencing highlights the dynamic early history of dogs. PLoS Genet 2014; 10: e1004016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Larson G, Bradley DG. How much is that in dog years? The advent of canine population genomics. PLoS Genet 2014; 10: e1004093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Skoglund P, Ersmark E, Palkopoulou Eet al.. Ancient wolf genome reveals an early divergence of domestic dog ancestors and admixture into high-latitude breeds. Curr Biol 2015; 25: 1515–9. [DOI] [PubMed] [Google Scholar]
- 45. Thalmann O, Shapiro B, Cui Pet al.. Complete mitochondrial genomes of ancient canids suggest a European origin of domestic dogs. Science 2013; 342: 871–4. [DOI] [PubMed] [Google Scholar]
- 46. Pang JF, Kluetsch C, Zou XJet al.. mtDNA data indicate a single origin for dogs south of Yangtze River, less than 16,300 years ago, from numerous wolves. Mol Biol Evol 2009; 26: 2849–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Wang GD, Zhai W, Yang HCet al.. The genomics of selection in dogs and the parallel evolution between dogs and humans. Nat Commun 2013; 4: 1860. [DOI] [PubMed] [Google Scholar]
- 48. Shannon LM, Boyko RH, Castelhano Met al.. Genetic structure in village dogs reveals a Central Asian domestication origin. Proc Natl Acad Sci USA 2015; 112: 13639–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Wang GD, Zhai W, Yang HCet al.. Out of southern East Asia: the natural history of domestic dogs across the world. Cell Res 2016; 26: 21–33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Cruz F, Vila C, Webster MT. The legacy of domestication: accumulation of deleterious mutations in the dog genome. Mol Biol Evol 2008; 25: 2331–6. [DOI] [PubMed] [Google Scholar]
- 51. Larson G, Karlsson EK, Perri Aet al.. Rethinking dog domestication by integrating genetics, archeology, and biogeography. Proc Natl Acad Sci USA 2012; 109: 8878–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Huerta-Sánchez E, Jin X, Asan BZet al.. Altitude adaptation in Tibetans caused by introgression of Denisovan-like DNA. Nature 2014; 512: 194–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Miao B, Wang Z, Li Y. Genomic analysis reveals hypoxia adaptation in the Tibetan mastiff by introgression of the grey wolf from the Tibetan Plateau. Mol Biol Evol 2016; 34: 734–43. [DOI] [PubMed] [Google Scholar]
- 54. Li Y, Wu DD, Boyko ARet al.. Population variation revealed high-altitude adaptation of Tibetan mastiffs. Mol Biol Evol 2014; 31: 1200–5. [DOI] [PubMed] [Google Scholar]
- 55. Caspermeyer J. A tale of dog and man: Tibetan Mastiff found to gain high-altitude adaptation after domestication by interbreeding with the Tibet Gray Wolf. Mol Biol Evol 2017; 34: 775–6. [DOI] [PubMed] [Google Scholar]
- 56. Wu H, Liu YH, Wang GDet al.. Identifying molecular signatures of hypoxia adaptation from sex chromosomes: a case for Tibetan Mastiff based on analyses of X chromosome. Sci Rep 2016; 6: 35004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Schweizer RM, Durvasula A, Smith Jet al.. Natural selection and origin of a melanistic allele in North American gray wolves. Mol Biol Evol 2018; 35: 1190–209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Dreger DL, Davis BW, Cocco Ret al.. Commonalities in development of pure breeds and population isolates revealed in the genome of the Sardinian Fonni's Dog. Genetics 2016; 204: 737–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Ollivier M, Tresset A, Bastian Fet al.. Amy2B copy number variation reveals starch diet adaptations in ancient European dogs. R Soc Open Sci 2016; 3: 160449. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Wayne RK, vonHoldt BM. Evolutionary genomics of dog domestication. Mamm Genome 2012; 23: 3–18. [DOI] [PubMed] [Google Scholar]
- 61. vonHoldt BM, Pollinger JP, Lohmueller KEet al.. Genome-wide SNP and haplotype analyses reveal a rich history underlying dog domestication. Nature 2010; 464: 898–902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. vonHoldt BM, Kays R, Pollinger JPet al.. Admixture mapping identifies introgressed genomic regions in North American canids. Mol Ecol 2016; 25: 2443–53. [DOI] [PubMed] [Google Scholar]
- 63. Parker HG, Dreger DL, Rimbault Met al.. Genomic analyses reveal the influence of geographic origin, migration, and hybridization on modern dog breed development. Cell Rep 2017; 19: 697–708. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Anderson TM, vonHoldt BM, Candille SIet al.. Molecular and evolutionary history of melanism in North American gray wolves. Science 2009; 323: 1339–43. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Gou X, Wang Z, Li Net al.. Whole-genome sequencing of six dog breeds from continuous altitudes reveals adaptation to high-altitude hypoxia. Genome Res 2014; 24: 1308–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Wang G-D, Shao X-J, Bai Bet al.. Structural variation during dog domestication: insights from gray wolf and dhole genomes. Natl Sci Rev 2019; 6: 110–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Loog L, Thomas MG, Barnett Ret al.. Inferring allele frequency trajectories from ancient DNA indicates that selection on a chicken gene coincided with changes in medieval husbandry practices. Mol Biol Evol 2017; 34:1981–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Axelsson E, Webster MT, Ratnakumar Aet al.. Death of PRDM9 coincides with stabilization of the recombination landscape in the dog genome. Genome Res 2012; 22: 51–63. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69. Khanna C, Gordon I. Catching cancer by the tail: new perspectives on the use of kinase inhibitors. Clin Cancer Res 2009; 15: 3645–7. [DOI] [PubMed] [Google Scholar]
- 70. Merlo DF, Rossi L, Pellegrino Cet al.. Cancer incidence in pet dogs: findings of the Animal Tumor Registry of Genoa, Italy. J Vet Intern Med 2008; 22: 976–84. [DOI] [PubMed] [Google Scholar]
- 71. Dreger DL, Rimbault M, Davis BWet al.. Whole-genome sequence, SNP chips and pedigree structure: building demographic profiles in domestic dog breeds to optimize genetic-trait mapping. Dis Model Mech 2016; 9: 1445–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72. Lewis TW, Abhayaratne BM, Blott SC. Trends in genetic diversity for all Kennel Club registered pedigree dog breeds. Canine Genet Epidemiol 2015; 2: 13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Sutter NB, Eberle MA, Parker HGet al.. Extensive and breed-specific linkage disequilibrium in Canis familiaris. Genome Res 2004; 14: 2388–96. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Schlamp F, van der Made J, Stambler Ret al.. Evaluating the performance of selection scans to detect selective sweeps in domestic dogs. Mol Ecol 2016; 25: 342–56. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75. Broeckx BJG, Derrien T, Mottier Set al.. An exome sequencing based approach for genome-wide association studies in the dog. Sci Rep 2017; 7: 15680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Ostrander EA, Kruglyak L. Unleashing the canine genome. Genome Res 2000; 10: 1271–4. [DOI] [PubMed] [Google Scholar]
- 77. Parker H, Kukekova AV, Akey DTet al.. Breed relationships facilitate fine-mapping studies: a 7.8-kb deletion cosegregates with collie eye anomaly across multiple dog breeds. Genome Res 2007; 17: 1562–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78. Sutter NB, Mosher DS, Gray MMet al.. Morphometrics within dog breeds are highly reproducible and dispute Rensch's rule. Mamm Genome 2008; 19: 713–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Rimbault M, Beale HC, Schoenebeck JJet al.. Derived variants at six genes explain nearly half of size reduction in dog breeds. Genome Res 2013; 23: 1985–95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80. Sutter NB, Bustamante CD, Chase Ket al.. A single IGF1 allele is a major determinant of small size in dogs. Science 2007; 316: 112–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81. Hoopes BC, Rimbault M, Liebers Det al.. The insulin-like growth factor 1 receptor (IGF1R) contributes to reduced size in dogs. Mamm Genome 2012; 23: 780–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82. Melkersson K, Persson B. Association between body mass index and insulin receptor substrate-4 (IRS-4) gene polymorphisms in patients with schizophrenia. Neuro Endocrinol Lett 2011; 32: 634–40. [PubMed] [Google Scholar]
- 83. Sun Y, Bak B, Schoenmakers Net al.. Loss-of-function mutations in IGSF1 cause an X-linked syndrome of central hypothyroidism and testicular enlargement. Nat Genet 2012; 44: 1375–81. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84. Asakura Y, Abe K, Muroya Ket al.. Combined growth hormone and thyroid-stimulating hormone deficiency in a Japanese patient with a novel frameshift mutation in IGSF1. Horm Res Paediatr 2015; 84: 349–54. [DOI] [PubMed] [Google Scholar]
- 85. Lango AH, Estrada K, Lettre Get al.. Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature 2010; 467: 832–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86. N’Diaye A, Chen GK, Palmer CDet al.. Identification, replication, and fine-mapping of loci associated with adult height in individuals of African ancestry. PLoS Genet 2011; 7: e1002298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87. Mankowska M, Krzeminska P, Graczyk Met al.. Confirmation that a deletion in the POMC gene is associated with body weight of Labrador Retriever dogs. Res Vet Sci 2017; 112: 116–8. [DOI] [PubMed] [Google Scholar]
- 88. Davison LJ, Holder A, Catchpole Bet al.. The canine POMC gene, obesity in Labrador Retrievers and susceptibility to diabetes mellitus. J Vet Intern Med 2017; 31: 343–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89. Raffan E, Dennis RJ, O’Donovan CJet al.. A deletion in the canine POMC gene is associated with weight and appetite in obesity-prone Labrador Retriever dogs. Cell Metab 2016; 23: 893–900. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90. Mankowska M, Stachowiak M, Graczyk Aet al.. Sequence analysis of three canine adipokine genes revealed an association between TNF polymorphisms and obesity in Labrador dogs. Anim Genet 2016; 47: 245–9. [DOI] [PubMed] [Google Scholar]
- 91. van Steenbeek FG, Hytonen MK, Leegwater PAet al.. The canine era: the rise of a biomedical model. Anim Genet 2016; 47: 519–27. [DOI] [PubMed] [Google Scholar]
- 92. Dobson JM. Breed-predispositions to cancer in pedigree dogs. ISRN Vet Sci 2013; 2013: 1–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93. Rankin KS, Starkey M, Lunec Jet al.. Of dogs and men: comparative biology as a tool for the discovery of novel biomarkers and drug development targets in osteosarcoma. Pediatr Blood Cancer 2012; 58: 327–33. [DOI] [PubMed] [Google Scholar]
- 94. Gardner H, Fenger J, London C. Dogs as a model for cancer. Annu Rev Anim Biosci 2016; 4: 199–222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95. Abadie J, Hédan B, Cadieu Eet al.. Epidemiology, pathology, and genetics of histiocytic sarcoma in the Bernese Mountain Dog breed. J Hered 2009; 100: S19–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96. Moore PF, Affolter VK, Vernau W. Canine hemophagocytic histiocytic sarcoma: a proliferative disorder of CD11d+ macrophages. Vet Pathol 2006; 43: 632–45. [DOI] [PubMed] [Google Scholar]
- 97. Erich SA, Constantino-Casas F, Dobson JMet al.. Morphological distinction of histiocytic sarcoma from other tumor types in Bernese Mountain dogs and Flatcoated Retrievers. In Vivo 2018; 32: 7–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98. Sommer BC, Dhawan D, Ratliff TLet al.. Naturally-occurring canine invasive urothelial carcinoma: a model for emerging therapies. Bladder Cancer 2018; 4: 149–59. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99. Tonomura N, Elvers I, Thomas Ret al.. Genome-wide association study identifies shared risk loci common to two malignancies in Golden Retrievers. PLoS Genet 2015; 11: e1004922. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100. Simpson S, Dunning MD, de Brot Set al.. Comparative review of human and canine osteosarcoma: morphology, epidemiology, prognosis, treatment and genetics. Acta Vet Scand 2017; 59: 71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101. Danek M, Danek J, Araszkiewicz A. Large animal as a potential models of humans mental and behavioral disorders. Psychiatr Pol 2017; 51: 1009–27. [DOI] [PubMed] [Google Scholar]
- 102. Bence M, Marx P, Szantai Eet al.. Lessons from the canine Oxtr gene: populations, variants and functional aspects. Genes Brain Behav 2017; 16: 427–38. [DOI] [PubMed] [Google Scholar]
- 103. Freedman AH, Schweizer RM, Ortega-Del Vecchyo Det al.. Demographically-based evaluation of genomic regions under selection in domestic dogs. PLoS Genet 2016; 12: e1005851. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104. Pendleton AL, Shen F, Taravella AMet al.. Comparison of village dog and wolf genomes highlights the role of the neural crest in dog domestication. BMC Biol 2018; 16: 64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105. Kukekova AV, Johnson JL, Xiang Xet al.. Red fox genome assembly identifies genomic regions associated with tame and aggressive behaviours. Nat Ecol Evol 2018; 2: 1479–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106. Grados M, Wilcox HC.. Genetics of obsessive-compulsive disorder: a research update. Expert Rev Neurother 2007; 7: 967–80. [DOI] [PubMed] [Google Scholar]
- 107. Dodman NH, Bronson R, Gliatto J. Tail chasing in a bull terrier. J Am Vet Med Assoc 1993; 202: 758–60. [PubMed] [Google Scholar]
- 108. Moon-Fanelli AA, Dodman NH, Famula TRet al.. Characteristics of compulsive tail chasing and associated risk factors in Bull Terriers. J Am Vet Med Assoc 2011; 238: 883–9. [DOI] [PubMed] [Google Scholar]
- 109. Moon-Fanelli AA, Dodman NH, Cottam N. Blanket and flank sucking in Doberman Pinschers. J Am Vet Med Assoc 2007; 231: 907–12. [DOI] [PubMed] [Google Scholar]
- 110. Tang R, Noh HJ, Wang Det al.. Candidate genes and functional noncoding variants identified in a canine model of obsessive-compulsive disorder. Genome Biol 2014; 15: R25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111. Noh HJ, Tang R, Flannick Jet al.. Integrating evolutionary and regulatory information with a multispecies approach implicates genes and pathways in obsessive-compulsive disorder. Nat Commun 2017; 17: 774. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112. Ilska J, Haskell MJ, Blott SCet al.. Genetic characterization of dog personality traits. Genetics 2017; 206: 1101–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113. Janowitz Koch I, Clark MM, Thompson MJet al.. The concerted impact of domestication and transposon insertions on methylation patterns between dogs and grey wolves. Mol Ecol 2016; 25: 1838–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114. Banlaki Z, Cimarelli G, Viranyi Zet al.. DNA methylation patterns of behavior-related gene promoter regions dissect the gray wolf from domestic dog breeds. Mol Genet Genomics 2017; 292: 685–97. [DOI] [PubMed] [Google Scholar]
- 115. Howell P. At Home and Astray: The Domestic dog in Victorian Britain. Charlottesville: University of Virginia Press, 2015. [Google Scholar]
- 116. Boyko AR, Boyko RH, Boyko CMet al.. Complex population structure in African village dogs and its implications for inferring dog domestication history. Proc Natl Acad Sci USA 2009; 106: 13903–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117. Montague M, Li G, Gandolfi Bet al.. Comparative analysis of the domestic cat genome reveals genetic signatures underlying feline biology and domestication. Proc Natl Acad Sci USA 2014; 111: 17230–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118. American Veterinary Medical Association. U.S. Pet Ownership & Demographics Sourcebook. 2012 edition; https://www.avma.org/kb/resources/statistics/pages/market-research-statistics-us-pet-ownership.aspx [Google Scholar]
- 119. VonHoldt BM, Stahler DR, Smith DWet al.. The genealogy and genetic viability of reintroduced Yellowstone grey wolves. Mol Ecol 2008; 17: 252–74. [DOI] [PubMed] [Google Scholar]
- 120. Tang B, Zhou Q, Dong Let al.. iDog: an integrated resource for domestic dogs and wild canids. Nucleic Acids Res 2019; 47: D793–800. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121. Bai B, Zhao WM, Tang BXet al.. DoGSD: the dog and wolf genome SNP database. Nucleic Acids Res 2015; 43: D777–83. [DOI] [PMC free article] [PubMed] [Google Scholar]