Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2010 Feb;95(2):105–110. doi: 10.1016/j.ygeno.2009.10.004

Genome-wide end-sequenced BAC resources for the NOD/MrkTac and NOD/ShiLtJ☆☆ mouse genomes

Charles A Steward a,, Sean Humphray a, Bob Plumb a, Matthew C Jones a, Michael A Quail a, Stephen Rice a, Tony Cox a, Rob Davies a, James Bonfield a, Thomas M Keane a, Michael Nefedov c, Pieter J de Jong c, Paul Lyons d, Linda Wicker d, John Todd d, Yoshihide Hayashizaki e, Omid Gulban f, Jayne Danska f, Jen Harrow a, Tim Hubbard a, Jane Rogers b, David J Adams a
PMCID: PMC2824108  PMID: 19909804

Abstract

Non-obese diabetic (NOD) mice spontaneously develop type 1 diabetes (T1D) due to the progressive loss of insulin-secreting β-cells by an autoimmune driven process. NOD mice represent a valuable tool for studying the genetics of T1D and for evaluating therapeutic interventions. Here we describe the development and characterization by end-sequencing of bacterial artificial chromosome (BAC) libraries derived from NOD/MrkTac (DIL NOD) and NOD/ShiLtJ (CHORI-29), two commonly used NOD substrains. The DIL NOD library is composed of 196,032 BACs and the CHORI-29 library is composed of 110,976 BACs. The average depth of genome coverage of the DIL NOD library, estimated from mapping the BAC end-sequences to the reference mouse genome sequence, was 7.1-fold across the autosomes and 6.6-fold across the X chromosome. Clones from this library have an average insert size of 150 kb and map to over 95.6% of the reference mouse genome assembly (NCBIm37), covering 98.8% of Ensembl mouse genes. By the same metric, the CHORI-29 library has an average depth over the autosomes of 5.0-fold and 2.8-fold coverage of the X chromosome, the reduced X chromosome coverage being due to the use of a male donor for this library. Clones from this library have an average insert size of 205 kb and map to 93.9% of the reference mouse genome assembly, covering 95.7% of Ensembl genes. We have identified and validated 191,841 single nucleotide polymorphisms (SNPs) for DIL NOD and 114,380 SNPs for CHORI-29. In total we generated 229,736,133 bp of sequence for the DIL NOD and 121,963,211 bp for the CHORI-29. These BAC libraries represent a powerful resource for functional studies, such as gene targeting in NOD embryonic stem (ES) cell lines, and for sequencing and mapping experiments.

Keywords: Bacterial artificial chromosome, NOD/MrkTac, NOD/ShiLtJ, Mouse genome, Non-obese diabetic (NOD), Type 1 diabetes, T1D, Insulin-dependent diabetes, IDD

Introduction

Type 1 diabetes (T1D) or insulin-dependent diabetes (IDD) is a polygenic disorder, characterized by hyperglycaemia that results from the autoimmune T cell-mediated destruction of the insulin-producing β-cells of the islets of Langerhans of the pancreas [1–3]. T1D is triggered by different environmental and genetic factors and has variable penetrance, suggesting that susceptibility to this syndrome is inherited and polygenic [1–6].

The non-obese diabetic (NOD) mouse is an experimental model for human T1D, developed in Japan by Makino et al. [7]. NOD mice spontaneously develop T1D following an autoimmune mediated process that progressively destroys their insulin-secreting β-cells [2,3]. T1D is typically associated with allelic variants of the Human Leukocyte Antigen (HLA) class II immune response genes within the Major Histocompatability Complex (MHC) [3,8]. Genetic analysis of the NOD mouse has further established that the inheritance of diabetes in these mice is controlled polygenically by at least 27 disease-associated loci, distributed over at least 14 different chromosomes [9]. These loci have been designated Idd loci, for insulin-dependent diabetes [9,10].

Understanding how these loci contribute to the development of T1D in the NOD mouse should inform us of the underlying mechanisms of T1D development in humans. It is important, however, to analyse Idd loci in the context of the genome in which they reside so that the effect of the background in which the phenotype is observed, and the role of epistatic genetic interactions can be assessed. Bacterial artificial chromosomes (BACs) represent a useful resource for sequencing, mapping and functional studies [11]. Here we describe the development and end-sequencing of two BAC libraries for the NOD substrains NOD/MrkTac (DIL NOD) and NOD/ShiLtJ (CHORI-29). While NOD/MrkTac and NOD/ShiLtJ mice are derived from the same founding stock of NOD mice developed by intercrossing Jcl:ICR (Institute for Cancer Research) mice for more than 20 generations [7] they have been maintained as isolated colonies for many generations, and as such are likely to have diverged significantly. Indeed these NOD substrains show subtle differences in the timing and presentation of diabetes, and also in their plasma glucose levels. The availability of BAC libraries for both of these NOD substrains will allow us to study the differences between them and to gain a better understanding of the pathogenesis of T1D. In addition, with the recent advent of embryonic stem (ES) cells derived from NOD mice [12,13] these BAC libraries will form the foundation for targeted manipulation of the NOD mouse genome.

Results

End-sequencing

All clones from the DIL NOD and CHORI-29 BAC libraries were end-sequenced and the sequence read data have been submitted to EMBL. These data are also available from the Ensembl trace repository (http://trace.ensembl.org/) and the NCBI Trace Archive (http://www.ncbi.nlm.nih.gov/Traces/trace.cgi). 332,535 DIL NOD BAC clone end-sequences successfully passed post-sequencing quality processing from a total of 196,032 BACs, generating 229,736,133 bp of sequence. Of these passed reads, 318,065 (95.6%) were aligned to the C57BL/6J reference genome (NCBIm37), 170,029 (53.5%) of which were aligned to a single definitive location (Table 1A). Similarly for the CHORI-29 library, 170,159 BAC clone end-sequences passed post-sequencing quality processing from 110,976 BACs, generating 121,963,211 bp of sequence. Of these passed reads, 159,574 (93.8%) were aligned successfully to the reference C57BL/6J genome with 80,710 (50.6%) reads aligned to a single definitive location on NCBIm37 (Table 1B). The majority of the reads that did not map contained repetitive sequences or were of low quality. Both sets of data can be downloaded from the Sanger FTP site (ftp://ftp.sanger.ac.uk/pub/NODmouse/NOD_BACend_alignments). Mapping was performed using SSAHA2 with default parameters [14]. Using read-pair information we could place 41,468 DIL NOD clones and 18,257 CHORI-29 clones unambiguously on the genome since both read-pairs matched uniquely. However, it was also possible to establish the position of certain clones for which only one end mapped uniquely where the other end of the clone mapped to the genome within 3 standard deviations of the mean insert length of clones from the library and on the opposite sequence strand. This allowed us to place a further 83,796 DIL NOD clones and 43,905 CHORI-29 clones on the genome, resulting in a total of 125,266 uniquely placed DIL NOD clones and 62,162 uniquely placed CHORI-29 clones. The different success rates in the unique positioning of the BAC clones from these libraries to the genome was largely due to differences in the quality of end-sequence data produced from both libraries. CHORI-29 clones have larger genomic inserts and as a consequence were harder to prep and sequence compared to clones from the DIL NOD BAC library. Using the mapping data for both libraries it was possible to estimate the average insert size for the DIL NOD library to be 149,809 bp and 205,413 bp for the CHORI-29 library (Fig. 1A), which correlated with the experimentally derived figures (Fig. 1B).

Table 1A.

BAC clones in library 196,032
Attempted BAC clone end-reads 378,896
BAC clone end-reads sequenced successfully 332,535
Passed BAC clones sequenced on both strands 150,878
BAC clone end-reads with an alignment to the genome 318,065
Aligned BAC clone end-reads with a unique match 170,029
Aligned BAC clone end-reads with multiple matches 148,036
BAC clones positioned using unique end-read pairs 41,468
BAC clones positioned uniquely 125,266

Sequencing and alignment summary for the DIL NOD BAC library.

Table 1B.

BAC clones in library 110,976
Attempted BAC clone end-reads 207,321
BAC clone end-reads sequenced successfully 170,159
Passed BAC clones sequenced on both strands 75,046
BAC clone end-reads with an alignment to the genome 159,574
Aligned BAC clone end-reads with a unique match 80,710
Aligned BAC clone end-reads with multiple matches 78,864
BAC clones positioned using unique end-read pairs 18,257
BAC clones positioned uniquely 62,162

Sequencing and alignment summary for the CHORI-29 BAC library.

Fig. 1.

Fig. 1

DIL NOD clones have an average insert size of 149,809 bp and CHORI-29 clones have an average insert size of 205,413 bp. (A) Frequency distribution of the insert size of the 125,266 DIL NOD clones (green) and the 62,162 CHORI-29 clones (blue). (B) CHEF gel of NotI digested clones showing the difference in size between the NOD libraries. The two different vector bands are illustrated with white arrows. Marker (M) is a DNA marker of 24–300 kb size range.

Physical genome coverage and coverage of Ensembl genes

The average depth of genome coverage of the end-sequenced DIL NOD library was calculated to be 7.1-fold across the autosomes and 6.6-fold across the X chromosome (Table 2A). The end-sequenced CHORI-29 library has an average depth of 5.0-fold and 2.8-fold across the autosomes and the X chromosome, respectively (Table 2B). The total number of Ensembl [15] predicted genes that are fully covered by a BAC clone from the DIL NOD library is 31,093 (98.8%) and 30,103 (95.7%) for the CHORI-29 library, based upon Ensembl mouse release 55 (NCBIm37). The total number of Ensembl genes partially covered by DIL NOD and CHORI-29 BACs is 349 (1.1%), and 1,351 (4.3%) respectively, 200 (0.6%) of which are partial in both libraries. The total number of Ensembl genes contained completely in DIL NOD and CHORI-29 BAC gaps is 30 (0.1%) and 18 (0.06%) respectively. There are 5 (0.02%) genes that are present on the reference genome but absent completely from both NOD libraries. This is likely because it was not possible to place one or both BAC end-reads of a pair due to reference genome gaps adjacent to these Ensembl genes. These data are available on the Sanger FTP site (ftp://ftp.sanger.ac.uk/pub/NODmouse/NOD_Ensembl_gene_coverage).

Table 2A.

Chr. name Total chr. length bp Non-redundant clone-length bp Clone-length total bp Passed aligned sequence bp Clone depth % total coverage
1 197,195,432 189,902,383 1,385,443,533 13,593,208 7.03 96.3
2 181,748,087 177,813,774 1,348,761,746 13,129,801 7.42 97.84
3 159,599,783 155,666,906 1,151,145,108 11,396,911 7.21 97.54
4 155,630,120 150,659,365 1,134,909,770 10,944,727 7.29 96.81
5 152,537,259 146,223,145 1,009,782,448 9,773,915 6.62 95.86
6 149,517,037 145,684,306 1,049,822,257 10,281,087 7.02 97.44
7 152,524,553 140,866,414 1,007,473,869 9,637,732 6.61 92.36
8 131,738,871 123,985,369 907,535,329 8,826,036 6.89 94.11
9 124,076,172 120,302,728 916,247,038 8,800,500 7.38 96.96
10 129,993,255 125,971,520 897,618,520 8,764,981 6.91 96.91
11 121,843,856 118,333,935 924,494,065 8,842,955 7.59 97.12
12 121,257,530 117,098,461 854,759,223 8,258,226 7.05 96.57
13 120,284,312 116,142,787 888,982,416 8,609,816 7.39 96.56
14 125,194,864 119,290,632 846,844,974 8,247,811 6.76 95.28
15 103,494,974 100,431,693 735,484,668 7,116,922 7.11 97.04
16 98,319,150 94,516,190 692,930,172 6,811,564 7.05 96.13
17 95,272,651 91,304,604 690,760,336 6,680,800 7.25 95.84
18 90,772,031 86,713,096 655,698,203 6,460,483 7.22 95.53
19 61,342,430 57,670,646 421,571,236 4,064,957 6.87 94.01
X 166,650,296 160,833,108 1,095,695,031 10,991,837 6.57 96.51
Y 15,902,555 0 0 0 0 0

All data in Table 1 and Table 2 are derived from the NCBIm37 mouse assembly. The DIL NOD library covers over 95.6% of the mouse genome at an average depth of 7.1-fold across the autosomes and 6.6-fold across the X chromosome. Column 2 is the C57BL/6J chromosome length. The non-redundant clone-length field (column 3) is the non-redundant BAC clone sequence estimated for that chromosome, using both read-pairs. This may count each base several times, the number of times being the number of BACs overlapping that position. The clone-length total field (column 4) is the total sequence estimated for all the BACs for that chromosome, using both read-pairs. The passed aligned sequence column (5) is the total number of sequenced bases per chromosome that have successfully passed quality control and been mapped. The clone depth field (column 6) is the clone-length total (column 4) divided by the total chromosome length (column 2). The % total coverage (column 7) is the non-redundant clone-length total (counting each base covered only once) as a percentage of the total chromosome length.

Table 2B.

Chr. Name Total chr. length bp Non-redundant clone-length bp Clone-length total bp Passed aligned sequence bp Clone depth % total coverage
1 197,195,432 188,433,001 1,046,993,130 7,724,484 5.31 95.56
2 181,748,087 175,429,538 942,386,790 6,916,061 5.18 96.52
3 159,599,783 154,711,914 875,284,125 6,490,661 5.48 96.94
4 155,630,120 148,498,769 827,737,353 6,127,905 5.32 95.42
5 152,537,259 144,530,268 733,517,218 5,404,820 4.81 94.75
6 149,517,037 143,599,805 775,532,299 5,768,104 5.19 96.04
7 152,524,553 135,008,655 644,735,047 4,788,996 4.23 88.52
8 131,738,871 121,214,563 606,581,664 4,522,109 4.60 92.01
9 124,076,172 118,654,230 598,157,862 4,362,992 4.82 95.63
10 129,993,255 124,749,578 675,601,360 4,987,613 5.20 95.97
11 121,843,856 115,930,403 605,273,345 4,474,962 4.97 95.15
12 121,257,530 115,274,597 604,649,354 4,482,487 4.98 95.07
13 120,284,312 114,926,992 609,116,213 4,517,988 5.06 95.55
14 125,194,864 116,442,268 607,970,623 4,531,884 4.85 93.01
15 103,494,974 99,239,967 543,851,152 3,998,763 5.25 95.89
16 98,319,150 92,840,807 534,474,134 3,928,889 5.44 94.43
17 95,272,651 89,677,039 468,395,835 3,456,144 4.92 94.13
18 90,772,031 85,715,767 456,115,712 3,373,974 5.02 94.43
19 61,342,430 56,286,798 291,811,698 2,148,157 4.75 91.76
X 166,650,296 148,955,095 469,399,660 3,543,869 2.82 89.38
Y 15,902,555 2,427,223 9,762,866 80,060 0.62 15.26

The CHORI-29 library covers 93.9% of the mouse genome at an average depth of 5.0-fold across the autosomes and 2.8-fold across the X chromosome. For further information regarding the data, please see legend for Table 2A.

To make the data accessible to the wider community, we have generated a Distributed Annotation System (DAS) [16] source to display both the DIL NOD clones (http://www.ebi.ac.uk/das-srv/genomicdas/das/nod_clones_m37) and the CHORI-29 clones (http://www.ebi.ac.uk/das-srv/genomicdas/das/chori29_clones_m37) so that they can be visualized in DAS source compliant browsers. For example, in the Ensembl genome browser (http://www.ensembl.org/Mus_musculus/Info/Index) the alignments of both NOD BAC libraries can be accessed through the DAS sources menu and viewed against the reference C57BL/6J genome. DIL NOD clones are displayed as red and black lines depending on the orientation of the insert in the vector, while CHORI-29 clones are displayed as green and blue lines. The BAC end-sequences can also be viewed as traces in the main “Region in Detail” window of the Ensembl genome browser. The method used to display BAC ends in Ensembl shows only those that have corresponding ends that are considered to be within 3 standard deviations of the mean insert size of the library. End-reads provide a link to the Ensembl trace repository (http://trace.ensembl.org), where the end-read sequences for all quality clipped reads have been deposited (Fig. 2). FASTA files of these quality clipped reads have also been generated and deposited on the Sanger FTP site (ftp://ftp.sanger.ac.uk/pub/NODmouse/NOD_BACend_fasta_sequences).

Fig. 2.

Fig. 2

NOD BAC clones are displayed on the Ensembl genome browser under the DAS sources, http://www.ebi.ac.uk/das-srv/genomicdas/das/nod_clones_m37 and http://www.ebi.ac.uk/das-srv/genomicdas/das/chori29_clones_m37. The NOD/MrkTac mouse strain is designated DIL NOD (Sanger/Ensembl prefix bQ) and the NOD/ShiLtJ mouse strain is designated CHORI-29 (Sanger/Ensembl prefix bCN). The DIL NOD clones are displayed as black and red bars and the CHORI-29 clones are displayed as blue and green bars, the colours indicating the orientation of the DNA insert in the vector. BAC clones on the forward strand are drawn above the DNA contig while clones on the reverse strand are drawn below. The clone end-reads are shown as small arrows in the corresponding color to the relevant BAC clone. Links to the end-read sequences in the Ensembl trace archive can be found by clicking on the clone end of interest.

Analysis of nucleotide variation

We used SSAHA-SNP2 [15] to call single nucleotide polymorphisms (SNPs) and deletion insertion polymorphisms (DIPs) from both the DIL NOD and CHORI-29 end-sequence reads by comparing them to the NCBIm37 C57BL/6J assembly. We called 191,841 SNPs and 15,824 DIPs for DIL NOD and 114,380 SNPs and 4,942 DIPs for CHORI-29. These data are available on the Sanger FTP site (ftp://ftp.sanger.ac.uk/pub/NODmouse/NOD_variation_data). The following criteria were used: the identity had to be greater than or equal to 92% match length, greater than or equal to 80% of the read length and a match score or match length greater than 250 bp. These SNPs have been validated against Illumina whole genome shotgun data of the NOD/ShiLtJ genome (http://www.sanger.ac.uk/modelorgs/mousegenomes/) and submitted to dbSNP. However, it is important to note that the DIPs are candidate nucleotide variants and follow-up genotyping is warranted.

Discussion

The genome-wide DIL NOD and CHORI-29 mouse BAC end-sequenced libraries provide a unique way of studying T1D in mouse. The two libraries were aligned against the C57BL/6J mouse genome and are displayed on the Ensembl genome browser, which almost eliminates the need to perform filter hybridizations to isolate clones of interest, except for “non-reference” regions of the genome that are novel in NOD mouse. The distribution of mapped sequence reads is relatively even across the genome, with the exception of the Y chromosome, which is not represented in the DIL NOD mouse library. To date, high-quality BAC libraries exist for several mouse strains, including C57BL/6J, MSM/Ms [17,18], C3H/HeJ, BALB/c, A/J, SPRET/Ei, AKR/J, CAST/Ei (http://bacpac.chori.org/home.htm) and 129S7 [11], with BAC end-sequences existing for C57BL/6J, MSM/Ms and 129S7. Such libraries have been shown to be an invaluable resource for assembling genomes and for in vivo functional studies, such as BAC rescue [19–21].

The high-density and end-sequence quality of these BAC libraries make them useful tools for examining large-scale structural differences between the two substrains of NOD and other mouse strains and will greatly facilitate high-throughput targeted manipulation of the NOD mouse genome. With the recent advent of NOD ES lines these BAC libraries will be a critical resource for targeting vector construction [12,13] where isogenic DNA has been shown to be critically important in obtaining high targeting frequencies [22]. Due to the high coverage of the C57BL/6J reference genome by the NOD BAC ends, regions with poor coverage of the reference genome may represent structural variants in the NOD mouse genome when compared to the reference genome. Importantly we are currently performing targeted sequencing of regions of the NOD mouse genome relevant to T1D, which is crucial for understanding the role genetic susceptibility plays in the pathogenesis of T1D (http://www.sanger.ac.uk/Projects/M_musculus-NOD/). We are also in the process of using the Illumina platform [23] to sequence the entire NOD/ShiLtJ genome, which will greatly improve the utility of these BAC resources, and should help to position unaligned BACs to novel ‘non-reference’ regions of the NOD genome (http://www.sanger.ac.uk/modelorgs/mousegenomes/).

Materials and methods

Construction of the BAC libraries

The DIL NOD BAC library was constructed from NOD/MrkTac female liver genomic DNA at the Genomic Sciences Center, RIKEN in collaboration with the Diabetes and Inflammation Laboratory at Cambridge University. EcoRI was used to partially digest whole genomic DNA and the resulting fragments were cloned in pBACe3.6 as described previously [18]. These clones are available from Dr Jayne Danska jayne.danska@sickkids.on.ca. The CHORI-29 library was constructed in a similar manner at the Children's Hospital, Oakland, California, USA. NOD/ShiLtJ male kidney genomic DNA, obtained from the Jackson Laboratory, was cloned in pTARBAC2.1 and these clones are available from http://bacpac.chori.org/. The average insert size was experimentally verified using clamped hexagonal electric field (CHEF) electrophoresis, a system similar to pulse field gel electrophoresis (PFGE) [24]. The marker (M) (Fig. 1B) was a MidRange II PFG Marker – N3552S from New England BioLabs. For storage purposes, the DIL NOD library was arrayed into 527 384-well plates and the CHORI-29 library was replicated into 672 384-well plates.

End-sequence profiling of the BAC resource

In total 378,896 reads were attempted for the DIL NOD library using T7 and SP6, M13-21 and pUCR, and 3HPPSK and 3HPpur primers on the vector and big dye terminator chemistry. 207,321 reads were attempted for the CHORI-29 library using the T7 and SP6 primers on the vector, and big dye terminator chemistry. Sequence-reads were subjected to processing using Automated Sequence Preprocessing (ASP) [25]. The number of insertless clones was determined to be 3% for DIL NOD and 1% for CHORI-29. Average read-lengths were determined to be 694.08 bp in length for DIL NOD and 718.77 bp in length for CHORI-29.

End-sequence mapping of BAC clones

End-read data were mapped using SSAHA2 with the mapping criteria that more than 100 bp should map with greater than 95% identity to the NCBIm37 assembly. Clone-ends were iteratively aligned against the genome and after each round the average size of the clones was calculated as well as the standard deviation. Clone-ends that were plus or minus three standard deviations away from the mean were rejected. Clones with only one end aligned, ends that were orientated in the same direction, or ends that lie at unrealistic distances were rejected.

Conflict of interest statement

The authors declare that they have no conflict of interest.

Acknowledgments

This research was conducted as part of the Immune Tolerance Network, which is jointly funded by the National Institute of Allergy and Infectious Diseases, the National Institute of Diabetes, Digestive, and Kidney Disorders (both part of the National Institutes of Health) and the Juvenile Diabetes Research Foundation. The project was also supported by the Medical Research Council, UK and the Wellcome Trust Sanger Institute. D. J. Adams is supported by Cancer Research, UK and the Wellcome Trust.

Footnotes

☆ Sequence data from this library have been deposited with the DDBJ/EMBL/GenBank Data libraries under Accession Numbers FR000001–FR332535.

☆☆ Sequence data from this library have been deposited with the DDBJ/EMBL/GenBank Data libraries under Accession Numbers FR332536–FR502694.

References

  • 1.Devendra D., Liu E., Eisenbarth G.S. Type 1 diabetes: recent developments. BMJ. 2004;328:750–754. doi: 10.1136/bmj.328.7442.750. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Leiter E.H. The NOD mouse: a model for insulin-dependent diabetes mellitus. Curr. Protoc. Immunol. 2001;Chapter 15 doi: 10.1002/0471142735.im1509s24. Unit 15 9. [DOI] [PubMed] [Google Scholar]
  • 3.Wicker L.S., Todd J.A., Peterson L.B. Genetic control of autoimmune diabetes in the NOD mouse. Annu. Rev. Immunol. 1995;13:179–200. doi: 10.1146/annurev.iy.13.040195.001143. [DOI] [PubMed] [Google Scholar]
  • 4.Anderson M.S., Bluestone J.A. The NOD mouse: a model of immune dysregulation. Annu. Rev. Immunol. 2005;23:447–485. doi: 10.1146/annurev.immunol.23.021704.115643. [DOI] [PubMed] [Google Scholar]
  • 5.Atkinson M.A., Leiter E.H. The NOD mouse model of type 1 diabetes: as good as it gets? Nat. Med. 1999;5:601–604. doi: 10.1038/9442. [DOI] [PubMed] [Google Scholar]
  • 6.Todd J.A. From genome to aetiology in a multifactorial disease, type 1 diabetes. Bioessays. 1999;21:164–174. doi: 10.1002/(SICI)1521-1878(199902)21:2<164::AID-BIES10>3.0.CO;2-4. [DOI] [PubMed] [Google Scholar]
  • 7.Makino S., Kunimoto K., Muraoka Y., Mizushima Y., Katagiri K., Tochino Y. Breeding of a non-obese, diabetic strain of mice. Jikken Dobutsu. 1980;29:1–13. doi: 10.1538/expanim1978.29.1_1. [DOI] [PubMed] [Google Scholar]
  • 8.Ivakine E.A., Fox C.J., Paterson A.D., Mortin-Toth S.M., Canty A., Walton D.S., Aleksa K., Ito S., Danska J.S. Sex-specific effect of insulin-dependent diabetes 4 on regulation of diabetes pathogenesis in the nonobese diabetic mouse. J. Immunol. 2005;174:7129–7140. doi: 10.4049/jimmunol.174.11.7129. [DOI] [PubMed] [Google Scholar]
  • 9.Ridgway W.M., Peterson L.B., Todd J.A., Rainbow D.B., Healy B., Burren O.S., Wicker L.S. Gene–gene interactions in the NOD mouse model of type 1 diabetes. Adv. Immunol. 2008;100:151–175. doi: 10.1016/S0065-2776(08)00806-7. [DOI] [PubMed] [Google Scholar]
  • 10.Erlich H., Valdes A.M., Noble J., Carlson J.A., Varney M., Concannon P., Mychaleckyj J.C., Todd J.A., Bonella P., Fear A.L., Lavant E., Louey A., Moonsamy P. HLA DR-DQ haplotypes and genotypes and type 1 diabetes risk: analysis of the type 1 diabetes genetics consortium families. Diabetes. 2008;57:1084–1092. doi: 10.2337/db07-1331. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Adams D.J., Quail M.A., Cox T., van der Weyden L., Gorick B.D., Su Q., Chan W.I., Davies R., Bonfield J.K., Law F., Humphray S., Plumb B., Liu P., Rogers J., Bradley A. A genome-wide, end-sequenced 129Sv BAC library resource for targeting vector construction. Genomics. 2005;86:753–758. doi: 10.1016/j.ygeno.2005.08.003. [DOI] [PubMed] [Google Scholar]
  • 12.Hanna J., Markoulaki S., Mitalipova M., Cheng A.W., Cassady J.P., Staerk J., Carey B.W., Lengner C.J., Foreman R., Love J., Gao Q., Kim J., Jaenisch R. Metastable pluripotent states in NOD-mouse-derived ESCs. Cell Stem Cell. 2009;4:513–524. doi: 10.1016/j.stem.2009.04.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Nichols J., Jones K., Phillips J.M., Newland S.A., Roode M., Mansfield W., Smith A., Cooke A. Validated germline-competent embryonic stem cell lines from nonobese diabetic mice. Nat. Med. 2009;15:814–818. doi: 10.1038/nm.1996. [DOI] [PubMed] [Google Scholar]
  • 14.Ning Z., Cox A.J., Mullikin J.C. SSAHA: a fast search method for large DNA databases. Genome Res. 2001;11:1725–1729. doi: 10.1101/gr.194201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Hubbard T.J., Aken B.L., Ayling S., Ballester B., Beal K., Bragin E., Brent S., Chen Y., Clapham P., Clarke L., Coates G., Fairley S., Fitzgerald S., Fernandez-Banet J., Gordon L., Graf S., Haider S., Hammond M., Holland R., Howe K., Jenkinson A., Johnson N., Kahari A., Keefe D., Keenan S., Kinsella R., Kokocinski F., Kulesha E., Lawson D., Longden I., Megy K., Meidl P., Overduin B., Parker A., Pritchard B., Rios D., Schuster M., Slater G., Smedley D., Spooner W., Spudich G., Trevanion S., Vilella A., Vogel J., White S., Wilder S., Zadissa A., Birney E., Cunningham F., Curwen V., Durbin R., Fernandez-Suarez X.M., Herrero J., Kasprzyk A., Proctor G., Smith J., Searle S., Flicek P. Ensembl 2009. Nucleic Acids Res. 2009;37:D690–D697. doi: 10.1093/nar/gkn828. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Dowell R.D., Jokerst R.M., Day A., Eddy S.R., Stein L. The distributed annotation system. BMC Bioinformatics. 2001;2:7. doi: 10.1186/1471-2105-2-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Abe K., Noguchi H., Tagawa K., Yuzuriha M., Toyoda A., Kojima T., Ezawa K., Saitou N., Hattori M., Sakaki Y., Moriwaki K., Shiroishi T. Contribution of Asian mouse subspecies Mus musculus molossinus to genomic constitution of strain C57BL/6J, as defined by BAC-end sequence-SNP analysis. Genome Res. 2004;14:2439–2447. doi: 10.1101/gr.2899304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Osoegawa K., Tateno M., Woon P.Y., Frengen E., Mammoser A.G., Catanese J.J., Hayashizaki Y., de Jong P.J. Bacterial artificial chromosome libraries for mouse sequencing and functional analysis. Genome Res. 2000;10:116–128. [PMC free article] [PubMed] [Google Scholar]
  • 19.Antoch M.P., Song E.J., Chang A.M., Vitaterna M.H., Zhao Y., Wilsbacher L.D., Sangoram A.M., King D.P., Pinto L.H., Takahashi J.S. Functional identification of the mouse circadian clock gene by transgenic BAC rescue. Cell. 1997;89:655–667. doi: 10.1016/s0092-8674(00)80246-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Gregory S.G., Sekhon M., Schein J., Zhao S., Osoegawa K., Scott C.E., Evans R.S., Burridge P.W., Cox T.V., Fox C.A., Hutton R.D., Mullenger I.R., Phillips K.J., Smith J., Stalker J., Threadgold G.J., Birney E., Wylie K., Chinwalla A., Wallis J., Hillier L., Carter J., Gaige T., Jaeger S., Kremitzki C., Layman D., Maas J., McGrane R., Mead K., Walker R., Jones S., Smith M., Asano J., Bosdet I., Chan S., Chittaranjan S., Chiu R., Fjell C., Fuhrmann D., Girn N., Gray C., Guin R., Hsiao L., Krzywinski M., Kutsche R., Lee S.S., Mathewson C., McLeavy C., Messervier S., Ness S., Pandoh P., Prabhu A.L., Saeedi P., Smailus D., Spence L., Stott J., Taylor S., Terpstra W., Tsai M., Vardy J., Wye N., Yang G., Shatsman S., Ayodeji B., Geer K., Tsegaye G., Shvartsbeyn A., Gebregeorgis E., Krol M., Russell D., Overton L., Malek J.A., Holmes M., Heaney M., Shetty J., Feldblyum T., Nierman W.C., Catanese J.J., Hubbard T., Waterston R.H., Rogers J., de Jong P.J., Fraser C.M., Marra M., McPherson J.D., Bentley D.R. A physical map of the mouse genome. Nature. 2002;418:743–750. doi: 10.1038/nature00957. [DOI] [PubMed] [Google Scholar]
  • 21.Zhao S. A comprehensive BAC resource. Nucleic Acids Res. 2001;29:141–143. doi: 10.1093/nar/29.1.141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.te Riele H., Maandag E.R., Berns A. Highly efficient gene targeting in embryonic stem cells through homologous recombination with isogenic DNA constructs. Proc. Natl. Acad. Sci. U. S. A. 1992;89:5128–5132. doi: 10.1073/pnas.89.11.5128. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Bentley D.R., Balasubramanian S., Swerdlow H.P., Smith G.P., Milton J., Brown C.G., Hall K.P., Evers D.J., Barnes C.L., Bignell H.R., Boutell J.M., Bryant J., Carter R.J., Keira Cheetham R., Cox A.J., Ellis D.J., Flatbush M.R., Gormley N.A., Humphray S.J., Irving L.J., Karbelashvili M.S., Kirk S.M., Li H., Liu X., Maisinger K.S., Murray L.J., Obradovic B., Ost T., Parkinson M.L., Pratt M.R., Rasolonjatovo I.M., Reed M.T., Rigatti R., Rodighiero C., Ross M.T., Sabot A., Sankar S.V., Scally A., Schroth G.P., Smith M.E., Smith V.P., Spiridou A., Torrance P.E., Tzonev S.S., Vermaas E.H., Walter K., Wu X., Zhang L., Alam M.D., Anastasi C., Aniebo I.C., Bailey D.M., Bancarz I.R., Banerjee S., Barbour S.G., Baybayan P.A., Benoit V.A., Benson K.F., Bevis C., Black P.J., Boodhun A., Brennan J.S., Bridgham J.A., Brown R.C., Brown A.A., Buermann D.H., Bundu A.A., Burrows J.C., Carter N.P., Castillo N., Chiara E.C.M., Chang S., Neil Cooley R., Crake N.R., Dada O.O., Diakoumakos K.D., Dominguez-Fernandez B., Earnshaw D.J., Egbujor U.C., Elmore D.W., Etchin S.S., Ewan M.R., Fedurco M., Fraser L.J., Fuentes Fajardo K.V., Scott Furey W., George D., Gietzen K.J., Goddard C.P., Golda G.S., Granieri P.A., Green D.E., Gustafson D.L., Hansen N.F., Harnish K., Haudenschild C.D., Heyer N.I., Hims M.M., Ho J.T., Horgan A.M. Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 2008;456:53–59. doi: 10.1038/nature07517. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Frengen E., Weichenhan D., Zhao B., Osoegawa K., van Geel M., de Jong P.J. A modular, positive selection bacterial artificial chromosome vector with multiple cloning sites. Genomics. 1999;58:250–253. doi: 10.1006/geno.1998.5693. [DOI] [PubMed] [Google Scholar]
  • 25.Wendl M.C., Dear S., Hodgson D., Hillier L. Automated sequence preprocessing in a large-scale sequencing environment. Genome Res. 1998;8:975–984. doi: 10.1101/gr.8.9.975. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES