Figure 4.
Gene Representation in MGI and LocusLink. (A) Emerging representation of the flexed tail gene (sideroflexin 1, Sfxn1). A gene record for the flexed tail (f) mouse mutation, described by Hunt et al. (1933), is created in MGI. Over time, MGI captures published information about the flexed tail locus; however, no sequence information is available. Clone 2810002O05, a novel mouse cDNA sequence is released with the FANTOM1 data, and a gene record is created in MGI and LocusLink for the sequence. Sequence-based annotations (GO terms, protein, domains, UniGene) are associated with gene 2810002O05Rik, and the MGI/LocusLink coordinated data exchange begins. LocusLink creates a RefSeq for the gene. After release of FANTOM1 data, Fleming et al. (2001) report the cloning of flexed tail and its sequence. Sequence analysis reveals that the flexed tail sequence is identical to the FANTOM1 cDNA. Gene 2810002O05Rik is merged with the flexed tail gene, and based on Fleming et al. (2001), the gene is renamed sideroflexin 1 (Sfxn1), for the siderocytic anemia and flexed tail phenotypes observed in mutant mice (see Fig. 4B). (B) Current representation of the Sfxn1 gene record in MGI and LocusLink, demonstrating the types of information integrated with sequences at the two resources. Wide arrows indicate data types shared between MGI and LocusLink, and the direction of transfer. MGI and LocusLink also exchange gene name synonyms and corresponding gene record identifiers. Hypertext links to various annotations and data are provided at both resources: official mouse gene nomenclature (MGI provides to LocusLink; A), mapping information (reconciled between MGI and LocusLink; B), allele and phenotype information (MGI; C), polymorphisms (LocusLink provides links to dbSNP, data not shown; D), gene ontology (MGI provides to LocusLink; E), homology information (MGI provides curated mammalian orthology data; F), expression (MGI; G), UniGene (H), LocusLink/MGI reciprocal links (I), mouse genome annotations (J), protein domains (also at LocusLink, data not shown; K), Database of Transcribed Sequences (DoTS, MGI; L), TIGR Mouse Gene Index (MGI; M), mRNA-genome alignments (LocusLink; N), references (O), RefSeqs (LocusLink provides to MGI; P), and sequences (exchanged between MGI and LocusLink; Q).