The structure of the 4.4-kb wild-type genomic fragment containing the RIB43a gene. Boxed areas indicate sequence contained in the cDNA clone pBrib43a; shaded boxes designate the largest open reading frame. Comparison of the cDNA length (1612 bp) with the estimated size of the RIB43a transcript (∼1700 bases by Northern blotting; Figure 2) puts the transcription initiation site at ∼1051 bases; the predicted translation start codon is located at base 1244. There are 11 potential tub boxes (at least 70% identical to the consensus sequence GTTCSAAGGC; Davies and Grossman, 1994) located at bases 48, 136, 318, 397, 425, 535, 542, 694, 783, 879, and 953; these sequence elements are thought to be important for regulated expression of flagellar transcripts. Potential TATA boxes are located at 1007 bases (TTTATGA), 1009 bases (TATGATA), 1012 bases (GATAATT), and 1046 bases (TACACAT). The translation stop codon is present at base 3808. A polyadenylation site (TGTAA) appears at base 4202; the start of the poly(A) tail on our cDNA corresponds to base 4220 of the genomic sequence.