Skip to main content
eLife logoLink to eLife
. 2022 Jun 21;11:e77999. doi: 10.7554/eLife.77999

Vision-related convergent gene losses reveal SERPINE3’s unknown role in the eye

Henrike Indrischek 1,2,3,4,5,6, Juliane Hammer 7, Anja Machate 7, Nikolai Hecker 2,3,4,, Bogdan Kirilenko 1,2,3,4,5,6, Juliana Roscito 2,3,4,, Stefan Hans 7, Caren Norden 2,§, Michael Brand 7,, Michael Hiller 1,2,3,4,5,6,
Editors: George H Perry8, George H Perry9
PMCID: PMC9355568  PMID: 35727138

Abstract

Despite decades of research, knowledge about the genes that are important for development and function of the mammalian eye and are involved in human eye disorders remains incomplete. During mammalian evolution, mammals that naturally exhibit poor vision or regressive eye phenotypes have independently lost many eye-related genes. This provides an opportunity to predict novel eye-related genes based on specific evolutionary gene loss signatures. Building on these observations, we performed a genome-wide screen across 49 mammals for functionally uncharacterized genes that are preferentially lost in species exhibiting lower visual acuity values. The screen uncovered several genes, including SERPINE3, a putative serine proteinase inhibitor. A detailed investigation of 381 additional mammals revealed that SERPINE3 is independently lost in 18 lineages that typically do not primarily rely on vision, predicting a vision-related function for this gene. To test this, we show that SERPINE3 has the highest expression in eyes of zebrafish and mouse. In the zebrafish retina, serpine3 is expressed in Müller glia cells, a cell type essential for survival and maintenance of the retina. A CRISPR-mediated knockout of serpine3 in zebrafish resulted in alterations in eye shape and defects in retinal layering. Furthermore, two human polymorphisms that are in linkage with SERPINE3 are associated with eye-related traits. Together, these results suggest that SERPINE3 has a role in vertebrate eyes. More generally, by integrating comparative genomics with experiments in model organisms, we show that screens for specific phenotype-associated gene signatures can predict functions of uncharacterized genes.

Research organism: Mouse, Zebrafish

Introduction

Disorders affecting eyes range from subtle vision impairment to blindness and are among the most prevalent diseases in the human population (Sheffield and Stone, 2011). For example, an estimated 76 million people worldwide suffer from glaucoma (Allison et al., 2020), a disease involving optic nerve damage, and about 1.8 million people are blind due to age-related macular degeneration, a degenerative disease of the central retina affecting retinal pigment epithelium (RPE) and photoreceptor cells (Mitchell et al., 2018; GBD 2019 Blindness and Vision Impairment Collaborators and Vision Loss Expert Group of the Global Burden of Disease Study, 2021).

Extensive research in the past decades identified many protein-coding genes with crucial roles in development and maintenance of different tissues and cell types in the eye as well as numerous genes that are associated with genetic eye disorders (Sheffield and Stone, 2011; Moore et al., 2018). For example, the RetNet database (Daiger et al., 2022) lists 271 genes associated with heritable retinal diseases. Even though the eye probably represents one of the best studied organs, our knowledge of the genes underlying eye diseases and disorders is still incomplete. For example, linkage analysis in patients with cataract, microcornea, microphthalmia, and iris coloboma identified new genomic loci linked to the diseases; however, the disease-causing genes have remained elusive (Wang et al., 2007; Sabir et al., 2010; Abouzeid et al., 2009). Similarly, the Cat-Map database (Shiels et al., 2010) lists several additional cataract-associated loci where the underlying disease-causing gene has not been identified. Furthermore, there are still thousands of genes that have not been experimentally investigated in detail, leaving many genes where potential eye-related functions remain to be discovered. Indeed, systematic knockouts of 4,364 genes in mouse detected ocular phenotypes for 347 genes, with 75% of them not been known as eye-related before (Moore et al., 2018). This indicates that vision-related genes as well as genes associated with genetic eye disorders remain to be identified and characterized.

Interestingly, many genes that are linked to human eye diseases are inactivated (lost) in non-human mammals that naturally exhibit poor vision (Emerling et al., 2017; Sharma and Hiller, 2020). For example, subterranean mammals, such as the blind mole rat, naked mole rat, star-nosed mole, and cape golden mole, exhibit gene-inactivating mutations in genes implicated in cataract, retinitis pigmentosa, color or night blindness, or macular degeneration in human (e.g. ABCA4, BEST1, CRYBA1, EYS, GJA8, GNAT2, PDE6C, ROM1, and SLC24A1) (Sharma and Hiller, 2020; Kim et al., 2011; Fang et al., 2014; Emerling and Springer, 2014; Prudent et al., 2016; Partha et al., 2017; Emerling, 2018). Symptoms that characterize these human eye diseases resemble traits found in these subterranean mammals, such as highly degenerated retinae and lenses and sometimes blindness. Similarly, losses of the short wave sensitive opsin (OPN1SW), which is linked to color blindness in humans, occurred in several mammalian lineages such as cetaceans, bats, sloths, and armadillos that are consequently inferred to have monochromatic vision (Emerling et al., 2017; Meredith et al., 2013; Zhao et al., 2009; Emerling and Springer, 2015; Sadier et al., 2018; Springer et al., 2016). In addition to losses of vision-related genes, subterranean mammals also exhibit widespread sequence and transcription factor binding site divergence in eye-related regulatory elements (Roscito et al., 2018; Langer et al., 2018; Langer and Hiller, 2019). Such mutations can cause mis-expression of target genes in ocular tissues such as the lens (Roscito et al., 2021). Loss and divergence of vision-related genes and regulatory elements in these mammals is likely caused by the lack of natural selection on maintaining functional eyes in a dark environment. Taken together, previous studies established a clear association between naturally occurring poor vision phenotypes and regressive evolution of the genetic machinery required for functional eyes (Emerling et al., 2017; Partha et al., 2017; Roscito et al., 2018; Langer et al., 2018).

Here, we performed a genome-wide screen for genes preferentially lost in independent mammalian lineages with a low visual acuity with the goal of revealing currently uncharacterized genes, where vision-related gene loss patterns would predict vision-related functions. In addition to identifying losses of known vision-related genes, our screen revealed previously unknown losses of several functionally uncharacterized genes, among them SERPINE3, which is lost in 18 mammalian lineages that often do not use vision as the primary sense. We show that SERPINE3 is specifically expressed in eyes of zebrafish and mouse. Furthermore, by knockout of serpine3 in zebrafish, we show that the gene is required for maintenance of a proper retinal lamination and overall eye shape, which confirms the predicted vision-related function. Collectively, our results confirm that SERPINE3 has functions in vertebrate eyes and our discovery-driven study demonstrates how specific evolutionary divergence patterns can reveal novel insights into gene function (Stephan et al., 2022).

Results

A genome-wide screen retrieves genes preferentially lost in mammals with low visual acuity

To uncover potentially unknown vision-related genes, we used the Forward Genomics framework (Hiller et al., 2012) to search for associations between convergent gene losses and poor vision phenotypes in mammalian lineages, where poor vision has independently evolved. Vision is a complex multifaceted trait that may not be easily captured with a single variable. In our study, we decided to select mammals with poor vision based on visual acuity values for two reasons. First, visual acuity describes the ability of an animal to resolve static spatial details, which generally reflects how much an animal relies on vision in comparison to other senses (Caves et al., 2018). Second, visual acuity data is available for 49 placental mammals with sequenced genomes, enabling a comprehensive genomic screen (Figure 1—source data 1). Using visual acuity values, we defined two groups. Low-acuity species have low visual acuity values <1 (log10(va) <0), which comprises ten species (three echolocating bats, three rodents and four subterranean mammals) representing seven lineages (Figure 1A). All other species have visual acuity values >1 and comprise the group with higher visual acuity values.

Figure 1. A comparative genomics screen uncovered known and novel vision-related genes.

(A) Phylogeny of the species included in our screen (left). Visual acuity values on a log10 scale are shown on the right as a bar chart; white boxes indicate the three subterranean mammals that lack acuity measurements but are functionally blind. Low-acuity species (red font) are defined here as species with visual acuity <1 (log10(va) <0). At a false discovery rate (FDR) threshold of 0.05, our screen retrieved 29 genes, which are preferentially lost in low-acuity species. Gene losses in individual species are denoted by red dots. The FDR value for the gene loss – phenotype association is shown at the top as a bar chart. (B) List of 29 genes, together with known vision-related functions. Asterisk marks genes that have no known vision-related function but were mentioned in large-scale gene expression data sets of ocular tissues. Genes in bold are expressed in human eyes according to the eyeIntegration database (Bryan et al., 2018) (see Methods). KO - knockout. (C) Functional enrichments of the 29 genes reveals vision-related functions (Gene Ontology, GO of biological processes), associations with human eye disorders (OMIM), expression in ocular tissues in the mouse gene atlas (Kuleshov et al., 2016) and eye phenotypes in mouse gene KOs (MGI Mammalian Phenotype level 4) among the top five most significant terms. Vision-related terms are shown as blue bars. The dashed line indicates statistical significance in a one-sided Fisher’s exact test after correcting for multiple testing using the Benjamini-Hochberg procedure (FDR 0.05).

Figure 1—source data 1. Genome assemblies used in this study and visual acuity values of mammalian species.
Figure 1—source data 2. List of candidate genes of the forward genomics screen based on visual acuity <1 including a short description.
Figure 1—source data 3. Enrichment of candidate genes in different processes/functions.
Figure 1—source data 4. A control forward genomics screen does not retrieve vision-related genes.
Figure 1—source data 5. List of candidate genes of the forward genomics screen based on a visual acuity threshold <1.5.
Figure 1—source data 6. List of candidate genes of the forward genomics screen based on visual acuity threshold <0.5.
Figure 1—source data 7. List of candidate genes of the forward genomics screen based on the molecular loss signature of known vision genes.
Figure 1—source data 8. Reading frame %intact values for all investigated genes and species.
Figure 1—source data 9. Enrichment of candidate genes in different stages of the filtering process.

Figure 1.

Figure 1—figure supplement 1. Flow diagram of filtering steps applied in the Forward genomics approach.

Figure 1—figure supplement 1.

The filtering procedure results in a list of 31 candidate genes. Two of the 31 genes are no longer contained in the latest Ensembl annotation version (v106) and were therefore dropped, resulting in 29 genes potentially associated with an eye-related function.

Using this classification, we performed a genome-wide screen for genes that exhibit inactivating mutations (frame-shifting insertions or deletions, premature stop codons, splice site mutations, exon, or gene deletions) preferentially in low-acuity species, similar to previous screens (Hecker et al., 2019a; Hecker et al., 2019b; Sharma et al., 2018). Using a false discovery rate (FDR) cutoff of 0.05 and filtering for genes lost in at least three low-acuity lineages, we obtained 29 genes that are each convergently lost in at least three lineages of low-acuity mammals (Figure 1B, Figure 1—source data 2). These genes include several known vision-related genes, such as components of the photoreceptor signal transduction cascade (GUCA1B, ARR3), a factor required for retinal organization (CRB1), lens crystallins (CRYBA1, CRYBB3, CRYGS) and the cornea specific keratin 12 (KRT12). The set of 29 genes is enriched in vision-related functions such as the Gene Ontology term ‘visual perception’ (p=1.8e-7), expression in the mouse lens (p=5.5e-3), and human eye diseases (cataract: p=8.5e-3) (Figure 1C, Figure 1—source data 3). To confirm the specificity of these results, we performed a control screen for genes that are preferentially lost in high-acuity sister species of the low-acuity mammals. This control screen retrieved only two genes, none of which have known functions in the eye (Figure 1—source data 4). Together, this shows that our genome-wide screen for genes preferentially lost in low-acuity species successfully retrieved known vision-related genes.

Interestingly, while 15 of the 29 top-ranked hits have no studied role in the eye (Figure 1B), many of these genes are expressed in human eyes. RSKR, LACTBL1, LBHD1, and ZNF529 even cluster in their expression pattern with other retina phototransduction and visual perception genes (Uhlen et al., 2010; Figure 1—source data 2). The preferential loss of these genes in species that exhibit lower visual acuity values and that have convergently lost other known vision-related genes predicts an uncharacterized vision-related function for some of these 15 genes (Figure 1A).

SERPINE3 is convergently lost in low-acuity mammals

We sought to experimentally test this prediction for a gene that is ranked highly in our screen. Since the first-ranked gene, TSACC (TSSK6 activating cochaperone), encodes a chaperone that is specifically expressed in testis (Uhlen et al., 2010; Jha et al., 2010) and is therefore unlikely to have an eye-related function, we focused on the second-ranked candidate, SERPINE3 (serpin family E member 3). SERPINE3 is largely uncharacterized and classified as lost in 7 of the 10 low-acuity mammals (Figure 1, Figure 1—source data 2). SERPINE3 independently accumulated inactivating mutations in all four subterranean species (cape golden mole, star-nosed mole, naked mole rat, blind mole rat). Within bats, SERPINE3 is inactivated in the two Myotis bats and the big brown bat (Yangochiroptera). These three species use echolocation instead of vision as the primary sense for hunting. In contrast, non-echolocating flying foxes (Pteropodidae) that rely more on vision, possess an intact SERPINE3 gene. To show that SERPINE3 loss is robustly associated with poor vision, we varied the visual acuity thresholds used to classify species as having low-acuity vision and explored an alternative approach for defining poor vision species based on a molecular loss signature of known vision-related genes. All three modified screens consistently retrieved SERPINE3 as one of the top-ranked hits (Figure 1—source data 5 , Figure 1—source data 6, Figure 1—source data 7), showing that this association is robust to the selected thresholds and phenotype definition.

SERPINE3 became dispensable in many mammals that do not primarily rely on vision

To explore the evolution of SERPINE3 in additional mammalian genomes, we made use of an orthology data set generated by the TOGA (Tool to infer Orthologs from Genome Alignments) method (Kirilenko et al., titled 'Integrating gene annotation with orthology inference at scale', under review) that includes 381 additional placental mammalian species that were not part of the genomic screen. Interestingly, this substantially extended data set revealed a number of additional losses of SERPINE3, typically in species that do not rely on vision as their primary sense (Figure 2A). We only considered SERPINE3 losses for which at least one inactivating mutation could be validated (see Materials and methods, Figure 2—source data 1, Figure 2—source data 2). Further supporting SERPINE3 losses, analyses of publicly available RNA-seq data indicates that remnants of inactivated SERPINE3 genes do not show relevant expression (Figure 2—figure supplement 1, Figure 2—source data 3). A detailed analysis of inactivating mutations indicates that SERPINE3 is inactivated in 70 of the 430 analyzed placental mammals and that the gene is convergently lost at least 18 times in placental mammal evolution (Figure 2—figure supplements 28).

Figure 2. SERPINE3 gene loss pattern across 430 mammalian species.

(A) Left: Phylogeny of mammalian species investigated for the loss of SERPINE3 with mapped gene loss events indicated as red crosses. Branches of major clades are colored (Rodentia – brown, Chiroptera – orange, Eulipotyphla – blue, Afroinsectiphilia - red). The number of species investigated per clade is specified in parenthesis. For all loss lineages (red font), visual capability (classified as echolocating, fossorial/subterranean, vision as non-primary and primary sense) is displayed as pictograms at the right. Asterisk marks indicate species, where SERPINE3 evolved under relaxed selection but did not accumulate not more than one inactivating mutation. Right: The Serpin protein domain (Pfam) spans all eight protein-coding exons (boxes) of the intact human SERPINE3 gene (top). The secretion signal and the reactive core region are conserved in species with an intact SERPINE3 (Figure 2—figure supplements 9 and 10). Gene-inactivating mutations are illustrated for three clades, with stop codon mutations shown in black, frame-shifting insertions and deletions shown in red and mutated splice site dinucleotides shown between exons in red. Deleted exons are shown as red boxes. Insets show codon alignments with inactivating mutations in red font. RB - roundleaf bat, HB - horseshoe bat. (B) An ancestral deletion removed large parts of SERPINE3 in the tenrec lineage. UCSC genome browser (Lee et al., 2022) view of the human hg38 assembly showing the SERPINE3 locus and whole genome alignments between human and two tenrec species, visualized as a chain of co-linear local alignments. In these chains, blocks represent aligning sequence and double lines represent sequence between the aligning blocks that do not align between human and tenrec. A large deletion removed the first five protein-coding exons of SERPINE3 in both species. Shared breakpoints (gray box) indicate that the deletion likely represents an ancestral event in Tenrecidae.

Figure 2—source data 1. SERPINE3 intactness in 447 mammalian assemblies.
Figure 2—source data 2. Verification of inactivating mutations in SERPINE3 with raw DNA sequencing reads.
Figure 2—source data 3. Expression of SERPINE3 remnants in public RNA-seq datasets.
Figure 2—source data 4. Analysis of relaxation or intensification of selection in SERPINE3 branches of interest.

Figure 2.

Figure 2—figure supplement 1. The remnants of SERPINE3 are not expressed in the common vampire bat.

Figure 2—figure supplement 1.

(A) UCSC genome browser screenshot of the common vampire bat (Desmodus rotundus) genome (Blumer et al., 2022) showing the four remaining exons of SERPINE3 (red: TOGA annotation, the other exons are deleted in this bat) and three coding exons of the neighboring INTS6 gene. Eye RNA-seq from D. rotundus (Sadier et al., 2018), shown as a wiggle track and read alignments, show that three SERPINE3 exons (blue highlight) are not expressed. The fourth SERPINE3 exon overlaps the 3' UTR of the neighboring INTS6 gene, which is located on the other strand. This explains why reads cover this SERPINE3 exon and its flanks, but there is not a single read that supports splicing of this exon to other SERPINE3 exons. In contrast, INTS6 exons are robustly expressed and show spliced alignments. (B) For comparison, the conserved GLUL gene, which is a key enzyme in glutamate detoxification, is highly expressed in eyes of the common vampire bat.
Figure 2—figure supplement 2. Gene-inactivating mutations support four independent losses of SERPINE3 in Afrotheria and Xenarthra (red dots).

Figure 2—figure supplement 2.

Those shared mutations are marked by boxes, which indicate the loss of SERPINE3 in the common ancestor of related species according to parsimony. For tenrecs (Microgale talazaci, Echinops telfairi) and elephant shrew (Elephantulus edwardii), the putative region of exon 4 was likely deleted after the shared frame-shifting mutations. SERPINE3 in sirenia (branches marked in red) evolve under relaxed selection. Coloring and legend as in Figure 2. Gray exons denote missing information.
Figure 2—figure supplement 3. Gene-inactivating mutations support a single loss of SERPINE3 in Pholidota.

Figure 2—figure supplement 3.

A partial deletion of coding exon 6 and three frame-shifting deletions in exon 7 are shared among all four species, indicating a single loss of SERPINE3 in the Pholidota lineage. Coloring and legend as in Figure 2.
Figure 2—figure supplement 4. Gene-inactivating mutations support three independent losses of SERPINE3 in Eulipotyphla (red dots).

Figure 2—figure supplement 4.

The start codon is not intact in the four moles (a). Star-nosed mole (Condylura cristata) and Hispaniolan solenodon (Solenodon paradoxus) share two inactivating mutations in exon 3, although they are phylogenetically distant (b, c). Coloring and legend as in Figure 2.
Figure 2—figure supplement 5. Gene-inactivating mutations support one loss of SERPINE3 in Yangochiroptera.

Figure 2—figure supplement 5.

The ancestral start codon is mutated in most Yangochiroptera (box). The most parsimonious explanation is a shared start codon mutation (red dot) in the ancestral lineage after split from the sac-winged bat (Saccopteryx bilineata). A back mutation to the regular start codon ATG likely occurred in Molossus molossus and Artibeus jamaicensis. Please see Figure 2—figure supplement 6 for gene-inactivating mutations of 15 more Yangochiroptera assemblies. The sac-winged bat has an intact copy of the gene that evolves under relaxed selection (red branch) in addition to a second copy of SERPINE3, which was independently inactivated. Coloring and legend as in Figure 2. Gray exons denote missing information.
Figure 2—figure supplement 6. Gene-inactivating mutations in additional Yangochiroptera.

Figure 2—figure supplement 6.

This is an extension of Figure 2—figure supplement 5. The start codon mutation shown is shared with most other Yangochiroptera (Figure 2—figure supplement 5). Coloring and legend as in Figure 2. Gray exons denote missing information.
Figure 2—figure supplement 7. Gene-inactivating mutations support six independent losses of SERPINE3 in Yinpterochiroptera.

Figure 2—figure supplement 7.

Six independent inactivation events (red dots) occurred in SERINE3’s coding sequence in Yinpterochiroptera. All Rhinolophidae share frame-shifting mutations in exon 4 (boxed), while Hipposideridea share a frame-shifting insertion in exon 1 and a splice site mutation at exon 2 (boxed). Coloring and legend as in Figure 2.
Figure 2—figure supplement 8. Independent gene-inactivating mutations in other mammals.

Figure 2—figure supplement 8.

A test for shifts in selective pressure revealed that SERPINE3 evolves under relaxed selection in the subterranean mole vole (Ellobius lutescens, marked in red), but not in the other cases of gene-inactivating mutations with unclear consequences (steenbok, okapi). A species-specific duplication with inactivation of one SERPINE3 copy occurred in white-footed mouse (Peromyscus leucopus). Coloring and legend as in Figure 2.
Figure 2—figure supplement 9. Conservation of the signal peptide in mammalian SERPINE3.

Figure 2—figure supplement 9.

The putative N-terminal signal peptide contains many hydrophobic residues in intact mammalian SERPINE3 and is predicted to guide secretion into extracellular space. We show the first 30 alignment columns. The sequence logo was generated with WebLogo (Crooks et al., 2004).
Figure 2—figure supplement 10. The hinge region and reactive core loop are conserved in intact mammalian SERPINE3.

Figure 2—figure supplement 10.

The numbers refer to human SERPINE3, where R369 likely is the scissile bond (P1)within the reactive core loop positions 366–372, (P4-P4’). The hinge region is located at positions 355–361. The logo was generated with WebLogo (Crooks et al., 2004).

For example, SERPINE3 is lost in several mammalian clades with nocturnal representatives that have a partially burrowing lifestyle such as pangolins (Manidae), aardvark and armadillos (Cingulata), which are characterized by proportionally small eyes (Mittermeier and Wilson, 2011; Myers et al., 2022; Figure 2A, Figure 2—figure supplements 2 and 3). All representatives of these clades use smell and hearing as primary senses for perception of environmental clues and have a poor sense of vision (DiPaola et al., 2020). SERPINE3 has also been lost in the stem lineage of Pilosa (sloths and anteaters). Although extant sloths have rather high visual acuity values (10.2 for southern two-toed sloth) (Veilleux and Kirk, 2014), the Pilosa ancestor was likely a digging/burrowing species (Gaudin and Croft, 2015), indicating that SERPINE3 loss originally occurred in a burrowing species. SERPINE3 is also lost in solenodon, European shrew and tenrecs (Tenrecidae), species with small eyes that use echolocation for close-range spatial orientation (Siemers et al., 2009; Gould, 1965; Eisenberg and Gould, 1966; Figure 2B, Figure 2—figure supplements 2 and 4). For bats, our extended data set shows that SERPINE3 is convergently lost in seven echolocating lineages, but remains intact in all 13 analyzed Pteropodid bats, revealing a clear pattern of convergent losses of this gene restricted to bat lineages relying on laryngeal echolocation (Figure 2—figure supplements 57). The sac-winged bat is the only laryngeal echolocating bat with an intact SERPINE3; however, selection rate analysis indicates that the gene evolves under relaxed selection (Figure 2—source data 4). The available genomes support shared gene-inactivating mutations among many bat species, which likely represent ancestral events (Figure 2A right). Expanding upon the two convergent losses of SERPINE3 in subterranean rodents detected in the initial screen, the gene is also convergently lost in the fossorial Damara mole rat and evolves under relaxed selection in the Transcaucasian mole vole (Figure 2—source data 4, Figure 2—figure supplement 8), while being intact in all other 63 rodents. In the expanded data set, we further uncovered the loss of SERPINE3 in another clade of fossorial moles (Talpidae) within the order of Eulipotyphla (Figure 2—figure supplement 4). A splice site mutation shared between dugong and manatee as well as patterns of relaxed selection indicate an ancestral loss of SERPINE3 in the ancestor of sirenians, which mostly use their tactile sense for navigation through murky water (Moore et al., 2021; Figure 2—figure supplement 2, Figure 2—source data 1). Consistent with our definition of poor vision based on visual acuity, the Florida manatee has a reduced ability to resolve spatial detail (va = 1.6, Mass et al., 2012) compared to its closest relatives, the African elephant (va = 13.16, Pettigrew et al., 2010), which has an intact SERPINE3 gene. Finally, SERPINE3 is also lost in the cape elephant shrew, which is a mostly diurnal species with relatively large eyes (Dengler-Crish et al., 2006); however, loss of SERPINE3 already occurred in the ancestral Afroinsectiphilia lineage (Figure 2—figure supplement 2), which was presumably nocturnal (Wu et al., 2017).

Together, many independent gene losses in species that do not rely on vision as their primary sense predicts a vision-related function for SERPINE3 that became dispensable in these mammals, leading to convergent SERPINE3 losses due to relaxed selection.

SERPINE3 encodes a putative secreted proteinase inhibitor

Based on sequence homology, SERPINE3 is classified as a member of the serine proteinase inhibitor (SERPIN) family. Many members of this family are secreted into the extracellular space and inhibit their substrates by covalent binding (Heit et al., 2013; Law et al., 2006). We performed a SERPINE3 sequence analysis, which revealed that key sequence features of inhibitory serpins are well conserved among placental mammals, suggesting that SERPINE3 also functions as a secreted serine proteinase inhibitor (see Material and Methods, Figure 2—figure supplements 9 and 10).

Serpins have roles in coagulation, angiogenesis, neuroprotection and inflammation, and several serpins have been implicated in human diseases (Law et al., 2006; Barnstable and Tombran-Tink, 2004). However, the functional role of SERPINE3 is largely unknown as the gene has never been studied in an animal or cellular model. A thorough literature search revealed that SERPINE3 is listed (often in the supplement) among many other genes, as differentially expressed in large-scale expression analyses. For example, Serpine3 is upregulated in the mouse retina in response to overexpressing neuroprotective factors (Machalińska et al., 2013). Serpine3 was also upregulated in complement component 3 (C3) knockout mice, which represent a model of the aged retina (Rogińska et al., 2017), and downregulated in the eye of knockout mice for Pcare, a causal gene for retinitis pigmentosa (Kevany et al., 2015). Together, while this gene was never studied in greater detail, literature clues and the striking convergent gene loss pattern in mammals with poor vision suggest that SERPINE3 may have a functional role in the eye.

Serpine3 is specifically expressed in Müller glia in the adult zebrafish retina

To test the prediction that SERPINE3 has an eye-related function, we first analyzed its expression pattern in adult zebrafish (Figure 3). Zebrafish has proven as valuable model species for the study of eye genetics as it has a cone-dominated retina (∼60% cones, ∼40% rods) (Fadool, 2003), similar to the central human retina. Furthermore, zebrafish have a single serpine3 ortholog that is located in a context of conserved gene order (Figure 4A), which makes zebrafish a suitable model for investigation of serpine3 expression and function. Reverse transcription quantitative PCR (RT-qPCR) analysis with two normalization genes (actb and rpl13a with n=8 and n=3 biological replicates, respectively) consistently revealed that the highest serpine3 expression is in the eye, followed by expression in the brain (Figure 3A and B, Figure 3—source data 1, Figure 3—source data 2). For actb, serpine3 expression is significantly higher in eye than in other tissues (Figure 3A).

Figure 3. Serpine3 is expressed in zebrafish and mouse eyes.

(A,B) Expression of zebrafish serpine3 mRNA in relation to the normalization genes actb (A) and rpl13a (B) measured with RT-qPCR. Expression levels are consistently the highest in eye. A two-sided unequal variances t-test was used. Boxplots display first quartile, median and third quartile with whiskers extending to the maximum and minimum value. Measurements of technical replicates (n=3–6) are encoded in the same color, different colors represent different biological replicates (n=3–8). Relative rates are displayed as zero, if expression was below the detection limit. (C) Serpine3 mRNA expression in different tissues of the zebrafish eye in relation to the reference gene actb measured with RT-qPCR for three biological replicates. Serpine3 is specifically expressed in the retina but not in other tissues of the eye. The expression level is significantly higher in retina without RPE compared to RPE only (two-sided unequal variances t-test). Although expression in RPE only is low, it is significantly higher than expression in eye tissue after removing retina and RPE (two-sided unequal variances t-test). (D) Serpine3 mRNA expression in mouse in relation to the reference gene Rpl27 measured with RT-qPCR for three biological replicates. Serpine3 is specifically expressed in the eye but not in colon, cortex, heart, liver, spleen and testis. (E–H) Serpine3 mRNA expression pattern in zebrafish retina. (E–F) Chromogenic in situ hybridization (ISH) shows localized expression of serpine3 (purple) in the retina, specifically in the inner nuclear layer (inlet). (G–H) Fluorescence in situ hybridization shows that serpine3 mRNA expression (yellow) is localized to cell bodies of Müller glia cells. Filaments of Müller glia cells are marked by the glial fibrillary acidic protein (ZRF1 antibody, green). Specificity of the serpine3 ISH probe is shown by absence of the signal in homozygous serpine3cbg17 knockout fish (Figure 3—figure supplement 1). Serpine3 mRNA is not expressed in bipolar or amacrine cells (Figure 3—figure supplement 2). Nuclei are stained with DAPI (white). Scale bar is 200 µm in (E) and (G) and 20 µm in (F) and (H). INL – inner nuclear layer, ONL – outer nuclear layer, GCL – ganglion cell layer.

Figure 3—source data 1. qPCR analysis of a zebrafish organ series using the normalizer gene actb.
Figure 3—source data 2. qPCR analysis of a zebrafish organ series using the normalizer gene rpl13a.
Figure 3—source data 3. qPCR analysis of zebrafish eye subtissues.
Figure 3—source data 4. qPCR analysis of mouse organs.
Figure 3—source data 5. Data sets supporting expression of SERPINE3 in tissues of the eye in vertebrates other than mouse and zebrafish.

Figure 3.

Figure 3—figure supplement 1. Serpine3 expression co-localizes with glial fibrillary acidic protein in the retina of wild type fish.

Figure 3—figure supplement 1.

The fluorescence in situ signal for zebrafish serpine3 is specific for wild type (WT, yellow), where it is in proximity to staining of the ZRF1-antibody (green). This indicates expression of serpine3 by Müller glia cells. Serpine3 signal is not present in homozygous serpine3cbg17 siblings. In the overlay, nuclei (DAPI) are shown in white. ONL – outer nuclear layer, INL – inner nuclear layer, GCL – ganglion cell layer. Scale bar = 20 µm.
Figure 3—figure supplement 2. Serpine3 expression does not co-localize with markers for bipolar or amacrine cells in the zebrafish retina.

Figure 3—figure supplement 2.

Fluorescence in situ hybridization of serpine3 mRNA (yellow) does not show co-localization with anti-protein kinase C alpha (PKCa encoded by prkca, green) or choline O-acetyltransferase a (CHAT, encoded by chata, green) proteins. In the overlay, nuclei (DAPI) are shown in white. ONL – outer nuclear layer, INL – inner nuclear layer, GCL – ganglion cell layer. Scale bar overview = 20 µm, details = 20 µm.

Figure 4. Serpine3 knockout in zebrafish causes defects in eye shape and retinal layering.

(A) UCSC genome browser visualization of the SERPINE3 genomic locus in human (hg38 assembly, top box) and zebrafish (danRer11 assembly, bottom box) shows that both species have a 1:1 ortholog with the same number of coding exons in a conserved gene order context. In the human locus, two single nucleotide polymorphisms (SNPs) are in linkage with SERPINE3 and associated with eye phenotypes. In zebrafish, we used CRISPR-Cas9 to generate two independent knockout (KO) lines. The position of guide RNAs is indicated as scissors. In the serpine3cbg17 line, we deleted the promoter and first exon. In the serpine3cbg18 line, we introduced a 92 bp frame-shifting deletion in exon 2 (coding exon 1) that results in three early stop codons in the original reading frame. (B) Relative expression of serpine3 mRNA in wild type (WT) zebrafish and serpine3cbg17 individuals quantified by RT-qPCR relative to the expression of rpl13a. Serpine3 mRNA expression is close to zero in serpine3cbg17 fish and significantly reduced in comparison to wild type fish (p=0.049, two-sided unequal variances t-test). Technical replicates of the qPCR are shown as individual data points; different colors represent different biological replicates. Boxplots display first quartile, median and third quartile with whiskers extending to the maximum and minimum of the three biological replicates. (C) In situ hybridization showing that serpine3 is expressed in the inner nuclear layer (INL) of WT zebrafish but not in the homozygous serpine3cbg17. Scale bar = 25 µm. (D) Serpine3 knockout leads to changes in eye shape in adult, homozygous knockout (KO) fish of serpine3cbg17 and serpine3cbg18 lines in comparison to their WT siblings (18 and 11 months, respectively). In WT, the eye shape almost perfectly corresponds to the concave shape of the iris (overlay of white and red dotted lines). In contrast, many KO individuals have alterations in eye shape, evident by notches (arrow heads) in the white line that follows the iris. Scale bar = 1 mm. (E) Iris solidity (ratio of eye shape/ concave eye shape) significantly differs between WT and KO siblings for both the serpine3cbg17 (16 vs 10 eyes) and the serpine3cbg18 (40 vs 40 eyes) line. A Wilcoxon Rank sum test was used. Boxplots display first quartile, median, and third quartile with whiskers extending to the maximum and minimum within 1.5 times interquartile range. Outliers are shown in black. Iris circularity, another quantification measure for the phenotype, is shown in Figure 4—figure supplement 3. (F) Hematoxylin/eosin histology staining of the eye of serpine3cbg17 fish (22 months) reveals histological differences in comparison to their WT siblings (dorsal top, ventral bottom). In comparison to WT, distance between lens and retina of serpine3cbg17 fish is reduced (distance bars). The WT retina (top) has a distinct lamination with clear separation of the single retinal layers (a, b) as shown in the schematic (RPE – retinal pigment epithelium layer, POS – photoreceptor outer segment, ONL – outer nuclear layer, OPL – outer plexiform layer, INL – inner nuclear layer, IPL – inner plexiform layer, GCL – ganglion cell layer, RNFL – retinal nerve fiber layer). Although all retinal layers are present in serpine3cbg17 fish, the layering appears distorted and the density of cells is reduced (c–f). Specifically, the RPE cells display an altered distribution and even local clusters (empty arrows), and displaced pigmented cells emerge in all retinal layers (yellow arrows). This was confirmed also for the cbg18 allele (Figure 4—figure supplement 5). Scale bar in the overviews = 200 µm, scale bar in the magnifications = 20 µm.

Figure 4—source data 1. qPCR analysis of zebrafish serpine3cbg17 homozygotes (KO) vs WT.
Figure 4—source data 2. Macroscopic eye phenotype in serpine3 knockout lines.
Figure 4—source data 3. Single-nucleotide polymorphism near SERPINE3 (13: 51, 341, 032–51, 362,101) are associated with human eye phenotypes.
Figure 4—source data 4. Raw gel images of genotyping of the serpine3cbg17 line with two primers (labeled image is shown in Figure 4 - figure supplement 2A).
Figure 4—source data 5. Raw gel images of genotyping of the serpine3cbg17 line with a mix of three primers (labeled image is shown in Figure 4 - figure supplement 2B).
Figure 4—source data 6. Raw gel images of genotyping of the serpine3cbg18 line with two primers (labeled image is shown in Figure 4 - figure supplement 2C).

Figure 4.

Figure 4—figure supplement 1. Knockout of serpine3cbg17 and serpine3cbg18 alleles in zebrafish by CRISPR-Cas9 is confirmed by sequencing results.

Figure 4—figure supplement 1.

Sequenced reads are mapped against the danRer11 zebrafish genome assembly with Blat and are visualized in the UCSC genome browser (black) together with the serpine3 RefSeq annotation (blue). (A) PCRs with primers For1 and Rev2 or For1 and Rev2.2 (green) amplify the expected regions around the transcription start site on chromosome 9 in wild type (WT_Rev2, WT_Rev2.2). Injection of CRISPR guides 1, 2, and 3 (orange) results in a deletion of about 400 bp for founder individuals 7 and 12 in serpine3cbg17. Offspring of founder 12 (394 bp deletion) was raised and further crossed. The location of a single serpine3 transcription start site is supported by annotation and activating histone marks H3K4me3 within the respective region (lower gray bar with darkness of color correlating with signal intensity). (B) PCR with primers For3 and Rev4 (green) amplifies a region around coding exon1 of serpine3. Injection of CRISPR guides 4, 5, and 6 (orange) results in a deletion of about 100 bp for founders 1, 9, 18, and 19 in serpine3cbg18. For founder 9, this 92 bp deletion induces a frame-shift in the reading frame (bold) with three early stop codons (two shown as *, third stop located in coding exon 2) and offspring was grown. The deletion is equivalent to deletion of 31 amino acids (red) and +1 nt insertion. The amino acid encoded by a split codon is shown in blue.
Figure 4—figure supplement 2. PCRs confirm expected CRISPR-Cas9 induced deletions in serpine3cbg17 and serpine3cbg18 fish.

Figure 4—figure supplement 2.

(A) Genotyping of the serpine3cbg17 with two primers (F0 generation). Injection of CRISPR guides into the one-cell embryo leads to mosaic embryos (several pooled at 72 hpf) with a strong wild type (WT) band at about 557 bp and a weak mutant band at about 161 bp, while uninjected embryos just have a single WT band (557 bp). The raw image is provided as Figure 4—source data 4. (B) Genotyping of serpine3cbg17 with a mix of three primers (F2 generation). Heterozygous animals (H) have two bands, a WT (expected height: 286 bp) and mutant band (expected height: 161 bp), while homozygous mutants (M) and homozygous WT animals show a single band. NC – negative control (water). The raw image is provided as Figure 4—source data 5. (C) Genotyping of serpine3cbg18 with two primers. Heterozygous animals (H) have two bands, a WT (expected height: 342 bp) and mutant band (expected height: 250 bp), while homozygous mutants (M) and homozygous WT animals show a single band. NC – negative control (water). The raw image is provided as Figure 4—source data 6.
Figure 4—figure supplement 3. Iris circularity differs between serpine3cbg17 and serpine3cbg18 fish and their respective wild type siblings (WT).

Figure 4—figure supplement 3.

Iris circularity is another descriptor for the deviation of eye shape and is defined as 4π × [Area]/[Perimeter]2. A value of 1 indicates a perfect circle. Boxplots show that iris circularity significantly differs between WT (n=14) and serpine3cbg17 eyes (n=10). The same holds for the comparison of WT (n=40) and serpine3cbg18 eyes (n=40). A Wilcoxon Rank sum test was used. Boxplots display first quartile, median and third quartile with whiskers extending to the maximum and minimum within 1.5 times inter-quartile range. Outliers are shown in black.
Figure 4—figure supplement 4. The optic nerve is intact in serpine3cbg17 and serpine3cbg18 fish as shown with a hematoxylin/eosin histology staining.

Figure 4—figure supplement 4.

No obvious difference in optic nerve morphology is observed between serpine3cbg17 (A) and serpine3cbg18 fish (B) and their respective wild type (WT) siblings. Scale bar = 200 µm.
Figure 4—figure supplement 5. Hematoxylin/eosin histology staining of serpine3cbg18 eyes (14 months) reveals histological differences in comparison to their wild type (WT) siblings (dorsal top, ventral bottom).

Figure 4—figure supplement 5.

We show details of representative overview images for one eye of each genotype on the right. In comparison to WT (A), distance between lens and retina of serpine3cbg18 fish is reduced (distance bars). In the serpine3cbg18 eye (B), all retinal layers are present and distinguishable, although they are not as tightly packed and clearly separated as in the WT (c–f). Moreover, we observed displaced pigmented cells that are located mainly in the photoreceptor outer segment and the outer nuclear layer yellow arrows, (c–e). Furthermore, the photoreceptor outer segment and RPE layer are not clearly separated in the serpine3cbg18 retina. Scale bar in the overviews represents 200 µm and in the magnifications 20 µm.

To better characterize serpine3 expression in the eye, we next performed RT-qPCR on samples of RPE, retina and eye without retina and RPE (Figure 3C, Figure 3—source data 3). Serpine3 is expressed significantly higher in the retina compared to RPE only (two-sided unequal variances t-test, p=0.01, Figure 3C). Expression is barely detectable in whole eye without retina and RPE (two-sided unequal variances t-test, eye vs. retina, p=0.009, Figure 3C), suggesting that the expression signal obtained from whole eye RT-qPCR mostly originates from expression in retina.

To specify in which cell types serpine3 is expressed, we finally performed in situ hybridization (ISH) on adult zebrafish retinae. Serpine3 expression is detected in the inner nuclear layer throughout the retina with stronger signals in the ventral region close to the optic nerve (Figure 3E and F). Co-staining with different cell type specific markers reveals expression of serpine3 in a fraction of glial fibrillary acidic protein (gfap)-positive Müller glia (MG) cells (Figure 3G and H, Figure 3—figure supplement 1), whereas expression of serpine3 mRNA was not detected in bipolar or amacrine cells (Figure 3—figure supplement 2).

To also explore SERPINE3 expression in mammals, we first used RT-qPCR to analyze Serpine3 expression in mouse. Similar to zebrafish, mouse Serpine3 expression is highest in whole eye, whereas expression was not detectable in colon, cortex, heart, liver, spleen, and testis (Figure 3D, Figure 3—source data 4). Next, we analyzed SERPINE3 expression in other mammals and vertebrates using publicly available expression data sets. Despite sparser data, we found evidence for expression of SERPINE3 in eyes of human, rat, cat, cow and chicken, which suggests a conserved expression pattern and a role in vertebrate vision (Figure 3—source data 5).

In summary, our experiments in zebrafish and mouse together with available data sets of different vertebrates show that SERPINE3 is expressed in the vertebrate eye, specifically in zebrafish in MG cells.

Knockout of serpine3 in zebrafish leads to morphological defects in the eye including the retina

Next, we tested whether serpine3 inactivation results in an eye phenotype. To this end, we deleted the transcription start site of serpine3 with CRISPR-Cas9, denoted as the serpine3cbg17 allele (Figure 4A, Figure 4—figure supplements 1 and 2). Adult homozygous serpine3cbg17 fish were viable and fertile. Using RT-qPCR and ISH, we confirmed that the deletion of the transcription start site abolished serpine3 expression in retinae of adult homozygotes for serpine3cbg17 compared to wild type (WT) siblings (two-sided unequal variances t-test, p=0.049, Figure 4B and C, Figure 4—source data 1).

We found that adult serpine3cbg17 individuals frequently showed notches in the iris of one of the eyes (4 of 5 individuals, white arrows in Figure 4D, Figure 4—source data 2, source data on Dryad), which affected the eye’s overall shape. To confirm that these notch-caused shape deviations are caused by inactivation of serpine3, we generated an independent line, where we introduced an early frameshift in serpine3’s coding exon 1 with CRISPR-Cas9, denoted as the serpine3cbg18 allele (Figure 4A, Figure 4—figure supplements 1 and 2). Serpine3cbg18 fish also showed a high frequency of notches in 13 of 20 individuals (Figure 4—source data 2, source data on Dryad). To quantify this shape deviation, we calculated iris solidity, which compares the area ratio of the eye’s outer shape (white line) and its concave shape (red dotted line in Figure 4D). Eyes of both serpine3cbg17 and serpine3cbg18 individuals had a significantly reduced solidity in comparison to their WT siblings (Wilcoxon rank sum test, p=0.009 and p=6.35e-9, respectively, Figure 4E). Iris circularity, another descriptor of the eye shape deviation, is also significantly reduced in homozygous individuals of both lines (Figure 4—figure supplement 3). Together this indicates that mutations in serpine3 cause eye shape deviations in zebrafish.

We next investigated the ocular morphology of adult fish of both lines and their WT siblings using hematoxylin/eosin sections (Figure 4F; Figure 4—figure supplements 4 and 5). While ocular structure and optic nerves of all inspected eyes are largely normal (Figure 4—figure supplement 4), the distance between retina and lens is reduced in homozygous fish of both alleles (Figure 4F, Figure 4—figure supplement 5). A detailed inspection of the retinal organization revealed that although all retinal layers were present, the mutant retinae are generally less organized and structured compared to WT siblings. Retinal cells appear reduced in number and less densely packed. Most prominently, we noticed that rod outer segments and the pigmented RPE cells, which were aligned in WT fish, were not clearly separated in homozygous fish of both alleles (Figure 4Fc-e, Figure 4—figure supplement 5). Furthermore, we observed large clusters of pigmented cells in the photoreceptor layer (empty arrows, Figure 4Fc-e) as well as single displaced pigmented cells in all retinal layers (yellow arrows, Figure 4Fc-e). Similar alterations in retinal structure were detected in homozygous serpine3cbg18 fish (Figure 4—figure supplement 5). This shows that KO of serpine3 in zebrafish results in morphological defects in the eye, characterized by differences in eye shape and retinal organization.

Polymorphisms near human SERPINE3 are associated with human eye phenotypes

We next analyzed recently published Genome Wide Association Study data for human single nucleotide polymorphisms (SNPs) associated with eye-related traits. This analysis revealed two such SNPs that are in linkage disequilibrium with SERPINE3 (Figure 4A). rs1028727 is located ~10 kb upstream of the SERPINE3 transcription start site and is associated with a decreased area of the optic nerve head (Bonnemaijer et al., 2019). rs7327381 is located ~97 kb downstream of the SERPINE3 transcription start site and is associated with an increase in corneal curvature (Fan et al., 2020). This suggests that human SNPs linked to SERPINE3 are associated with eye phenotypes, supporting a putative ocular function of human SERPINE3.

Discussion

Our study combines comparative genomics to predict genes having vision-related functions and experiments in zebrafish to confirm this prediction for a top-ranked candidate, the uncharacterized SERPINE3 gene. By conducting a genome-wide screen for genes that are preferentially lost in mammals with low visual acuity values, we uncovered both known vision-related genes as well as several genes that have no known eye-related function. One of the top-ranked candidates is SERPINE3, which we found to be independently lost at least 18 times in mammalian evolution, preferentially in species that do not use vision as the primary sense. For mouse and zebrafish, we show that the highest SERPINE3 expression is in the eye, which is corroborated by available expression data of other vertebrates. By generating the zebrafish serpine3cbg17 and serpine3cbg18 knockout (KO) lines, we show that inactivation of this gene results in abnormal eye shape and retinal lamination, revealing an eye-related function for serpine3. This is further supported by SERPINE3-linked polymorphisms that are associated with eye-related traits in human.

In zebrafish, serpine3 is expressed in a fraction of Müller glia (MG) cells. MG are the major retinal macroglia and perform numerous functions. By removing waste products and secreting (neuro)trophic substances and signaling molecules, they maintain the blood-retinal barrier and regulate vascularization (Reichenbach and Bringmann, 2013). Most importantly, MGs are essential for the long-term viability of photoreceptors and other neuronal cell types (Bringmann et al., 2009). Our serpine3 KO fish exhibit a disorganization of RPE cells, a phenotype that resembles those previously observed in experiments that perturb MG function. For example, selective ablation of MG in adult mice results in eye defects including aggregation of RPE cells and displacement of pigment granules in the ganglion cell layer (Shen et al., 2012). Interestingly, we also observed that serpine3 KO affects eye shape, which cannot be readily explained by a direct MG-mediated effect. However, MG span the entire retina, connecting the extracellular space of retinal neurons, the vitreous and the capillaries at the apical retina (Reichenbach and Bringmann, 2013; Klimczak et al., 2009). It is therefore possible that a secreted protein, as predicted for serpine3 based on its conserved signal peptide, may also affect other eye subtissues. Finally, in zebrafish, MG are able to regenerate retinal neurons upon injury. However, serpine3 does not seem to be involved in this process as during stress-induced regeneration, it is upregulated in resting MG that do not proliferate (Hoang et al., 2020).

In mammals, the SERPINE3 gene loss pattern and expression profile in several species also support an eye-related function; however, the cell type expression pattern may differ between mammals and zebrafish. While our experiments in zebrafish show serpine3 expression in MG, which is in agreement with Hoang et al., 2020, in mouse and human, SERPINE3 seems to be expressed in RPE (Hoang et al., 2020; Hadziahmetovic et al., 2012; Cowan et al., 2020) indicating that cell type specificity may differ between mammals and fish. As SERPINE3 is likely a secreted extracellular protein, it is possible that it has a similar function in mammals and zebrafish, despite secretion by different cell types. Whether this is the case or whether SERPINE3 function and protein expression pattern in the mammalian eye differs remains to be explored in future studies. At the molecular level, this question may be addressed by investigating whether the molecular targets of SERPINE3 are conserved among vertebrates.

Of particular interest is elucidating the functional role of SERPINE3 in human. Several pieces of evidence indicate a potential role of this gene in anti-inflammatory processes and retinal survival. SERPINE3 is upregulated in human patients with age-related macular degeneration (Newman et al., 2012), a progressive eye disease that is linked to chronic inflammation and wound healing. Consistent with a role in retinal survival, mouse Serpine3 is upregulated after experimental overexpression of neurotrophin-4, a neuroprotective factor that promotes retinal survival (Machalińska et al., 2013). Furthermore, SERPINE3 is a hallmark gene of differentiated, healthy human RPE cells (Radeke et al., 2015; Boles et al., 2020; Sripathi et al., 2021) that also have neurotrophic functions in the retina. It is thus conceivable that perturbation of proper SERPINE3 expression or function may influence age-related diseases or human eye phenotypes, as indicated by human polymorphisms that are linked to SERPINE3.

In addition to SERPINE3, our screen also revealed other candidate genes for which unknown eye-related functions are plausible. The LACTBL1 gene encodes a putative serine proteinase (Liobikas et al., 2006) and has an expression pattern similar to retina phototransduction genes (Uhlen et al., 2010). Furthermore, 20 kb upstream of LACTBL1 is a linked SNP (rs10158878) that is associated with refractive error in human (GWAS catalog, Hysi et al., 2020). Another uncharacterized candidate gene with a strong visual acuity associated loss pattern is SAPCD1. A missense mutation (rs6905572) within SAPCD1 is associated with macular degeneration (dbSNP, Chen et al., 2010). The LRIT2 gene is linked to SNPs (rs12217769 and rs745480) that are associated with macular thickness and refractive error in human. Whereas this gene was not functionally characterized at the time our screen was conducted, a recent study showed that lrit2 knockdown in zebrafish led to a reduction in eye size (Chiang et al., 2020). Finally, a high-throughput screen in mouse found that Pkd1l2 KO leads to abnormalities in lens morphology (Dickinson et al., 2016), but no follow-up experiments exist. A potential role of this gene in human disease is also indicated by an intronic SNP (rs873693), which is associated with age onset of myopia (GWAS catalog, Hysi et al., 2020; Tedja et al., 2018). Thus, in addition to SERPINE3, for which we performed an initial functional characterization in zebrafish, our screen uncovered other promising, uncharacterized candidates that may have an eye-related function. Overall, this highlights the potential of comparative genomics to shed light on the functional roles of less characterized genes and to help to further identify human disease-causing genes (Meadows and Lindblad-Toh, 2017).

Materials and methods

Visual acuity values

We used publicly available visual acuity measurements (Veilleux and Kirk, 2014; Kemp and Kirk, 2014; Figure 1—source data 1 lists all primary references) to classify placental mammals into visual high-acuity and low-acuity species. Visual acuity can be measured by either behavioral experiments or calculated from the eye axial diameter, the peak ganglion cell density and a correction factor for diel activity (Veilleux and Kirk, 2014). For species for which both measures were available, we used the behavioral visual acuity as this measure is more accurate (Veilleux and Kirk, 2014). For species that lack visual acuity data, we used available visual acuity measurements of closely related species of the same genus or family (Figure 1—source data 1). Three subterranean mammals lack available visual acuity measurements (cape golden mole, star-nosed mole, blind mole rat) but exhibit highly degenerated eyes. We therefore assumed a visual acuity of zero. In total, visual acuity values were obtained for 49 placental mammals that were included in a previously generated whole genome alignment (Sharma and Hiller, 2017). Using a visual acuity threshold of one, we considered ten mammals (cape golden mole, naked mole rat, star-nosed mole, blind mole rat, big brown bat, little brown bat, David’s myotis bat, mouse, prairie vole, deer mouse) representing seven lineages, as low-acuity vision species. All other 39 species with visual acuity greater than one were considered as high-acuity vision species.

Forward genomics screen

To screen for genes that are preferentially lost in low-acuity placental mammals, we used a previously generated data set of inactivated genes (Sharma et al., 2018). Briefly, gene losses were detected with a pipeline that searches for gene-inactivating mutations and performs a number of filtering steps to distinguish between real mutations and artifacts related to genome assembly or alignment issues and exon-intron structure changes. A genome alignment with human (hg38 assembly) as the reference (Sharma and Hiller, 2017), human genes annotated by Ensembl (version 87) (Aken et al., 2017) and principal isoforms from the APPRIS database (Rodriguez et al., 2018) were used as input. Based on the relative positions of inactivating mutations, a value measuring the maximum percent of the reading frame that remains intact (%intact) was computed for each gene and species (Figure 1—source data 8). A gene was classified as lost if %intact was <60%. A gene was classified as intact if %intact was ≥90%.

Using the human Ensembl version 87 gene annotation, we excluded protein-coding genes that have missing data due to assembly incompleteness for more than 50% of low- or high-acuity species and genes that had identical %intact values across species, since such genes cannot show associations with lower visual acuity. This resulted in 13,172 genes. Figure 1—figure supplement 1 provides a flowchart.

To search for genes that tend to have lower %intact values in the low-acuity group, we applied the Forward Genomics approach (Prudent et al., 2016; Sharma et al., 2018) with a phenotype definition based on visual acuity to these 13,172 genes. We used phylogenetic generalized least squares (Freckleton et al., 2002) to account for phylogenetic relatedness and ranked genes by the Benjamini-Hochberg corrected p-value (FDR <0.05). The list of p-values for all genes is provided in Figure 1—source data 2. The resulting set of 68 genes is enriched in the GO-terms ‘visual perception (GO:0007601)’ and ‘sensory perception of light stimulus (GO:0050953)’ (Figure 1—source data 9A). Finally, to obtain candidates that are convergently lost, we required that a gene was lost in at least three of the seven independent low-acuity lineages. This filtering step removed 37 genes and these genes are not enriched in any GO-term (Figure 1—source data 9B). Of the remaining 31 genes, two genes are no longer contained in the latest Ensembl annotation (version 106) and were therefore dropped. The other 29 genes are presented in Figure 1 and considered in the enrichment analysis (below).

To test whether the identification of SERPINE3, which we experimentally investigated, is robust to the selected visual acuity threshold, we re-ran the screen after increasing or decreasing the threshold while keeping other parameters constant. Using a more inclusive definition of low-acuity species by increasing the visual acuity threshold to 1.5, which additionally considered manatee, the two flying foxes and rats as low-acuity species, identified a total of seven genes at an FDR of 0.05, with SERPINE3 at the first rank (Figure 1—source data 5). Using a more restrictive definition of low-acuity species by decreasing the visual acuity threshold to 0.5 identified 31 genes at an FDR of 0.05 with SERPINE3 at rank 8 (Figure 1—source data 6).

In addition to using phenotype definitions based on visual acuity values, we also tested a phenotype definition based on a molecular signature of losses of known vision-related genes. To this end, we ranked all 49 species based on their number of lost genes annotated with the GO-term ‘visual perception’ (total of 93 genes, Figure 1—source data 1). The set of ten species that have lost five or more visual perception genes is similar to the set of low-acuity species used in the original screen. The differences are that mouse and deer mouse are substituted by manatee and goat. Re-running the screen based on this molecular loss pattern of vision genes, retrieved SERPINE3 at rank 5 (Figure 1—source data 7).

As a control to ensure that a Forward Genomics screen does not always retrieve vision-related genes, we ran a new screen, searching for genes preferentially lost in high-acuity sister species (elephant, rhinoceros, horse, two flying foxes, guinea pig, degu, squirrel) of the low-acuity mammals that we used in the original screen. All other species including the other high-acuity mammals were then treated as background (Figure 1—source data 4).

Enrichment analysis

Enrichment analysis was performed using the Enrichr web service (Kuleshov et al., 2016), which uses a two-sided Fisher’s exact test and corrects for multiple testing with the Benjamini-Hochberg method.

Investigating and validating SERPINE3 losses in additional genomes

To explore conservation and loss of SERPINE3 in additional mammalian genomes that became available since our initial screen, we used the TOGA method (Tool to infer Orthologs from Genome Alignments; Kirilenko et al., titled 'Integrating gene annotation with orthology inference at scale', under review). TOGA uses pairwise alignments between a reference (here human hg38) and a query genome, infers orthologous loci of a gene with a machine learning approach, and uses CESAR 2.0 (Sharma et al., 2017) to align the exons of the reference gene to the orthologous locus in the query. TOGA then classifies each transcript by determining whether the central 80% of the transcript’s coding sequence encodes an intact reading frame (classified as intact) or exhibits at least one gene-inactivating mutation (classified as potentially lost). If less than 50% of the coding sequence is present in the assembly, the transcript is classified as missing.

Focusing on the evolutionarily conserved human SERPINE3 Ensembl (version 104) transcript ENST00000524365 (Howe et al., 2021), we analyzed the TOGA transcript classification for 418 assemblies that were not used in the initial screen (Figure 2—source data 1). These assemblies represent 381 new placental mammal species. For species, where TOGA classified the SERPINE3 transcript as missing, we inspected the orthologous alignment chain to distinguish intact and lost orthologs from truly missing orthologs due to assembly gaps. Since assembly base errors can mimic false gene losses (Hecker et al., 2019b; Sharma et al., 2020), we validated inactivating mutations via two approaches. We considered a mutation as real if it is either shared with an independent assembly of the same or a closely-related species, or if alignments of raw DNA sequencing reads support the mutation. Support by raw DNA reads was assessed by aligning the genomic sequence around the mutation against the NCBI short read archive (SRA queried via NCBI megablast) (Leinonen, 2011; Figure 2—source data 2). With the exception of okapi, steenbok and fox, where SERPINE3 does not evolve under relaxed selection (Figure 2—source data 4), we considered a gene loss as real if there is at least one validated inactivating mutation. We did not validate SERPINE3 mutations in the genomes of cyclops roundleaf bat and Stoliczka’s trident bat, as both species have each eight mutations across several exons and their PacBio HiFi read-based assemblies have very high base accuracies (QV >60, indicating less than one base error per 1 Mb). In order to map SERPINE3 loss events on the phylogenetic tree, we searched for gene-inactivating mutations that are shared among phylogenetically related species, where parsimony indicates that these mutations and thus gene loss likely occurred in their common ancestor. Those mutations are shown in boxes in Figure 2—figure supplements 28, all other mutations are the output of CESAR.

To explore if SERPINE3 remnants are still expressed, we mined expression for eleven SERPINE3-loss species by blasting the TOGA transcript annotation against publicly-available RNA-Seq data of eye or brain tissue. We counted the number of reads that have at least 95% identity (Figure 2—source data 3).

Selection analysis

For species with an unclear SERPINE3 loss status (mole vole, steenbok, okapi, fox, Steller’s sea cow) and species or clades that have an intact SERPINE3 but many close relatives have lost the gene (greater sac-winged bat, European hedgehog, elephants), we tested whether SERPINE3 evolves under relaxed selection. To this end, we used RELAX from the HyPhy suite (Wertheim et al., 2015) to test whether selection pressure was relaxed (selection intensity parameter K<1) or intensified (K>1) in this species or clade, which we labeled as foreground (Figure 2—source data 4). We restricted the analysis to 327 SERPINE3 that are intact and complete (middle 80% of the coding sequence present) and treated those as background. Codon sequences were obtained from TOGA and aligned with MACSE v2.0 (Ranwez et al., 2018). This procedure was repeated using only one foreground species/clade at the time. The species tree used for the analysis is provided as Figure 2—source data 4.

Protein sequence analysis and structure prediction

Signal peptides and the cellular location were predicted with the SignalP 5.0 webserver (Almagro Armenteros et al., 2019) and DeepLoc 1 (Almagro Armenteros et al., 2017), respectively, for all intact and complete SERPINE3 protein sequences as defined above. Protein sequences were aligned with muscle (Edgar, 2004) and visualized with Jalview (Waterhouse et al., 2009; Supplementary file 1).

The three-dimensional structure of human SERPINE3 was retrieved from the AlphaFold2 web server (Jumper et al., 2021; Supplementary file 2). We calculated the root mean square distance (RMSD) to all homologous chains of existing crystal structures of close serpin relatives in native state (Supplementary file 3) after structural alignment in PyMOL (Schrödinger and DeLano, 2020). For each crystal structure, we averaged the RMSD for all chains.

Mammalian SERPINE3 have features of inhibitory, secreted SERPINs

Mammalian SERPINE3 carries an N-terminal signal peptide (Figure 2—figure supplement 9) that is predicted to lead to their secretion into the extracellular space in 96% of all analyzed intact and complete SERPINE3. SignalP 5.0 did not predict the presence of a signal peptide for three species (probability<50%): fat dormouse, black flying fox and puma.

Sequence analysis of intact and complete SERPINE3s revealed that two key features of inhibitory serpins are well conserved among placental mammals (sequence alignment is provided as Supplementary file 1). First, the substrate determining residues P4-P4’ are conserved (positions 366–372 in human SERPINE3), whereby P1 denotes the substrate binding scissile bond, position 369 (Figure 2—figure supplement 10; Khan et al., 2011). This position is occupied by an arginine in SERPINE3 as is the case in the inhibitory SERPINE1 and SERPINE2 proteins. Second, the close-by hinge region (positions 355–361) is mostly occupied by small amino acids without prolines. This may allow the insertion of the hinge region into the A beta sheet, a key feature of serpins’ inhibitory mechanism (Simonovic et al., 2001).

Furthermore, AlphaFold2 (Jumper et al., 2021) predicted a three-dimensional structure of the human SERPINE3 (Supplementary file 2) that is very similar to other native serpins (mean RMSD to native structures 1.6A, Supplementary file 3) and adopts the native fold of serpins with an exposed, disordered reactive core loop for substrate binding that does not seem to adopt an alpha-helical conformation as in the non-inhibitory ovalbumin (Stein et al., 1991). Taken together, this suggests that SERPINE3 functions as a secreted serine protease inhibitor.

Mining gene function, expression, and genetic variation sources

Information on the function of genes discovered in our screen was obtained from GeneCards database (Stelzer et al., 2016), UniProt (UniProtConsortium, 2021), Ensembl (Howe et al., 2021), Proteomics DB (Samaras et al., 2020), the Human protein atlas (Uhlén et al., 2019), the Expression atlas and Single cell expression atlas (Papatheodorou et al., 2020). Expression in human eye for each candidate gene were obtained by averaging expression over all healthy, primary RNA-Seq data sets per tissue (cornea, RPE, retina) provided by the eyeIntegration database (Bryan et al., 2018). Cell lines were not included in the average. A gene was considered to be expressed if the Transcripts Per Million (TPM) value was >100 in cornea, RPE, or retina. Expression of SERPINE3 was further assessed by retrieving primary data sets from FantomCat (Hon et al., 2017), GEO profiles (Barrett et al., 2013), and Bgee (Bastian et al., 2021). Figure 3—source data 5 provides the list of all data sets.

Phenotype associations of SNPs located in loci of interest were investigated based on the GWAS catalog (MacArthur et al., 2017), dbSNP (Sherry et al., 2001) and PheGenI (Ramos et al., 2014). Linkage of SNPs with a candidate gene in 30 human populations was investigated based on the GWAS catalog. To evaluate possible functional consequences of the respective SNPs, we overlapped their (projected) coordinates with regulatory elements from ENCODE for human (hg38) and mouse (mm10) via the web-based server SCREEN v. 2020–10 (Moore et al., 2020). Additionally, we investigated eye- and retina-associated regulatory elements in the Ensembl and UCSC genome browsers (Lee et al., 2022; Howe et al., 2021).

Animal husbandry

Adult zebrafish (Danio rerio, AB line) were maintained at 26.5 °C with a 10/14 hr dark/light cycle (Brand and Granato, 2002). Embryos and larvae were raised at 28.5 °C in the dark until six days old. For phenotyping, we used adult fish of both serpine3 KO lines (serpine3cbg17:19 months, serpine3cbg18: 11 months) generated in this study as well as their WT siblings. WT mice (Mus musculus, C57BL/6JOlaHsd line) were maintained in a barrier system at 20°C–24°C with a 12/12 hr dark/light cycle.

Expression analysis by RT-qPCR

Adult zebrafish >12 months were sacrificed by rapid cooling after anesthesia with MESAB. Adult mice (2 months, male) were sacrificed by cranial dislocation after carbon dioxide anesthesia. Tissues were dissected in ice-cold phosphate buffer (PBS), frozen in liquid nitrogen and stored at –80 °C until further use. RNA was extracted from lysed, homogenized tissue with RNeasy mini or midi kits (Qiagen) according to manufacturer’s instructions and reprecipitated if necessary. The RNA Integrity Number (RIN) was >7 for all tissues except spleen (RIN >6). Intact total RNA was reverse transcribed into cDNA using the ProtoScript II First Strand cDNA Synthesis Kit (NEB) according to manufacturer’s instructions with random primers. RT-qPCR was performed after addition of SybrGreen (Roche). Expression relative to a normalization gene was calculated from Ct values according to the efficiency and delta delta Ct method. Specifically, relative ratios of Serpine3/serpine3 expression (Zf_serp_F: GAGACCCAAAACCTGCCCTT, Zf_serp_R: AGCCGGAAATGACCGATATTGA, Mm_serp_F: TGGAGCTTTCAGAGGAGGGTA, Mm_serp_R: GATACTGAAGACAAACCCTGTGC) were obtained by using rpl13a (Zf_rpl13a_F: TCTGGAGGAACTGTAAGAGGTATGC, Zf_rpl13a_R: AGACGCACAATCTTGAGAGCGA) or actb (Zf_actb_F: CGAGCAGGAGATGGGAACC, Zf_actb_R: CAACGGAAACGCTCATTGC) as the reference gene for zebrafish and Rpl27 as the reference gene for mouse (Mm_Rpl27_F: TTGAGGAGCGATACAAGACAGG, Mm_Rpl27_R: CCCAGTCTCTTCCCACACAAA). At least three biological replicates per sample group were analyzed, which were each represented by the average normalized relative ratio of three to six technical replicates. For zebrafish eye with actb as the normalizer, one biological replicate was an expression outlier for eye and liver, defined as exceeding the third quartile by more than three times the interquartile distance (Q3 +3*interquartile distance). These two data points were excluded from Figure 3A but are contained in Figure 3—source data 1.

ISH and FISH

For ISH and immunostainings, fish were sacrificed, eyes dissected and fixed in 4% paraformaldehyde/ 0.1 M PBS after removal of the lens. Eyes were embedded in gelatin/sucrose and sectioned (14 µm) with a cryostat. The serpine3 in situ probe spans the coding exons 3–8 (transcript: ENSDART00000132915.2). Using primers with restriction enzyme cut sites (forward, Not1: TAAGCA GC GGCCGCGTAAAAGTGCCCATGATGTACCAG, reverse, BamHI: TAAGCA G GATCC ACAACTCGACCTATAAACAGCAAC), serpine3 cDNA was amplified from total cDNA and cloned into the pCRII-topo vector (Invitrogen). The antisense probes were transcribed with SP6 polymerase, using a DIG-labeled NTP mix (Roche diagnostics). The (fluorescent) ISH were conducted as previously described with minor modifications (de Oliveira-Carlos et al., 2013): Hybridization and washing steps were performed at 60 °C. For chromogenic in situs, sections were incubated with anti-digoxigenin-AP (Roche), diluted 1:4000 in DIG-blocking reagent (Roche) at 4 °C overnight and subsequently developed with NBT/BCIP (Roche). For FISH, sections were washed in PBS immediately after quenching. Sections were blocked for 1 hr with 2% blocking reagent in MABT (Perkin-Elmer) and then incubated with anti-digoxigenin-POD (Roche), diluted 1:500. The signal was detected with the TSA Plus Cy3/Cy5 kit (Perkin-Elmer).

Immunohistochemistry

Immunostainings for glial fibrillary acidic protein (ZRF1 from DSHB, 1:200), choline O-acetyltransferase (AB144P from Millipore, 1:500) and protein kinase C alpha (SC-208 from Santa-Cruz, 1:500) were performed after completion of the FISH protocol according to de Oliveira-Carlos et al., 2013. For chat, antigens were retrieved in preheated 10 mM sodium citrate buffer for 6’ at 85 °C. Sections were washed in PBS and 0.3% PBSTx prior to primary antibody incubation. Following the protocol of de Oliveira-Carlos et al., 2013, we washed the sections three times in PBSTx, and incubated in anti-goat or anti-rabbit IgG (H+L) Alexa 488-conjugated secondary antibodies (Invitrogen, 1:750).

Generation of KO lines and genotyping

To generate serpine3 KO lines, deletions were introduced using the CRISPR-Cas9 system. Guides were chosen considering efficiency predictions of the IDT DNA CRISPR-Cas9 guide checker and ChopChopV2 (Labun et al., 2016) on the zebrafish assembly danRer7 and ordered as Alt-R CRISPR-Cas crRNA from IDT DNA. For each line, three guides were simultaneously injected into one-cell stage zebrafish embryos as ribonucleoprotein delivery using the Alt-R CRISPR-Cas system following the manufacturer’s instructions (0.5 fmol crRNA per embryo per guide, 0.68 ng Cas9 protein per embryo). The expected deletions were confirmed by Sanger sequencing in several founder individuals, one of which was chosen as the founder of each line (Figure 4—figure supplement 1). Heterozygous cbg17 and cbg18 zebrafish were further outcrossed to WT fish for several generations and then bred to homozygosity.

More specifically, for serpine3cbg17, we abolished serpine3 transcription by deleting the single transcription start site, which is supported by activating histone marks in zebrafish and is also well conserved in human and mouse (Figure 4—figure supplement 1A), using the following guides: GGTATTTGTACTCTAATGAA (guide1), TGTACTCTAATGAAAGGAAC (guide2), CTCACACAGGACAATCCGGCAGG (guide3). For genotyping, we used primers (For1: 5-GAAATCGCATGTCACGCAGAAAT-3, Rev2: 5-ATATCGGAACTGACATACTGAACG-3, Rev2.2: 5- GTGAGCTTCGTGTTTGTGGT-3) to amplify a region around the transcription start site (Figure 4—figure supplement 2A, B, Figure 4—source data 4 and 5).

Serpine3cbg18 was generated by introducing a frame-shifting deletion in coding exon 1, which presumably results in three early stop codons when the transcript is translated. The following guides were used: TCTTCTGCAACTCGGGGCCA (guide4), TCTCTGTGAGCGTCTGGTAG (guide5), AACACTCTGGTTCAGCTCGA (guide6) (Figure 4—figure supplement 1B). We genotyped fish by amplifying a region around coding exon1 with the following primers: For3: 5-GGCATTGTTGAGATTCAGTAGTCA-3, Rev4: 5-CAGTTTACTCCTACCATTGACATC-3 (Figure 4—figure supplement 2C, Figure 4—source data 6).

Histology

For hematoxylin/eosin stainings, fish were sacrificed and heads were fixed overnight at 4 °C in 4% paraformaldehyde/ 0.1 M PBS and decalcified in 0.5 M EDTA in 0.1 M PBS for 3–4 days. Next, they were processed in a Paraffin-Infiltration-Processor (STP 420, Zeiss) according to the following program: ddH20: 1×1′; 50% ethanol (EtOH) 1×5′; 70% EtOH 1×10′; 96% EtOH 1×25′; 96% EtOH 2×20′; 100% EtOH 2×20′; xylene 2×20′; paraffin 3×40′/60 °C; paraffin 1×60′/60 °C. The heads were embedded in paraffin using the Embedding Center EG1160 (Leica). Semi-thin sections (2 µm) were cut on an Ultracut microtome (Mikrom) and counterstained using hematoxylin/eosin (HE, Sigma).

Microscopy, image processing, and analysis

Imaging was performed using the ZEISS Axio Imager.Z1 provided by the CMCB Light Microscopy Facility. The images were processed in Fiji/ImageJ version 2.1.0 (Schindelin et al., 2012) (macroscopic eye images) and Adobe Illustrator. For macroscopic phenotyping, eyes were imaged with a Leica stereo microscope M165C. To parameterize the eye shape, the eye outline was first approximated by an oval and then manually corrected if necessary. Particle parameters of the final eye object were measured automatically in Fiji. The statistical analysis and visualization were conducted in R version 4.1.0 (2021-05-18) using the packages ggplot2 (Wickham, 2016), tseries (Trapletti and Hornik, 2021) and effsize. Outliers were not excluded. Comparing WT and KO individuals, we tested whether both genotypes have the same iris shape (estimated by iris solidity and circularity) using the Wilcoxon rank sum test after rejecting normality of the variables with a Jarque Bera Test. Both eyes of the same individual were treated as individual biological replicates, since we observed shape deviations often only in one eye of the same individual (Figure 4—source data 2).

Animal licenses

All experiments in mouse and zebrafish were performed in accordance with the German animal welfare legislation. Protocols were approved by the Institutional Animal Welfare Officer (Tierschutzbeauftragter), and licensed by the regional Ethical Commission for Animal Experimentation (Landesdirektion Sachsen, Germany; license no. DD24-5131/354/11, DD24.1-5131/451/8, DD24-5131/346/11, DD24-5131/346/12).

Acknowledgements

We thank the genomics community for sequencing and assembling the genomes and the UCSC genome browser group for providing software and genome annotations. Experimental work would not have been possible (or as pleasant) without supporting hosting labs, especially Nadine Vastenhouw, Elisabeth Knust and Wieland Huttner. We also thank Nadine Vastenhouw and her whole lab, Michael Heide and Mauricio Rocha as well as current and former members of the Hiller lab for helpful scientific discussion and comments on the manuscript. We thank the following facilities of MPI-CBG: Biomedical Services (especially fish unit), Cell technologies (Julia Jarrells), Sequencing and genotyping (Sylke Winkler), Light Microscopy, Scientific Computing, Computer Service Facilities as well as the Computer Service Facilities of MPI-PKS and the CMCB Histology (Susanne Weiche) and CMCB Light Microscopy Facility for their support. Work of HI and MH was supported by an exploration grant from the Boehringer Ingelheim Stiftung, the Max Planck Society and the LOEWE Centre for Translational Biodiversity Genomics (TBG) funded by the Hessen State Ministry of Higher Education, Research and the Arts (HMWK). Work by JH, AM, SH and MB was supported by project grants of the German Research Foundation (Deutsche Forschungsgemeinschaft, project numbers BR 1746/3 and BR 1746/6) and an ERC advanced grant (Zf-BrainReg) to MB. This study was furthermore supported with a PhD scholarship from Studienstiftung des deutschen Volkes to JH.

Appendix 1

Appendix 1—key resources table.

Reagent type (species) or resource Designation Source or reference Identifiers Additional information
gene (Danio rerio) serpine3 Ensembl ENSDART00000132915.2
gene (Homo sapiens) SERPINE3 Ensembl ENST00000524365
strain, strain background (Danio rerio) WT (AB) Zebrafish International Resource Center
strain, strain background (Mus musculus) WT (C57BL/6JOlaHsd) Envigio
genetic reagent (Danio rerio) cbg17 This paper See Materials and Methods “Generation of KO lines and genotyping”
genetic reagent (Danio rerio) cbg18 This paper See Materials and Methods “Generation of KO lines and genotyping”
Antibody Anti-digoxigenin-AP (polyclonal sheep) Sigma-Aldrich Cat. #: 11093274910, RRID: AB_2734716 IF (1:4000)
Antibody Anti-digoxigenin-POD conjugated (polyclonal sheep) Roche Cat. #: 11207733910, RRID: AB_514500 IF (1:500)
Antibody Anti-gfap ZRF-1 (monoclonal mouse) Zebrafish International Resource Center Cat. #: zrf-1, RRID: AB_10013806 IF (1:200)
antibody Anti-choline acetyltransferase (polyclonal goat) Sigma-Aldrich Millipore Cat. #: AB144P, RRID: AB_2079751 IF (1:500)
antibody Anti-protein kinase C alpha (polyclonal rabbit) Santa Cruz Biotechnology Cat. #: sc-208, RRID: AB_2168668 IF (1:500)
antibody Anti-goat lgG (H+L) Alexa 488 (polyclonal chicken) Invitrogen Cat. #: A21467 IF (1:750)
antibody Anti-rabbit lgG (H+L) Alexa 488 (polyclonal goat) Invitrogen Cat. #: A11034 IF (1:750)
recombinant DNA reagent serpine3-pCRII-topo vector This paper See Materials and Methods “Generation of KO lines and genotyping”
sequence-based reagent Zf_serp_F This paper qPCR primer GAGACCCAAAACCTGCCCTT
sequence-based reagent Zf_serp_R This paper qPCR primer AGCCGGAAATGACCGATATTGA
sequence-based reagent Mm_serp_F This paper qPCR primer TGGAGCTTTCAGAGGAGGGTA
sequence-based reagent Mm_serp_R This paper qPCR primer GATACTGAAGACAAACCCTGTGC
sequence-based reagent Zf_rpl13a_F https://doi.org/10.1111/j.1745-7270.2007.00283.x qPCR primer TCTGGAGGAACTGTAAGAGGTATGC
sequence-based reagent Zf_rpl13a_R https://doi.org/10.1111.i.1745-7270.2007.00283.x qPCR primer AGACGCACAATCTTGAGAGCGA
sequence-based reagent Zf_actb_F https://doi.org/10.1007/s00441-020-03318-2 qPCR primer CGAGCAGGAGATGGGAACC
sequence-based reagent Zf_actb_R https://doi.org/10.1007/s00441-020-03318-2 qPCR primer CAACGGAAACGCTCATTGC
sequence-based reagent Mm_Rpl27_F This paper qPCR primer TTGAGGAGCGATACAAGACAGG
sequence-based reagent Mm_Rpl27_R This paper qPCR primer CCCAGTCTCTTCCCACACAAA
sequence-based reagent Guide1 This paper Crispr guide GGTATTTGTACTCTAATGAA
sequence-based reagent Guide2 This paper Crispr guide TGTACTCTAATGAAAGGAAC
sequence-based reagent Guide3 This paper Crispr guide CTCACACAGGACAATCCGGCAGG
sequence-based reagent Guide4 This paper Crispr guide TCTTCTGCAACTCGGGGCCA
sequence-based reagent Guide5 This paper Crispr guide TCTCTGTGAGCGTCTGGTAG
sequence-based reagent Guide6 This paper Crispr guide AACACTCTGGTTCAGCTCGA
sequence-based reagent For1 This paper Genotyping primer GAAATCGCATGTCACGCAGAAAT
sequence-based reagent Rev2 This paper Genotyping primer ATATCGGAACTGACATACTGAACG
sequence-based reagent Rev2.2 This paper Genotyping primer GTGAGCTTCGTGTTTGTGGT
sequence-based reagent For3 This paper Genotyping primer GGCATTGTTGAGATTCAGTAGTCA
sequence-based reagent Rev4 This paper Genotyping primer CAGTTTACTCCTACCATTGACATC
peptide, recombinant protein Alt-R CRISPR-Cas 9 Nuclease IDT Cat. #: 10000735
commercial assay or kit RNeasy mini kit Qiagen Cat. #: 74,106
commercial assay or kit RNeasy midi kit Qiagen Cat. #: 75,144
commercial assay or kit ProtoScript II First Strand Synthesis kit NEB Cat. #: E6560S
commercial assay or kit TSA Plus Cy3/Cy5 kit Perkin-Elmer Cat. #: NEL744001KT
commercial assay or kit Alt-R CRISPR-Cas9 tracr RNA IDT Cat. #: 1072532
software, algorithm Enrichr https://doi.org/10.1093/nar/gkw377 RRID:SCR_001575
software, algorithm TOGA Kirilenko et al., titled 'Integrating gene annotation with orthology inference at scale', under review (https://github.com/hillerlab/TOGA; Kirilenko and Hiller, 2022)
software, algorithm CESAR 2.0 https://doi.org/10.1093/bioinformatics/btx527
software, algorithm RELAX https://doi.org/10.1093/molbev/msu400 RRID:SCR_016162
software, algorithm MACSE v2.0 https://doi.org/10.1093/molbev/msy159
software, algorithm Jalview https://doi.org/10.1093/molbev/msy159 RRID:SCR_006459
software, algorithm Human protein atlas https://doi.org/10.1038/nbt1210-1248 RRID:SCR_006710
software, algorithm Gene Expression Atlas https://doi.org/10.1093/nar/gkz947 RRID:SCR_007989
software, algorithm PyMOL Schrödinger and DeLano, 2020
RRID:SCR_000305
software, algorithm Fiji https://doi.org/10.1038/nmeth.2019 RRID:SCR_002285
software, algorithm R Other RRID:SCR_001905 https://cran.r-project.org/src/base/R-3/
software, algorithm ggplot2 Other RRID:SCR_014601 http://docs.ggplot2.org/current/
software, algorithm tseries Other https://CRAN.R-project.org/package=tseries
software, algorithm effsize Other https://cran.r-project.org/web/packages/effsize/index.html
software, algorithm SignalP5 https://doi.org/10.1038/s41587-019-0036-z
software, algorithm DeepLoc https://doi.org/10.1093/bioinformatics/btx431
software, algorithm ChopChop v2 https://doi.org/10.1093/nar/gkw398 RRID:SCR_015723

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Michael Brand, Email: michael.brand@tu-dresden.de.

Michael Hiller, Email: michael.hiller@senckenberg.de.

George H Perry, Pennsylvania State University, United States.

George H Perry, Pennsylvania State University, United States.

Funding Information

This paper was supported by the following grants:

  • Boehringer Ingelheim to Michael Hiller.

  • Max Planck Society to Michael Hiller.

  • Hessen State Ministry of Higher Education, Research and the Arts to Michael Hiller.

  • German Research Foundation BR 1746/3 and BR 1746/6 to Michael Brand.

  • European Research Council Zf-BrainReg to Michael Brand.

  • Studienstiftung des Deutschen Volkes to Juliane Hammer.

Additional information

Competing interests

No competing interests declared.

Author contributions

Data curation, Formal analysis, Investigation, Methodology, Resources, Visualization, Writing – original draft, Writing – review and editing.

Formal analysis, Investigation, Resources, Visualization, Writing – review and editing.

Formal analysis, Investigation, Resources, Visualization, Writing – review and editing.

Conceptualization, Formal analysis, Investigation, Methodology, Resources, Writing – review and editing.

Investigation, Resources, Software, Writing – review and editing.

Formal analysis, Investigation, Resources, Writing – review and editing.

Formal analysis, Investigation, Methodology, Resources, Writing – review and editing.

Investigation, Methodology, Resources, Writing – review and editing.

Conceptualization, Funding acquisition, Methodology, Project administration, Resources, Supervision, Writing – review and editing.

Conceptualization, Funding acquisition, Investigation, Project administration, Supervision, Visualization, Writing – original draft, Writing – review and editing.

Ethics

All experiments in mouse and zebrafish were performed in accordance with the German animal welfare legislation. Protocols were approved by the Institutional Animal Welfare Officer (Tierschutzbeauftragter), and licensed by the regional Ethical Commission for Animal Experimentation (Landesdirektion Sachsen, Germany; license no. DD24-5131/354/11, DD24.1-5131/451/8, DD24-5131/346/11, DD24-5131/346/12).

Additional files

Supplementary file 1. Annotated protein alignment of intact and complete mammalian SERPINE3.jvp format is to be opened in Jalview.
elife-77999-supp1.zip (68.7KB, zip)
Supplementary file 2. Predicted structure of human SERPINE3 (AlphaFold 2).
elife-77999-supp2.zip (58.9KB, zip)
Supplementary file 3. Comparison of 3D structures of crystallized SERPINs to the predicted human SERPINE3.
elife-77999-supp3.xls (29KB, xls)
Transparent reporting form

Data availability

All data needed to evaluate the conclusions in the paper are present in the paper and the Supplementary Materials. The annotated protein alignment of intact and complete mammalian SERPINE3 genes are in Supplementary File 1, the predicted structure of human SERPINE3 are in Supplementary File 2 and the raw microscopy images of fish eyes are available on Dryad at DOI 10.5061/dryad.ncjsxksxt. TOGA annotations of SERPINE3 and a visualization of gene-inactivating mutations are available at our UCSC genome browser mirror https://genome.senckenberg.de.

The following dataset was generated:

Indrischek H, Hiller M, Brand M. 2022. Microscopic eye phenotype of SERPINE3 cbg17 and cbg18 Knockout lines. Dryad Digital Repository.

References

  1. Abouzeid H, Meire FM, Osman I, ElShakankiri N, Bolay S, Munier FL, Schorderet DF. A new locus for congenital cataract, microcornea, microphthalmia, and atypical iris coloboma maps to chromosome 2. Ophthalmology. 2009;116:154–162. doi: 10.1016/j.ophtha.2008.08.044. [DOI] [PubMed] [Google Scholar]
  2. Aken BL, Achuthan P, Akanni W, Amode MR, Bernsdorff F, Bhai J, Billis K, Carvalho-Silva D, Cummins C, Clapham P, Gil L, Girón CG, Gordon L, Hourlier T, Hunt SE, Janacek SH, Juettemann T, Keenan S, Laird MR, Lavidas I, Maurel T, McLaren W, Moore B, Murphy DN, Nag R, Newman V, Nuhn M, Ong CK, Parker A, Patricio M, Riat HS, Sheppard D, Sparrow H, Taylor K, Thormann A, Vullo A, Walts B, Wilder SP, Zadissa A, Kostadima M, Martin FJ, Muffato M, Perry E, Ruffier M, Staines DM, Trevanion SJ, Cunningham F, Yates A, Zerbino DR, Flicek P. Ensembl 2017. Nucleic Acids Research. 2017;45:D635–D642. doi: 10.1093/nar/gkw1104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Allison K, Patel D, Alabi O. Epidemiology of glaucoma: the past, present, and predictions for the future. Cureus. 2020;12:e11686. doi: 10.7759/cureus.11686. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Almagro Armenteros JJ, Sønderby CK, Sønderby SK, Nielsen H, Winther O. DeepLoc: prediction of protein subcellular localization using deep learning. Bioinformatics. 2017;33:3387–3395. doi: 10.1093/bioinformatics/btx431. [DOI] [PubMed] [Google Scholar]
  5. Almagro Armenteros JJ, Tsirigos KD, Sønderby CK, Petersen TN, Winther O, Brunak S, von Heijne G, Nielsen H. SignalP 5.0 improves signal peptide predictions using deep neural networks. Nature Biotechnology. 2019;37:420–423. doi: 10.1038/s41587-019-0036-z. [DOI] [PubMed] [Google Scholar]
  6. Barnstable CJ, Tombran-Tink J. Neuroprotective and antiangiogenic actions of PEDF in the eye: molecular targets and therapeutic potential. Progress in Retinal and Eye Research. 2004;23:561–577. doi: 10.1016/j.preteyeres.2004.05.002. [DOI] [PubMed] [Google Scholar]
  7. Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Holko M, Yefanov A, Lee H, Zhang N, Robertson CL, Serova N, Davis S, Soboleva A. NCBI GEO: archive for functional genomics data sets--update. Nucleic Acids Research. 2013;41:D991–D995. doi: 10.1093/nar/gks1193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bastian FB, Roux J, Niknejad A, Comte A, Fonseca Costa SS, de Farias TM, Moretti S, Parmentier G, de Laval VR, Rosikiewicz M, Wollbrett J, Echchiki A, Escoriza A, Gharib WH, Gonzales-Porta M, Jarosz Y, Laurenczy B, Moret P, Person E, Roelli P, Sanjeev K, Seppey M, Robinson-Rechavi M. The Bgee suite: integrated curated expression atlas and comparative transcriptomics in animals. Nucleic Acids Research. 2021;49:D831–D847. doi: 10.1093/nar/gkaa793. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Blumer M, Brown T, Freitas MB, Destro AL, Oliveira JA, Morales AE, Schell T, Greve C, Pippel M, Jebb D, Hecker N, Ahmed A-W, Kirilenko BM, Foote M, Janke A, Lim BK, Hiller M. Gene losses in the common vampire bat illuminate molecular adaptations to blood feeding. Science Advances. 2022;8:eabm6494. doi: 10.1126/sciadv.abm6494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Boles NC, Fernandes M, Swigut T, Srinivasan R, Schiff L, Rada-Iglesias A, Wang Q, Saini JS, Kiehl T, Stern JH, Wysocka J, Blenkinsop TA, Temple S. Epigenomic and transcriptomic changes during human RPE EMT in a stem cell model of epiretinal membrane pathogenesis and prevention by nicotinamide. Stem Cell Reports. 2020;14:631–647. doi: 10.1016/j.stemcr.2020.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bonnemaijer PWM, Leeuwen EV, Iglesias AI, Gharahkhani P, Vitart V, Khawaja AP, Simcoe M, Höhn R, Cree AJ, Igo RP, Gerhold-Ay A, Nickels S, Wilson JF, Hayward C, Boutin TS, Polašek O, Aung T, Khor CC, Amin N, Lotery AJ, Wiggs JL, Cheng CY, Hysi PG, Hammond CJ, Thiadens A, MacGregor S, Klaver CCW, Duijn CV, International Glaucoma Genetics Consortium. NEIGHBORHOOD consortium. UK Biobank Eye and Vision Consortium Multi-trait genome-wide association study identifies new loci associated with optic disc parameters. Communications Biology. 2019;2:435. doi: 10.1038/s42003-019-0634-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Brand M, Granato M. Zebrafish: A Practical Approach. Oxford University Press; 2002. Keeping and raising zebrafish. [Google Scholar]
  13. Bringmann A, Iandiev I, Pannicke T, Wurm A, Hollborn M, Wiedemann P, Osborne NN, Reichenbach A. Cellular signaling and factors involved in Müller cell gliosis: neuroprotective and detrimental effects. Progress in Retinal and Eye Research. 2009;28:423–451. doi: 10.1016/j.preteyeres.2009.07.001. [DOI] [PubMed] [Google Scholar]
  14. Bryan JM, Fufa TD, Bharti K, Brooks BP, Hufnagel RB, McGaughey DM. Identifying core biological processes distinguishing human eye tissues with precise systems-level gene expression analyses and weighted correlation networks. Human Molecular Genetics. 2018;27:3325–3339. doi: 10.1093/hmg/ddy239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Caves EM, Brandley NC, Johnsen S. Visual acuity and the evolution of signals. Trends in Ecology & Evolution. 2018;33:358–372. doi: 10.1016/j.tree.2018.03.001. [DOI] [PubMed] [Google Scholar]
  16. Chen W, Stambolian D, Edwards AO, Branham KE, Othman M, Jakobsdottir J, Tosakulwong N, Pericak-Vance MA, Campochiaro PA, Klein ML, Tan PL, Conley YP, Kanda A, Kopplin L, Li Y, Augustaitis KJ, Karoukis AJ, Scott WK, Agarwal A, Kovach JL, Schwartz SG, Postel EA, Brooks M, Baratz KH, Brown WL, Brucker AJ, Orlin A, Brown G, Ho A, Regillo C, Donoso L, Tian L, Kaderli B, Hadley D, Hagstrom SA, Peachey NS, Klein R, Klein BEK, Gotoh N, Yamashiro K, Ferris Iii F, Fagerness JA, Reynolds R, Farrer LA, Kim IK, Miller JW, Cortón M, Carracedo A, Sanchez-Salorio M, Pugh EW, Doheny KF, Brion M, Deangelis MM, Weeks DE, Zack DJ, Chew EY, Heckenlively JR, Yoshimura N, Iyengar SK, Francis PJ, Katsanis N, Seddon JM, Haines JL, Gorin MB, Abecasis GR, Swaroop A. Genetic variants near TIMP3 and high-density lipoprotein-associated loci influence susceptibility to age-related macular degeneration. PNAS. 2010;107:7401–7406. doi: 10.1073/pnas.0912702107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Chiang C-Y, Ching Y-H, Chang T-Y, Hu L-S, Yong YS, Keak PY, Mustika I, Lin M-D, Liao B-Y. Novel eye genes systematically discovered through an integrated analysis of mouse transcriptomes and phenome. Computational and Structural Biotechnology Journal. 2020;18:73–82. doi: 10.1016/j.csbj.2019.12.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Cowan CS, Renner M, De Gennaro M, Gross-Scherf B, Goldblum D, Hou Y, Munz M, Rodrigues TM, Krol J, Szikra T, Cuttat R, Waldt A, Papasaikas P, Diggelmann R, Patino-Alvarez CP, Galliker P, Spirig SE, Pavlinic D, Gerber-Hollbach N, Schuierer S, Srdanovic A, Balogh M, Panero R, Kusnyerik A, Szabo A, Stadler MB, Orgül S, Picelli S, Hasler PW, Hierlemann A, Scholl HPN, Roma G, Nigsch F, Roska B. Cell types of the human retina and its organoids at single-cell resolution. Cell. 2020;182:1623–1640. doi: 10.1016/j.cell.2020.08.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Crooks GE, Hon G, Chandonia JM, Brenner SE. WebLogo: a sequence logo generator. Genome Research. 2004;14:1188–1190. doi: 10.1101/gr.849004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Daiger S, Sullivan L, Browne S. RetNet. 2022. [June 29, 2022]. https://sph.uth.edu/RetNet/
  21. de Oliveira-Carlos V, Ganz J, Hans S, Kaslin J, Brand M. Notch receptor expression in neurogenic regions of the adult zebrafish brain. PLOS ONE. 2013;8:e73384. doi: 10.1371/journal.pone.0073384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Dengler-Crish CM, Crish SD, O’Riain MJ, Catania KC. Organization of the somatosensory cortex in elephant shrews (E. edwardii. The Anatomical Record Part A. 2006;288A:859–866. doi: 10.1002/ar.a.20357. [DOI] [PubMed] [Google Scholar]
  23. Dickinson ME, Flenniken AM, Ji X, Teboul L, Wong MD, White JK, Meehan TF, Weninger WJ, Westerberg H, Adissu H, Baker CN, Bower L, Brown JM, Caddle LB, Chiani F, Clary D, Cleak J, Daly MJ, Denegre JM, Doe B, Dolan ME, Edie SM, Fuchs H, Gailus-Durner V, Galli A, Gambadoro A, Gallegos J, Guo S, Horner NR, Hsu CW, Johnson SJ, Kalaga S, Keith LC, Lanoue L, Lawson TN, Lek M, Mark M, Marschall S, Mason J, McElwee ML, Newbigging S, Nutter LMJ, Peterson KA, Ramirez-Solis R, Rowland DJ, Ryder E, Samocha KE, Seavitt JR, Selloum M, Szoke-Kovacs Z, Tamura M, Trainor AG, Tudose I, Wakana S, Warren J, Wendling O, West DB, Wong L, Yoshiki A, MacArthur DG, Tocchini-Valentini GP, Gao X, Flicek P, Bradley A, Skarnes WC, Justice MJ, Parkinson HE, Moore M, Wells S, Braun RE, Svenson KL, de Angelis MH, Herault Y, Mohun T, Mallon AM, Henkelman RM, Brown SDM, Adams DJ, Lloyd KCK, McKerlie C, Beaudet AL, Bućan M, Murray SA, International Mouse Phenotyping Consortium. Jackson Laboratory. Infrastructure Nationale PHENOMIN, Institut Clinique de la Souris. Charles River Laboratories. MRC Harwell. Toronto Centre for Phenogenomics. Wellcome Trust Sanger Institute. RIKEN BioResource Center High-throughput discovery of novel developmental phenotypes. Nature. 2016;537:508–514. doi: 10.1038/nature19356. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. DiPaola JD, Yindee M, Plotnik JM. Investigating the use of sensory information to detect and track prey by the Sunda pangolin (Manis javanica) with conservation in mind. Scientific Reports. 2020;10:9787. doi: 10.1038/s41598-020-65898-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Eisenberg JF, Gould E. The behavior of Solenodon paradoxus in captivity with comments in the behavior of other insectivora. Zoologica. 1966;51:49–58. doi: 10.5962/p.203281. [DOI] [Google Scholar]
  27. Emerling CA, Springer MS. Eyes underground: regression of visual protein networks in subterranean mammals. Molecular Phylogenetics and Evolution. 2014;78:260–270. doi: 10.1016/j.ympev.2014.05.016. [DOI] [PubMed] [Google Scholar]
  28. Emerling CA, Springer MS. Genomic evidence for rod monochromacy in sloths and armadillos suggests early subterranean history for Xenarthra. Proceedings Biological Sciences. 2015;282:20142192. doi: 10.1098/rspb.2014.2192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Emerling CA, Widjaja AD, Nguyen NN, Springer MS. Their loss is our gain: regressive evolution in vertebrates provides genomic models for uncovering human disease loci. Journal of Medical Genetics. 2017;54:787–794. doi: 10.1136/jmedgenet-2017-104837. [DOI] [PubMed] [Google Scholar]
  30. Emerling CA. Regressed but not gone: patterns of vision gene loss and retention in subterranean mammals. Integrative and Comparative Biology. 2018;58:441–451. doi: 10.1093/icb/icy004. [DOI] [PubMed] [Google Scholar]
  31. Fadool JM. Development of a rod photoreceptor mosaic revealed in transgenic zebrafish. Developmental Biology. 2003;258:277–290. doi: 10.1016/s0012-1606(03)00125-8. [DOI] [PubMed] [Google Scholar]
  32. Fan Q, Pozarickij A, Tan NYQ, Guo X, Verhoeven VJM, Vitart V, Guggenheim JA, Miyake M, Tideman JWL, Khawaja AP, Zhang L, MacGregor S, Höhn R, Chen P, Biino G, Wedenoja J, Saffari SE, Tedja MS, Xie J, Lanca C, Wang YX, Sahebjada S, Mazur J, Mirshahi A, Martin NG, Yazar S, Pennell CE, Yap M, Haarman AEG, Enthoven CA, Polling J, Hewitt AW, Jaddoe VWV, van Duijn CM, Hayward C, Polasek O, Tai ES, Yoshikatsu H, Hysi PG, Young TL, Tsujikawa A, Wang JJ, Mitchell P, Pfeiffer N, Pärssinen O, Foster PJ, Fossarello M, Yip SP, Williams C, Hammond CJ, Jonas JB, He M, Mackey DA, Wong TY, Klaver CCW, Saw SM, Baird PN, Cheng CY, UK Biobank Eye and Vision Consortium Genome-wide association meta-analysis of corneal curvature identifies novel loci and shared genetic influences across axial length and refractive error. Communications Biology. 2020;3:133. doi: 10.1038/s42003-020-0802-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Fang X, Seim I, Huang Z, Gerashchenko MV, Xiong Z, Turanov AA, Zhu Y, Lobanov AV, Fan D, Yim SH, Yao X, Ma S, Yang L, Lee SG, Kim EB, Bronson RT, Šumbera R, Buffenstein R, Zhou X, Krogh A, Park TJ, Zhang G, Wang J, Gladyshev VN. Adaptations to a subterranean environment and longevity revealed by the analysis of mole rat genomes. Cell Reports. 2014;8:1354–1364. doi: 10.1016/j.celrep.2014.07.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Freckleton RP, Harvey PH, Pagel M. Phylogenetic analysis and comparative data: a test and review of evidence. The American Naturalist. 2002;160:712–726. doi: 10.1086/343873. [DOI] [PubMed] [Google Scholar]
  35. Gaudin TJ, Croft DA. Paleogene Xenarthra and the evolution of South American mammals. Journal of Mammalogy. 2015;96:622–634. doi: 10.1093/jmammal/gyv073. [DOI] [Google Scholar]
  36. GBD 2019 Blindness and Vision Impairment Collaborators. Vision Loss Expert Group of the Global Burden of Disease Study Causes of blindness and VISION impairment in 2020 and trends over 30 years, and prevalence of avoidable blindness in relation to VISION 2020: the Right to Sight: an analysis for the Global Burden of Disease Study. The Lancet. Global Health. 2021;9:e144–e160. doi: 10.1016/S2214-109X(20)30489-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Gould E. Evidence for Echolocation in the Tenrecidae of Madagascar. P Am Philos Soc. 1965;109:352–360. [Google Scholar]
  38. Hadziahmetovic M, Kumar U, Song Y, Grieco S, Song D, Li Y, Tobias JW, Dunaief JL. Microarray analysis of murine retinal light damage reveals changes in iron regulatory, complement, and antioxidant genes in the neurosensory retina and isolated RPE. Investigative Ophthalmology & Visual Science. 2012;53:5231–5241. doi: 10.1167/iovs.12-10204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Hecker N, Lächele U, Stuckas H, Giere P, Hiller M. Convergent vomeronasal system reduction in mammals coincides with convergent losses of calcium signalling and odorant-degrading genes. Molecular Ecology. 2019a;28:3656–3668. doi: 10.1111/mec.15180. [DOI] [PubMed] [Google Scholar]
  40. Hecker N, Sharma V, Hiller M. Convergent gene losses illuminate metabolic and physiological changes in herbivores and carnivores. PNAS. 2019b;116:3036–3041. doi: 10.1073/pnas.1818504116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Heit C, Jackson BC, McAndrews M, Wright MW, Thompson DC, Silverman GA, Nebert DW, Vasiliou V. Update of the human and mouse SERPIN gene superfamily. Human Genomics. 2013;7:22. doi: 10.1186/1479-7364-7-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Hiller M, Schaar BT, Indjeian VB, Kingsley DM, Hagey LR, Bejerano G. A “forward genomics” approach links genotype to phenotype using independent phenotypic losses among related species. Cell Reports. 2012;2:817–823. doi: 10.1016/j.celrep.2012.08.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Hoang T, Wang J, Boyd P, Wang F, Santiago C, Jiang L, Yoo S, Lahne M, Todd LJ, Jia M, Saez C, Keuthan C, Palazzo I, Squires N, Campbell WA, Rajaii F, Parayil T, Trinh V, Kim DW, Wang G, Campbell LJ, Ash J, Fischer AJ, Hyde DR, Qian J, Blackshaw S. Gene regulatory networks controlling vertebrate retinal regeneration. Science. 2020;370:eabb8598. doi: 10.1126/science.abb8598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Hon C-C, Ramilowski JA, Harshbarger J, Bertin N, Rackham OJL, Gough J, Denisenko E, Schmeier S, Poulsen TM, Severin J, Lizio M, Kawaji H, Kasukawa T, Itoh M, Burroughs AM, Noma S, Djebali S, Alam T, Medvedeva YA, Testa AC, Lipovich L, Yip C-W, Abugessaisa I, Mendez M, Hasegawa A, Tang D, Lassmann T, Heutink P, Babina M, Wells CA, Kojima S, Nakamura Y, Suzuki H, Daub CO, de Hoon MJL, Arner E, Hayashizaki Y, Carninci P, Forrest ARR. An atlas of human long non-coding RNAs with accurate 5’ ends. Nature. 2017;543:199–204. doi: 10.1038/nature21374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Howe KL, Achuthan P, Allen J, Allen J, Alvarez-Jarreta J, Amode MR, Armean IM, Azov AG, Bennett R, Bhai J, Billis K, Boddu S, Charkhchi M, Cummins C, Da Rin Fioretto L, Davidson C, Dodiya K, El Houdaigui B, Fatima R, Gall A, Garcia Giron C, Grego T, Guijarro-Clarke C, Haggerty L, Hemrom A, Hourlier T, Izuogu OG, Juettemann T, Kaikala V, Kay M, Lavidas I, Le T, Lemos D, Gonzalez Martinez J, Marugán JC, Maurel T, McMahon AC, Mohanan S, Moore B, Muffato M, Oheh DN, Paraschas D, Parker A, Parton A, Prosovetskaia I, Sakthivel MP, Salam AIA, Schmitt BM, Schuilenburg H, Sheppard D, Steed E, Szpak M, Szuba M, Taylor K, Thormann A, Threadgold G, Walts B, Winterbottom A, Chakiachvili M, Chaubal A, De Silva N, Flint B, Frankish A, Hunt SE, IIsley GR, Langridge N, Loveland JE, Martin FJ, Mudge JM, Morales J, Perry E, Ruffier M, Tate J, Thybert D, Trevanion SJ, Cunningham F, Yates AD, Zerbino DR, Flicek P. Ensembl 2021. Nucleic Acids Research. 2021;49:D884–D891. doi: 10.1093/nar/gkaa942. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Hysi PG, Choquet H, Khawaja AP, Wojciechowski R, Tedja MS, Yin J, Simcoe MJ, Patasova K, Mahroo OA, Thai KK, Cumberland PM, Melles RB, Verhoeven VJM, Vitart V, Segre A, Stone RA, Wareham N, Hewitt AW, Mackey DA, Klaver CCW, MacGregor S, Consortium for Refractive Error and Myopia. Khaw PT, Foster PJ, UK Eye and Vision Consortium. Guggenheim JA, 23andMe Inc. Rahi JS, Jorgenson E, Hammond CJ. Meta-analysis of 542,934 subjects of European ancestry identifies new genes and mechanisms predisposing to refractive error and myopia. Nature Genetics. 2020;52:401–407. doi: 10.1038/s41588-020-0599-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Jha KN, Wong L, Zerfas PM, De Silva RS, Fan Y-X, Spiridonov NA, Johnson GR. Identification of a novel HSP70-binding cochaperone critical to HSP90-mediated activation of small serine/threonine kinase. The Journal of Biological Chemistry. 2010;285:35180–35187. doi: 10.1074/jbc.M110.134767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, Tunyasuvunakool K, Bates R, Žídek A, Potapenko A, Bridgland A, Meyer C, Kohl SAA, Ballard AJ, Cowie A, Romera-Paredes B, Nikolov S, Jain R, Adler J, Back T, Petersen S, Reiman D, Clancy E, Zielinski M, Steinegger M, Pacholska M, Berghammer T, Bodenstein S, Silver D, Vinyals O, Senior AW, Kavukcuoglu K, Kohli P, Hassabis D. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–589. doi: 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Kemp AD, Kirk CE. Eye size and visual acuity influence vestibular anatomy in mammals. Anatomical Record. 2014;297:781–790. doi: 10.1002/ar.22892. [DOI] [PubMed] [Google Scholar]
  50. Kevany BM, Zhang N, Jastrzebska B, Palczewski K. Animals deficient in C2Orf71, an autosomal recessive retinitis pigmentosa-associated locus, develop severe early-onset retinal degeneration. Human Molecular Genetics. 2015;24:2627–2640. doi: 10.1093/hmg/ddv025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Khan MS, Singh P, Azhar A, Naseem A, Rashid Q, Kabir MA, Jairajpuri MA. Serpin inhibition mechanism: a delicate balance between native metastable state and polymerization. Journal of Amino Acids. 2011;2011:1–10. doi: 10.4061/2011/606797. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Kim EB, Fang X, Fushan AA, Huang Z, Lobanov AV, Han L, Marino SM, Sun X, Turanov AA, Yang P, Yim SH, Zhao X, Kasaikina MV, Stoletzki N, Peng C, Polak P, Xiong Z, Kiezun A, Zhu Y, Chen Y, Kryukov GV, Zhang Q, Peshkin L, Yang L, Bronson RT, Buffenstein R, Wang B, Han C, Li Q, Chen L, Zhao W, Sunyaev SR, Park TJ, Zhang G, Wang J, Gladyshev VN. Genome sequencing reveals insights into physiology and longevity of the naked mole rat. Nature. 2011;479:223–227. doi: 10.1038/nature10533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Kirilenko B, Hiller A. TOGA. v1.0.0Zenodo. 2022 doi: 10.5281/zenodo.6400670. [DOI]
  54. Klimczak RR, Koerber JT, Dalkara D, Flannery JG, Schaffer DV. A novel adeno-associated viral variant for efficient and selective intravitreal transduction of rat Müller cells. PLOS ONE. 2009;4:e7467. doi: 10.1371/journal.pone.0007467. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Kuleshov MV, Jones MR, Rouillard AD, Fernandez NF, Duan Q, Wang Z, Koplev S, Jenkins SL, Jagodnik KM, Lachmann A, McDermott MG, Monteiro CD, Gundersen GW, Ma’ayan A. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Research. 2016;44:W90–W97. doi: 10.1093/nar/gkw377. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Labun K, Montague TG, Gagnon JA, Thyme SB, Valen E. CHOPCHOP v2: a web tool for the next generation of CRISPR genome engineering. Nucleic Acids Research. 2016;44:W272–W276. doi: 10.1093/nar/gkw398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Langer BE, Roscito JG, Hiller M. REforge associates transcription factor binding site divergence in regulatory elements with phenotypic differences between species. Molecular Biology and Evolution. 2018;35:3027–3040. doi: 10.1093/molbev/msy187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Langer BE, Hiller M. TFforge utilizes large-scale binding site divergence to identify transcriptional regulators involved in phenotypic differences. Nucleic Acids Research. 2019;47:e19. doi: 10.1093/nar/gky1200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Law RHP, Zhang Q, McGowan S, Buckle AM, Silverman GA, Wong W, Rosado CJ, Langendorf CG, Pike RN, Bird PI, Whisstock JC. An overview of the serpin superfamily. Genome Biology. 2006;7:216. doi: 10.1186/gb-2006-7-5-216. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Lee BT, Barber GP, Benet-Pagès A, Casper J, Clawson H, Diekhans M, Fischer C, Gonzalez JN, Hinrichs AS, Lee CM, Muthuraman P, Nassar LR, Nguy B, Pereira T, Perez G, Raney BJ, Rosenbloom KR, Schmelter D, Speir ML, Wick BD, Zweig AS, Haussler D, Kuhn RM, Haeussler M, Kent WJ. The UCSC genome browser database: 2022 update. Nucleic Acids Research. 2022;50:D1115–D1122. doi: 10.1093/nar/gkab959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Leinonen R. International nucleotide sequence database, the sequence read archive. Nucleic Acids Research. 2011;39:D19–D21. doi: 10.1093/nar/gkq1019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Liobikas J, Polianskyte Z, Speer O, Thompson J, Alakoskela J-M, Peitsaro N, Franck M, Whitehead MA, Kinnunen PJK, Eriksson O. Expression and purification of the mitochondrial serine protease LACTB as an N-terminal GST fusion protein in Escherichia coli. Protein Expression and Purification. 2006;45:335–342. doi: 10.1016/j.pep.2005.08.006. [DOI] [PubMed] [Google Scholar]
  63. MacArthur J, Bowler E, Cerezo M, Gil L, Hall P, Hastings E, Junkins H, McMahon A, Milano A, Morales J, Pendlington ZM, Welter D, Burdett T, Hindorff L, Flicek P, Cunningham F, Parkinson H. The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog. Nucleic Acids Research. 2017;45:D896–D901. doi: 10.1093/nar/gkw1133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Machalińska A, Kawa M, Pius-Sadowska E, Stępniewski J, Nowak W, Rogińska D, Kaczyńska K, Baumert B, Wiszniewska B, Józkowicz A, Dulak J, Machaliński B. Long-term neuroprotective effects of NT-4-engineered mesenchymal stem cells injected intravitreally in a mouse model of acute retinal injury. Investigative Ophthalmology & Visual Science. 2013;54:8292–8305. doi: 10.1167/iovs.13-12221. [DOI] [PubMed] [Google Scholar]
  65. Mass AM, Ketten DR, Odell DK, Supin AY. Ganglion cell distribution and retinal resolution in the florida manatee, trichechus manatus latirostris. The Anatomical Record. 2012;295:177–186. doi: 10.1002/ar.21470. [DOI] [PubMed] [Google Scholar]
  66. Meadows JRS, Lindblad-Toh K. Dissecting evolution and disease using comparative vertebrate genomics. Nature Reviews. Genetics. 2017;18:624–636. doi: 10.1038/nrg.2017.51. [DOI] [PubMed] [Google Scholar]
  67. Meredith RW, Gatesy J, Emerling CA, York VM, Springer MS. Rod monochromacy and the coevolution of cetacean retinal opsins. PLOS Genetics. 2013;9:e1003432. doi: 10.1371/journal.pgen.1003432. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Mitchell P, Liew G, Gopinath B, Wong TY. Age-related macular degeneration. Lancet. 2018;392:1147–1159. doi: 10.1016/S0140-6736(18)31550-2. [DOI] [PubMed] [Google Scholar]
  69. Mittermeier RA, Wilson DE. Hoofed Animals. Handbook of the Mammals of the World. Barcelona: Hardcover; 2011. [Google Scholar]
  70. Moore BA, Leonard BC, Sebbag L, Edwards SG, Cooper A, Imai DM, Straiton E, Santos L, Reilly C, Griffey SM, Bower L, Clary D, Mason J, Roux MJ, Meziane H, Herault Y, McKerlie C, Flenniken AM, Nutter LMJ, Berberovic Z, Owen C, Newbigging S, Adissu H, Eskandarian M, Hsu CW, Kalaga S, Udensi U, Asomugha C, Bohat R, Gallegos JJ, Seavitt JR, Heaney JD, Beaudet AL, Dickinson ME, Justice MJ, Philip V, Kumar V, Svenson KL, Braun RE, Wells S, Cater H, Stewart M, Clementson-Mobbs S, Joynson R, Gao X, Suzuki T, Wakana S, Smedley D, Seong JK, Tocchini-Valentini G, Moore M, Fletcher C, Karp N, Ramirez-Solis R, White JK, de Angelis MH, Wurst W, Thomasy SM, Flicek P, Parkinson H, Brown SDM, Meehan TF, Nishina PM, Murray SA, Krebs MP, Mallon AM, Lloyd KCK, Murphy CJ, Moshiri A, International Mouse Phenotyping Consortium Identification of genes required for eye development by high-throughput screening of mouse knockouts. Communications Biology. 2018;1:236. doi: 10.1038/s42003-018-0226-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Moore JE, Purcaro MJ, Pratt HE, Epstein CB, Shoresh N, Adrian J, Kawli T, Davis CA, Dobin A, Kaul R, Halow J, Van Nostrand EL, Freese P, Gorkin DU, Shen Y, He Y, Mackiewicz M, Pauli-Behn F, Williams BA, Mortazavi A, Keller CA, Zhang XO, Elhajjajy SI, Huey J, Dickel DE, Snetkova V, Wei X, Wang X, Rivera-Mulia JC, Rozowsky J, Zhang J, Chhetri SB, Zhang J, Victorsen A, White KP, Visel A, Yeo GW, Burge CB, Lécuyer E, Gilbert DM, Dekker J, Rinn J, Mendenhall EM, Ecker JR, Kellis M, Klein RJ, Noble WS, Kundaje A, Guigó R, Farnham PJ, Cherry JM, Myers RM, Ren B, Graveley BR, Gerstein MB, Pennacchio LA, Snyder MP, Bernstein BE, Wold B, Hardison RC, Gingeras TR, Stamatoyannopoulos JA, Weng Z, ENCODE Project Consortium Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature. 2020;583:699–710. doi: 10.1038/s41586-020-2493-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Moore AM, Hartstone‐Rose A, Gonzalez‐Socoloske D. Review of sensory modalities of sirenians and the other extant Paenungulata clade. The Anatomical Record. 2021;305:715–735. doi: 10.1002/ar.24741. [DOI] [PubMed] [Google Scholar]
  73. Myers P, Espinosa R, Parr CS, Jones T, Hammond GS, Dewey TA. Animal Diversity Web. 2022. [June 29, 2022]. https://animaldiversity.org/
  74. Newman AM, Gallo NB, Hancox LS, Miller NJ, Radeke CM, Maloney MA, Cooper JB, Hageman GS, Anderson DH, Johnson LV, Radeke MJ. Systems-level analysis of age-related macular degeneration reveals global biomarkers and phenotype-specific functional networks. Genome Medicine. 2012;4:16. doi: 10.1186/gm315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Papatheodorou I, Moreno P, Manning J, Fuentes AM-P, George N, Fexova S, Fonseca NA, Füllgrabe A, Green M, Huang N, Huerta L, Iqbal H, Jianu M, Mohammed S, Zhao L, Jarnuczak AF, Jupp S, Marioni J, Meyer K, Petryszak R, Prada Medina CA, Talavera-López C, Teichmann S, Vizcaino JA, Brazma A. Expression Atlas update: from tissues to single cells. Nucleic Acids Research. 2020;48:D77–D83. doi: 10.1093/nar/gkz947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Partha R, Chauhan BK, Ferreira Z, Robinson JD, Lathrop K, Nischal KK, Chikina M, Clark NL. Subterranean mammals show convergent regression in ocular genes and enhancers, along with adaptation to tunneling. eLife. 2017;6:e25884. doi: 10.7554/eLife.25884. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Pettigrew JD, Bhagwandin A, Haagensen M, Manger PR. Visual acuity and heterogeneities of retinal ganglion cell densities and the tapetum lucidum of the african elephant (loxodonta africana. Brain, Behavior and Evolution. 2010;75:251–261. doi: 10.1159/000314898. [DOI] [PubMed] [Google Scholar]
  78. Prudent X, Parra G, Schwede P, Roscito JG, Hiller M. Controlling for phylogenetic relatedness and evolutionary rates improves the discovery of associations between species’ phenotypic and genomic differences. Molecular Biology and Evolution. 2016;33:2135–2150. doi: 10.1093/molbev/msw098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Radeke MJ, Radeke CM, Shih Y-H, Hu J, Bok D, Johnson LV, Coffey PJ. Restoration of mesenchymal retinal pigmented epithelial cells by TGFβ pathway inhibitors: implications for age-related macular degeneration. Genome Medicine. 2015;7:58. doi: 10.1186/s13073-015-0183-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Ramos EM, Hoffman D, Junkins HA, Maglott D, Phan L, Sherry ST, Feolo M, Hindorff LA. Phenotype-Genotype Integrator (PheGenI): synthesizing genome-wide association study (GWAS) data with existing genomic resources. European Journal of Human Genetics. 2014;22:144–147. doi: 10.1038/ejhg.2013.96. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Ranwez V, Douzery EJP, Cambon C, Chantret N, Delsuc F. MACSE v2: toolkit for the alignment of coding sequences accounting for frameshifts and stop codons. Molecular Biology and Evolution. 2018;35:2582–2584. doi: 10.1093/molbev/msy159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Reichenbach A, Bringmann A. New functions of Müller cells. Glia. 2013;61:651–678. doi: 10.1002/glia.22477. [DOI] [PubMed] [Google Scholar]
  83. Rodriguez JM, Rodriguez-Rivas J, Di Domenico T, Vázquez J, Valencia A, Tress ML. APPRIS 2017: principal isoforms for multiple gene sets. Nucleic Acids Research. 2018;46:D213–D217. doi: 10.1093/nar/gkx997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Rogińska D, Kawa MP, Pius-Sadowska E, Lejkowska R, Łuczkowska K, Wiszniewska B, Kaarniranta K, Paterno JJ, Schmidt CA, Machaliński B, Machalińska A. Depletion of the third complement component ameliorates age-dependent oxidative stress and positively modulates autophagic activity in aged retinas in a mouse model. Oxidative Medicine and Cellular Longevity. 2017;2017:5306790. doi: 10.1155/2017/5306790. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Roscito JG, Sameith K, Parra G, Langer BE, Petzold A, Moebius C, Bickle M, Rodrigues MT, Hiller M. Phenotype loss is associated with widespread divergence of the gene regulatory landscape in evolution. Nature Communications. 2018;9:4737. doi: 10.1038/s41467-018-07122-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Roscito JG, Subramanian K, Naumann R, Sarov M, Shevchenko A, Bogdanova A, Kurth T, Foerster L, Kreysing M, Hiller M. Recapitulating evolutionary divergence in a single cis-regulatory element is sufficient to cause expression changes of the lens gene Tdrd7. Molecular Biology and Evolution. 2021;38:380–392. doi: 10.1093/molbev/msaa212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Sabir N, Riazuddin SA, Kaul H, Iqbal F, Nasir IA, Zafar AU, Qazi ZA, Butt NH, Khan SN, Husnain T, Hejtmancik JF, Riazuddin S. Mapping of a novel locus associated with autosomal recessive congenital cataract to chromosome 8p. Molecular Vision. 2010;16:2911–2915. [PMC free article] [PubMed] [Google Scholar]
  88. Sadier A, Davies KT, Yohe LR, Yun K, Donat P, Hedrick BP, Dumont ER, Dávalos LM, Rossiter SJ, Sears KE. Multifactorial processes underlie parallel opsin loss in neotropical bats. eLife. 2018;7:e37412. doi: 10.7554/eLife.37412. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Samaras P, Schmidt T, Frejno M, Gessulat S, Reinecke M, Jarzab A, Zecha J, Mergner J, Giansanti P, Ehrlich H-C, Aiche S, Rank J, Kienegger H, Krcmar H, Kuster B, Wilhelm M. ProteomicsDB: a multi-omics and multi-organism resource for life science research. Nucleic Acids Research. 2020;48:D1153–D1163. doi: 10.1093/nar/gkz974. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez J-Y, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nature Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Schrödinger LLC, DeLano W. The PyMOL Molecular Graphics System. 2.5.2PyMOL. 2020 https://sourceforge.net/projects/pymol/
  92. Sharma V, Hiller M. Increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation. Nucleic Acids Research. 2017;45:8369–8377. doi: 10.1093/nar/gkx554. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Sharma V, Schwede P, Hiller M. CESAR 2.0 substantially improves speed and accuracy of comparative gene annotation. Bioinformatics. 2017;33:3985–3987. doi: 10.1093/bioinformatics/btx527. [DOI] [PubMed] [Google Scholar]
  94. Sharma V, Hecker N, Roscito JG, Foerster L, Langer BE, Hiller M. A genomics approach reveals insights into the importance of gene losses for mammalian adaptations. Nature Communications. 2018;9:1215. doi: 10.1038/s41467-018-03667-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Sharma V, Hecker N, Walther F, Stuckas H, Hiller M. Convergent losses of TLR5 suggest altered extracellular flagellin detection in four mammalian lineages. Molecular Biology and Evolution. 2020;37:1847–1854. doi: 10.1093/molbev/msaa058. [DOI] [PubMed] [Google Scholar]
  96. Sharma V, Hiller M. Losses of human disease-associated genes in placental mammals. NAR Genomics and Bioinformatics. 2020;2:lqz012. doi: 10.1093/nargab/lqz012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Sheffield VC, Stone EM. Genomics and the eye. The New England Journal of Medicine. 2011;364:1932–1942. doi: 10.1056/NEJMra1012354. [DOI] [PubMed] [Google Scholar]
  98. Shen W, Fruttiger M, Zhu L, Chung SH, Barnett NL, Kirk JK, Lee S, Coorey NJ, Killingsworth M, Sherman LS, Gillies MC. Conditional Müllercell ablation causes independent neuronal and vascular pathologies in a novel transgenic model. The Journal of Neuroscience. 2012;32:15715–15727. doi: 10.1523/JNEUROSCI.2841-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K. dbSNP: the NCBI database of genetic variation. Nucleic Acids Research. 2001;29:308–311. doi: 10.1093/nar/29.1.308. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Shiels A, Bennett TM, Hejtmancik JF. Cat-Map: putting cataract on the map. Molecular Vision. 2010;16:2007–2015. [PMC free article] [PubMed] [Google Scholar]
  101. Siemers BM, Schauermann G, Turni H, von Merten S. Why do shrews twitter? Communication or simple echo-based orientation. Biology Letters. 2009;5:593–596. doi: 10.1098/rsbl.2009.0378. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Simonovic M, Gettins PG, Volz K. Crystal structure of human PEDF, a potent anti-angiogenic and neurite growth-promoting factor. PNAS. 2001;98:11131–11135. doi: 10.1073/pnas.211268598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Springer MS, Emerling CA, Fugate N, Patel R, Starrett J, Morin PA, Hayashi C, Gatesy J. Inactivation of cone-specific phototransduction genes in rod monochromatic cetaceans. Frontiers in Ecology and Evolution. 2016;4:61. doi: 10.3389/fevo.2016.00061. [DOI] [Google Scholar]
  104. Sripathi SR, Hu MW, Turaga RC, Mertz J, Liu MM, Wan J, Maruotti J, Wahlin KJ, Berlinicke CA, Qian J, Zack DJ. Proteome landscape of epithelial-to-mesenchymal transition (EMT) of retinal pigment epithelium shares commonalities with malignancy-associated EMT. Molecular & Cellular Proteomics. 2021;20:100131. doi: 10.1016/j.mcpro.2021.100131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Stein PE, Leslie AG, Finch JT, Carrell RW. Crystal structure of uncleaved ovalbumin at 1.95 A resolution. Journal of Molecular Biology. 1991;221:941–959. doi: 10.1016/0022-2836(91)80185-w. [DOI] [PubMed] [Google Scholar]
  106. Stelzer G, Rosen N, Plaschkes I, Zimmerman S, Twik M, Fishilevich S, Stein TI, Nudel R, Lieder I, Mazor Y, Kaplan S, Dahary D, Warshawsky D, Guan-Golan Y, Kohn A, Rappaport N, Safran M, Lancet D. The genecards suite: from gene data mining to disease genome sequence analyses. Current Protocols in Bioinformatics. 2016;54:1. doi: 10.1002/cpbi.5. [DOI] [PubMed] [Google Scholar]
  107. Stephan T, Burgess SM, Cheng H, Danko CG, Gill CA, Jarvis ED, Koepfli KP, Koltes JE, Lyons E, Ronald P, Ryder OA, Schriml LM, Soltis P, VandeWoude S, Zhou H, Ostrander EA, Karlsson EK. Darwinian genomics and diversity in the tree of life. PNAS. 2022;119:e2115644119. doi: 10.1073/pnas.2115644119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Tedja MS, Wojciechowski R, Hysi PG, Eriksson N, Furlotte NA, Verhoeven VJM, Iglesias AI, Meester-Smoor MA, Tompson SW, Fan Q, Khawaja AP, Cheng C-Y, Höhn R, Yamashiro K, Wenocur A, Grazal C, Haller T, Metspalu A, Wedenoja J, Jonas JB, Wang YX, Xie J, Mitchell P, Foster PJ, Klein BEK, Klein R, Paterson AD, Hosseini SM, Shah RL, Williams C, Teo YY, Tham YC, Gupta P, Zhao W, Shi Y, Saw W-Y, Tai E-S, Sim XL, Huffman JE, Polašek O, Hayward C, Bencic G, Rudan I, Wilson JF, CREAM Consortium. 23andMe Research Team. UK Biobank Eye and Vision Consortium. Joshi PK, Tsujikawa A, Matsuda F, Whisenhunt KN, Zeller T, van der Spek PJ, Haak R, Meijers-Heijboer H, van Leeuwen EM, Iyengar SK, Lass JH, Hofman A, Rivadeneira F, Uitterlinden AG, Vingerling JR, Lehtimäki T, Raitakari OT, Biino G, Concas MP, Schwantes-An T-H, Igo RP, Jr, Cuellar-Partida G, Martin NG, Craig JE, Gharahkhani P, Williams KM, Nag A, Rahi JS, Cumberland PM, Delcourt C, Bellenguez C, Ried JS, Bergen AA, Meitinger T, Gieger C, Wong TY, Hewitt AW, Mackey DA, Simpson CL, Pfeiffer N, Pärssinen O, Baird PN, Vitart V, Amin N, van Duijn CM, Bailey-Wilson JE, Young TL, Saw S-M, Stambolian D, MacGregor S, Guggenheim JA, Tung JY, Hammond CJ, Klaver CCW. Genome-wide association meta-analysis highlights light-induced signaling as a driver for refractive error. Nature Genetics. 2018;50:834–848. doi: 10.1038/s41588-018-0127-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. Trapletti A, Hornik K. tseries: Time Series Analysis and Computational Finance. 0.10-51Cran. 2021 https://CRAN.R-project.org/package=tseries
  110. Uhlen M, Oksvold P, Fagerberg L, Lundberg E, Jonasson K, Forsberg M, Zwahlen M, Kampf C, Wester K, Hober S, Wernerus H, Björling L, Ponten F. Towards a knowledge-based human protein atlas. Nature Biotechnology. 2010;28:1248–1250. doi: 10.1038/nbt1210-1248. [DOI] [PubMed] [Google Scholar]
  111. Uhlén M, Karlsson MJ, Hober A, Svensson A-S, Scheffel J, Kotol D, Zhong W, Tebani A, Strandberg L, Edfors F, Sjöstedt E, Mulder J, Mardinoglu A, Berling A, Ekblad S, Dannemeyer M, Kanje S, Rockberg J, Lundqvist M, Malm M, Volk A-L, Nilsson P, Månberg A, Dodig-Crnkovic T, Pin E, Zwahlen M, Oksvold P, von Feilitzen K, Häussler RS, Hong M-G, Lindskog C, Ponten F, Katona B, Vuu J, Lindström E, Nielsen J, Robinson J, Ayoglu B, Mahdessian D, Sullivan D, Thul P, Danielsson F, Stadler C, Lundberg E, Bergström G, Gummesson A, Voldborg BG, Tegel H, Hober S, Forsström B, Schwenk JM, Fagerberg L, Sivertsson Å. The human secretome. Science Signaling. 2019;12:eaaz0274. doi: 10.1126/scisignal.aaz0274. [DOI] [PubMed] [Google Scholar]
  112. UniProtConsortium UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Research. 2021;49:D480–D489. doi: 10.1093/nar/gkaa1100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  113. Veilleux CC, Kirk EC. Visual acuity in mammals: effects of eye size and ecology. Brain, Behavior and Evolution. 2014;83:43–53. doi: 10.1159/000357830. [DOI] [PubMed] [Google Scholar]
  114. Wang L, Lin H, Shen Y, Huang S, Gu J, Su H, Qi Y. A new locus for inherited nuclear cataract mapped to the long arm of chromosome 1. Molecular Vision. 2007;13:1357–1362. [PubMed] [Google Scholar]
  115. Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ. Jalview Version 2--a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009;25:1189–1191. doi: 10.1093/bioinformatics/btp033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  116. Wertheim JO, Murrell B, Smith MD, Kosakovsky Pond SL, Scheffler K. RELAX: detecting relaxed selection in a phylogenetic framework. Molecular Biology and Evolution. 2015;32:820–832. doi: 10.1093/molbev/msu400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Wickham H. Wickham, Ggplot2: Elegant Graphics for Data Analysis. Cham: Springer-Verlag; 2016. [DOI] [Google Scholar]
  118. Wu J, Yonezawa T, Kishino H. Rates of molecular evolution suggest natural history of life history traits and a Post-K-Pg nocturnal bottleneck of placentals. Current Biology. 2017;27:3025–3033. doi: 10.1016/j.cub.2017.08.043. [DOI] [PubMed] [Google Scholar]
  119. Zhao H, Rossiter SJ, Teeling EC, Li C, Cotton JA, Zhang S. The evolution of color vision in nocturnal mammals. PNAS. 2009;106:8980–8985. doi: 10.1073/pnas.0813201106. [DOI] [PMC free article] [PubMed] [Google Scholar]

Editor's evaluation

George H Perry 1

The authors use a comparative genomics approach to predict gene function, in particular genes that have a role in eye development. After identifying the convergent loss of SERPINE3 with vision loss across mammals, the authors confirmed its involvement in eye development by characterizing zebrafish knockouts. This work highlights the power of comparative genomics to generate hypotheses that can be experimentally validated. This work is relevant to a broad audience interested in evolution and adaptation as well as for those studying eye development and eye pathologies.

Decision letter

Editor: George H Perry1
Reviewed by: Stephen Treaster2

Our editorial process produces two outputs: (i) public reviews designed to be posted alongside the preprint for the benefit of readers; (ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.

Decision letter after peer review:

Thank you for submitting your article "Vision-related convergent gene losses reveal SERPINE3's unknown role in the eye" for consideration by eLife. Your article has been reviewed by 2 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and George Perry as the Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Stephen Treaster (Reviewer #2).

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential revisions:

The reviewers were positive but also consistent in their major concerns about the pipeline and its output, requiring more clarity/data and at least some level of additional supporting validation, perhaps including manual curation. I agree that these concerns need to be addressed, including some further cross-checking of the database Hopefully this is possible with existing cross-species RNA-seq datasets, and does not require the generation of new experimental data.

Overall, congratulations on this nice study, and I look forward to seeing the manuscript at its next evolutionary stage!

Reviewer #1 (Recommendations for the authors):

I have only one comment: The authors identify a very large number of putative gene losses in many different species. Given that usually the majority of such bioinformatically identified gene losses are false positives due to annotation or sequencing artifacts, a manual curation is usually critical – even with the most sophisticated algorithms. According to the methods this has not been performed. I am aware that the authors are world experts in identifying gene losses, but I am still skeptical that all of these are true events (given the sheer number). I recommend that the authors use existing RNA-Seq data sets (for example from the naked mole rat) or PCR from genomic DNA to confirm at least a subset of the ones that they are showing.

Reviewer #2 (Recommendations for the authors):

Many of the filters are explained in the previous publications generating the dataset, however, there seem to be four additional thresholds to reuse that dataset for this analysis. While these filters may all be appropriate, it is opaque to the reader. (1) visual acuity cutoffs, (2) percent of high and low acuity lineages with assembly gaps, (3) requiring an intact score in >80% of high acuity lineages and lost in <10% in high acuity lineages, (4) requiring the gene be lost in at least three of the seven independent low-acuity lineages. There is a brief expansion on filter 1 by trying two other close acuity cutoffs. However, these other acuity cutoffs seem to have dramatic consequences that are not discussed. 20 of the 26 top hits vanish when using a cutoff of 2 instead of 1. With such an impact, it is difficult to interpret what exactly is being measured or what the "right" cutoff may be to get the best measure of vision loss. This is exacerbated by the unintuitive nature of visual acuity scores. Unlike the iconic convergence examples, the operational difference between visual acuity scores of 0.5, 1, and 2, or even 2 and 20, is not obvious. If a continuous measure is going to be binarized, there should be a more apparent argument for the cutoffs, particularly since the 0.5 cutoff has more significant results than 1, but is not the focus of the manuscript.

Filters 3 and 4 appear to be redundant with each other and convergence analysis itself. The analysis is already scoring genes for intactness in conjunction with the trait, so it seems unnecessary to exclude genes from the analysis based on patterns of intactness in conjunction with the trait. It's the same measurement, twice. The purpose of these filters is unclear. The consequences of these filters on p-value distributions, multiple hypothesis corrections, and top hits are unclear, and possibly inappropriate. The lack of a negative control again exacerbates these concerns. While it is non-trivial to select a control set (or many sets) of lineages with comparable topological relationships as the experimental group, it is necessary to demonstrate the robustness of the method and dataset.

Including entire gene lists in the supplemental tables for each method (as opposed to just the top 26 of a single method) would helpfully provide the raw p-values for the distributions mentioned above, allow more comparison and quantification of differences between cutoffs, and allow for future intersections with other datasets. This would dramatically increase the utility of the overall convergence results for the field. In conjunction, there is a truly impressive amount of genomic work here that is unlikely to be replicated by someone else that wants to do a similar analysis on a different phenotype, but the dataset itself does not seem to be available.

eLife. 2022 Jun 21;11:e77999. doi: 10.7554/eLife.77999.sa2

Author response


Essential revisions:

The reviewers were positive but also consistent in their major concerns about the pipeline and its output, requiring more clarity/data and at least some level of additional supporting validation, perhaps including manual curation. I agree that these concerns need to be addressed, including some further cross-checking of the database Hopefully this is possible with existing cross-species RNA-seq datasets, and does not require the generation of new experimental data.

Overall, congratulations on this nice study, and I look forward to seeing the manuscript at its next evolutionary stage!

We thank the editor for these encouraging words.

As detailed below, we have analyzed publicly-available RNA-seq data of different species and find in all cases no relevant expression of the remnants of the inactivated SERPINE3 gene. We also performed additional validations of the inactivating mutations and every reported SERPINE3 loss has at least one validated mutation.

Reviewer #1 (Recommendations for the authors):

I have only one comment: The authors identify a very large number of putative gene losses in many different species. Given that usually the majority of such bioinformatically identified gene losses are false positives due to annotation or sequencing artifacts, a manual curation is usually critical – even with the most sophisticated algorithms. According to the methods this has not been performed. I am aware that the authors are world experts in identifying gene losses, but I am still skeptical that all of these are true events (given the sheer number). I recommend that the authors use existing RNA-Seq data sets (for example from the naked mole rat) or PCR from genomic DNA to confirm at least a subset of the ones that they are showing.

We apologize for the lack of clarity in the manuscript describing our initial validation of inactivating SERPINE3 mutations. This description was included in the section “Investigating SERPINE3 in additional genomes”, which is placed downstream of the section describing the screen, since we validated SERPINE3 not only in the species included in our screen. We have now changed the title of this section to “Investigating and validating SERPINE3 losses in additional genomes” and rephrased it to improve clarity:

“Since assembly base errors can mimic false gene losses (32, 85), we validated inactivating mutations via two approaches. We considered a mutation as real if it is either shared with an independent assembly of the same or a closely-related species, or if alignments of raw DNA sequencing reads support the mutation. Support by raw DNA reads was assessed by aligning the genomic sequence around the mutation against the NCBI short read archive (SRA queried via NCBI megablast) (86) (Figure 2 – source data 2). With the exception of okapi, steenbok and fox, where SERPINE3 does not evolve under relaxed selection (Figure 2 – source data 4), we considered a gene loss as real if there is at least one validated inactivating mutation. We did not validate SERPINE3 mutations in the genomes of cyclops roundleaf bat and Stoliczka's trident bat, as both species have each eight mutations across several exons and their PacBio HiFi read-based assemblies have very high base accuracies (QV>60, indicating less than one base error per 1 Mb).”

We now also added a sentence about this validation step in the Results section “SERPINE3 became dispensable in many mammals that do not primarily rely on vision”.

“We only considered SERPINE3 losses for which at least one inactivating mutation could be validated (see Methods, Figure 2 – source data 1-2). “

We initially validated only mutations for species, where only 1 or 2 mutations in SERPINE3 were observed that are not shared with other assemblies or species. We now extended this analysis to all species, where SERPINE3 mutations are not shared, irrespective of the number of observed mutations.

With the exception of dikdik and polecat, where putative mutations are likely base errors, we could confirm 28 inactivating mutations in 11 other species.

Details on the validation with raw DNA sequencing reads are provided in the new additional source data “Figure 2 – source data 2” and summarized in “Figure 2 – source data 1”.

I am aware that the authors are world experts in identifying gene losses, but I am still skeptical that all of these are true events (given the sheer number). I recommend that the authors use existing RNA-Seq data sets (for example from the naked mole rat) or PCR from genomic DNA to confirm at least a subset of the ones that they are showing.

As suggested, we used publicly available RNA-seq data to evaluate whether inactivated SERPINE3 genes are still expressed. RNA-seq data of eyes, where SERPINE3 expression is expected, is available for several bat species. We blasted the full-length SERPINE3 sequences as annotated by TOGA against these RNA-Seq data sets and obtained a very low number of aligning reads (<10) for all bats. As a positive control, we blasted GLUL, a gene that is expressed in brain and eye, which revealed at least 9000 aligning RNA-seq reads.

For the common vampire bat, we also mapped the entire eye RNA-seq data set against the genome. A new “Figure 2 —figure supplement 1” shows the absence of SERPINE3 expression and the lack of spliced reads that would indicate transcript processing:

For the naked mole rat and other species like blind mole rat, damara mole rat and armadillo, eye RNA-seq data is not available. Therefore, we analyzed available brain RNA-seq data for these species, as our expression tests in zebrafish show that SERPINE3 is also expressed in the brain. As for eye tissue, we found no indication that remnants of inactivated SERPINE3 genes are still expressed in the brain.

These data are shown in a new source Data file “Figure 2 – source data 3“ and mentioned in the main text:

“Further supporting SERPINE3 losses, analyses of publicly-available RNA-seq data indicates that remnants of inactivated SERPINE3 genes do not show relevant expression (Figure 2 —figure supplement 1, Figure 2 – source data 3). “

Reviewer #2 (Recommendations for the authors):

Many of the filters are explained in the previous publications generating the dataset, however, there seem to be four additional thresholds to reuse that dataset for this analysis. While these filters may all be appropriate, it is opaque to the reader. (1) visual acuity cutoffs, (2) percent of high and low acuity lineages with assembly gaps, (3) requiring an intact score in >80% of high acuity lineages and lost in <10% in high acuity lineages, (4) requiring the gene be lost in at least three of the seven independent low-acuity lineages. There is a brief expansion on filter 1 by trying two other close acuity cutoffs. However, these other acuity cutoffs seem to have dramatic consequences that are not discussed.

We thank the reviewer for pointing this out. We addressed this point as follows:

First, we have now simplified the workflow by dropping filter 3. This filter was inspired by previous work, but is not necessary for our vision-related screen as it affects only three genes. This increases the number of genes retrieved from the screen from 26 to 29. The three additional genes are PKD1L2, GJA10, LBHD1, of which PKD1L2 and GJA10 have a known eye-related function. These three genes are now included in the updated Figure 1.

Second, we have now added a flowchart as Figure 1 – supplement figure 1, which describes each step and lists how many genes passed each filter.

Third, we explain the rationale behind the first two filters in the revised Methods section. These filters exclude genes that cannot show strong associations because of incomplete data or lack of variation:

“Using the human Ensembl version 87 gene annotation, we excluded protein-coding genes that have missing data due to assembly incompleteness for more than 50% of low- or high-acuity species and genes that had identical %intact values across species, since such genes cannot show associations with lower visual acuity. This resulted in 13,172 genes. Figure 1 —figure supplement 1 provides a flowchart.”

Fourth, we explored the impact of the last filter, which reduced the candidate gene set from 68 to 31. At an FDR of 0.05, we retrieved 68 genes. We now also tested whether this gene set is enriched in eye-related genes. As shown in a new Figure 1 – source data 9A, these 68 genes are significantly enriched in ‘’visual perception” (corrected P-value 9.1E-08) and “sensory perception of light stimulus” (corrected P-value 9.1E-08) functions. The last filter for losses in at least three independent low-acuity lineages attempts to filter out gene losses in ≤2 lineages that are less likely to be related to the poor vision phenotype. By focusing on genes that are convergently lost, we aim at enriching for the most promising candidates. Indeed, the 37 genes that were removed during this filtering step are not enriched in “visual perception ”, which we now show in Figure 1 – source data 9B, which indicates that this filter is helpful. The rationale behind this filter is now better explained in the Methods section “Forward genomics screen”:

“Finally, to obtain candidates that are convergently lost, we required that a gene was lost in at least three of the seven independent low-acuity lineages. This filtering step removed 37 genes and these genes are not enriched in any GO-term (Figure 1 – source data 9B). Of the remaining 31 genes, two genes are no longer contained in the latest Ensembl annotation (version 106) and were therefore dropped. The other 29 genes are presented in Figure 1 and considered in the enrichment analysis (below).”

20 of the 26 top hits vanish when using a cutoff of 2 instead of 1. With such an impact, it is difficult to interpret what exactly is being measured or what the "right" cutoff may be to get the best measure of vision loss. This is exacerbated by the unintuitive nature of visual acuity scores. Unlike the iconic convergence examples, the operational difference between visual acuity scores of 0.5, 1, and 2, or even 2 and 20, is not obvious. If a continuous measure is going to be binarized, there should be a more apparent argument for the cutoffs, particularly since the 0.5 cutoff has more significant results than 1, but is not the focus of the manuscript.

We agree with the reviewer that it is not trivial to pick a ‘right’ cutoff for visual acuity. We present the results with a cutoff of 1 as this separates the echolocating bats from the non-echolocating Pteropodid bats, which have larger eyes and rely more on vision. However, the ‘right’ cutoff likely does not exist. For this reason, we ran our screen three times, testing not only a cutoff of 1, but also the more inclusive cutoff 1.5 and the more strict cutoff of 0.5. Importantly, SERPINE3 is a significant hit for all three cutoffs, indicating that this gene tends to be lost in species with lower acuity values, irrespective of the exact cutoff.

Given the difficulty of finding a ‘right’ cutoff, we now also investigated an additional phenotype definition that is independent of visual acuity. To this end, we considered a molecular loss signature of known vision genes and defined 10 “poor vision” species that have lost ≥5 known “visual perception” genes. It is reassuring that these 10 species are similar to low-acuity species defined by visual acuity values <1 and only two species differ. Supporting a robust association for SERPINE3, this screen also ranked SERPINE3 among the top hits.

We added the details to the Methods section:

“In addition to using phenotype definitions based on visual acuity values, we also tested a phenotype definition based on a molecular signature of losses of known vision-related genes. To this end, we ranked all 49 species based on their number of lost genes annotated with the GO-term “visual perception” (total of 93 genes, Figure 1 – source data 1). The set of ten species that have lost five or more visual perception genes is similar to the set of low-acuity species used in the original screen. The differences are that mouse and deer mouse are substituted by manatee and goat. Re-running the screen based on this molecular loss pattern of vision genes, retrieved SERPINE3 at rank 5 (Figure 1 – source data 7).”

and summarized it in the main text:

“To show that SERPINE3 loss is robustly associated with poor vision, we varied the visual acuity thresholds used to classify species as having low-acuity vision and explored an alternative approach for defining poor vision species based on a molecular loss signature of known vision-related genes. All three modified screens consistently retrieved SERPINE3 as one of the top-ranked hits (Figure 1 – source data 5-7), showing that this association is robust to the selected thresholds and phenotype definition.”

Filters 3 and 4 appear to be redundant with each other and convergence analysis itself. The analysis is already scoring genes for intactness in conjunction with the trait, so it seems unnecessary to exclude genes from the analysis based on patterns of intactness in conjunction with the trait. It's the same measurement, twice.

The reviewer is right that filter 3 was not really necessary. As explained above, we have now removed this filter, which only affected three genes.

The purpose of these filters is unclear. The consequences of these filters on p-value distributions, multiple hypothesis corrections, and top hits are unclear, and possibly inappropriate. The lack of a negative control again exacerbates these concerns. While it is non-trivial to select a control set (or many sets) of lineages with comparable topological relationships as the experimental group, it is necessary to demonstrate the robustness of the method and dataset.

As mentioned above, we have now explained the rationale behind the filters. We also ran a negative control screen. In contrast to the real screen for genes preferentially lost in low-acuity species, this control screen did not retrieve any eye-related genes. The results of the control screen are provided in Figure 1 – source data 4.

Including entire gene lists in the supplemental tables for each method (as opposed to just the top 26 of a single method) would helpfully provide the raw p-values for the distributions mentioned above, allow more comparison and quantification of differences between cutoffs, and allow for future intersections with other datasets. This would dramatically increase the utility of the overall convergence results for the field. In conjunction, there is a truly impressive amount of genomic work here that is unlikely to be replicated by someone else that wants to do a similar analysis on a different phenotype, but the dataset itself does not seem to be available.

We apologize for not listing the entire dataset in the submitted version. This is now corrected by providing a new source Data file, Figure 1 – source data 2, which lists all genes that were screened, the raw P-values and FDR values, and which genes passed the filter for losses in at least three independent lineages.

Figure 1 – source data 4-7 now provides the full list of genes together with their P-values, FDRs and filters for all additional screens conducted in this manuscript (two different visual acuity cutoffs, using a gene loss signature to define the phenotype, negative control screen).

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Indrischek H, Hiller M, Brand M. 2022. Microscopic eye phenotype of SERPINE3 cbg17 and cbg18 Knockout lines. Dryad Digital Repository. [DOI]

    Supplementary Materials

    Figure 1—source data 1. Genome assemblies used in this study and visual acuity values of mammalian species.
    Figure 1—source data 2. List of candidate genes of the forward genomics screen based on visual acuity <1 including a short description.
    Figure 1—source data 3. Enrichment of candidate genes in different processes/functions.
    Figure 1—source data 4. A control forward genomics screen does not retrieve vision-related genes.
    Figure 1—source data 5. List of candidate genes of the forward genomics screen based on a visual acuity threshold <1.5.
    Figure 1—source data 6. List of candidate genes of the forward genomics screen based on visual acuity threshold <0.5.
    Figure 1—source data 7. List of candidate genes of the forward genomics screen based on the molecular loss signature of known vision genes.
    Figure 1—source data 8. Reading frame %intact values for all investigated genes and species.
    Figure 1—source data 9. Enrichment of candidate genes in different stages of the filtering process.
    Figure 2—source data 1. SERPINE3 intactness in 447 mammalian assemblies.
    Figure 2—source data 2. Verification of inactivating mutations in SERPINE3 with raw DNA sequencing reads.
    Figure 2—source data 3. Expression of SERPINE3 remnants in public RNA-seq datasets.
    Figure 2—source data 4. Analysis of relaxation or intensification of selection in SERPINE3 branches of interest.
    Figure 3—source data 1. qPCR analysis of a zebrafish organ series using the normalizer gene actb.
    Figure 3—source data 2. qPCR analysis of a zebrafish organ series using the normalizer gene rpl13a.
    Figure 3—source data 3. qPCR analysis of zebrafish eye subtissues.
    Figure 3—source data 4. qPCR analysis of mouse organs.
    Figure 3—source data 5. Data sets supporting expression of SERPINE3 in tissues of the eye in vertebrates other than mouse and zebrafish.
    Figure 4—source data 1. qPCR analysis of zebrafish serpine3cbg17 homozygotes (KO) vs WT.
    Figure 4—source data 2. Macroscopic eye phenotype in serpine3 knockout lines.
    Figure 4—source data 3. Single-nucleotide polymorphism near SERPINE3 (13: 51, 341, 032–51, 362,101) are associated with human eye phenotypes.
    Figure 4—source data 4. Raw gel images of genotyping of the serpine3cbg17 line with two primers (labeled image is shown in Figure 4 - figure supplement 2A).
    Figure 4—source data 5. Raw gel images of genotyping of the serpine3cbg17 line with a mix of three primers (labeled image is shown in Figure 4 - figure supplement 2B).
    Figure 4—source data 6. Raw gel images of genotyping of the serpine3cbg18 line with two primers (labeled image is shown in Figure 4 - figure supplement 2C).
    Supplementary file 1. Annotated protein alignment of intact and complete mammalian SERPINE3.jvp format is to be opened in Jalview.
    elife-77999-supp1.zip (68.7KB, zip)
    Supplementary file 2. Predicted structure of human SERPINE3 (AlphaFold 2).
    elife-77999-supp2.zip (58.9KB, zip)
    Supplementary file 3. Comparison of 3D structures of crystallized SERPINs to the predicted human SERPINE3.
    elife-77999-supp3.xls (29KB, xls)
    Transparent reporting form

    Data Availability Statement

    All data needed to evaluate the conclusions in the paper are present in the paper and the Supplementary Materials. The annotated protein alignment of intact and complete mammalian SERPINE3 genes are in Supplementary File 1, the predicted structure of human SERPINE3 are in Supplementary File 2 and the raw microscopy images of fish eyes are available on Dryad at DOI 10.5061/dryad.ncjsxksxt. TOGA annotations of SERPINE3 and a visualization of gene-inactivating mutations are available at our UCSC genome browser mirror https://genome.senckenberg.de.

    The following dataset was generated:

    Indrischek H, Hiller M, Brand M. 2022. Microscopic eye phenotype of SERPINE3 cbg17 and cbg18 Knockout lines. Dryad Digital Repository.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES