Abstract
Genetically encoded biosensors based on engineered fluorescent proteins (FPs) are essential tools for monitoring the dynamics of specific ions and molecules in biological systems. Arsenic ion in the +3 oxidation state (As3+) is highly toxic to cells due to its ability to bind to protein thiol groups, leading to inhibition of protein function, disruption of protein–protein interactions, and eventually to cell death. A genetically encoded biosensor for the detection of As3+ could potentially facilitate the investigation of such toxicity both in vitro and in vivo. Here, we designed and developed two prototype genetically encoded arsenic biosensors (GEARs), based on a bacterial As3+ responsive transcriptional factor AfArsR from Acidithiobacillus ferrooxidans. We constructed FRET-based GEAR biosensors by insertion of AfArsR between FP acceptor/donor FRET pairs. We further designed and engineered single FP-based GEAR biosensors by insertion of AfArsR into GFP. These constructs represent prototypes for a new family of biosensors based on the ArsR transcriptional factor scaffold. Further improvements of the GEAR biosensor family could lead to variants with suitable performance for detection of As3+ in various biological and environmental systems.
Keywords: genetically encoded biosensor, arsenic biosensors (GEARs), FRET and FP-based arsenic biosensors
1. Introduction
Arsenic and arsenic compounds ubiquitously exist in the natural environment in different forms including organic, inorganic and arsine gas. Common organic arsenic compounds include arsanilic acid (C6H8AsNO3), methylarsonic acid (CH5AsO3), dimethylarsinic acid (cacodylic acid, C2H7AsO2), and arsenobetaine (C5H11AsO2). The inorganic compounds, which are the most toxic, include trivalent and pentavalent compounds. Arsenic trioxide (As2O3), sodium arsenite (NaAsO2), and arsenic trichloride (AsCl3) are the most common trivalent compounds, and arsenic pentoxide (As2O5), arsenic acid (H3AsO4), and arsenates (e.g., lead arsenate PbHAsO4 and calcium arsenate Ca3(AsO4)2) are the most common pentavalent compounds [1]. Anthropogenic and naturally arsenic-contaminated groundwater and soil are the major sources of arsenic introduction into the food chain, resulting in human exposure to excessive arsenic [2]. High levels of arsenic exposure lead to health problems in humans, including many arsenicosis diseases such as skin disease, respiratory disorders, cardiovascular disorders, developmental neurotoxicity, and various cancers [3,4,5].
Methods to detect and quantify arsenic and arsenic compounds in vitro or in vivo are important to help assess arsenic contamination and its toxicity. For the in vitro detection of arsenic and its compounds, a variety of tools have been developed [6]. However, none of these detection methods provide compatibility within cell and in vivo detection of arsenic. Genetically encodable proteinaceous indicators with optical output, on the other hand, offer distinctive advantages as they can be used either in vitro or in vivo [7]. One strategy for the development of genetically encoded biosensors is to use changes in Förster resonance energy transfer (FRET) efficiency as a reporter of binding-induced conformational changes in a suitably designed fusion protein. A target molecule (analyte) binding domain (also referred to here as the sensing domain) is genetically fused between two fluorescent proteins (FPs) that serve as FRET donor (a more blue-shifted FP) and acceptor (a more red-shifted FP). Upon analyte binding to the sensing domain, a change in distance and/or orientation between the two FPs leads to alteration of FRET efficiency and a ratiometric change in fluorescence emission profile [8,9]. Another important type of genetically encoded biosensor is based on a single FP into which a sensing domain has been genetically inserted. Upon analyte binding, the sensing domain undergoes a conformational change that leads to an alteration of the FP chromophore environment, and a consequent change in the fluorescence emission intensity [10].
The use of an appropriate sensing domain in a genetically encodable biosensor can enable the biosensor to have exquisite specificity for the target molecule (that is, analyte) of interest. Appropriate sensing domains for construction of an arsenic biosensor are the arsenic binding proteins from bacterial arsenic detoxification systems. Expression of detoxification genes located in the ars operons is controlled by the As3+-responsive repressor ArsR. Upon binding to As3+, ArsR activates transcription of genes in the ars operon that encode for proteins involved in As3+ detoxification [11]. Among the ArsR family, three ArsR proteins have been characterized to have distinctive As3+ binding sites [12]. From Escherichia coli plasmid R773, the As3+ binding domain EcArsR contains three cysteines at positions Cys32, Cys34 and Cys37, which are proposed to form the As3+ binding site [12,13]. Recently, a FRET-based As3+ indicator, designated SenALiB, based on EcArsR and the ECFP/mVenus FRET pair was reported [14]. The second ArsR ortholog was discovered in Acidithiobacillus ferrooxidans (AfArsR). The As3+ binding site of AfArsR is likely composed of three cysteine residues, Cys95, Cys96, and Cys102, which are located at the flexible end of the C-terminus [15,16]. The third ArsR repressor has been identified in Corynebacterium glutamicum (CgArsR). In CgArsR, the three cysteines that form the binding site are located between the two dimeric subunits: Cys15 and Cys16 are located in one dimer subunit and Cys55 is located in the other [16,17]. Structural and biophysical studies of AfArsR and CgArsR have provided insight into the As3+ binding mode and its specificity towards As3+, therefore suggesting the feasibility of using such scaffolds for the design of biosensors for detecting arsenic [15,16]. In this study, we report our efforts to design, construct, and characterize prototypes of genetically encoded arsenic biosensors (GEARs) based on the Acidithiobacillus ferrooxidans As3+ responsive transcription factor, AfArsR.
2. Materials and Methods
2.1. DNA Construction and Mutagenesis
All synthetic DNA primers were purchased from Integrated DNA Technologies (IDT, Coralville, IA, USA). CloneAmp HiFi polymerase (TakaraBio, Kusatsu, Japan) was used for PCR amplification. PCR products were purified using GeneJET gel extraction kit (Thermo Scientific, Waltham, MA, USA) according to the manufacturer’s protocols. InFusion Assembly (TakaraBio, Kusatsu, Japan) was used for assembly of gene inserts and plasmid vectors. The resulting plasmids were used to transform E. coli DH10B electrocompetent cells (Thermo Scientific, Waltham, MA, USA). The DNA sequencing reactions were performed and analyzed at the University of Alberta Molecular Biology Service Unit by dye terminator cycle sequencing using the BigDye Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems, Waltham, MA, USA). The DNA encoding AfArsR was synthesized by IDT. For construction of FRET-based GEAR-CV1, AfArsR gene was amplified using primers with overlap regions with the FRET donor and acceptor pair mCerulean3 and cpVenus, then assembled into a PCR-amplified pBAD-KIRIN1 plasmid vector. For the replacement of FRET donor and acceptor, FP mTFP1 and mCitrine gene were PCR-amplified and double digested with XhoI/KpnI and EcoR1/HindIII, respectively. The digested product was then ligated to similarly digested pBAD-GEAR-CV1 plasmid. For construction of single FP-based GEAR-G1, AfArsR gene was amplified using primers with overlap regions with GFP, then assembled into a PCR-amplified pBAD-GINKO1 plasmid vector. Site-directed mutagenesis, deletion, and linker saturation mutagenesis were performed using a QuikChange site-directed mutagenesis kit (Agilent, Santa Clara, CA, USA) following the manufacturer’s protocol. The transformed cells were spread on agar plates supplemented with 0.1 mg/mL ampicillin and 0.02% L-arabinose for 16–18 h. The colony fluorescence was inspected by using a custom-built colony screener under blue light.
2.2. Protein Expression and Purification
The protein was expressed as His-tagged recombinant proteins in DH10B E. coli cells. The culture was incubated at 37 °C for 4 h when OD achieved 0.6–0.8. Then, L-arabinose was added to a final concentration at 0.02%. Then, the culture was transferred to 30 °C for overnight incubation for a maximum of 18 h. Bacterial culture was harvested at 10,000 rpm at 4 °C for 10 min. Cell pellet was resuspended in Tris-buffered saline (TBS, 150 mM NaCl, 20 mM Tris, pH 7.5) and lysed by using sonication. The His-tagged protein from the collected supernatant was purified by affinity chromatography using Ni-NTA beads. Protein-bound beads were washed with wash buffer (20 mM imidazole, 50 mM Tris pH 7.5, 300 mM NaCl) and eluted with an elution buffer (50 mM NaH2PO4.H2O, 300 mM NaCl, 500 mM Imidazole). The eluted protein fractions were buffer exchanged by using a PD-10 Columns (GE Healthcare Life Sciences, Chicago, IL, USA) desalting column and then concentrated by using 10 kDa cut-off centrifuge filter columns (Amicon, Merck Millipore, Burlington, MA, USA) and stored in 1× TBS (pH 7.5).
2.3. Fluorescence Measurement
For FRET-based GEARs the fluorescence spectrum was measured using a Tecan Safire2 microplate reader with excitation at 430 nm and emission from 450 nm to 600 nm. The fluorescence spectrum of the single-based GEARs was measured with excitation at 400–480 nm and emission from 480 nm to 600 nm. For measurement of Kd,app of As3+, purified protein was diluted into a series of buffers with As3+; different concentrations of sodium arsenite (Sigma-Aldrich, St. Louis, MO, USA) as the source of As3+ with 5 mM β-mercaptoethanol (BME) were added. The titration experiments were performed in triplicate. Data were analyzed and plotted with GraphPad Prism (8.0, GraphPad Software, San Diego, CA, USA).
2.4. Computational Structure Prediction of Genetically Encoded As3+ Biosensors
The 3D structure for GEAR-G1 was generated by using the Robetta server, which simultaneously used de novo and homology modeling methods. Structure refinement [18] and validation were carried out by using standard protocol of molecular dynamics (MD) simulation. MD simulations for the top selected model of GEAR-G1, generated by Robetta, were performed by using the GROMACS simulation package (GROMACS 2020.4) [19]. MD simulation of the protein complex GEAR-G1 was carried out for 150 ns in water using CHARMM 36 m force field for protein. The trajectory and energy files were written after every 10 ps. The system was solvated in a truncated octahedral box, containing TIP3P water molecules. The GEAR-G1 protein was centered in the simulation box within minimum distance to the box edge, with 5535 atoms overall which were neutralized by adding 7 K+ ions to the system. The steepest descent method was used to perform the minimization steps of 5000. To remove the steric clashes the convergence was achieved within the maximum force <1000 (KJ mol−1 nm−1). The system was equilibrated at NVT (volume and temperature at constant number of particles) and NPT (pressure and temperature at constant number of particles or molecules) ensembles for 100 ps (50,000 steps) and 1000 ps (1,000,000 steps), respectively, using time steps of 0.2 and 0.1 fs, respectively, at 300 K to ensure a fully converged system for production run. Production runs were performed at constant temperature of 300 K and at 1 atm pressure (using NPT ensemble) using weak coupling velocity-rescaling (modified Berendsen thermostat) [20] and Parrinello–Rahman algorithms [21], respectively. Relaxation times were set to τ T = 0.1 ps and τ P = 2.0 ps. All bond lengths involving hydrogen atoms were kept rigid at ideal bond lengths using the Linear Constraint Solver (lincs) algorithm, allowing for a time step of 2 fs. The Verlet scheme was used for the calculation of non-bonded interactions. Periodic boundary conditions (PBC) were used in all x, y, z directions. Interactions within a short-range cutoff of 1.2 nm were calculated in each time step. Particle mesh ewald (PME) was used to calculate the electrostatic interactions and forces to account for a homogeneous medium outside the long-range cutoff. The production was run for 150 ns for the system and for the analysis such as RMSF and PCA, only the last 100 ns was used, and the first 50 ns were discarded [22].
2.5. Statistical Analysis
Data are expressed as individual data points or mean ± SD. t-tests were used with statistical significance labeled on the figures.
3. Results and Discussion
3.1. Development of Genetically Encoded As3+ Biosensors Based on FRET
We first designed a FRET-based genetically encoded As3+ biosensor by fusing a FP FRET pair to the N- and C-termini of AfArsR (Figure 1A). We rationalized that, upon binding of As3+ with the three cysteine residue at position Cys95, Cys96, and Cys102, the conformational change of AfArsR would bring the fused FP FRET donor and acceptor closer together, thus increasing the FRET efficiency and resulting in a ratiometric fluorescence change (Figure 1B). We constructed the FRET-based As3+ biosensor by using cyan FP (CFP) mCerulean3 [23] as the FRET donor and yellow FP (YFP) variant cpVenus173 [24] as acceptor [25,26]. mCerulean3 is linked to the N-terminus of the full length AfArsR (residue 1–118) and cpVenus173 is linked to the C-terminus. The resulting construct was designated as GEAR-CV1. To characterize GEAR-CV1, we expressed and purified the protein and tested its FRET efficiency change in response to As3+ addition. The emission spectrum (450 nm to 600 nm) of GEAR-CV1 protein was measured before and after addition of 1 mM As3+. We observed that, in response to As3+, the mCerulean3 emission (~475 nm) decreased by ~10% and cpVenus (~530 nm) emission increased by ~5%, indicating an overall increase in FRET efficiency (Figure 1C). The maximum FRET acceptor and donor fluorescence emission intensity ratio (R = F530/F475) change (ΔR/Rmin) associated with the addition of As3+ was calculated to be 15.8 ± 0.2%. The titration results showed that there was a concentration-dependent increase in FRET ratio for GEAR-CV1 with an apparent Kd (Kd,app = concentration at half maximal change of ΔR/Rmin) of 84.9 µM (Figure 1D). These data established GEAR-CV1 as a functional FRET-based As3+ biosensor prototype.
With the prototype biosensor constructed, we sought to use site-directed mutagenesis to investigate the potential contributions of cysteine residues in the AfArsR arsenic binding site for the sensing of As3+. Previous studies have demonstrated that cysteine residues 95, 96, and 102 are all involved in the As3+ binding (Figure 2A) [15,16]. Among these three residues, Cys95 and Cys96 have been shown to be essential for As3+ binding, while Cys102 is not strictly required. Specifically, the Cys102Ser mutant still maintains the ability to bind As3+, albeit with a reduced affinity [15]. Accordingly, we created a series of cysteine-mutated variants based on GEAR-CV1, including three single-mutants (Cys95Ala, Cys96Ala, and Cys102Ala) (Figure 2B–D), one double-mutant (Cys95Ala/Cys96Ala) (Figure 2E), and one triple-mutant (Cys95Ala/Cys96Ala/Cys102Ala) (Figure 2F). Among all these mutants, only the Cys102Ala single-mutant retained a statistically significant FRET change (7.2 ± 0.9%) upon the addition of As3+ (Figure 2D). All other mutants showed no response to As3+ (Figure 2), which is consistent with the previous biochemical study on AfArsR Cys mutations [15]. These results confirmed the distinctive roles of the cysteine residues in As3+ binding, and also suggested the feasibility of using a FRET-based approach for the investigation of As3+ binding residues. These non-binding mutants of GEAR-CV1 also demonstrate that the fluorescent proteins mCerulean3 and cpVenus do not directly bind and respond to As3+.
We next attempted to improve the FRET ratio change (ΔR/Rmin) of GEAR-CV1. Upon inspection of the crystal structure of AfArsR, we noticed that the C-terminal region of AfArsR appears to be largely unstructured [16]. Therefore, we hypothesized that the C-terminal region might not be essential for As3+ binding, and that deletion of the C-terminal region in the GEAR-CV1 could effectively bring the FRET donor and acceptor closer in distance, potentially further increasing the FRET efficiency change. Accordingly, we constructed the variant GEAR-CV2 with ten C-terminal residues of AfArsR (Gly-Glu-Thr-Arg-Ser-Pro-Ser-Val-Gln-Glu) deleted (Figure 3A). Compared to the 15.8 ± 0.2% maximum ΔR/Rmin of the template GEAR-CV1 (Figure 3B), the As3+ response test of GEAR-CV2 revealed the maximum ΔR/Rmin to be 22 ± 3.5% (Figure 3C), which indicates a substantial improvement in FRET ratio change. This FRET change is also larger than the previously reported ~10% for the SenALiB As3+ biosensor [14]. It is worth noting that the FRET ratio of GEAR-CV2 in the As3+ unbound state is increased relative to GEAR-CV1 (Figure 3C), indicating that the FRET donor and acceptor are closer in proximity due to the deletion of AfArsR C-terminal residues.
In an effort to further improve the FRET ratio change, we explored the use of alternative FP FRET donors and acceptors. We designed three additional FRET-based As3+ biosensor prototypes by replacing the donor and acceptor with mTFP1 and mCitrine, respectively [27,28]. Based on GEAR-CV2, we constructed GEAR-TV1 (mTFP1/cpVenus) (Figure 3D), GEAR-CC1 (mCerulean3/mCitrine) (Figure 3E), and GEAR-TC1 (mTFP1/mCitrine) (Figure 3F). In vitro characterization revealed that the FRET ratio changes ΔR/Rmin for all these three constructs (GEAR-TV1 7 ± 2.3%, GEAR-CC1 11 ± 1.8%, GEAR-TC1 2.8 ± 0.6%) were no greater than that of GEAR-CV2 (22 ± 3.5%). Overall, the structure-guided deletion of the C-terminal residues, but not the replacements of the donor and acceptor FPs, improved the As3+-induced response of the FRET-based As3+ prototype. Accordingly, the engineered GEAR-CV2 is a promising template for further genetically encoded FRET biosensor development leading to eventual application for As3+ detection.
3.2. Development of Genetically Encoded As3+ Biosensors Based on a Single FP
Following on from our success at constructing the prototype FRET-based GEAR-CV1 and GEAR-CV2, we further explored the possibility of constructing a single FP-based As3+ biosensor using AfArsR as a binding domain. We designed and constructed a genetically encoded As3+ biosensor based on a single green FP (GFP). We used an insertion-type biosensor topology, in which the intact AfArsR domain was inserted into GFP in close proximity with the chromophore (Figure 4A). Specifically, residues 146–147 of GFP were replaced with AfArsR, with two residue-long linkers at both connection points. The conformational change of AfArsR upon binding to As3+ was expected to alter the chromophore environment, thus changing the fluorescence emission intensity (Figure 4B). This topology has been previously reported for the construction of Ca2+, glutamate, K+, and citrate biosensors [26,29,30,31] among many others. We found that insertion of intact AfArsR into the GFP scaffold resulted in a prototype green fluorescent indicator, designated as GEAR-G1. Upon addition of 1 mM As3+ to purified GEAR-G1, the fluorescence emission intensity change (ΔF/F0) is 31.6% (Figure 4C).
To improve the response of GEAR-G1 biosensor toward As3+, we focused our rational engineering efforts on the linkers that connect GFP to the AfArsR domain, and the residues of GFP (residue 145 as numbered in GFP = residue 146 as numbered in GEAR-G1; and residue 148 as numbered in GFP = residue 263 as numbered in GEAR-G1) that are connected to the linkers (Figure 4D). Based on structural and mechanistic insights gained from analyzing highly engineered single FP-based biosensors [32], the sequence consensus suggested that mutating the residues at Met146 and Gln263 to Phe146 and His263, respectively, could potentially improve the biosensor. We thus performed site-directed mutagenesis to obtain GEAR-G1-Met146Phe/Gln263His. Starting from this variant, we optimized the linker regions by randomizing linker1 (Glu147-Pro148) and linker2 (Gly261-Asn262) using saturation mutagenesis. Upon screening the linker mutagenesis libraries, we identified GEAR-G1-Met146Phe/Glu147Asp/Pro148Ser/Gly261Phe/Asn262Asp/Gln263His with optimized linker sequences. This linker optimized variant, designated as GEAR-G2, has a maximum ΔF/F0 of 44.9% upon binding of As3+ (Figure 4E). Altogether, the rational approaches of consensus engineering and linker optimization substantially improved the GEAR-G1 prototype biosensor in terms of As3+-induced fluorescent emission intensity change. The development and further improvement of the prototype GEAR-G2 As3+ biosensor provides strong support for the conclusion that the ArsR scaffold can be used for the construction of genetically encoded single FP-based indicators. Nonetheless, the established GEAR-G2 biosensor exhibits relatively low sensitivity, as expected for a prototype biosensor. Further optimization efforts are likely to eventually lead to high-performance biosensors with greater sensitivity and improved utility.
3.3. Computational Structure Prediction of Genetically Encoded As3+ Biosensors
To obtain further insight into the structure and mechanism of the GEAR-G1 biosensor, we employed computational structure prediction and simulations. We used multiple online servers including i-TASSER, SWISS MODEL, Phyre2, and Rosetta [33,34,35,36] to generate models of the protein where they were compared and selected. Our selection criteria for the model was based on the perseverance of the secondary structure elements in the protein complex, especially for the AfArsR arsenic binding site in the protein complex. Ultimately, the model generated by Robetta was chosen (Figure 5A) to be carried forward for further simulations as it is more structurally preserved compared to others.
Having selected the Rosetta-generated model for further structure validation, we ran molecular dynamics (MD) simulations for 150 ns and compared four different structures of GEAR-G1 at different time scales of the MD simulation: 0 ns, 50 ns, 100 ns, and 150 ns (Figure 5B). It was observed that the C-terminal helix of GFP in GEAR-G1, which was visible at 0 ns, had completely transformed into a coil structure within 50 ns of simulation. While the structure of GFP in GEAR-G1 was well-maintained throughout the MD simulation, a conformational change in the region of residues 80, 81, and 82 can be observed. Additionally, some minor structural changes can also be seen in the AfArsR region. The Dictionary of Secondary Structure of Protein (DSSP) graph provided information regarding α-helix, β-sheet and loop content in the protein [37]. In the DSSP graph of GEAR G1 (Figure 5C), no major changes were observed except for a few minor changes in coils and short helix (Figure 5C, shown in red and cyan), possibly due to the formation of coils in the C-terminal of GFP in GEAR-G1. As shown in Figure 5D, the root means square deviation (RMSD) and radius of gyration (Rg) values are reasonably stable throughout the last 100 ns of MD simulation (i.e., 50–150 ns) with average values of 0.44 ± 0.04 nm and 2.36 ± 0.02 nm, respectively. The root mean square fluctuation (RMSF) plot (Figure 5D, bottom panel) indicated some noticeable fluctuations for residues 180–195, 215–225, and 245–255, which mostly represent the loop and coil regions of the AfArsR domain. The GFP domain is rather stable except for the highly flexible C-terminus. Overall, these computational structure analyses showed that the GEAR-G1 secondary structure elements were conserved with only small variations during the 150 ns simulation of GEAR-G1.
To investigate the correlated motions between residues of GEAR-G1 protein complex, we performed dynamic cross correlation matrix analysis on the trajectory obtained from the last 100 ns of MD simulation. The cyan and purple colors indicate positive and negative correlation motions between fluctuating residues, respectively. In Figure 5E, the diagonal, cyan-colored region shows a positive correlation of topologically proximate residues. Moreover, the analysis also suggests highly correlated intra-residual motion within GFP and AfArsR residues, as well as an inter-residual cross correlation between GFP and AfArsR residues.
To confirm the correlated dynamic nature of GEAR-G1, we performed principal component analysis (PCA) using the α-carbon (Cα) position covariance. The first PC (PC1) accounts for 33.7% variance, which shows the largest variation in protein dynamic. The second PC (PC2) captures 23.2% of variance in protein, which is the second most important direction, and it is orthogonal to the PC1 axis. Altogether, the first three PC components account for more than 63% of protein variance (Figure 6A). The first PC highlights a twisting motion of the whole AfArsR region with the GFP; the N-terminal region of GFP exhibits a pronounced twisting effect (Figure 6B). On the other hand, the PC2 displays a scissoring motion (bending motion) of the AfArsR and GFP regions, which depicts the negative correlation motion (Figure 6C–E). Based on PC1, PC2 and PC3 variances, the conformers are clustered into four groups (colored as blue, cyan, pink, and red) based on the k-means algorithm values (Figure 6C–E) [38]. The red color region shows that there is less movement, the white region shows intermediate movement, while the blue region has the most significant movement or flexibility. The PCA identified the dominant regions of protein during simulation. The GFP domain (shown in red color) is pretty much stable, with the only exception being the flexible loops at the C-terminus. On the contrary, the AfArsR domain showed sufficient fluctuations as represented in blue color in Figure 6B, suggesting that the AfArsR domain is structurally highly flexible.
4. Conclusions
Over the past two decades the development and application of genetically encoded biosensors has steadily broadened the scope of scientific questions that cell biologists can ask and answer [39,40]. With the rapid expansion of the biosensor toolkit, a wider range of molecules now can be detected using FP-based biosensors to benefit more areas of biological sciences [26,29,41,42,43,44]. Here, we have further broadened this target range of genetically encoded biosensors by establishing the ‘GEAR series’ of FRET-based and single FP-based genetically encoded As3+ biosensor prototypes. Using rational engineering approaches, we successfully improved the performance of these biosensors. We also used the newly developed FRET-based biosensor GEAR-CV1 to study the role of the AfArsR As3+ binding site residues using site-directed mutagenesis and fluorescence spectroscopy, providing further support for the conclusions of previous studies [15,16]. We thus expect this could serve as a generally useful approach for investigation of mutational effect on target binding as long as the binding-induced conformational change can be measured by an observable FRET change. Using computational structure prediction and simulations, we further obtained valuable structural and mechanistic insights for sensor optimization. Future effort should be directed towards experimental structure determination in order to reveal and provide insight into the detailed sensing mechanism.
The newly developed GEAR biosensor prototypes, together with the EcArsR-based SenALiB [14], are promising templates for future development of high-performance As3+ biosensors. Further optimization using directed evolution is likely to yield new variants with substantially improved sensitivity for in vitro and in vivo detection of As3+. Engineered and improved biosensors with proper brightness, sensitivity, and selectivity are expected to unlock new possibilities for investigation of cellular signaling associated with As3+ dynamics and toxicity in the native cellular environment. Moreover, these biosensors serve as the first examples using the ArsR transcriptional regulator for genetically encoded biosensor engineering. The transcriptional factor ArsR superfamily has a diversified range of binding targets [45]; therefore, it represents a promising resource to be exploited for the future engineering of fluorescent biosensors.
Acknowledgments
We thank Landon Zarowny, Sheng-Yi Wu, Rochelin Dalangin, Xiaocen Lu, and Shuce Zhang for technical support and helpful discussion. We thank the University of Alberta Molecular Biology Services Unit (MBSU) for DNA sequencing support, Christopher W. Cairo (University of Alberta, Canada) for providing access to instrumentation.
Author Contributions
Y.S. and R.E.C. conceived this project. S.S.K. performed plasmid construction, protein purification, and in vitro protein characterization. S.S.K. and M.Q.F. performed computational structure prediction and analysis. Y.S. and S.S.K. analyzed data, prepared figures, and wrote the manuscript. Y.S., R.E.C. and H.B. supervised the project. All authors contributed to manuscript editing. All authors have read and agreed to the published version of the manuscript.
Funding
This research supported by the International Research Support Initiative Program from Higher Education Commission of Pakistan (IRSIP, HEC) and by HEC Pakistan NRPU funded project #20-347/NRPU/R&D/HEC/2014/1360. Research in the lab of REC was supported by grants from the Natural Sciences and Engineering Research Council of Canada (NSERC) and the Canadian Institutes of Health Research (CIHR). The APC was funded by grants from the Natural Sciences and Engineering Research Council of Canada (NSERC) and the Canadian Institutes of Health Research (CIHR).
Data Availability Statement
The data supporting this research are available upon request.
Conflicts of Interest
The authors declare no conflict of interest.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Chung J.-Y., Yu S.-D., Hong Y.-S. Environmental Source of Arsenic Exposure. J. Prev. Med. Public Health. 2014;47:253–257. doi: 10.3961/jpmph.14.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Shahid M., Niazi N.K., Dumat C., Naidu R., Khalid S., Rahman M.M., Bibi I. A Meta-Analysis of the Distribution, Sources and Health Risks of Arsenic-Contaminated Groundwater in Pakistan. Environ. Pollut. 2018;242:307–319. doi: 10.1016/j.envpol.2018.06.083. [DOI] [PubMed] [Google Scholar]
- 3.Kuo C.-C., Moon K.A., Wang S.-L., Silbergeld E., Navas-Acien A. The Association of Arsenic Metabolism with Cancer, Cardiovascular Disease, and Diabetes: A Systematic Review of the Epidemiological Evidence. Environ. Health Perspect. 2017;125:087001. doi: 10.1289/EHP577. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Rahman M.A., Rahman A., Khan M.Z.K., Renzaho A.M.N. Human Health Risks and Socio-Economic Perspectives of Arsenic Exposure in Bangladesh: A Scoping Review. Ecotoxicol. Environ. Saf. 2018;150:335–343. doi: 10.1016/j.ecoenv.2017.12.032. [DOI] [PubMed] [Google Scholar]
- 5.Di Giovanni P., Di Martino G., Scampoli P., Cedrone F., Meo F., Lucisano G., Romano F., Staniscia T. Arsenic Exposure and Risk of Urothelial Cancer: Systematic Review and Meta-Analysis. Int. J. Environ. Res. Public Health. 2020;17:3105. doi: 10.3390/ijerph17093105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Yogarajah N., Tsai S.S.H. Detection of Trace Arsenic in Drinking Water: Challenges and Opportunities for Microfluidics. Environ. Sci.: Water Res. Technol. 2015;1:426–447. doi: 10.1039/C5EW00099H. [DOI] [Google Scholar]
- 7.Greenwald E.C., Mehta S., Zhang J. Genetically Encoded Fluorescent Biosensors Illuminate the Spatiotemporal Regulation of Signaling Networks. Chem. Rev. 2018;118:11707–11794. doi: 10.1021/acs.chemrev.8b00333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Campbell R.E. Fluorescent-Protein-Based Biosensors: Modulation of Energy Transfer as a Design Principle. Anal. Chem. 2009;81:5972–5979. doi: 10.1021/ac802613w. [DOI] [PubMed] [Google Scholar]
- 9.Wiens M.D., Shen Y., Li X., Salem M.A., Smisdom N., Zhang W., Brown A., Campbell R.E. A Tandem Green-Red Heterodimeric Fluorescent Protein with High FRET Efficiency. Chembiochem. 2016;17:2361–2367. doi: 10.1002/cbic.201600492. [DOI] [PubMed] [Google Scholar]
- 10.Frommer W.B., Davidson M.W., Campbell R.E. Genetically Encoded Biosensors Based on Engineered Fluorescent Proteins. Chem. Soc. Rev. 2009;38:2833–2841. doi: 10.1039/b907749a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Murphy J.N., Saltikov C.W. The ArsR Repressor Mediates Arsenite-Dependent Regulation of Arsenate Respiration and Detoxification Operons of Shewanella Sp. Strain ANA-3. J. Bacteriol. 2009;191:6722–6731. doi: 10.1128/JB.00801-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Chen J., Rosen B.P. Biosensors for Inorganic and Organic Arsenicals. Biosensors. 2014;4:494–512. doi: 10.3390/bios4040494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Saltikov C.W., Olson B.H. Homology of Escherichia Coli R773 arsA, arsB, and arsC Genes in Arsenic-Resistant Bacteria Isolated from Raw Sewage and Arsenic-Enriched Creek Waters. Appl. Environ. Microbiol. 2002;68:280–288. doi: 10.1128/AEM.68.1.280-288.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Soleja N., Manzoor O., Khan P., Mohsin M. Engineering Genetically Encoded FRET-Based Nanosensors for Real Time Display of Arsenic (As3+) Dynamics in Living Cells. Sci. Rep. 2019;9:11240. doi: 10.1038/s41598-019-47682-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Qin J., Fu H.-L., Ye J., Bencze K.Z., Stemmler T.L., Rawlings D.E., Rosen B.P. Convergent Evolution of a New Arsenic Binding Site in the ArsR/SmtB Family of Metalloregulators. J. Biol. Chem. 2007;282:34346–34355. doi: 10.1074/jbc.M706565200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Prabaharan C., Kandavelu P., Packianathan C., Rosen B.P., Thiyagarajan S. Structures of Two ArsR As (III)-Responsive Transcriptional Repressors: Implications for the Mechanism of Derepression. J. Struct. Biol. 2019;207:209–217. doi: 10.1016/j.jsb.2019.05.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Santha S., Pandaranayaka E.P.J., Rosen B.P., Thiyagarajan S. Purification, Crystallization and Preliminary X-Ray Diffraction Studies of the Arsenic Repressor ArsR from Corynebacterium Glutamicum. Acta Crystallogr. Sect. F Struct. Biol. Cryst. Commun. 2011;67:1616–1618. doi: 10.1107/S1744309111038966. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Park H., Ovchinnikov S., Kim D.E., DiMaio F., Baker D. Protein Homology Model Refinement by Large-Scale Energy Optimization. Proc. Natl. Acad. Sci. USA. 2018;115:3054–3059. doi: 10.1073/pnas.1719115115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lemkul J.A. From Proteins to Perturbed Hamiltonians: A Suite of Tutorials for the GROMACS-2018 Molecular Simulation Package [Article v1.0] Living J. Comput. Mol. Sci. 2019;1:5068. doi: 10.33011/livecoms.1.1.5068. [DOI] [Google Scholar]
- 20.Braun E., Moosavi S.M., Smit B. Anomalous Effects of Velocity Rescaling Algorithms: The Flying Ice Cube Effect Revisited. J. Chem. Theory Comput. 2018;14:5262–5272. doi: 10.1021/acs.jctc.8b00446. [DOI] [PubMed] [Google Scholar]
- 21.Okumura H., Itoh S.G., Okamoto Y. Explicit Symplectic Integrators of Molecular Dynamics Algorithms for Rigid-Body Molecules in the Canonical, Isobaric-Isothermal, and Related Ensembles. J. Chem. Phys. 2007;126:084103. doi: 10.1063/1.2434972. [DOI] [PubMed] [Google Scholar]
- 22.Kuznetsov A., Jarv J. Mapping the ACE2 Binding Site on the SARS-CoV-2 Spike Protein S1: Molecular Recognition Pattern. Proc. Eston. Acad. Sci. 2020;69:228. [Google Scholar]
- 23.Markwardt M.L., Kremers G.-J., Kraft C.A., Ray K., Cranfill P.J.C., Wilson K.A., Day R.N., Wachter R.M., Davidson M.W., Rizzo M.A. An Improved Cerulean Fluorescent Protein with Enhanced Brightness and Reduced Reversible Photoswitching. PLoS ONE. 2011;6:e17896. doi: 10.1371/journal.pone.0017896. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Nagai T., Yamada S., Tominaga T., Ichikawa M., Miyawaki A. Expanded Dynamic Range of Fluorescent Indicators for Ca2+ by Circularly Permuted Yellow Fluorescent Proteins. Proc. Natl. Acad. Sci. USA. 2004;101:10554–10559. doi: 10.1073/pnas.0400417101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Thestrup T., Litzlbauer J., Bartholomaus I., Mues M., Russo L., Dana H., Kovalchuk Y., Liang Y., Kalamakis G., Laukat Y., et al. Optimized Ratiometric Calcium Sensors for Functional in Vivo Imaging of Neurons and T Lymphocytes. Nat. Methods. 2014;11:175–182. doi: 10.1038/nmeth.2773. [DOI] [PubMed] [Google Scholar]
- 26.Shen Y., Wu S.-Y., Rancic V., Aggarwal A., Qian Y., Miyashita S.-I., Ballanyi K., Campbell R.E., Dong M. Genetically Encoded Fluorescent Indicators for Imaging Intracellular Potassium Ion Concentration. Commun. Biol. 2019;2:18. doi: 10.1038/s42003-018-0269-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Ai H.W., Olenych S.G., Wong P., Davidson M.W., Campbell R.E. Hue-Shifted Monomeric Variants of Clavularia Cyan Fluorescent Protein: Identification of the Molecular Determinants of Color and Applications in Fluorescence Imaging. BMC Biol. 2008;6:13. doi: 10.1186/1741-7007-6-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Griesbeck O., Baird G.S., Campbell R.E., Zacharias D.A., Tsien R.Y. Reducing the Environmental Sensitivity of Yellow Fluorescent Protein. Mechanism and Applications. J. Biol. Chem. 2001;276:29188–29194. doi: 10.1074/jbc.M102815200. [DOI] [PubMed] [Google Scholar]
- 29.Zhao Y., Shen Y., Wen Y., Campbell R.E. High-Performance Intensiometric Direct- and Inverse-Response Genetically Encoded Biosensors for Citrate. ACS Cent. Sci. 2020;6:1441–1450. doi: 10.1021/acscentsci.0c00518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Wu J., Abdelfattah A.S., Zhou H., Ruangkittisakul A., Qian Y., Ballanyi K., Campbell R.E. Genetically Encoded Glutamate Indicators with Altered Color and Topology. ACS Chem. Biol. 2018;13:1832–1837. doi: 10.1021/acschembio.7b01085. [DOI] [PubMed] [Google Scholar]
- 31.Zarowny L., Aggarwal A., Rutten V.M.S., Kolb I., GENIE Project. Patel R., Huang H.-Y., Chang Y.-F., Phan T., Kanyo R., et al. Bright and High-Performance Genetically Encoded Ca2+ Indicator Based on mNeonGreen Fluorescent Protein. ACS Sens. 2020;5:1959–1968. doi: 10.1021/acssensors.0c00279. [DOI] [PubMed] [Google Scholar]
- 32.Nasu Y., Shen Y., Kramer L., Campbell R.E. Structure- and Mechanism-Guided Design of Single Fluorescent Protein-Based Biosensors. Nat. Chem. Biol. 2021;17:509–518. doi: 10.1038/s41589-020-00718-x. [DOI] [PubMed] [Google Scholar]
- 33.Waterhouse A., Bertoni M., Bienert S., Studer G., Tauriello G., Gumienny R., Heer F.T., de Beer T.A.P., Rempfer C., Bordoli L., et al. SWISS-MODEL: Homology Modelling of Protein Structures and Complexes. Nucleic Acids Res. 2018;46:W296–W303. doi: 10.1093/nar/gky427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Kelley L.A., Mezulis S., Yates C.M., Wass M.N., Sternberg M.J.E. The Phyre2 Web Portal for Protein Modeling, Prediction and Analysis. Nat. Protoc. 2015;10:845–858. doi: 10.1038/nprot.2015.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Yang J., Zhang Y. I-TASSER Server: New Development for Protein Structure and Function Predictions. Nucleic Acids Res. 2015;43:W174–W181. doi: 10.1093/nar/gkv342. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Song Y., DiMaio F., Wang R.Y.-R., Kim D., Miles C., Brunette T., Thompson J., Baker D. High-Resolution Comparative Modeling with RosettaCM. Structure. 2013;21:1735–1742. doi: 10.1016/j.str.2013.08.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Klose D.P., Wallace B.A., Janes R.W. 2Struc: The Secondary Structure Server. Bioinformatics. 2010;26:2624–2625. doi: 10.1093/bioinformatics/btq480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Spellmon N., Sun X., Sirinupong N., Edwards B., Li C., Yang Z. Molecular Dynamics Simulation Reveals Correlated Inter-Lobe Motion in Protein Lysine Methyltransferase SMYD2. PLoS ONE. 2015;10:e0145758. doi: 10.1371/journal.pone.0145758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Lin M.Z., Schnitzer M.J. Genetically Encoded Indicators of Neuronal Activity. Nat. Neurosci. 2016;19:1142–1153. doi: 10.1038/nn.4359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Shen Y., Nasu Y., Shkolnikov I., Kim A., Campbell R.E. Engineering Genetically Encoded Fluorescent Indicators for Imaging of Neuronal Activity: Progress and Prospects. Neurosci. Res. 2020;152:3–14. doi: 10.1016/j.neures.2020.01.011. [DOI] [PubMed] [Google Scholar]
- 41.Shen Y., Rosendale M., Campbell R.E., Perrais D. pHuji, a pH-Sensitive Red Fluorescent Protein for Imaging of Exo− and Endocytosis. J. Cell Biol. 2014;207:419–432. doi: 10.1083/jcb.201404107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Rizza A., Walia A., Lanquar V., Frommer W.B., Jones A.M. In Vivo Gibberellin Gradients Visualized in Rapidly Elongating Tissues. Nat. Plants. 2017;3:803–813. doi: 10.1038/s41477-017-0021-9. [DOI] [PubMed] [Google Scholar]
- 43.Lobas M.A., Tao R., Nagai J., Kronschläger M.T., Borden P.M., Marvin J.S., Looger L.L., Khakh B.S. A Genetically Encoded Single-Wavelength Sensor for Imaging Cytosolic and Cell Surface ATP. Nat. Commun. 2019;10:711. doi: 10.1038/s41467-019-08441-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Zhang J.-F., Liu B., Hong I., Mo A., Roth R.H., Tenner B., Lin W., Zhang J.Z., Molina R.S., Drobizhev M., et al. An Ultrasensitive Biosensor for High-Resolution Kinase Activity Imaging in Awake Mice. Nat. Chem. Biol. 2021;17:39–46. doi: 10.1038/s41589-020-00660-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Busenlehner L.S., Pennella M.A., Giedroc D.P. The SmtB/ArsR Family of Metalloregulatory Transcriptional Repressors: Structural Insights into Prokaryotic Metal Resistance. FEMS Microbiol. Rev. 2003;27:131–143. doi: 10.1016/S0168-6445(03)00054-8. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data supporting this research are available upon request.