Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2016 Mar 30;25(3):259–271. doi: 10.1111/imb.12220

Insect transformation with piggyBac: getting the number of injections just right

M Gregory 1,2,, L Alphey 1,2,3,, N I Morrison 2, S M Shimeld 1
PMCID: PMC4982070  PMID: 27027400

Abstract

The insertion of exogenous genetic cargo into insects using transposable elements is a powerful research tool with potential applications in meeting food security and public health challenges facing humanity. piggyBac is the transposable element most commonly utilized for insect germline transformation. The described efficiency of this process is variable in the published literature, and a comprehensive review of transformation efficiency in insects is lacking. This study compared and contrasted all available published data with a comprehensive data set provided by a biotechnology group specializing in insect transformation. Based on analysis of these data, with particular focus on the more complete observational data from the biotechnology group, we designed a decision tool to aid researchers' decision‐making when using piggyBac to transform insects by microinjection. A combination of statistical techniques was used to define appropriate summary statistics of piggyBac transformation efficiency by species and insect order. Publication bias was assessed by comparing the data sets. The bias was assessed using strategies co‐opted from the medical literature. The work culminated in building the Goldilocks decision tool, a Markov‐Chain Monte‐Carlo simulation operated via a graphical interface and providing guidance on best practice for those seeking to transform insects using piggyBac.

Keywords: piggyBac, insect transformation, moths, transposases, transposable element, genetic, DNA transposable elements genetic vectors, germ‐line mutation, Monte‐Carlo, Markov‐Chain, decision model, goldilocks, insects, microinjection, transformation efficiency, survival, embryo, sterile insect technique, fluorescent proteins

Introduction

Insect transformation using piggyBac

The ability to integrate genetic constructs into the genome of organisms has utility in mitigating some of the global challenges facing humanity (Morales et al., 2007; Bazuin et al., 2011; Kim & Pyykko, 2011). Insect germline transformation (synonymous with insect transgenesis) can be employed to alter the phenotype of an insect by gene insertion (Fraser, 2012) and represents a research area attracting global interest (Tamura et al., 2000; Handler & Harrell, 2001; Perera et al., 2002; Sarkar et al., 2003; Morrison et al., 2010; Raphael et al., 2011).

Various methods can be employed to achieve genetic transformation. Transposable element (transposon) vector systems (Piégu et al., 2015) were developed for Drosophila melanogaster using the P element (Rubin & Spradling, 1982). Although P works in only a very limited range of insect species, similar systems using other Class II transposable elements were developed for non‐Drosophila insects. The most commonly used of these is the piggyBac element, originally discovered in cell lines of the cabbage looper moth, Trichoplusia ni (Fraser et al., 1983; Sarkar et al., 2003; Zimowska & Handler, 2006). It has been used for germline transformation in multiple insect orders (Table 1). A recent review by Kim & Pyykko (2011) summarized the molecular structure and mobility of piggyBac. The 2472‐bp‐long element is structured with two sets of inverted repeats at both ends and a central transposase‐encoding open reading frame (Fraser, 2012). The insertion site of piggyBac is quasi‐random, with a cut‐and‐paste insertion at the short genome motif site of TTAA (O'Brochta, 2003; Wu & Burgess, 2004; Zhuang et al., 2010).

Table 1.

Summary of the earliest successful transformation of insect species using piggyBac. Modified from Morrison et al. (2010)

Family Species name(s) Reference
Mosquitoes
Culicidae Yellow fever mosquito, Aedes aegypti (Kokoza et al., 2001)
Asian tiger mosquito, Aedes albopictus (Labbé et al., 2010)
Aedes fluviatilis (Rodrigues et al., 2006)
New World malaria mosquito, Anopheles albimanus (Perera et al., 2002)
African malaria mosquito, Anopheles gambiae (Grossman et al., 2001)
Indo‐Pakistan malaria mosquito, Anopheles stephensi (Ito et al., 2002; Nolan et al., 2002)
Fruit flies
Drosophilidae Common fruit fly, Drosophila melanogaster (Handler & Harrell, 1999)
Spotted‐wing drosophila, Drosophila suzukii (Schetelig et al., 2013)
Tephritidae Mexican fruit fly, Anastrepha ludens (Condon et al., 2007)
Caribbean fruit fly, Anastrepha suspensa (Handler & Harrell, 2001b)
Oriental fruit fly, Bactrocera dorsalis (Handler et al., 1998)
Queensland fruit fly, Bactrocera tryoni (Raphael et al., 2011)
Mediterranean fruit fly, Ceratitis capitata (Handler et al., 1998)
Other Diptera (pest, myiasis, biting flies)
Muscidae Housefly, Musca domestica (Hediger et al., 2001)
Calliphoridae Australian sheep blowfly, Lucilia cuprina (Heinrich et al., 2002)
New World screwworm, Cochliomyia hominivorax (Allen et al., 2004)
Diopsidae Stalk‐eyed flies, Teleopsis dalmanni (Warren et al., 2010)
Wasps, bees and ants
Hymenoptera Sawfly, Athalia rosae (Sumitani et al., 2003)
Honeybee, Apis mellifera Schulte et al., 2014
Beetles
Coccinellidae Harlequin ladybird, Harmonia axyridis (Kuwayama et al., 2006)
Tenebrionidae Red flour beetle, Tribolium castaneum (Berghammer et al., 1999)
Butterflies and moths
Nymphalidae Squinting bush brown butterfly, Bicyclus anynana (Marcus et al., 2004)
Gelechiidae Pink bollworm, Pectinophora gossypiella (Peloquin et al., 2000)
Bombycidae Silkworm, Bombyx mori (Tamura et al., 2000)
Plutellidae Diamondback moth, Plutella xylostella (Martins et al., 2012)
Crambidae Asian corn borer, Ostrinia furnacalis (Liu et al., 2012)
Tortricidae Codling moth, Cydia pomonella (Ferguson et al., 2011)

For insect transformation, piggyBac constructs and the respective source of helper transposase are typically microinjected into preblastoderm embryos, with the offspring of the injection survivors examined for the expression of a marker gene, typically a fluorescent protein. A recent review described the transposon vectors as having an ‘experimentally effective frequency, [however] the process remains relatively laborious and time‐consuming. Frequencies on the order of 0.1% to 10% are achievable, with higher frequencies less probable than lower ones' (Fraser, 2012).

Alternatives do exist, including electroporation, ultrasonic activation and use of a ‘gene gun’ (Wells, 2004; Mehier‐Humbert & Guy, 2005; Al‐Dosari & Gao, 2009). Other transposable elements are used for insect germline transformation, and other molecular methods are available. This study was restricted to piggyBac as the most widely used method and correspondingly the one for which most data are available. This may provide a benchmark against which the efficiency of other methods may be compared.

Meta‐analyses of data from multiple primary studies can be used to improve the efficiency of the scientific process (Brandt et al., 2013) while simultaneously dispelling misconceptions (McClain et al., 2015). Meta‐analyses are usually associated with clinical trials and the medical literature, but recent co‐opting of this technique has proven its applicability and usefulness to other scientific disciplines (Castellanos & Verdú, 2012). Meta‐analyses facilitate the elucidation of effect sizes and interstudy variation despite noisy backgrounds associated with a typical single observational study.

Here we provide a description of the transformation efficiency of insect transgenesis using piggyBac as the vector. We draw upon a systematic literature analysis and an analysis of an unpublished data set provided by the biotechnology company, Oxitec Ltd. The application of the decision tools and information therein provides researchers with an approximation of what to expect when conducting insect transgenesis using piggyBac, complementing other attempts in the literature to characterize and quantify costs of genetic control (Alphey et al., 2011).

Results and discussion

Systematic review of transformation efficiencies in published literature

Design and implementation of the meta‐analysis followed guidelines in Khoshdel et al. (2006). Meta‐analyses tend to be conducted in the medical literature, so methods were co‐opted as appropriate (Reade et al., 2008; Cooper and Patall, 2009). The structure of the Experimental procedures section follows Sim et al. (2011). A checklist for evaluation of meta‐analysis quality is described by Huf et al. (2011). Full details can be found in the Supporting Information.

Data sources

A summary of the literature search can be seen in Fig. 1 (described further in the ‘Meta‐analysis’ section below). The following checklist was applied to candidate studies to be included in the analysis following discovery:

Figure 1.

Figure 1

Flow chart of the paper selection process using the search term, across three life science relevant databases [SCOPUS, PUBMED and Web of Knowledge (WOK)]. 32 publications describing 86 unique experiments provided microinjection data and transformation efficiency estimates across a range of insect species.

  1. The species transformed is an insect.

  2. The insect germline was transformed using the piggyBac vector.

  3. The piggyBac vector was microinjected into embryos.

Injection data are included: specifically number of injected embryos, number of injection survivors and number of independent transgenic lines generated per unique construct (the derived variables; survival and transformation efficiency, were calculated from the raw data).

Typical reasons for non‐inclusion were: different methods of transgenesis; transgenesis in cell lines rather than the whole organism (for example in Mandrioli & Wimmer, 2003) and interdatabase duplication. Following this process, 32 studies remained (Fig. 1; Table 1). Additional details concerning the data extraction methods, summary statistic of choice and bias considerations are given in the Supporting Information.

Transformation efficiency by insect order and species from published data

Germline transformation or transgenesis has been achieved across a diverse range of insect orders (Table 1). Some authors have hinted at a difference between transformation efficiencies amongst orders, with the Lepidoptera efficiencies being lower compared to the Diptera for example (Marec et al., 2005). The data were plotted to examine this at order level (Fig. 2A) and at species level (Fig. 2B). Most of the transformation efficiency estimates (52/74) were from transformed dipteran species, with 22 of those of the genus Drosophila. All medians were between 0.001 and 0.1 except for the coleopteran estimate of 0.237 (see for discussion of appropriate statistics and methods to describe the distributions). The lower whisker in the Coleoptera is the data point provided by the only non‐Tribolium castaneum transformed beetle; the ladybird Harmonia axyridis at 0.0370 (see Supplementary 5.1.3 for Bayesian methods to produce a posterior probability distribution for the transformation efficiency of a species). The Hymenoptera have only one representative so were excluded from comparison.

Figure 2.

Figure 2

(A) A box‐and‐whisker plot of the meta‐analysis transformation efficiency data subsetted by insect order. Each subset was comprised of n equal to: Coleoptera, seven; Diptera, 52; Hymenoptera, one; and Lepidoptera, 14. The dark horizontal line represents the median, the box the interquartile range (IQR) and the whiskers 1.5 times the IQR, and the black asterisks are supplementary to the scatterplot and identify horizontally adjacent outliers within an order. The Hymenoptera and Lepidoptera do not have whiskers plotted as all non‐anomalous data are found within the IQR. A horizontal jitter plot is superimposed onto the boxplot showing the transformation efficiency of each unique construct species combination found within the literature search. The grand mean by insect order is represented by the empty diamond. (B) Published transformation efficiencies in insects found by this systematic analysis, sorted by species (alphabetical order by species). The order to which the species belongs is represented by the shape of the points (Diptera; triangle, Coleoptera; circle, Lepidoptera; cross and Hymenoptera; plus). Mean estimate from individual experiments shown by the shapes. Horizontal lines represent the upper and lower 95% confidence intervals of each experimental mean calculated using Bayesian methods (with prior distribution provided by beta distribution fitted to the combined data; shape 1 = 0.73 and shape 2 = 5.67) in R with the package ‘binom’ (Dorai‐Raj, 2014). Each experimental construct species combination has its own mean and confidence interval; the transparency of the points and intervals allows overlap to be visualized.

Outliers are a common characteristic in each order, with some transformation efficiencies of over 0.3 occurring. Following inspection it was found that the outliers were produced by less precise transformation efficiency mean estimates owing to a relatively small number of trials for certain piggyBac–insect combinations. For example, the lepidopteran Bombyx mori produced an outlier experiment with five transformed lines from 27 G0. As pointed out by Fraser (2012), the more extreme the transformation efficiencies the less frequently those efficiencies are observed (see Supplementary 5.1.1.3 for more detail).

Analysis of an extensive unpublished data set of piggyBac transgenesis experiments

The biotechnology company Oxitec has collected a data set of over 250 000 insect injection experiments (Fig. 3A) using piggyBac (Table 2), more than doubling the data set used for the meta‐analysis (119 557 injections). The data were collated and subjected to exploratory data analysis of the derived variables microinjection survival and transformation efficiency to establish typical values and any discrepancy from the published data set.

Figure 3.

Figure 3

(A) The experimental unit setup of variables and statistics of interest after a given construct was injected into an embryo n1 times, with probability of survival s. Values not observed or recorded are annotated with ‘NA’ (not available). The outcome of each trial, n, is assumed to be independent from the outcome of all other trials. After injection the embryo has p1 probability of producing transgenic offspring. The embryo will either die, or develop into a transient (G0T) (injection survivors showing transient expression of the fluorescent marker) or nontransient fertile adult (G0) (injection survivor not showing transient expression of the fluorescent marker or a transient adult that was not detected as transient). Considering the G0 survivors, n2 – a proportion, p2, will produce at least one transgenic offspring (known as ‘transformation efficiency’ in Warren et al., 2010 and Martins et al., 2012) This particular transformation efficiency (X/G0 and X/n2) leads to a lower calculated rate than if only fertile G0 individuals were considered; as we have no estimate of the infertility or fertility rate in G0; this could not be corrected for. Multiple transgenic G1 from the same G0 parent pool are assumed to represent a single transformation event unless shown otherwise with molecular tests. Of the G0T, n3 ‐ a proportion, p3 will produce transgenic offspring. The proportions p1–p3 are bounded between zero and one and are derived from how many times an event, transgenesis (X), did or did not occur (the numerator). The phrases transgenic efficiency and transgenic rate are used interchangeably for p2 (p1 is not given a name despite recommendations from Warren et al., 2010). (B) The correlation between microinjection survival and the achieved transformation efficiency is described. Each point represents an evaluation unit, the number of successes of a unique construct injected into a species of preblastoderm embryo, divided by the number of trials. The survival axis corresponds to number of injection survivors (to adulthood) divided by the number of embryos injected. The transformation efficiency is derived from the number of injection survivors divided by the number of independent transgenic lines generated. The figure does not group the data into species hence the more uniform and poly‐modal shape along the survival axis. A box‐and‐whisker plot on each axis describes the density of the data.

Table 2.

Summary of the data set; subsets organized by species. Information includes number of unique constructs injected into a species (the evaluation unit), sum of embryos injected (n1), sum of injection survivors to fertile adults (n2 or G0) and total number of independent transgenic lines created in that species in the compiled Oxitec data set. Accurate as of March 2014

Species Sum of unique constructs Sum of embryos microinjected Sum of microinjection survivors Sum of independent transgenic lines
Aedes aegypti 46 71 252 3314 239
Aedes albopictus 10 37 235 5 339 89
Ceratitis capitata 26 21 858 5977 167
Bactrocera oleae 5 27 500 760 23
Drosophila suzukii 4 3287 138 4
Pectinophora gossypiella 37 55 605 9296 100
Plutella xylostella 34 68 547 21 761 108
Tribolium castaneum 2 5227 572 8
Tuta absoluta 2 7244 601 3

Data entry and checking

A rectangular data set was compiled using data accrued from more than 6 years of research involving the microinjection of nine insect species’ embryos with exogenous DNA.

Transformation efficiency distribution

A summary of the data was provided by plotting the injection survival to adulthood against the transformation efficiency for each evaluation unit (Fig. 3B) by insect order. This facilitated comparison to the meta‐analysis as well as Fraser's (2012) description of the piggyBac transformation efficiency interpretation. This confirmed the comments of Fraser, with most of the data distributed between 0.01 and 0.1 (117/166 or 70% observations lay within this range).

Interestingly we see a clustering within is clearer, with the Lepidoptera tending to have lower transformation efficiencies (0–0.1) compared to the Diptera, where it is not uncommon to have transformation efficiencies above 0.1. This does not necessarily mean that Lepidoptera are more difficult to transform, as the Lepidoptera injected embryos are more likely to survive. This may be accounted for by differences in injection methodology or by the hardiness of the embryo.

This supports previous work that has compared the variability between efficiencies in the Diptera and Lepidoptera. Lobo et al. (2002) compared the mobility of piggyBac in embryos from different insect families using a transposition assay. The rate of transposition in dipteran species was higher than that of T. ni, which harbours the piggyBac transposon (Mohammed & Coates, 2004).

Publication bias

For those species for which we have approximately 30 or more experiments with transformation efficiency data, we plotted all the data and highlighted those that have been published (Fig. S8). We observed many more zero and near‐zero transformation efficiencies than would be expected given the meta‐analysis findings hinting at bias (see Supplementary 5.1.1.5 for quantification of the bias). We also observed that atypically high transformation efficiencies tend to be associated with a lower number of injection survivors (quantified in Supplementary 5.1.4.5). This could be caused by researchers stopping their inspection and screening of G0 crosses when they feel they have enough lines generated, thereby overestimating the efficiency. Conversely, the lower efficiencies associated with the higher number of injection survivors crossed could be caused by researchers not stopping until they have success. It is also unclear whether all zero successes experiments are recorded.

Interspecies variation in survival

The distributions were visualized using a boxplot and scatterplot hybrid (Fig. 4A). The distributions located further away from the bounds (zero and one) tend to be less skewed and more variable. Extreme values near one or zero are improbable, indeed if zero survival were achieved the results may have been discarded; furthermore, 100% survival does not occur even with uninjected embryos in optimal conditions. Seventeen of the 166 experiments did not include survival data owing to the number of injections or injection survivors missing for the experiment in question. There were no recorded experiments with a zero survival. The maximum survival was achieved in Plutella xylostella, with 0.76 compared to the lowest nonzero survival of Aedes aegypti at 0.0028.

Figure 4.

Figure 4

(A) The proportional survival of Oxitec insect research species, from embryo to fertile adult, following microinjection of a piggyBac vector. A horizontal jittered scatterplot is overlaid on a boxplot, summarizing the survival distribution for each species. Species are grouped by insect order (Diptera, Lepidoptera and Coleoptera) from left to right. The asterisks show the points that were outliers. For each species there are 10 Aedes albopictus, 39 Aedes aegypti, 26 Ceratitis capitata, five Bactrocera oleae, four Drosophila suzukii, 29 Plutella xylostella, 32 Pectinophora gossypiella, two Tuta absoluta and two Tribolium castaneum data, each representing a unique construct experiment. One outlier at 0.67 survival was removed from Pl. xylostella to improve ease of reading. (B). The transformation efficiency of different genetic constructs vectored by piggyBac into the germline of different insect species. Transformation efficiency is defined as the number of independent transgenic lines divided by the number of fertile injection survivors crossed, given the unique construct species combination. Few data were found above 0.2 transformation efficiency so the y axis was limited to this range. For each species there are 10 Aedes albopictus, 39 Aedes aegypti, 26 Ceratitis capitata, 5 Bactrocera oleae, four Drosophila suzukii, 31 Plutella xylostella, 35 Pectinophora gossypiella, two Tuta absoluta and two Tribolium castaneum data. (C) A lattice plot of the transformation efficiency of constructs injected into different species by size (in bp). Most constructs are 10 000–15 000 bp in length. As these are injected more frequently there is a cluster of points around this range for each species. Small (< 10 kbp) or large (> 15 kbp) are injected more rarely and so data for these are sparse. The data are subsetted into a species pane with the species label above. Each datum is transparent; dark points represent overlap.

Interspecies variation in transformation efficiency

Most of the data are found between 1 and 10% (Fig. 4B). However, some species appear to be highly clustered, with all Pl. xylostella data found between 0 and 5% inclusive. As pointed out by Fraser (2012) the more extreme the transformation efficiencies the less frequently those efficiencies are observed. This can be envisioned as a long tail or a skewed positive distribution. This is observed with the efficiencies far away from the main cluster, as seen in the Diptera and Lepidoptera. For those species with greater than 10 data the interquartile range tends to increase as the median moves away from zero. The tails of both mosquito distributions extends above 20%, with outliers for Ae. aegypti as high as 66% (the maximum achieved). Closer inspection reveals the datum responsible comprised of two transgenic lines from three G0. This species was described in an earlier study as having a typical transformation efficiency of only 8% (Nimmo et al., 2006).

The Lepidoptera have consistently lower transformation efficiencies compared to Diptera. However, both orders have their share of zero transformation efficiency experiments, with 27 in total between them. Those species with lower median transformation efficiency have more zero transformation efficiency experiments. The two Lepidoptera, Pl. xylostella (9/31) and Pectinophora gossypiella (9/35), have nine each compared to one Ceratitis capitata (1/26) and five Ae. aegypti (5/39) zero experiments. 19.4% of experiments (27/139) ended without germline transformation, in contrast to the meta‐analysis literature rate of 12% (9/75). This could suggest that those species with lower transformation efficiencies are more likely to have a construct abandoned, or microinjection of DNA is mechanically more difficult, with more injection survivors not containing any plasmid DNA, or simply because of natural variation in the insertion rate.

Despite the nature of the interexperimental variation, such as different constructs injected, different engineers and rearing methods, transformation efficiency is remarkably consistent within some species, particularly the Lepidoptera. This may partly arise as a relic owing to the enforced bounding at zero; however, it does suggest that given a new construct, it is possible to provide a reliable prediction of the transformation efficiency.

As a caveat, some constructs encoded dominant lethal genes as part of a repressible system, and so one might expect a proportion of transgenics to be lost owing to transient (episomal) expression. We might expect this to produce a bi‐modal distribution of transformation efficiency in a species. This may be evident in Ae. aegypti (Fig. 4B).

Interspecies variation by construct size

Experimental evidence for other transposons suggests a negative correlation between the size of the construct and the transformation efficiency of the vector (Delattre et al., 2000). This could be a consequence of larger DNA molecules diffusing smaller distances (Lukacs et al., 2000), larger plasmids being injected at a lower molar concentration relative to smaller plasmids or a physical limitation of the vector. Typically a construct injected will be between 10 000 and 15 000 bp in length. It is therefore not recommended to extrapolate the data and try to identify a trend where very few values lie outside the typical range. This is compounded in some species by the low number of constructs injected; each point has a large effect on the overall trend, whereby removal or addition of one datum can change the inference made (Fig. 4C).

Other factors may impact the survival and transformation efficiency of a construct, including operator skill and the coding potential or structure of the construct, for which features such as secondary structure may be important. However, exploration of these factors carries the risk of false‐positives associated with data dredging (Smith & Ebrahim, 2002) and also suffers from limited data and therefore it was not investigated further.

Goldilocks decision‐making: how to get the number of injections just right with piggyBac

Injecting too few embryos can result in no or very few transgenic lines. If the lines generated do not show the desired phenotype the investigator is left uncertain as to whether the construct needs to be redesigned or it failed because of position effects. The other extreme involves an excessive number of injections, as the investigator urgently seeks to generate at least one transformed line. Historical data provide an opportunity to estimate the transformation efficiency and guide future experiments.

An example using Pl. xylostella data

The Pl. xylostella survival and transgenesis efficiency are highly skewed, rendering the mean a poor descriptor of the central location of either distribution (Fig. 5B). The median is a better metric as it is more robust to extreme values and because several (9/34) zero values were also present. A representation of the black box model is shown in Fig. 5A. The probability of the embryo failing to achieve G0 status is 1 – s (where s is the median survival of an embryo to G0 postinjection). The embryo survives injection, hatches and the larva develops to adulthood with probability s. The G0 (assumed fertile adult) is then crossed and the offspring G1 are screened for the transgene. A G0 gives rise to a unique insertion event with probability X/G0 (where X is the number of independent lines produced). The published and Oxitec data give the probability of transformation of Pl. xylostella under this model as 0.0065 and 0.0043, respectively (Table S4).

Figure 5.

Figure 5

(A) A simple black box model depicting the trajectory of one embryo injected with a novel construct never before injected. The probability that the embryo survives injection, s, is based on previous injection data, as is the transformation efficiency, X/G0. For a given number of independent injections, n, the number of transgenic lines can be estimated given that particular simulation of the model. The stochasticity at each branch adds variability to the output. Accordingly the simulation for n injections should be repeated an appropriate number of times to provide a discrete probability distribution of the frequency of transgenic lines produced. (B) A jittered boxandwhisker plot of the survival (n = 29) and transformation efficiency (n = 33) distributions in diamondback moth (Plutella xylostella). The median values are 0.28 and 0.0043, respectively. Outliers are annotated with an adjacent asterisk. (C) A diagram describing the approximate structure of the program used to model the number of independent transgenic lines produced from n injections over B simulations. The pseudocode is simplified and does not match precisely how the MATLAB/R function works. The reader is advised to start from the top and read from left to right. The variables ‘s’ and ‘transformation efficiency’ are input into the model beforehand and are fixed following the example of using the median survival and transformation efficiency. The branching steps provide the stochasticity of the model whereby u and v are drawn from a random uniform distribution between 0 and 1. These numbers are compared with the input variables, thus determining the fate of the injected embryo. At the first branch it is determined whether the embryo survives (it remains ‘alive’, coded as 0) and at the second whether it gives rise to at least one transgenic offspring. State is a placeholder variable that deals with the previous logic branch by converting the alive embryo to the current value of v (multiplying by v, if alive state equals v, if dead state equals zero). At the end of the loop, the process is repeated, unless the desired injection number has been reached. The number of transgenics is recorded in a vector X and summed upon reaching n injects. This provides an integer, which again is stored in a vector of B length. This vector provides the information to draw a discrete probability distribution of the expected number of transgenic lines produced from n injections simulated B times. (D) The Goldilocks application interface for helping researchers get the number of injections just right. The output updates when the slider inputs are changed. The most recent data for Plutella xylostella are used.

A decision‐making model for insect transgenesis

A Markov‐Chain Monte‐Carlo simulation (pseudocode in Fig. 5C) was used to model the system in which the number of embryos injected, n, gives a binary vector of successes or failures. The final state of each embryo either gives rise to a transgenic‐bearing G0 (1) or it does not (0; dead or nontransgenic). Stochasticity is built into the model at each branch where a pseudorandom number (between 0 and 1) is generated and tested against the input parameters as appropriate. The simulation can be run repeatedly to estimate the discrete probability distribution of the total number of transgenic lines produced by n injections. The model is named ‘Goldilocks’, to assist researchers in getting the number of injections just right (Fig. 5D). A graphic interface allows the user to adjust sliders to the appropriate values for a species of interest (suggested values are provided in Table 3).

Table 3.

The median survival and transformation efficiency achieved in species transformed at Oxitec. The statistics in bold are based on many experiments and are probably more reliable. The lowest transformation efficiencies belong to the Lepidoptera

Species Order Survival Transformation efficiency
Aedes aegypti Diptera 0.093 0.059
Aedes albopictus Diptera 0.042 0.074
Bactrocera oleae Diptera 0.028 0.029
Ceratitis capitata Diptera 0.230 0.045
Drosophila suzukii Diptera 0.046 0.014
Pectinophora gossypiella Lepidoptera 0.149 0.009
Plutella xylostella Lepidoptera 0.278 0.004
Tribolium castaneum Coleoptera 0.124 0.014
Tuta absoluta Lepidoptera 0.086 0.004

Conclusion

This paper provides a rigorous description of the distribution of microinjection survival and piggyBac transformation efficiencies in different insect species. The paper inspection of the published literature leading on to a detailed exploration and analysis of an unpublished observational data set. This allows insight into publication bias and misconceptions of what is a typical survival or transformation efficiency in a given species. Combined with the Goldilocks decision model, researchers can use this analysis to minimize wasted effort and resources resulting from an inappropriate number of injections being carried out. Owing to the simplicity of assumptions and versatility of the model, Goldilocks can be applied to other germline transformation methods, given that survival and transformation efficiency data are available.

Experimental procedures

Meta‐analysis

The following electronic databases were searched from inception to March 2013, repeated in October 2013 and March 2015, to identify relevant experiments and or studies: Web of Knowledge, PUBMED and SCOPUS databases. The key terms used for the search were: piggyBac AND insect*. The database search results were refined by manual inspection and identification of publications with relevant transformation efficiencies. The title and then abstract were read.

Papers were checked for duplicates and removed as appropriate. Each included publication was read by M.G. and data extracted if it met the predefined criteria. If data were missing it was assigned a NA placeholder (NA, not available). The insect species, insect order, unique piggyBac construct ID (from the relevant paper to avoid duplication), number of embryos injected, injection survivors and independent transgenic lines derived from those injection survivor crosses were transliterated. The publication search and selection was repeated again 6 months after the initial study selection by the same reviewer and compared (March and October, 2013). Aside from two publications that were newly published, the second search found nine additional relevant publications, possibly because of a more systematic review approach and familiarity with the procedure. Prior to manuscript preparation the search was repeated in March 2015, finding only one new, recently transformed insect. The assumed publication bias will probably have under‐represented the number of failures to transform, as publication of successful transformation of a novel species will tend to be preferred (further details in Supplementary 5.1.1.1).

Data were explored, plotted and modelled using the open‐source R language for statistical computing (R Core Team, 2015). The full data set is available from https://github.com/mammykins/piggyBac-data.

Oxitec data

The data were transliterated from the original laboratory books as well as student theses (Bilski, 2012; Ant, 2013; Harvey‐Samuel, 2014); contemporary data were also collected from present researchers, and where published cross‐checked against publications. Data were organized in a ‘tidy’ dataframe (Wickham, 2014) and validated by re‐entry. Missing data were treated as described in the Supporting Information (5.1.4.2). The full data set is available from https://github.com/mammykins/piggyBac-data. The observational data were explored, plotted and modelled using R.

The decision‐making model

The model was initially developed in matlab 2012a Student Version (The MathWorks, Inc., Natick, MA, USA) then re‐coded in R using R studio (http://www.rstudio.com/) and R shiny (http://www.rstudio.com/products/shiny/) to develop a web application for insect transgenesis researchers to use. The model is available online at https://mammykins.shinyapps.io/App-gold and can be implemented locally by using the code from https://github.com/mammykins/Goldilocks-decision-tool.

Supporting information

Additional Supporting Information may be found in the online version of this article at the publisher's web‐site:

Figure S1. A simple method to describe the distribution of the meta‐analysis transformation efficiency values is to plot rank on value. Relative rank is calculated (p = r/n) as the proportion of values in the number of trials for the pooled data whose ranks are less than or equal to that value. Percentile information can be retrieved from the figure, for example the median is at relative rank 0.5, upper quartile at 0.75 and lower quartile at 0.25. The rug plot along the x‐axis is a uni‐variate scatter of the transformation efficiency estimates pooled from the literature (n = 74). Most (∼83%) of the published transformation efficiency data are between 0 and 20%.

Figure S2. Funnel plot suggesting bias for transformation efficiency data with confidence interval based on sample size generated using Wilson's method (mean, solid line; 95% confidence interval, dotted and dashed line; 99% confidence interval, dashed line). The plot shows all the nonzero transformation efficiency data collected from the literature (64 nonzero experiments plotted with nine zero experiments; another 13 had missing data).

Figure S3. The funnel plot reveals some patient researchers with over 6000 injections in some species before success. Funnel plot of survival data with confidence interval based on sample size generated using Wilson's method (mean, solid line; 95% confidence interval, dotted and dashed line; 99% confidence interval, dashed line). The plot shows all the nonzero transformation efficiency data collected from the literature (64 nonzero experiments plotted with nine zero experiments; another 13 had missing data).

Figure S4. In the absence of publication bias we might expect a symmetrical funnel plot. The bounded nature of the proportion data limits the effectiveness of the plot as the Pearson–Klopper confidence intervals are bounded at zero, limiting the size of the 95% confidence interval, which is calculated by subtracting the lower interval from the upper interval at the 95% confidence interval. As the transformation efficiency is not relative to any conventional control, unlike medicine, this removes the relative nature and expected symmetry of the plot.

Figure S5. Histogram of the transformation efficiency of the meta‐analysis pooled data with bin width of 0.05. The data are positively skewed and bounded between zero and one. A conventional histogram with associated density curve is shaded grey and a solid line. The area under the curve and between the axes integrates to unity and provides a visual representation of the probability of a transformation efficiency falling in a given interval. The cumulative density histogram is also provided as white bars and a dashed density curve. The graph shows why the use of a mean (0.097 and SD of 0.120) to summarize the data is inappropriate as it is asymmetrical. The bounded nature of the transformation efficiency (between zero and one) also creates problems for the normal approximation.

Figure S6. A histogram of the nonzero transformation efficiencies of experiments from the systematic literature review. A beta distribution curve with parameter estimates was overlaid, fitted from the data using the ‘fitdistr’ function from the MASS package in R (Brian Ripley, 1998). The bin width and number of measurements in total forming the histogram were multiplied by the beta distribution so that the curve could be normalized to the correct height. The area under the curve and between the axes integrates to unity and provides a visual representation of the probability of a transformation efficiency falling in a given interval.

Figure S7. Bayesian tri‐plot for the mean transformation efficiency probability density in Tribolium castaneum. The prior was formulated using 38 transgenics given 95 fertile G0 crosses (after Lorenzen et al., 2003); the likelihood represents the data of 36 transgenics given 152 fertile G0 crosses (after Lorenzen et al., 2003). The prior and likelihood are combined using Bayes' theorem to create the posterior distribution, which provides a probabilistic parameter estimate of the transformation efficiency in Tribolium castaneum given previous information and recent experimental evidence.

Figure S8. The mean and binomial confidence intervals appear to overestimate the transformation efficiency consistently in all four well‐studied species shown (Aedes aegypti = 30, Plutella xylostella = 33, Ceratitis capitata = 26, Pectinophora gossypiella = 35). The axes are not constant so care should be taken when comparing between species. The data includes some experiments included in publications highlighted black. The funnel plot provides transformation efficiency data with confidence interval based on sample size generated using Wilson's method (mean, solid line; 95% confidence interval, dotted and dashed line; 99% confidence interval; dashed line). A benefit of the funnel plot is that it highlights the ever‐present danger of mistaking variation owing to chance for correlation or causation.

Figure S9. Output from the Goldilocks simulation model – helping researchers to get the number of injections just right. The top and bottom row are simulations involving 500 and 1000 injections, respectively. The left column uses the median statistics from the published data and the right from a more complete Oxitec data set identifying publication bias. The literature provides an overly optimistic view of the chances of successful transformation given a number of injections.

Table S1. A timeline of Oxitec publications involving transformation events of an insect species using piggyBac. Injection data included number of microinjections, number of injection survivors and number of independent transgenic lines including details of construct and helper concentrations used.

Table S2. Only Oxitec in‐house data are summarized (experiments carried out in partnership with Oxitec are not included). An experiment is defined as a unique construct injected into the given insect species. On occasion an experiment will be unsuccessful in that transgenesis is not achieved, described as a zero transgenics experiment. The number of injection survivors crossed and their progeny screened for transgenics varied.

Table S3. The probability mass function of the transformation efficiency data found in the literature offers a complete empirical probability mass function version in graphical form. Technically zero is a point not an interval. The probability (0–1) that a transformation efficiency of a publication randomly sampled from the literature sample will fall into given intervals. Convention for 3 significant figures (3.s.f).

Table S4. The alpha and beta shape parameter estimates for the fitted beta distribution applied to three different subsets of the Oxitec transformation efficiency data facilitating empirical Bayesian shrinkage towards a beta prior. To utilize these parameters for improving transformation efficiency estimation adjust estimates by: empirical Bayes estimate = (successes + α) / (number of trials + α + β) (Jiang & Zhang, 2010). Prior to subsetting, observations with fewer than 20 injection survivors were removed. Lepidoptera consist of Plutella xylostella and Pectinophora gossypiella, MedFly just Ceratitis capitata and mosquitoes both Aedes albopictus and Aedes aegypti.

Table S5. The diamondback moth parameters to be used in two separate uses of the model to compare differences or the bias produced by reliance on an incomplete data set (published). Both survival and transformation efficiency medians are given for the published and Oxitec data sets.

Acknowledgements

L.A. is supported by core funding from the UK Biotechnology and Biological Sciences Research Council (BBSRC) to the Pirbright Institute (BBS/E/I/00001892). M.G. was supported by a BBSRC‐funded Collaborative Awards in Science and Engineering (CASE) DPhil studentship at Oxitec and the University of Oxford throughout the completion of this work. Thanks to Adam Walker for his assistance during the data collection stage of this project.

References

  1. Al‐Dosari, M.S. and Gao, X. (2009) Nonviral gene delivery: principle, limitations, and recent progress. AAPS J 11: 671–681. http://doi.org/10.1208/s12248-009-9143-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Allen, M.L. , Handler, A.M. , Berkebile, D.R. and Skoda, S.R. (2004) piggyBac transformation of the New World screwworm, Cochliomyia hominivorax, produces multiple distinct mutant strains. Med Vet Entomol 18(1): 1–9. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/15009439 [DOI] [PubMed] [Google Scholar]
  3. Alphey, N. , Alphey, L. and Bonsall, M.B. (2011) A model framework to estimate impact and cost of genetics‐based sterile insect methods for dengue vector control. PLoS ONE 6: e25384 http://doi.org/10.1371/journal.pone.0025384 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Ant, T. (2013) Genetic control of the olive fruit fly, Bactrocera oleae. The University of Oxford, Oxford, UK. [Google Scholar]
  5. Bazuin, S. , Azadi, H. and Witlox, F. (2011) Application of GM crops in sub‐Saharan Africa: lessons learned from green revolution. Biotechnol Adv 29: 908–912. http://doi.org/10.1016/j.biotechadv.2011.07.011 [DOI] [PubMed] [Google Scholar]
  6. Berghammer, A. , Bucher, G. , Maderspacher, F. and Klingler, M. (1999) A system to efficiently maintain embryonic lethal mutations in the flour beetle Tribolium castaneum . Dev Genes Evol 209(6): 382–389. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/10370121 [DOI] [PubMed] [Google Scholar]
  7. Bilski, M. (2012) Engineered genetic sterility of pest insects. University of Oxford, Oxford, UK. [Google Scholar]
  8. Brandt, K. , Sanderson, R. , Leifert, C. and Seal, C. (2013) Methods for comparing data across differently designed agronomic studies: examples of different meta‐analysis methods used to compare relative composition of plant foods grown using organic or conventional production methods. J Agric Food Chem 61: 7173–7180. [DOI] [PubMed] [Google Scholar]
  9. Castellanos, M.C. and Verdú, M. (2012) Meta‐analysis of meta‐analyses in plant evolutionary ecology. Evol Ecol 26: 1187–1196. http://doi.org/10.1007/s10682-012-9562-6 [Google Scholar]
  10. Condon, K.C. , Condon, G.C. , Dafa'alla, T.H. , Forrester, O.T. , Phillips, C.E. , Scaife, S. et al (2007) Germ‐line transformation of the Mexican fruit fly. Insect Mol Biol 16(5): 573–580. http://doi.org/10.1111/j.1365-2583.2007.00752.x [DOI] [PubMed] [Google Scholar]
  11. Cooper, H. and Patall, E. (2009) The relative benefits of meta‐analysis conducted with individual participant data versus aggregated data. Psychol Methods 14: 165–176. http://doi.org/10.1037/a0015565 [DOI] [PubMed] [Google Scholar]
  12. Delattre, M. , Tatout, C. and Coen, D. (2000) P‐element transposition in Drosophila melanogaster: influence of size and arrangement in pairs. Mol Gen Genet 263: 445–454. [DOI] [PubMed] [Google Scholar]
  13. Dorai‐Raj, S. (2014) Package “binom”: Binomial Confidence Intervals for Several Parameterizations Retrieved from http://cran.r-project.org/web/packages/binom/index.html.
  14. Ferguson, H.J. , Neven, L.G. , Thibault, S.T. , Mohammed, A. and Fraser, M. (2011) Genetic transformation of the codling moth, Cydia pomonella L., with piggyBac EGFP. Transgenic Res 20(1): 201–214. http://doi.org/10.1007/s11248-010-9391-8 [DOI] [PubMed] [Google Scholar]
  15. Fraser, M.J. (2012) Insect transgenesis: current applications and future prospects. Annu Rev Entomol 57: 267–289. http://doi.org/10.1146/annurev.ento.54.110807.090545 [DOI] [PubMed] [Google Scholar]
  16. Fraser, M.J. , Smith, G.E. , Max, D. and Summers, M.A.X.D. (1983) Acquisition of host cell DNA sequences by Baculoviruses: relationship between host mutants of Autographa californica and Galleria mellonella nuclear polyhedrosis viruses. J Virol 47: 187–200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Grossman, G.L. , Rafferty, C.S. , Clayton, J.R. , Stevens, T.K. , Mukabayire, O. and Benedict, M.Q. (2001) Germline transformation of the malaria vector, Anopheles gambiae, with the piggyBac transposable element. Insect Mol Biol 10(6): 597–604. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/11903629 [DOI] [PubMed] [Google Scholar]
  18. Handler, A.M. and Harrell, R.A. (1999) Germline transformation of Drosophila melanogaster with the piggyBac transposon vector. Insect Mol Biol 8(4): 449–457. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/10634970 [DOI] [PubMed] [Google Scholar]
  19. Handler, A.M. and Harrell, R. (2001) Transformation of the Caribbean fruit fly, Anastrepha suspensa, with a piggyBac vector marked with polyubiquitin‐regulated GFP. Insect Biochem Mol Biol 31: 199–205. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/11164342 [DOI] [PubMed] [Google Scholar]
  20. Handler, A.M. , McCombs, S.D. , Fraser, M.J. and Saul, S.H. (1998) The lepidopteran transposon vector, piggyBac, mediates germ‐line transformation in the Mediterranean fruit fly. Proc Natl Acad Sci USA 95: 7520–7525. http://doi.org/10.1073/pnas.95.13.7520 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Harvey‐Samuel, T. (2014) Genetic control of the diamondback moth (Plutella xylostella L.). The University of Oxford, Oxford, UK. [Google Scholar]
  22. Hediger, M. , Niessen, M. , Wimmer, E.A. , Dübendorfer, A. and Bopp, D. (2001) Genetic transformation of the housefly Musca domestica with the lepidopteran derived transposon piggyBac. Insect Mol Biol 10(2): 113–119. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/11422506 [DOI] [PubMed] [Google Scholar]
  23. Heinrich, J.C. , Li, X. , Henry, R.A. , Haack, N. , Stringfellow, L. , Heath, A.C.G. et al (2002) Germ‐line transformation of the Australian sheep blowfly Lucilia cuprina . Insect Mol Biol 11(1): 1–10. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/11841497 [DOI] [PubMed] [Google Scholar]
  24. Huf, W. , Kalcher, K. , Pail, G. , Friedrich, M.E. , Filzmoser, P. and Kasper, S. (2011) Meta‐analysis: fact or fiction? How to interpret meta‐analyses. World J Biol Psychiatry 12: 188–200. http://doi.org/10.3109/15622975.2010.551544. [DOI] [PubMed] [Google Scholar]
  25. Ito, J. , Ghosh, A. , Moreira, L. and Wimmer, E. (2002) Transgenic anopheline mosquitoes impaired in transmission of malaria parasite. Nature 417: 387–388. [DOI] [PubMed] [Google Scholar]
  26. Jiang, W. and Zhang, C. (2010) Empirical Bayes in‐season prediction of baseball batting averages. Inst Math Stat Collect 6: 263–273. http://doi.org/10.1214/10-IMSCOLL618 [Google Scholar]
  27. Khoshdel, A. , Attia, J. and Carney, S.L. (2006) Basic concepts in meta‐analysis: a primer for clinicians. Int J Clin Pract 60: 1287–1294. http://doi.org/10.1111/j.1742-1241.2006.01078.x [DOI] [PubMed] [Google Scholar]
  28. Kim, A. and Pyykko, I. (2011) Size matters: versatile use of PiggyBac transposons as a genetic manipulation tool. Mol Cell Biochem 354: 301–309. http://doi.org/10.1007/s11010-011-0832-3 [DOI] [PubMed] [Google Scholar]
  29. Kokoza, V. , Ahmed, A. , Wimmer, E.A. and Raikhel, A.S. (2001) Efficient transformation of the yellow fever mosquito Aedes aegypti using the piggyBac transposable element vector pBac[3xP3‐EGFP afm]. Insect Biochem Mol Biol 31(12): 1137–1143. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/11583926 [DOI] [PubMed] [Google Scholar]
  30. Kuwayama, H. , Yaginuma, T. , Yamashita, O. and Niimi, T. (2006) Germ‐line transformation and RNAi of the ladybird beetle, Harmonia axyridis . Insect Mol Biol 15(4): 507–512. http://doi.org/10.1111/j.1365-2583.2006.00665.x [DOI] [PubMed] [Google Scholar]
  31. Labbé, G.M.C. , Nimmo, D.D. and Alphey, L. (2010) piggybac‐ and PhiC31‐mediated genetic transformation of the Asian tiger mosquito, Aedes albopictus (Skuse). PLoS Negl Trop Dis 4(8): e788 http://doi.org/10.1371/journal.pntd.0000788 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Liu, D. , Yan, S. , Huang, Y. , Tan, A. , Stanley, D.W. and Song, Q. (2012) Genetic transformation mediated by piggyBac in the Asian corn borer, Ostrinia furnacalis (Lepidoptera: Crambidae). Arch Insect Biochem Physiol 80(3): 140–150. http://doi.org/10.1002/arch.21035 [DOI] [PubMed] [Google Scholar]
  33. Lobo, N.F. , Hua‐Van, A. , Li, X. , Nolen, B.M. and Fraser, M.J. (2002) Germ line transformation of the yellow fever mosquito, Aedes aegypti, mediated by transpositional insertion of a piggyBac vector. Insect Mol Biol 11: 133–139. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/11966878 [DOI] [PubMed] [Google Scholar]
  34. Lorenzen, M.D. , Berghammer, A.J. , Brown, S.J. , Denell, R.E. , Klingler, M. and Beeman, R.W. (2003) piggyBac‐mediated germline transformation in the beetle Tribolium castaneum . Insect Mol Biol 12(5): 433–440. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/12974948 [DOI] [PubMed] [Google Scholar]
  35. Lukacs, G. , Haggie, P. , Seksek, O. , Lechardeur, D. , Freedman, N. and Verkman, A. (2000) Size dependent DNA mobility in cytoplasm and nucleus. J Biol Chem 275: 1625–1629. [DOI] [PubMed] [Google Scholar]
  36. Mandrioli, M. and Wimmer, E. (2003) Stable transformation of a Mamestra brassicae (Lepidoptera) cell line with the lepidopteran‐derived transposon piggyback. Insect Biochem Mol Biol 33: 1–5. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/12459194 [DOI] [PubMed] [Google Scholar]
  37. Marcus, J.M. , Ramos, D.M. and Monteiro, A. (2004) Germline transformation of the butterfly Bicyclus anynana . Proc Biol Sci 271(Suppl 5): S263–S265. http://doi.org/10.1098/rsbl.2004.0175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Marec, F. , Neven, L. and Robinson, A. (2005) Development of genetic sexing strains in Lepidoptera: from traditional to transgenic approaches. J Econ Entomol 98: 248–259. Retrieved from http://www.bioone.org/doi/abs/10.1603/0022-0493-98.2.248 [DOI] [PubMed] [Google Scholar]
  39. Martins, S. , Naish, N. , Walker, A.S. , Morrison, N.I. , Scaife, S. , Fu, G. et al (2012) Germline transformation of the diamondback moth, Plutella xylostella L., using the piggyBac transposable element. Insect Mol Biol 44: 414–421. [DOI] [PubMed] [Google Scholar]
  40. McClain, C.R. , Balk, M.A. , Benfield, M.C. , Branch, T.A. , Chen, C. , Cosgrove, J. et al (2015) Sizing ocean giants: patterns of intraspecific size variation in marine megafauna. PeerJ 2: e715 http://doi.org/10.7717/peerj.715 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Mehier‐Humbert, S. and Guy, R.H. (2005) Physical methods for gene transfer: improving the kinetics of gene delivery into cells. Adv Drug Deliv Rev 57: 733–753. http://doi.org/10.1016/j.addr.2004.12.007 [DOI] [PubMed] [Google Scholar]
  42. Mohammed, A. and Coates, C.J. (2004) Promoter and piggyBac activities within embryos of the potato tuber moth, Phthorimaea operculella, Zeller (Lepidoptera: Gelechiidae). Gene 342: 293–301. http://doi.org/10.1016/j.gene.2004.08.008 [DOI] [PubMed] [Google Scholar]
  43. Morales, M.E. , Mann, V.H. , Kines, K.J. , Gobert, G.N. , Fraser, M.J. , Kalinna, B.H. et al (2007) piggyBac transposon mediated transgenesis of the human blood fluke, Schistosoma mansoni . FASEB J 21: 3479–3489. [DOI] [PubMed] [Google Scholar]
  44. Morrison, N.I. , Franz, G. , Koukidou, M. , Miller, T.A. , Saccone, G. , Luke, S. et al (2010) Genetic improvements to the sterile insect technique for agricultural pests. AsPac J Mol Biol Biotechnol 18: 275–295. [Google Scholar]
  45. Nimmo, D.D. , Alphey, L. , Meredith, J.M. and Eggleston, P. (2006) High efficiency site‐specific genetic engineering of the mosquito genome. Insect Mol Biol 15: 129–136. http://doi.org/10.1111/j.1365-2583.2006.00615.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Nolan, T. , Bower, T.M. , Brown, A. , Crisanti, A. and Catteruccia. (2002) piggyBac‐mediated germline transformation of the malaria mosquito Anopheles stephensi using the red fluorescent protein dsRED as a selectable marker. J Biol Chem 277(11): 8759–8762. [DOI] [PubMed] [Google Scholar]
  47. O'Brochta, D. (2003) Gene vector and transposable element behavior in mosquitoes. J Exp Biol 206: 3823–3834. http://doi.org/10.1242/jeb.00638 [DOI] [PubMed] [Google Scholar]
  48. Peloquin, J.J. , Thibault, S.T. , Staten, R. and Miller, T.A. (2000) Germ‐line transformation of pink bollworm (Lepidoptera: gelechiidae) mediated by the piggyBac transposable element. Insect Mol Biol 9(3): 323–333. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/10886417 [DOI] [PubMed] [Google Scholar]
  49. Perera, O.P. , Harrell, R., II and Handler, A.M. (2002) Germ‐line transformation of the South American malaria vector, Anopheles albimanus, with a piggyBac/EGFP transposon vector is routine and highly efficient. Insect Mol Biol 11: 291–297. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/12144693 [DOI] [PubMed] [Google Scholar]
  50. Piégu, B. , Bire, S. , Arensburger, P. and Bigot, Y. (2015) A survey of transposable element classification systems – a call for a fundamental update to meet the challenge of their diversity and complexity. Mol Phylogenet Evol 86: 90–109. http://doi.org/10.1016/j.ympev.2015.03.009 [DOI] [PubMed] [Google Scholar]
  51. R Core Team, R . (2015) R: a language and environment for statistical computing. Vienna, Austria. https://www.r-project.org/
  52. Raphael, K. , Shearman, D.C. , Streamer, K. , Morrow, J.L. , Handler, A.M. and Frommer, M. (2011) Germ‐line transformation of the Queensland fruit fly, Bactrocera tryoni, using a piggyBac vector in the presence of endogenous piggyBac elements. Genetica 139: 91–97. http://doi.org/10.1007/s10709-010-9500-x [DOI] [PubMed] [Google Scholar]
  53. Reade, M.C. , Delaney, A. , Bailey, M.J. and Angus, D.C. (2008) Bench‐to‐bedside review: avoiding pitfalls in critical care meta‐analysis–funnel plots, risk estimates, types of heterogeneity, baseline risk and the ecologic fallacy. Crit Care 12: 220 http://doi.org/10.1186/cc6941 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Rodrigues, F.G. , Oliveira, S.B. , Rocha, B.C. and Moreira, L.A. (2006) Germline transformation of Aedes flaviatilis with the piggyBac transposable element. Mem Inst Oswaldo Cruz 101: 755–757. [DOI] [PubMed] [Google Scholar]
  55. Rubin, G.M. and Spradling, A.C. (1982) Genetic transformation of Drosophila with transposable element vectors. Science 218: 348–353. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/6289436. [DOI] [PubMed] [Google Scholar]
  56. Sarkar, A. , Sim, C. , Hong, Y.S. , Hogan, J.R. , Fraser, M.J. , Robertson, H.M. et al (2003) Molecular evolutionary analysis of the widespread piggyBac transposon family and related “domesticated” sequences. Mol Genet Genomics 270: 173–180. [DOI] [PubMed] [Google Scholar]
  57. Schetelig, M.F. and Handler, A.M. (2013) Germline transformation of the spotted wing drosophilid, Drosophila suzukii, with a piggyBac transposon vector. Genetica 141: 189–193. http://doi.org/10.1007/s10709-013-9717-6 [DOI] [PubMed] [Google Scholar]
  58. Schulte, C. , Theilenberg, E. , Muller‐Borg, M. , Gempe, T. and Beye, M. (2014) Highly efficient integration and expression of piggyBac‐derived cassettes in the honeybee (Apis mellifera). PNAS 111(24): 9003–9008. http://doi.org/10.1073/pnas.1402341111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Sim, H. , Shin, B.C. , Lee, M.S. , Jung, A. , Lee, H. and Ernst, E. (2011) Acupuncture for carpal tunnel syndrome: a systematic review of randomized controlled trials. J Pain 12: 307–314. http://doi.org/10.1016/j.jpain.2010.08.006. [DOI] [PubMed] [Google Scholar]
  60. Smith, G.D. and Ebrahim, S. (2002) Data dredging, bias, or confounding. BMJ 325: 1437–1438. http://doi.org/10.1136/bmj.325.7378.1437 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Sumitani, M. , Yamamoto, D. , Oishi, K. , Lee, J. and Hatakeyama, M. (2003) Germline transformation of the sawfly, Athalia rosae (Hymenoptera: Symphyta), mediated by a piggyBac‐derived vector. Insect Biochem Mol Biol 33(4): 449–458. http://doi.org/10.1016/S0965-1748(03)00009-2 [DOI] [PubMed] [Google Scholar]
  62. Tamura, T. , Thibert, C. , Royer, C. , Kanda, T. , Abraham, E. , Kamba, M. et al (2000) Germline transformation of the silkworm Bombyx mori L. using a piggyBac transposon‐derived vector. Nat Biotechnol 18: 81–84. http://doi.org/10.1038/71978 [DOI] [PubMed] [Google Scholar]
  63. Warren, I.A. , Fowler, K. and Smith, H. (2010) Germline transformation of the stalk‐eyed fly, Teleopsis dalmanni . BMC Mol Biol 11(1): 86 http://doi.org/10.1186/1471-2199-11-86 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Wells, D.J. (2004) Gene therapy progress and prospects: electroporation and other physical methods. Gene Ther 11: 1363–1369. http://doi.org/10.1038/sj.gt.3302337. [DOI] [PubMed] [Google Scholar]
  65. Wickham, H. (2014) Tidy data. J Stat Sci 59: 1–23. [Google Scholar]
  66. Wu, X. and Burgess, S.M. (2004) Integration target site selection for retroviruses and transposable elements. Cell Mol Life Sci 61(19‐20): 2588–2596. http://doi.org/10.1007/s00018-004-4206-9 [DOI] [PubMed] [Google Scholar]
  67. Zhuang, L. , Wei, H. , Lu, C. and Zhong, B. (2010) The relationship between internal domain sequences of piggyBac and its transposition efficiency in BmN cells and Bombyx mori . Acta Biochim Biophys 42: 426–431. http://doi.org/10.1093/abbs/gmq039.Original [DOI] [PubMed] [Google Scholar]
  68. Zimowska, G.J. and Handler, A.M. (2006) Highly conserved piggyBac elements in noctuid species of Lepidoptera. Insect Biochem Mol Biol 36: 421–428. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Additional Supporting Information may be found in the online version of this article at the publisher's web‐site:

Figure S1. A simple method to describe the distribution of the meta‐analysis transformation efficiency values is to plot rank on value. Relative rank is calculated (p = r/n) as the proportion of values in the number of trials for the pooled data whose ranks are less than or equal to that value. Percentile information can be retrieved from the figure, for example the median is at relative rank 0.5, upper quartile at 0.75 and lower quartile at 0.25. The rug plot along the x‐axis is a uni‐variate scatter of the transformation efficiency estimates pooled from the literature (n = 74). Most (∼83%) of the published transformation efficiency data are between 0 and 20%.

Figure S2. Funnel plot suggesting bias for transformation efficiency data with confidence interval based on sample size generated using Wilson's method (mean, solid line; 95% confidence interval, dotted and dashed line; 99% confidence interval, dashed line). The plot shows all the nonzero transformation efficiency data collected from the literature (64 nonzero experiments plotted with nine zero experiments; another 13 had missing data).

Figure S3. The funnel plot reveals some patient researchers with over 6000 injections in some species before success. Funnel plot of survival data with confidence interval based on sample size generated using Wilson's method (mean, solid line; 95% confidence interval, dotted and dashed line; 99% confidence interval, dashed line). The plot shows all the nonzero transformation efficiency data collected from the literature (64 nonzero experiments plotted with nine zero experiments; another 13 had missing data).

Figure S4. In the absence of publication bias we might expect a symmetrical funnel plot. The bounded nature of the proportion data limits the effectiveness of the plot as the Pearson–Klopper confidence intervals are bounded at zero, limiting the size of the 95% confidence interval, which is calculated by subtracting the lower interval from the upper interval at the 95% confidence interval. As the transformation efficiency is not relative to any conventional control, unlike medicine, this removes the relative nature and expected symmetry of the plot.

Figure S5. Histogram of the transformation efficiency of the meta‐analysis pooled data with bin width of 0.05. The data are positively skewed and bounded between zero and one. A conventional histogram with associated density curve is shaded grey and a solid line. The area under the curve and between the axes integrates to unity and provides a visual representation of the probability of a transformation efficiency falling in a given interval. The cumulative density histogram is also provided as white bars and a dashed density curve. The graph shows why the use of a mean (0.097 and SD of 0.120) to summarize the data is inappropriate as it is asymmetrical. The bounded nature of the transformation efficiency (between zero and one) also creates problems for the normal approximation.

Figure S6. A histogram of the nonzero transformation efficiencies of experiments from the systematic literature review. A beta distribution curve with parameter estimates was overlaid, fitted from the data using the ‘fitdistr’ function from the MASS package in R (Brian Ripley, 1998). The bin width and number of measurements in total forming the histogram were multiplied by the beta distribution so that the curve could be normalized to the correct height. The area under the curve and between the axes integrates to unity and provides a visual representation of the probability of a transformation efficiency falling in a given interval.

Figure S7. Bayesian tri‐plot for the mean transformation efficiency probability density in Tribolium castaneum. The prior was formulated using 38 transgenics given 95 fertile G0 crosses (after Lorenzen et al., 2003); the likelihood represents the data of 36 transgenics given 152 fertile G0 crosses (after Lorenzen et al., 2003). The prior and likelihood are combined using Bayes' theorem to create the posterior distribution, which provides a probabilistic parameter estimate of the transformation efficiency in Tribolium castaneum given previous information and recent experimental evidence.

Figure S8. The mean and binomial confidence intervals appear to overestimate the transformation efficiency consistently in all four well‐studied species shown (Aedes aegypti = 30, Plutella xylostella = 33, Ceratitis capitata = 26, Pectinophora gossypiella = 35). The axes are not constant so care should be taken when comparing between species. The data includes some experiments included in publications highlighted black. The funnel plot provides transformation efficiency data with confidence interval based on sample size generated using Wilson's method (mean, solid line; 95% confidence interval, dotted and dashed line; 99% confidence interval; dashed line). A benefit of the funnel plot is that it highlights the ever‐present danger of mistaking variation owing to chance for correlation or causation.

Figure S9. Output from the Goldilocks simulation model – helping researchers to get the number of injections just right. The top and bottom row are simulations involving 500 and 1000 injections, respectively. The left column uses the median statistics from the published data and the right from a more complete Oxitec data set identifying publication bias. The literature provides an overly optimistic view of the chances of successful transformation given a number of injections.

Table S1. A timeline of Oxitec publications involving transformation events of an insect species using piggyBac. Injection data included number of microinjections, number of injection survivors and number of independent transgenic lines including details of construct and helper concentrations used.

Table S2. Only Oxitec in‐house data are summarized (experiments carried out in partnership with Oxitec are not included). An experiment is defined as a unique construct injected into the given insect species. On occasion an experiment will be unsuccessful in that transgenesis is not achieved, described as a zero transgenics experiment. The number of injection survivors crossed and their progeny screened for transgenics varied.

Table S3. The probability mass function of the transformation efficiency data found in the literature offers a complete empirical probability mass function version in graphical form. Technically zero is a point not an interval. The probability (0–1) that a transformation efficiency of a publication randomly sampled from the literature sample will fall into given intervals. Convention for 3 significant figures (3.s.f).

Table S4. The alpha and beta shape parameter estimates for the fitted beta distribution applied to three different subsets of the Oxitec transformation efficiency data facilitating empirical Bayesian shrinkage towards a beta prior. To utilize these parameters for improving transformation efficiency estimation adjust estimates by: empirical Bayes estimate = (successes + α) / (number of trials + α + β) (Jiang & Zhang, 2010). Prior to subsetting, observations with fewer than 20 injection survivors were removed. Lepidoptera consist of Plutella xylostella and Pectinophora gossypiella, MedFly just Ceratitis capitata and mosquitoes both Aedes albopictus and Aedes aegypti.

Table S5. The diamondback moth parameters to be used in two separate uses of the model to compare differences or the bias produced by reliance on an incomplete data set (published). Both survival and transformation efficiency medians are given for the published and Oxitec data sets.


Articles from Insect Molecular Biology are provided here courtesy of Wiley

RESOURCES