Skip to main content
NPJ Vaccines logoLink to NPJ Vaccines
. 2018 Oct 4;3:50. doi: 10.1038/s41541-018-0088-y

Raman spectroscopy-based identification of toxoid vaccine products

Anja Silge 1,2, Thomas Bocklitz 1,2,3, Bjoern Becker 4, Walter Matheis 4, Juergen Popp 1,2,3,, Isabelle Bekeredjian-Ding 4,
PMCID: PMC6172244  PMID: 30323957

Abstract

Vaccines are complex biomedicines. Manufacturing is time consuming and requires a high level of quality control (QC) to guarantee consistent safety and potency. An increasing global demand has led to the need to reduce time and cost of manufacturing. The evolving concepts for QC and the upcoming threat of falsification of biomedicines define a new need for methods that allow the fast and reliable identification of vaccines. Raman spectroscopy is a non-destructive technology already established in QC of classical medicines. We hypothesized that Raman spectroscopy could be used for identification and differentiation of vaccine products. Raman maps obtained from air-dried samples of combination vaccines containing antigens from tetanus, diphtheria and pertussis (DTaP vaccines) were summarized to compile product-specific Raman signatures. Sources of technical variance were emphasized to evaluate the robustness and sensitivity in downstream data analysis. The data management approach corrects for spatial inhomogeneities in the dried sample while offering a proper representation of the original samples inherent chemical signature. Reproducibility of the identification was validated by a leave-one-replicate-out cross-validation. The results highlighted the high specificity and sensitivity of Raman measurements in identifying DTaP vaccine products. The results pave the way for further exploitation of the Raman technology for identification of vaccines in batch release and cases of suspected falsification.

Quality control: Rapid and reliable vaccine analysis

A light-based identification method offers a fast, reliable and non-destructive method to analyze vaccines. Using vaccines for tetanus, diphtheria, and pertussis, a German research team led by the Paul Ehrlich Institute and University of Jena and Leibniz-IPHT showed that Raman spectroscopy — which identifies substances based on how they scatter laser light — is able to identify distinctive signatures of vaccines. In their experiments, Raman spectroscopy was sensitive enough to detect subtle differences in vaccine formulation, such as the specific combination of vaccine and adjuvant components. However, a data analytics technique was required to correct for sample quality variation caused by their preparation. Raman spectroscopy already sees use in classical medicines, and its application to vaccines could help to reduce the time and cost of quality control while benefitting the unmet need for rapid analysis of vaccine quality and identity.

Introduction

Due to their size and complexity, the quality of biomedicines cannot be characterized nor controlled by conventional chemical methods. Instead, regulatory requirements for biomedicines include a tightly controlled manufacturing processes and batch release by both the manufacturers and official medicines control laboratories (OMCL). The methods used for product characterization and batch testing are defined in the marketing authorisation and need to be approved by the regulatory authority. The regional pharmacopoieia, e.g., European Pharmacopoieia (Ph. E.), provides the legislative framework for product testing and regulatory bodies such as the European Directorate for Quality of Medicines (EDQM) prequalify methods for these purposes, including the biological standards to be used to obtain comparability. Among the methods established for quality control of classical medicines the so called “non-invasive”, e.g., non-destructive, techniques, such as near-infrared and Raman spectroscopy have been applied for molecular imaging and analytics in process analytical technology (PAT) and are implemented in quality by design (QbD) concepts.15 Recent technical developments in the field of Raman technology now enable manufacturers to use this technique for analysis of more complex biological products including protein mixtures in bioreactors and cell-based and tissue-engineered products.68 Raman microspectroscopy is an inelastic light scattering-based method useful for the non-destructive analysis of biochemical samples. It provides a wealth of molecular information on a specimen by the sample’s own inherent vibrational signatures. As the biochemical composition of a sample is mirrored in the Raman spectrum, mathematical methods including analytical modelling translate the physically recorded Raman data into higher level information, which can further be exploited for comparative analyses. The fingerprint-like specificity of the spectral signatures can be utilized to setup a reference database of tested biological products for identification purposes.911

Vaccines used for prophylaxis of infectious diseases range among the eldest and most complex biomedicines. Today, a remarkable number of them still consist of mixtures of attenuated microorganisms or detoxified virulence factors that induce protective immune memory, i.e., either opsonizing or neutralizing antibody responses. The microbial antigens contained in a vaccine are often poorly immunogenic, such as inactivated toxins and are, therefore, provided together with adjuvants that augment the immune response.12 The most common adjuvants are poorly soluble aluminium salt-containing gels that adsorb the antigens, thereby forming a suspension.13,14 Furthermore, preservatives and excipients may be present in vaccines as well as traces of chemical substances, which usually are residuals from the manufacturing procedure. Thus, any vaccine consists of a unique mixture of chemically distinct substances.15 The final composition is considered to be characteristic for every single vaccine product.

Combination vaccines against tetanus, diphtheria and pertussis have successfully been employed over decades. Their active substances are detoxified tetanus, diphtheria and pertussis toxins and, dependent on the product, other Bordetella pertussis-derived antigens. Notably, in the Western hemisphere, use of acellular pertussis vaccines predominates because the former whole-cell pertussis vaccine was less well tolerated. Some vaccines additionally contain Hepatitis B antigens and/or inactivated poliovirus. In this study, the authors used tetanus-diphtheria-acellular pertussis (DTaP) vaccines commercially available on the German market.

DTaP combination vaccines are used for primary immunization and booster vaccination in all age groups. They differ in vaccine antigen content and the adjuvants used, e.g., aluminium hydroxide (Al(OH)3) and/or aluminium phosphate (AlPO4). As in many countries around the world, in the EU, an important prerequisite for marketing of these vaccines is the governmental batch release granted to the marketing authorization holder. For DTaP vaccines, this implies that vaccines are experimentally tested using standardized assays for confirmation of antigen identity and content as well as potency at intermediate and final product stages. Pass and fail criteria need to be defined, and any confirmed out-of-specification results lead to rejection of the vaccine batch by the manufacturer or the authorities.

There is also a growing need for rapid testing procedures that confirm identity and preserved safety and efficacy of medicinal products after market entry. This new demand arises from the increased awareness on falsification of medicines, which also affects biomedicines such as monoclonal antibodies16 and potentially vaccines. The development of methods for rapid verification of vaccine identity, antigen concentration, integrity and potency represent a significant requirement in this context.

Raman spectroscopy has already been applied for identification of falsified classical medicines.17 In this study, we investigated the potential of Raman spectroscopy for identification and differentiation of DTaP vaccines. The study is driven by a so far unmet need for rapid and reliable methods that support OMCL decision making on product identity and quality. The Raman spectroscopy is a non-invasive technology that provides rapid information on easily accessible product characteristics. Since Raman spectra contain the complete information on the Raman-active chemical content we hypothesized that Raman spectra obtained from vaccine products should be specific for the products and could be used for product identification.

Results

Raman spectroscopic analysis of vaccine samples

In the present study, we hypothesized that Raman microspectroscopy could be used to rapidly identify and distinguish vaccine products. Our working hypothesis was that due to differences in formulations vaccine products of the same vaccine type would still differ in their Raman spectra because spectra arising from the multitude of components and excipients would form a unique combined Raman spectral fingerprint that could be used as a product identifier.

For standardization and to obtain optimal Raman spectral information the vaccine suspensions were dried before measurements. A microscope image of an air-dried vaccine sample (dTaP-IPV) is depicted in the 3D data cube in the left part of Fig. 1. The overlaid grid visualizes the arranged sample raster for the mapping. To obtain a comprehensive Raman spectral signature of a unique vaccine product, spectra of all pixels were sequentially arranged in a 2D data matrix. In a first step, the 2D data matrices derived from the replicate measurements were summarized simply by averaging. Figure 2 depicts the averaged Raman spectra derived from the mapped replicates of each vaccine product. The mean spectra reveal that each vaccine may have a specific spectral signature. The grey shaded area around the spectra visualizes the standard deviations of the mean spectra of replicate measurements.

Fig. 1.

Fig. 1

Principle of vaccine Raman spectral data management. Left: Arrangement and output of Raman measurements of an air-dried vaccine droplet is visualized. The microscopic image in the 3D data cube shows the air-dried vaccine sample dTaP-IPV1. The overlaid grid visualizes the arranged sample raster of the Raman spectroscopic mapping approach. The hyperspectral data cube is reoriented into a 2D data matrix. To obtain the Raman spectral signature of a distinct vaccine product, the data matrices of the respective replicates were averaged. Right: The processing for the statistical modelling is sketched. The 2D data table of each replicate was reduced from the 100 original spectra to 10 representative spectra. Ten randomly chosen raster points were successive picked without replacement and averaged. The matrices of all vaccine replicates were applied for statistical modelling and cross-validation. The prediction result for one replicate was assembled by the prediction result of its ten spectra. If the prediction result was not uniform for all spectra of one replicate, the prediction output was based on majority rule voting to identify the vaccine product specification

Fig. 2.

Fig. 2

Mean Raman spectra of the investigated vaccine products. Mean spectra present the average of the n = 1000 (T1), n = 1000 (T2), n = 1000 (dT1), n = 2100 (dT2), n = 1100 (IPV), n = 1000 (dTaP), n = 1100 (dTaP-IPV1), n = 1100 (dTaP-IPV2), n = 1000 (dT-IPV), n = 1000 (DTaP-IPV-HepB), n = 1000 (HepB) measured single Raman spectra. The Raman spectra were collected by the mapping based measurements of technical replicates (air-dried vaccine droplets). The mean spectra visualize the averaged Raman signature of the indicated vaccine products, including the Raman data of the measured replicates with varying drying times. The standard deviations of the means are visualized by the grey shades around each spectrum. All spectra are normalized for an effective comparison and shifted vertically for visualization

Since the scope of the present study was to use Raman microspectroscopy for the identification and differentiation of vaccine products, our initial analysis was focused on the investigation of technical replicates to determine whether the preparation and background noise could represent a limitation as observed in many high-throughput biological experiments.

Technical reasons for variance of Raman spectra

While examining air-dried vaccine samples by means of Raman microspectroscopy two sources of technical variance were identified. First, the drying procedure of the colloidal vaccine droplets caused a spatial inhomogeneity of the vaccine material during evaporation, which is known as the coffee-ring effect.18 Therefore, concentration gradients within the air-dried droplet resulted in spatial inhomogeneities and therefore in variation of Raman signal intensity at different grid points (Fig. 1 3D data cube, Fig. S1). Second, vaccine products containing phenoxyethanol (Table 1) showed variations within their Raman signature that roughly correlated with the drying time. In Fig. 3a–c the mean spectra of dTaP-IPV2, IPV, and dT2 are depicted. Each spectrum summarizes the Raman signatures of one replicate mapped after the indicated drying interval. The mean spectra of the first replicates were measured after a drying interval of 20 min. These spectra were dominated by Raman signals (highlighted by asterisks) that were previously assigned to phenoxyethanol by Badawi et al.19. These signals diminished in intensity the longer the vaccine suspensions were dried at room temperature, albeit to varying extents: For the vaccine products dTaP-IPV2 and IPV the signals disappear almost completely after a drying period of 2 h (Fig. 3a, b) while in dT2 the phenoxyethanol signals remain prominent in the Raman signature within the observed time span. Furthermore, the dried spots of a vaccine suspension are not identical preparations, they rather represent technical replicates. Therefore, slight differences in the drying behaviour were obtained in the microscope images of distinct vaccine spots (Fig. S1). Also the evaporation behaviour of the phenoxyethanol is subjected to such fluctuations, Fig. 3a. We suggest, each replicate has its individual drying kinetic. Therefore, the phenoxyethanol Raman signals can still be present after 100 min drying time in one spot while they were disappeared after 90 min drying in another spot (Fig. 3a). Aside from such fluctuations, the trend of the decreasing Raman signal intensities of phenoxyethanol with ongoing drying time due to evaporation is evident in the mean spectra of Fig. 3a, b. For the statistical modelling, it is important to be aware of such variations and subsequent to include them for a robust training data set.

Table 1.

Vaccine products used in this study

Vaccine product (antigen composition) Adjuvant Phenoxyethanol Manufacturer
T1 Al(OH)3 No A
T2 Al(OH)3 No C
dT1 Al(OH)3 No C
dT2 AlPO4 Yes A
dTaP Al(OH)3+AlPO4 No B
dTaP-IPV1 Al(OH)3+AlPO4 No B
dTaP-IPV2 AlPO4 Yes A
dT-IPV Al(OH)3 Yes A
IPV none Yes A
DTaP-IPV-HepB Al(OH)3+AlPO4 No B
HepB Al(OH)3 No B

Vaccine antigens: T tetanus, d diphtheria (low antigen content), D diphtheria (high antigen content), aP acellular pertussis antigens, IPV inactivated poliovirus, HepB hepatitis B

Fig. 3.

Fig. 3

Technical variance of Raman spectra depending on the length of drying. For a detailed view, the Raman data of the three vaccine products dTaP-IPV2, IPV, dT2 were split into the data subsets of the replicate measurements. The mean spectra of the separate technical replicates are plotted according to drying time (from bottom to top). The standard deviations are visualized by the grey shades around each spectrum. Prominent signals of phenoxyethanol are highlighted by asterisks in analogy to Badawi et al.19. The signal intensities of phenoxyethanol decrease to different extents during the course of drying. All spectra were normalized for an effective comparison and shifted vertically for visualization

Data evaluation by linear discrimination analysis (LDA)

The aim of the statistical analysis of the vaccine Raman data is to find the discriminating features in the spectral data and model the differences between the vaccine products so that unknown samples could be identified by their Raman signature. Applying linear discrimination analysis (LDA), the statistical properties of the particular vaccine types were calculated. To correct for the spatial inhomogeneities in the mapping area of the vaccine material, single Raman spectra of one map were summarized to subsets of averaged spectra by a randomized averaging procedure. The procedure is visualized in the right part of Fig. 1. Ten spectra from the 2D data matrix were successively picked without replacement and averaged. Therefore, the data set of each replicate was reduced from the original 100 to 10 spectra. The reduced data matrices (Fig. 1) were utilized for statistical modelling. To consider the vaccine formulation dependent evaporation behaviour of the phenoxyethanol, Raman data from replicates recorded after different drying times were included into the present vaccine data set.

In Fig. 4a the 3D scatterplot of the discriminant functions (LD2–LD4) are plotted to visualize one discrimination possibility between vaccine Raman data. The Raman data derived from the replicate measurements of the particular vaccine types are represented by colour-coded squares. The ellipsoids visualize the 95% confidence interval for the tested batches of each product. The scatterplots reveal that the vaccine Raman data cluster according to their Raman spectral similarities close together or in distinct areas of the spectral feature space. The sizes of the ellipsoids and the spatial spread of the data points visualize the variance of the data within each type and therefore the variance of the technical replicates.

Fig. 4.

Fig. 4

Classification of the vaccine products are represented by the discriminant functions LD2, LD3 and LD4 in a 3D scatterplot. Raman data derived from the replicate measurements of the particular vaccine products are represented by colour-coded squares, ellipsoids visualize the 95% confidence interval for each type. Vaccine Raman data cluster according to their Raman spectral similarities close together or in distinct areas of the spectral feature space

Discrimination of similar vaccine formulations

The performance of the classifier for vaccines with equivalent formulations was investigated and the results are shown in Fig. 5. The radar plot provides information on the statistical properties of the vaccine model at a glance. Big differences between the LD values of the compared vaccine products in one particular direction reveal big spectral differences in the Raman data. Consequently, the smaller the overlapping surfaces in the radar plot become the more distinct the differences of the Raman signatures of the compared vaccines are. Figure 5a depicts the comparison of the vaccine products containing the same vaccine antigens but produced by different manufacturers. Both dTaP-IV vaccine products differ in their Raman spectral profiles shown in Fig. 2b, which becomes evident in the small surface overlap in the radar plot. Also, both dT vaccine products differ in their Raman spectral profile (Fig. 2a). The larger surface area of dT1 when compared to dT2 indicates a clear statistical difference of the spectral features. The Raman signatures of both tetanus vaccines (T1 and T2) in Fig. 2a look very similar but the radar plot of the tetanus vaccines in Fig. 5a reveals a clear offset of the surface areas and therefore clearly distinguishable Raman data. Similarly, vaccine products containing the same adjuvants were compared in Fig. 5b. The offsets of the surface areas indicate, that the differences between the respective vaccine products are clearly reflected by the spectroscopic data.

Fig. 5.

Fig. 5

Discrimination of similar vaccine formulations. The radar plot provides information on the statistical properties of the vaccine model. The predicted scores for each LD function are depicted. Big differences between the LD values of the compared vaccine products in one particular direction reveal big spectral differences. a Analysis of vaccine products containing the same composition of active substances. Vaccine products containing the same set of vaccine antigens display differences in their Raman spectra that can be exploited for product identification. b Comparison of vaccine products with similar adjuvants. Subtle differences derived from the active substances can be recognized in the Raman spectra of vaccines containing the same adjuvant

Optimization of LDA modelling

Classification rates were estimated using a leave-one-replicate-out cross-validation procedure to ensure the ability of the approach to work in routine laboratories. The vaccine model was tested by leave-one-replicate-out cross-validation meaning that a LDA model was constructed based on training data that included all but one experimental replicate, while the left-out replicate served as validation data for testing. This was performed once for all replicates of all vaccine products. The prediction result for one measured replicate (technically independent Raman maps) was assembled by the prediction result of its ten pre-arranged spectra (Fig. 1). If the prediction result was not uniform for spectra of one replicate, the prediction output was based on majority rule voting. The results for all vaccines are summarized in Table 2. In the diagonal, the percentage of correctly predicted replicates is shown. The sensitivities and specificities for each vaccine type are indicated below the table, together with the number of measured replicates and the number of averaged spectra included in the statistical analysis. The numbers of the original Raman spectra derived from the mapping measurements are indicated in the last row of the table.

Table 2.

Confusion table of the cross-validation. Classification rates were estimated using a hold-out validation procedure to ensure applicability in routine laboratories.

True label [%] T1 T2 dT1 dT2 dTaP dTaP-IPV1 dTaP-IPV2 dT-IPV IPV DTaP-IPV-HepB HepB
Predicted vaccine label [%] T1 100 0 0 0 0 0 0 0 0 0 0
T2 0 90 20 0 0 0 0 0 0 0 0
dT1 0 10 80 0 0 0 0 0 0 0 0
dT2 0 0 0 95 0 0 0 0 0 0 0
dTaP 0 0 0 0 100 0 0 0 0 0 0
dTaP-IPV1 0 0 0 0 0 100 0 0 0 0 0
dTaP-IPV2 0 0 0 5 0 0 100 0 0 0 0
dT-IPV 0 0 0 0 0 0 0 100 9 0 0
IPV 0 0 0 0 0 0 0 0 91 0 0
DTaP-IPV-HepB 0 0 0 0 0 0 0 0 0 100 0
HepB 0 0 0 0 0 0 0 0 0 0 100
Sensitivity [%] 100 90 80 95 100 100 100 100 91 100 100
Specificity [%] 100 98 99 100 100 100 99 99 100 100 100
Number of replicates 10 10 10 21 10 11 11 10 11 10 10
Number of averaged spectra 100 100 100 210 100 110 110 100 110 100 100
Number of original spectra 1000 1000 1000 2100 1000 1100 1100 1000 1100 1000 1000

The table-diagonal shows the percentage of correctly predicted replicates

Discussion

The present work demonstrates the potential of the Raman spectroscopy to analyse and classify complex manufacturer-specific vaccine formulations with one single analytical method. The methodological approach presented in this study provides a good basis for development of an easy and reliable system for vaccine product identification based on a vaccine’s inherent Raman spectral features. The fingerprint-like specificity of a vaccine product’s spectral signatures observed in this study could be exploited to setup a reference database of well-standardized Raman spectra of vaccine products to estimate batch-to-batch variation. Thus, this study addresses an unmet need for a cost-effective, label-free and reliable screening method with minimal preparation and measurement effort in vaccine control.

The examination of dried fluids containing proteins in submicromolar concentrations by the Raman spectroscopy is a well-established and straightforward approach, known as drop coat deposition Raman spectroscopy (DCDRS).2022 Here, we combined the spectral analysis of thin layers of air-dried vaccine solution with statistical modelling for vaccine product identification.

The drying of 1 µl of vaccine sample permitted us to record a reproducible and representative spectral fingerprint of a vaccine. The data management approach used corrects for spatial inhomogeneities in the dried sample while offering a proper representation of the original sample’s inherent chemical signature. To equalize spatial inhomogeneities caused by local bulky spots or areas with less vaccine substance within the air-dried spot, a randomized averaging process was introduced to the data of the Raman maps. Subsequently, the product-specific spectral features sampled from spatially distributed vaccine material were summed up and intensified while infrequently appearing artefacts were averaged out. The randomized averaging procedure of the mapping data makes the introduced approach reproducible and more robust against technical variations.

Because, a Raman spectrum is the sum of the Raman signals of all chemical components within a sample, the information on multiple vaccine components and the excipients is probed almost instantaneously. The pattern of spectroscopic variations (i.e., changes in peak heights, widths, positions, relative heights and widths, etc.) reflects the pattern of variation within the vaccine formulations, which consist of a variable number of different components. However, a spectrum with more than 1000 wavenumbers does not contain a thousand pieces of independent information. The resulting spectrum is formed by the superposition and the reciprocal influence of the underlying spectral values simultaneously recorded from all chemical constituents, e.g., vaccine antigens, adjuvants, buffer components and other excipients. These signals are further influenced by the presence of solvents, the pH and physical properties, such as formation of crystals or amorphous particles within the vaccine suspension. Thus, the mixture itself and other interfering effects complicate the analysis of the spectral data. Well-established computational methods were applied to correct for the influence of the instrumental setup or background noise.23,24

A common method for dimension reduction is the principal component analysis (PCA).25,26 PCA transforms a set of possibly correlated response variables into a new set of non-correlated variables, referred to as principal components (PC). The output of the PCA is the components in the order of significance. Components with less significance (assigned to noise) can be ignored. Therefore, the dimension of the data is reduced without loss of information.27 Subsequently, the data were classified by LDA to capture the particular combination of spectral values that form the unique spectra of defined vaccine formulations. The output of this analysis was a vaccine product-specific LD value that could be utilized as a product-specific identifier once systematic batch-to-batch analysis has been carried out.

Notably, classification rates were estimated using a leave-one-replicate-out cross-validation procedure to ensure the applicability of the approach in routine laboratories. The classification rules defined for the LD value can subsequently be applied for the allocation of new and unknown samples. Reproducibility and robustness of the model was evaluated by a leave-one-replicate-out cross-validation.28

In combination with batch-to-batch variances technical variability can hamper the statistical analysis of the data, thus, increasing the risk for misinterpreted results. Thus, the technical variance information is a crucial point to evaluate the robustness and sensitivity in downstream data analysis.29 Technical reproducibility for the Raman spectroscopic analysis of the air-dried vaccine suspension was demonstrated to be effective within a time frame of 2 h. The present study revealed that a proper adaption of the drying intervals for different vaccine products containing the volatile excipient phenoxyethanol considerably increases the classification accuracy. For example, with longer drying intervals and subsequent decreasing phenoxyethanol signal intensities, the misclassification between dT2 for dTaP-IPV2 was reduced. Nevertheless, shorter drying intervals were beneficial to differentiate the investigated IPV product from the dT-IPV vaccine. Our data suggest that for Raman measurement of vaccines containing phenoxyethanol a time series of different drying intervals with at least three time points (including one droplet preparation with a drying period of 20 min, 2 h and one time point in-between, respectively) should be recorded for an optimal result.

Notably, phenoxyethanol is used as preservative in some vaccine products.30 It is a highly volatile solvent and its drying kinetic within a complex vaccine formulation depends on the present molecular interactions. In forensic science, determining the phenoxyethanol concentrations on an ink stroke on paper is a common method to trace the course of drying.31,32 The data presented suggest that, similarly, the observed phenoxyethanol evaporation kinetic is a specific feature of the vaccine formulation.

Another characteristic feature of the DTaP vaccines is that vaccine antigens are typically adsorbed to adjuvants, such as Al(OH)3 and AlPO4.14 The results of the present study reveal that despite the signals derived from the same adjuvant in different vaccine products the sensitivity of the Raman technology is high enough to allow the detection of subtle differences in spectra that arise from the specific vaccine formulation, e.g., the combination of antigenic components and adjuvant (Fig. 4b).

The potential confusion of T2 and dT1 was due to very similar spectral profiles, which could not be explained by spectral changes during the course of drying. A possible explanation could be that T2 and dT1 are subject to similar manufacturing processes that result in specific excipients or other residuals causing the specific signal overlap in the present vaccine data set. However, differences in the Raman data of these vaccines were captured and can be modelled by LDA. It is, thus, expected that with an increasing sample size the differentiation between T2 and dT1 Raman spectra can be highly improved. Nevertheless, in the present study the possible confusion is limited to these two specific vaccine products; thus, the error rates caused by T2 and dT1 did not question the classification of other vaccines and will not lead to the assignment of false positives or false negatives within the remaining vaccines. In this context it should further be noted that the distinction of vaccines with similar vaccine antigen composition and identical clinical indication, such as T1 and T2 or dT1 and dT2 were not mistaken for one another (Table 2). Thus, the results of the analysis demonstrate that Raman spectroscopy is an appropriate and reliable analytical tool for the differentiation of vaccines of similar composition. The signatures obtained can be exploited for product identification.

The high sensitivities and specificities for vaccine identification found in this study highlight that Raman spectroscopy could serve as a reliable and robust analytical tool delivering few false negative or false positive results. This was especially evident for DTaP vaccine products that displayed validated classification accuracies of 100%. The combination of machine learning and statistical modelling provides unbiased product identification based on the Raman spectra of the final vaccine product. However, the degree of batch-to-batch variation for each vaccine was not systematically investigated in this study.

Since the data indicated that the Raman signature of the individual product is unique, we refrained from an in-depth analysis of the Raman spectra to identify the underlying chemical components. However, other applications might require this analysis to confirm the presence of specific vaccine antigens and adjuvants. These results could, thus, also pave the way for application of the presented workflow for confirmation of vaccine batch consistency: the easily accessible product- and potentially batch-specific information provided by the introduced technical concept can be entered into product-specific databases. This approach is well in-line with the presently pursued quality control concept based on comparative analysis for consistency testing.3335

Materials and methods

Vaccines

Eleven different commercially available vaccine products were tested (Table 1). The vaccines were stored at 2–8 °C until measurement. Unfortunately, some licensed vaccine products could not be included in the study because of unavailability due to supply shortages.

Sample preparation

For Raman measurements vaccine suspensions were extracted from the containers into Eppendorf tubes, homogenized by vortexing and 1 μl was applied onto a CaF2 slide and dried at room temperature. The sample size was predicted using the learning curve36 resulting in five replicates to be necessary. Therefore, we planned with 10 in every group to ensure a valid statistical outcome. Twenty minutes after preparation the samples were visibly dry and the measures started with the first replicate. The last replicate was measured after ~2 h later.

Raman measurements

The air-dried vaccine preparations were mapped by grid defined Raman measurements. A 1000 × 1000 µm sample raster with 10 × 10 measurement points were arranged per droplet. Accordingly 100 spectra were recorded for each sample. Raman maps were recorded with a WITec UATS 300 spectrometer (Ulm, Germany), combined with a Zeiss microscope equipped with a Zeiss EC EPIPLAN objective (20×/0.4). The beam of a diode laser with an excitation wavelength of 514 nm was focused with a power of ~50 mW onto the sample. A piezo-electrically driven scanning stage moved the sample through the laser focus in a raster pattern. The step size between two raster points was set to 100 µm. For each measurement position one spectrum was recorded with an acquisition time of 5 s. The back scattered light was collected (through the same microscope objective), diffracted using a 600 lines/mm grating with a spectral resolution of 5 cm−1 and detected with a Peltier cooled electron-multiplying charge-coupled device (EM-CCD, −65 °C) camera.

Data pre-processing

The statistical language R was applied for chemometrical analysis and computations (R).37 The workflow of the data pre-processing was previously described.38 In summary, a cosmic spike removal followed by wavenumber calibration with 4-acetaminophenol39 background correction with the SNIP algorithm and finally a vector normalization of each Raman spectrum was carried out.

Each Raman map of a technical independent replicate represents a spectral data cube containing 100 single spectra. Spectra of all pixels were sequentially arranged in a 2D data matrix. To obtain a comprehensive Raman spectral signature of a specific vaccine product, the 2D data matrices derived from the replicate measurements were summarized by averaging. For statistics, a correction for the spatial inhomogeneities was required. Therefore, the 2D data table of each replicate was reduced from the 100 original spectra to 10 representative spectra applying an averaging procedure. For this purpose, 10 randomly chosen raster points were successive picked without replacement and averaged. The 10 mean Raman spectra were utilized to construct a reduced data matrix and these matrices were utilized for statistical modelling.

Statistical modelling

As classification model a combination of principal component analysis (PCA) and linear discriminant analysis (LDA) was applied to model the differences between the vaccine products. A dimension reduction via principal component analysis (PCA) was carried out and the first 12 principle components (PCs) were introduced to the LDA. Applying LDA, the statistical properties of the particular vaccine products were calculated, which includes the mean spectra for each type and the covariance matrix calculated across all types. In doing so, for the 11 vaccine product, 10 discriminative functions were determined by the LDA in the spectral space to classify the vaccine data set. The LDA model was validated by a leave-one-replicate-out cross-validation.28 For this purpose, all Raman spectral data except the data of one replicate were used to build a LDA model and then the model was utilized to predict the left-out subset. This method is repeated so that spectra of each replicate are predicted once. A majority vote was applied to determine a comprehensive prediction for a replicate. This voting scheme was applied to avoid mixed predictions, if the prediction result was not uniform for spectra of one replicate.

The radar plot visualizes the vaccine LDA model in a two-dimensional plot. The ten spokes in each plot visualize the 10 discriminative functions of the model. The LD scores contain the class-discriminatory information. For the radar plots,40 the predicted scores for each LD function were min/max normalized to scale the data between 0 and 1. Vaccines with similar score values are localized close together along one spoke line. The bigger the spectral differences are between the respective vaccines, the more different are their score values and the longer is the distance between the scaled scores along one spoke line.

Code availability

The software “R” and the utilized packages “peaks”, “MASS”, “fields”, “scatterplot3d” and “fmsb” are open source software and freely available. The in-house written procedures for data pre-processing can be obtained from T.B. and J.P. upon reasonable request.

Electronic supplementary material

Supplementary Figure S1 (342.3KB, pdf)

Acknowledgements

We thank Kay-Martin Hanschmann and Volker Öppling, PEI, Langen, Germany, for critical reading of the manuscript. The authors did not receive external funding for this study.

Author contributions

A.S., W.M., I.B.-D. and J.P. designed the experiments. A.S. and B.B. performed the Raman measurements. A.S., T.B. and W.M. analysed the data. A.S., W.M., I.B.-D., T.B. and J.P. interpreted the data. A.S., W.M., T.B. and I.B.-D. wrote the manuscript. A.S. and T.B. prepared the figures. All authors reviewed the manuscript.

Data availability

All data generated or analysed during this study are included in this published article and its supplementary information files.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Juergen Popp, Phone: +49 3641 206300, Email: juergen.popp@ipht-jena.de.

Isabelle Bekeredjian-Ding, Phone: +49 6103 77 3701, Email: isabelle.bekeredjian-ding@pei.de.

Electronic supplementary material

Supplementary information accompanies the paper on the npj Vaccines website (10.1038/s41541-018-0088-y).

References

  • 1.Beer TRM, de Baeyens WRG, Ouyang J, Vervaet C, Remon JP. Raman spectroscopy as a process analytical technology tool for the understanding and the quantitative in-line monitoring of the homogenization process of a pharmaceutical suspension. Analyst. 2006;131:1137–1144. doi: 10.1039/b605299a. [DOI] [PubMed] [Google Scholar]
  • 2.Versiani AF, et al. Multi-walled carbon nanotubes functionalized with recombinant Dengue virus 3 envelope proteins induce significant and specific immune responses in mice. J. Nanobiotechnol. 2017;15:26. doi: 10.1186/s12951-017-0259-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Vankeirsbilck T, et al. Applications of Raman spectroscopy in pharmaceutical analysis. Trends Anal. Chem. 2002;21:869–877. doi: 10.1016/S0165-9936(02)01208-6. [DOI] [Google Scholar]
  • 4.Paudel A, Raijada D, Rantanen J. Raman spectroscopy in pharmaceutical product design. Adv. Drug Deliv. Rev. 2015;89:3–20. doi: 10.1016/j.addr.2015.04.003. [DOI] [PubMed] [Google Scholar]
  • 5.Dave VS, Shahin HI, Youngren-Ortiz SR, Chougule MB, Haware RV. Emerging technologies for the non-invasive characterization of physical-mechanical properties of tablets. Int. J. Pharm. 2017;532:299–312. doi: 10.1016/j.ijpharm.2017.09.009. [DOI] [PubMed] [Google Scholar]
  • 6.Abu-Absi NR, et al. Real time monitoring of multiple parameters in mammalian cell culture bioreactors using an in-line Raman spectroscopy probe. Biotechnol. Bioeng. 2011;108:1215–1221. doi: 10.1002/bit.23023. [DOI] [PubMed] [Google Scholar]
  • 7.Esmonde-White KA, Cuellar M, Uerpmann C, Lenain B, Lewis IR. Raman spectroscopy as a process analytical technology for pharmaceutical manufacturing and bioprocessing. Anal. Bioanal. Chem. 2017;409:637–649. doi: 10.1007/s00216-016-9824-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Krafft C, Schie IW, Meyer T, Schmitt M, Popp J. Developments in spontaneous and coherent Raman scattering microscopic imaging for biomedical applications. Chem. Soc. Rev. 2016;45:1819–1849. doi: 10.1039/C5CS00564G. [DOI] [PubMed] [Google Scholar]
  • 9.Schmitt M, et al. Raman-Spektroskopie – der Weg zu einer labelfreien biomedizinischen Diagnostik. Endo Heute. 2012;25:262–267. doi: 10.1055/s-0032-1325591. [DOI] [Google Scholar]
  • 10.Huser T, Chan J. Raman spectroscopy for physiological investigations of tissues and cells. Adv. Drug Deliv. Rev. 2015;89:57–70. doi: 10.1016/j.addr.2015.06.011. [DOI] [PubMed] [Google Scholar]
  • 11.Krafft C, et al. Label-free molecular imaging of biological cells and tissues by linear and nonlinear raman spectroscopic approaches. Angew. Chem. Int. Ed. 2017;56:4392–4430. doi: 10.1002/anie.201607604. [DOI] [PubMed] [Google Scholar]
  • 12.McKee AS, Marrack P. Old and new adjuvants. Curr. Opin. Immunol. 2017;47:44–51. doi: 10.1016/j.coi.2017.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Kool M, Fierens K, Lambrecht BN. Alum adjuvant. Some of the tricks of the oldest adjuvant. J. Med. Microbiol. 2012;61:927–934. doi: 10.1099/jmm.0.038943-0. [DOI] [PubMed] [Google Scholar]
  • 14.Baylor NW, Egan W, Richman P. Aluminum salts in vaccines--US perspective. Vaccine. 2002;20(Suppl 3):S18–S23. doi: 10.1016/S0264-410X(02)00166-4. [DOI] [PubMed] [Google Scholar]
  • 15.Josefsberg JO, Buckland B. Vaccine process technology. Biotechnol. Bioeng. 2012;109:1443–1460. doi: 10.1002/bit.24493. [DOI] [PubMed] [Google Scholar]
  • 16.Streit R. Der Herceptin® -Fall. Ein Fälschungsfall von Arzneimitteln größeren Ausmaßes. Bundesgesundheitsblatt. Gesundh. Gesundh. 2017;60:1203–1207. doi: 10.1007/s00103-017-2622-x. [DOI] [PubMed] [Google Scholar]
  • 17.Rebiere H, Martin M, Ghyselinck C, Bonnet PA, Brenier C. Raman chemical imaging for spectroscopic screening and direct quantification of falsified drugs. J. Pharm. Biomed. Anal. 2018;148:316–323. doi: 10.1016/j.jpba.2017.10.005. [DOI] [PubMed] [Google Scholar]
  • 18.Deegan RD, et al. Capillary flow as the cause of ring stains from dried liquid drops. Nature. 1997;389:827–829. doi: 10.1038/39827. [DOI] [Google Scholar]
  • 19.Badawi HM. Vibrational spectra and assignments of 2-phenylethanol and 2-phenoxyethanol. Spectrochim Acta A Mol. Biomol. Spectrosc. 2011;82:63–68. doi: 10.1016/j.saa.2011.06.066. [DOI] [PubMed] [Google Scholar]
  • 20.Hu H, Larson RG. Marangoni effect reverses coffee-ring depositions. J. Phys. Chem. B. 2006;110:7090–7094. doi: 10.1021/jp0609232. [DOI] [PubMed] [Google Scholar]
  • 21.Halvorson RA, Vikesland PJ. Drop coating deposition Raman (DCDR) for microcystin-LR identification and quantitation. Environ. Sci. Technol. 2011;45:5644–5651. doi: 10.1021/es200255y. [DOI] [PubMed] [Google Scholar]
  • 22.Ortiz C, Zhang D, Xie Y, Ribbe AE, Ben-Amotz D. Validation of the drop coating deposition Raman method for protein analysis. Anal. Biochem. 2006;353:157–166. doi: 10.1016/j.ab.2006.03.025. [DOI] [PubMed] [Google Scholar]
  • 23.McCreery RL. Raman spectroscopy for chemical analysis. New York: John Wiley & Sons; 2000. [Google Scholar]
  • 24.Bocklitz TW, Guo S, Ryabchykov O, Vogler N, Popp J. Raman based molecular imaging and analytics. A magic bullet for biomedical applications!? Anal. Chem. 2016;88:133–151. doi: 10.1021/acs.analchem.5b04665. [DOI] [PubMed] [Google Scholar]
  • 25.Cooper JB. Chemometric analysis of Raman spectroscopic data for process control applications. Chemom. Intell. Lab. Syst. 1999;46:231–247. doi: 10.1016/S0169-7439(98)00174-9. [DOI] [Google Scholar]
  • 26.Chiang LH, Russell EL, Braatz RD. Fault diagnosis in chemical processes using Fisher discriminant analysis, discriminant partial least squares, and principal component analysis. Chemom. Intell. Lab. Syst. 2000;50:243–252. doi: 10.1016/S0169-7439(99)00061-1. [DOI] [Google Scholar]
  • 27.Cozzolino D, Cynkar WU, Shah N, Smith P. Multivariate data analysis applied to spectroscopy. Potential application to juice and fruit quality. Food Res. Int. 2011;44:1888–1896. doi: 10.1016/j.foodres.2011.01.041. [DOI] [Google Scholar]
  • 28.Guo S, Bocklitz T, Neugebauer U, Popp J. Common mistakes in cross-validating classification models. Anal. Methods. 2017;9:4410–4417. doi: 10.1039/C7AY01363A. [DOI] [Google Scholar]
  • 29.Glaab E, Schneider R. RepExplore. Addressing technical replicate variance in proteomics and metabolomics data analysis. Bioinformatics. 2015;31:2235–2237. doi: 10.1093/bioinformatics/btv127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Geier DA, Jordan SK, Geier MR. The relative toxicity of compounds used as preservatives in vaccines and biologics. Med. Sci. Monit. 2010;16:SR21–SR27. [PubMed] [Google Scholar]
  • 31.Weyermann C, Kirsch D, Costa Vera C, Spengler B. A GC/MS study of the drying of ballpoint pen ink on paper. Forensic Sci. Int. 2007;168:119–127. doi: 10.1016/j.forsciint.2006.06.076. [DOI] [PubMed] [Google Scholar]
  • 32.Lociciro S, Dujourdy L, Mazzella W, Margot P, Lock E. Dynamic of the ageing of ballpoint pen inks. Quantification of phenoxyethanol by GC-MS. Sci. Justice. 2004;44:165–171. doi: 10.1016/S1355-0306(04)71709-8. [DOI] [PubMed] [Google Scholar]
  • 33.Bruysters MWP, et al. Drivers and barriers in the consistency approach for vaccine batch release testing. Report of an international workshop. Biologicals. 2017;48:1–5. doi: 10.1016/j.biologicals.2017.06.006. [DOI] [PubMed] [Google Scholar]
  • 34.Mattia Fde, et al. The consistency approach for quality control of vaccines - a strategy to improve quality control and implement 3Rs. Biologicals. 2011;39:59–65. doi: 10.1016/j.biologicals.2010.12.001. [DOI] [PubMed] [Google Scholar]
  • 35.Stirling C. Consistency as tool to support in vitro batch potency testing in GMP production. Dev. Biol. 2012;134:115–118. [PubMed] [Google Scholar]
  • 36.Figueroa RL, Zeng-Treitler Q, Kandula S, Ngo LH. Predicting sample size required for classification performance. BMC Med. Inform. Decis. Mak. 2012;12:8. doi: 10.1186/1472-6947-12-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.R Core Team. R: A language and environment for statistical computing (R Foundation for Statistical Computing, Vienna, Austria, 2014).
  • 38.Bocklitz T, Walter A, Hartmann K, Rösch P, Popp J. How to pre-process Raman spectra for reliable and stable models? Anal. Chim. Acta. 2011;704:47–56. doi: 10.1016/j.aca.2011.06.043. [DOI] [PubMed] [Google Scholar]
  • 39.Dörfer T, Bocklitz T, Tarcea N, Schmitt M, Popp J. Checking and improving calibration of Raman Spectra using chemometric approaches. Z. für Phys. Chem. 2011;225:753–764. doi: 10.1524/zpch.2011.0077. [DOI] [Google Scholar]
  • 40.Minato Nakazawa. fsmb: Functions for Medical Statistics Book with some Demographic Data. (CRAN, 2017) https://cran.r-project.org/web/packages/fmsb/index.html.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Figure S1 (342.3KB, pdf)

Data Availability Statement

All data generated or analysed during this study are included in this published article and its supplementary information files.


Articles from NPJ Vaccines are provided here courtesy of Nature Publishing Group

RESOURCES