Skip to main content
F1000Research logoLink to F1000Research
. 2019 Jan 29;7:1725. Originally published 2018 Oct 31. [Version 2] doi: 10.12688/f1000research.16758.2

Oral hormone pregnancy tests and the risks of congenital malformations: a systematic review and meta-analysis

Carl Heneghan 1,a, Jeffrey K Aronson 1, Elizabeth Spencer 1, Bennett Holman 2, Kamal R Mahtani 1, Rafael Perera 1, Igho Onakpoya 1
PMCID: PMC6281024  PMID: 30631442

Version Changes

Revised. Amendments from Version 1

We thank the reviewers for their positive comments about our manuscript and we have responded to the points made and revised version 1 of the paper in light of their comments. The major changes affect the introduction which has been revised in line with the peer review comments, We have also made changes to the references due to citation errors; uploaded revised excel data sheets as there was an error in the data in the first version, and made one correction in the text of the results to the ‘Exposure to oral HPTs was associated with a 40% increased risk of all congenital malformations: pooled odds ratio (OR) = 1.40 (95% CI 1.18 to 1.66; P < 0.0001; I 2 = 0%).', which incorrectly stated the increased risk as 37%. We have also revised the forest plots (Figures 2-8) as the effect estimates were incorrectly labelled.

Abstract

Background: Oral hormone pregnancy tests (HPTs), such as Primodos, containing ethinylestradiol and high doses of norethisterone, were given to over a million women from 1958 to 1978, when Primodos was withdrawn from the market because of concerns about possible teratogenicity. We aimed to study the association between maternal exposure to oral HPTs and congenital malformations.

Methods: We have performed a systematic review and meta-analysis of case-control and cohort studies that included data from pregnant women and were exposed to oral HPTs within the estimated first three months of pregnancy, if compared with a relevant control group. We used random-effects meta-analysis and assessed the quality of each study using the Newcastle–Ottawa Scale for non-randomized studies.

Results: We found 16 case control studies and 10 prospective cohort studies, together including 71 330 women, of whom 4,209 were exposed to HPTs. Exposure to oral HPTs was associated with a 40% increased risk of all congenital malformations: pooled odds ratio (OR) = 1.40 (95% CI 1.18 to 1.66; P<0.0001; I 2 = 0%). Exposure to HPTs was associated with an increased risk of congenital heart malformations: pooled OR = 1.89 (95% CI 1.32 to 2.72; P = 0.0006; I 2=0%); nervous system malformations  OR = 2.98 (95% CI 1.32 to 6.76; P = 0.0109 I 2 = 78%); gastrointestinal malformations, OR = 4.50 (95% CI 0.63 to 32.20; P = 0.13; I 2 = 54%); musculoskeletal malformations, OR = 2.24 (95% CI 1.23 to 4.08; P= 0.009; I 2 = 0%); the VACTERL syndrome (Vertebral defects, Anal atresia, Cardiovascular anomalies, Tracheoesophageal fistula, Esophageal atresia, Renal anomalies, and Limb defects), OR = 7.47 (95% CI 2.92 to 19.07; P < 0.0001; I 2 = 0%).

Conclusions: This systematic review and meta-analysis shows that use of oral HPTs in pregnancy is associated with increased risks of congenital malformations.

Keywords: pregnancy; congenital malformations, hormones

Introduction

Oral hormone pregnancy tests (HPTs), such as Primodos (known as Duogynon in Germany), were available as injections from 1950 and in tablet form in the UK from 1956 onwards, before the modern forms of urine pregnancy tests became available 1. Oral HPTs contained ethinylestradiol and large doses of norethisterone (synthetic forms of estrogen and progesterone respectively), the latter in much larger amounts than those included in current combined oral contraceptives (see Table 1). The test principle was that they would induce bleeding similar to menstruation in those who were not pregnant.

Table 1. Doses of ethinylestradiol and norethisterone in various formulations of contraceptive steroids, ordered by increasing dose of norethisterone.

Indication (oral formulation) Ethinylestradiol dose Norethisterone acetate dose
Progestogen-only contraception * - 350 micrograms
Combined oral contraceptive (Loestrin-20) 20 micrograms 1000 micrograms
Combined oral contraceptive (Norimin) 35 micrograms 1000 micrograms
Biphasic combined oral contraceptive (BiNovum) 35 micrograms 500/1000 micrograms
Triphasic combined oral contraceptive (Synphase) 35 micrograms 500/1000/500 micrograms
Combined oral contraceptive (Loestrin-30) 30 micrograms 1500 micrograms
Oral hormone pregnancy test (Primodos) 20 micrograms 10 milligrams
In endometriosis, dysmenorrhoea, dysfunctional
uterine bleeding, and menorrhagia, or to delay
menstruation *
- 10–15 milligrams/day
Breast cancer * - 40 milligrams/day

*Unbranded

In the UK more than a million women took HPTs 2. However, evidence that they should not be used in pregnant women because of a risk of fetal malformations 3 led the then Committee on Safety of Medicines in 1975 to conclude that a warning should be added to the Data Sheets, stating that HPTs should not be taken during pregnancy. ( Supplementary File 1) Warnings about HPTs in pregnancy first emerged in 1956: 4 accumulating concerns over an increased risk of malformations led to their withdrawal in a number of countries at different times. Norway cancelled the indication in pregnancy for HPTs in 1970; when the UK did so in 1978, the manufacturers of Primodos, Schering AG (taken over by Bayer AG in 2008), voluntarily stopped marketing the product; in Germany, Duogynon was taken off the market in 1981 1.

Since Primodos was withdrawn, the discovery of previously confidential documents has led to renewed concerns about its potential to cause harm 5. In 2014, therefore, the Medicines and Healthcare products Regulatory Agency (MHRA) initiated a review, which was published in 2017 and reported that the evidence was insufficient, mixed, and too heterogeneous to support an association between oral HPTs and congenital malformations 3.

To date, there has been no systematic review and meta-analysis of oral HPTs, using all the available data, to assess the likelihood of an association. We have therefore performed a systematic review to obtain all relevant data on hormone pregnancy tests and congenital malformations, used meta-analytical tools to obtain summary estimates of the likelihood of an association, and assessed the potential biases in these estimates.

Methods

Data sources

Full details of our search strategy are provided in Supplementary File 2. We searched Medline, Embase, and Web of Science (which yielded German papers and conference abstracts) and searched for regulatory documents online, including the UK Government’s “Report of the Commission on Human Medicines’ Expert Working Group on Hormone Pregnancy Tests”, which includes the original Landesarchiv Berlin Files 3, and reference lists of retrieved studies from the start of the databases in 1946 to 20 February 2018.

We used the following search terms without date limits or language restrictions: (Primodos OR Duogynon OR "hormone pregnancy test" OR "sex hormones" OR "hormone administration" OR “norethisterone” OR “ethinylestradiol”) AND pregnancy AND (congenital OR malformations OR anomalies). Several comparable high-dose HPTs were available at the same time as Primodos; we performed additional searches for evidence relating to these (See Supplementary File 3 for List of HPTs included in evidence search).

Study selection

We included observational studies of women who were or became pregnant during the study and were exposed to oral HPTs within the estimated first three months of pregnancy and compared them with a relevant control group. When a study was described in more than one publication, we chose the publication that contained the most comprehensive data as the primary publication. We excluded studies where the intervention was oral hormones taken for other reasons (e.g., oral contraception) and it was not possible to extract data on hormone pregnancy tests. We did not restrict the language of publication. We checked additional relevant data and extracted them from the secondary publications when necessary.

Data extraction and risk of bias assessment

Two reviewers (CH and ES) applied inclusion and quality assessment criteria, compared results, and resolved discrepancies through discussion with the other authors. We used a review template to extract data on study type, numbers of pregnancies exposed and not exposed to oral HPTs, and types and numbers of outcomes. Where available, we extracted data about the women studied, including ascertainment of cases, age, parity, setting, exposure to other medications, and confounding variables. In case-control studies, if data were reported on more than one control group, we extracted data where possible for non-disease/non-abnormality controls, and combined control groups if necessary.

The primary outcome of interest was all major congenital malformations. We also categorized outcomes for the congenital anomaly in the offspring at any time into congenital cardiac, gastrointestinal, musculoskeletal, nervous system, and urogenital defects, and the VACTERL syndrome (Vertebral defects, Anal atresia, Cardiovascular anomalies, Tracheoesophageal fistula, Esophageal atresia, Renal anomalies, and Limb defects).

We assessed quality using the Newcastle–Ottawa Scale (NOS) for non-randomized studies included in systematic reviews 6. The scale assesses the selection of study groups (cases and controls), comparability of study groups, including cases and controls, and ascertainment of the outcome/exposure. Each positive criterion scores 1 point, except comparability, which scores up to 2 points. The maximum NOS score is 9, and we interpreted a score of 1 to 3 points as indicating a high risk of bias 7. To determine whether the study had controlled for the most important factors, we selected the items reported in the original paper and resolved disagreements through consensus, using a third author (IO). We examined whether there was a linear relation between methodological quality and study results, by plotting the odds ratios against the NOS scores, using Excel, and assessed the correlations of NOS scores with several confounding variables we collected 8.

Data synthesis and statistical methods

We calculated study-specific odds ratios for outcomes and associated confidence intervals. We meta-analysed the data using a random-effects model. We assessed heterogeneity across studies using the I 2 statistic and publication bias using funnel plots 9. We performed a sensitivity analysis by removing single studies to judge the stability of the effect and to explore the effect on heterogeneity 10, and we described any sources of variation. We also judged robustness by removing studies of low quality from the analysis. To examine whether the observed heterogeneity could be explained by differences in the NOS score, we also performed meta-regression using the NOS score as the covariate against the log OR as weights for traditional meta-regression using Stata version 14.

We planned subgroup analyses for the timing of administration of HPTs in relation to pregnancy and organogenesis and study design (case-control versus cohort) using Cochran’s Q test. We used RevMan v.5.3 for all analyses, except for meta-regression, for which we used Stata version 14. RevMan and Stata estimate the effects of trials with zero events in one arm by adding a correction factor of 0.5 to each arm (trials with zero events in both arms are omitted). We performed a sensitivity analysis by removing studies with zero events from the analyses.

We followed the reporting guidelines of the Meta-Analysis of Observational Studies in Epidemiology (MOOSE). A completed checklist is available as Supplementary File 4 11

Patient involvement

Members of the Association for Children Damaged by HPTs were involved in the original discussions of this review and provided input to the outcome choices, the search, the location of study articles, and translations. We plan to present the study findings to relevant patient groups and make available lay interpretations.

Results

Description of included studies

We retrieved 409 items for screening. After title and abstract screening and removal of duplicates (n = 18), we excluded 354 records as not being relevant to the aim of the review. We assessed the full texts of 37 articles and identified 24 articles for inclusion. Figure 1 shows the PRISMA flow diagram for the inclusion of studies.

Figure 1. Study flow diagram showing inclusion of relevant studies.

Figure 1.

The 24 included articles reported on 26 studies (16 case-control studies and ten prospective cohort studies); one article [Nora 78] included two case-control studies and one prospective study. We found no randomized controlled trials. Of these articles, two were unpublished reports (see Supplementary File 5 for full references). The studies included 71,330 women. The case-control studies included 28,761 mothers, 594 of whom were exposed to HPTs; the cohort studies included 42,569 mothers and 3,615 exposures to HPTs. The studies were published between 1972 and 2014, and all were performed either in Europe or the USA. They mostly recruited women and their infants at maternity centres or hospital paediatrics wards.

The choices of controls in the case-control studies varied; they included, at one extreme, healthy infants born on a date close to the case infants and, at the other extreme, infants with malformations other than those under investigation. Among the prospective cohort studies, the populations tended to be women recruited at antenatal clinics or birth centres (See Table 2. Characteristics of included studies).

Table 2. Characteristics of included studies.

Study ID Study population Setting Confounding variables collected Information on controls including
matching criteria in case-control
studies
Outcomes reported
Case-control studies
Ferencz 1980 Mothers of 110 infants
with conotruncal
abnormalities of the
heart, born 1972–75.
Hospitals served by the
Maryland State Intensive
Care Neonatal Program,
USA.
Maternal health (hospitalisations,
illnesses, treatments); past reproductive
history; index pregnancy factors
including contraception used previously,
fertility treatments, symptoms, illnesses
and medications during pregnancy
including hormones; smoking; alcohol
intake; occupational history of mother
and father; exposure of mother to
fumes, paints and insecticides; family
history including history of congenital
abnormalities in previous children or in
close relatives.
For each case, three unaffected
controls were chosen from the
birth population: two matched on
eight characteristics related to
the likelihood of hormone-taking
(race, maternal age, parity, foetal
losses, gestational age, delivery
mode, time of prenatal registration,
private service), and one also on
the infant’s sex and birthweight; the
third control was chosen at random.
Congenital heart
disease (Conotruncal
malformations of the
heart)
Gal 1972 100 mothers of infants
with spina bifida, and
controls
Hospital in London, UK,
for cases; unclear where
controls were recruited
from
Age, parity, reproductive history,
illnesses, illegitimacy, bleeding
Controls matched for week of
baby's birth; age of mother (5-year
bands), reproductive history, course
of pregnancy, sex of baby.
Spina bifida
Greenberg 1977 Cases identified via
OPCS and matched
controls identified from
general practices of the
cases.
General practices in
the UK
Antenatal, personal, and family history
and drugs prescribed during the first
trimester.
Controls: babies born within 3
months of and based at the same
general practice as matched cases.
Antenatal, personal, and family
history and drugs prescribed during
the first trimester.
Neural tube defects, oral
clefts, limb malformations
and other non-minor
abnormalities
Janerich 1974 108 cases of congenital
limb defects and 108
unaffected controls
New York State, USA Age, parity, race Controls matched on birth date,
mother's race and age +/- 2 years;
and by default, due to adjacent
records for cases and controls
these matched well on county of
residence of the mothers.
Congenital limb defects
Janerich 1977 104 cases with birth
certificate mentioning
CHD, 104 matched
controls
New York State, USA Age, country of residence, date of birth,
race, medications, infections
From adjacent birth record
matched by mother’s age, county of
residence, date of birth, race
Congenital heart disease
Lammer 1986 1,091 mothers of infants
with abnormalities born 1
July 190 to 20 June 1979,
(21% not completed data
collection)
Population register Race, maternal education, family history,
socio-economic status, parity, previous
foetal loss
Control group was composed of
infants with malformations other
than the one under investigation.
e.g. for spina bifida, controls
were those with non-spina bifida
abnormalities.
Major malformations,
including anencephaly,
spina bifida, cleft lip, cleft
palate, Down syndrome,
oesophageal atresia,
small bowel atresia,
rectal anal atresia,
anterior abdominal wall
defects, diaphragmatic
hernia, limb reduction.
Laurence 1971 1968-1970, UK 3 hospital birth centres
in the UK
Non-reported In London the controls were the
next baby with no abnormality born
in the same hospital; in Exeter,
control mothers were matched for
area of birth, parity and month of
conception; in Wales the control
mothers were those who had had
one baby with spina bifida or
anencephaly and had a subsequent
unaffected birth during the study
period; these last were not matched
individually.
Spina bifida and
anencephaly
Levy 1973 76 cases, 76 controls Hospital, Montreal,
Canada
Non-reported Controls were infants with
Mendelian disorders, matched for
date of birth.
Congenital heart defects
(transposition of the great
vessels)
Nora 1975 15 patients with multiple
congenital anomalies.
30 controls (15 with
chromosomal anomalies,
15 with functional heart
murmurs)
University of Colorado
Medical Center, Denver,
and affiliated hospitals,
USA
Age, race, socioeconomic status, area of
residence
Matched for age. 15 controls had
chromosomal abnormalities, 15 had
functional heart murmurs
VACTERL
Nora 1978 case
control 1
32 patients with
VACTERL, 60 controls
Hospital Age, date of birth, sex, gestational age,
race, socioeconomic levels, areas of
residences, parity
Matched as closely as possible for
age, date of birth, sex, gestational
age, race, socioeconomic levels,
areas of residences, parity
VACTERL
Nora 1978 case
control 2 and 3
236 patients with full
variety of cardiac lesions,
412 controls with known
single mutant gene and
chromosomal disorders
Hospital Sex, race, approximate date of birth,
area of residence
Matching was for sex, race,
approximate date of birth, area of
residence
Congenital heart disease
(congenital heart lesions)
Polednak 1983 99 singleton male births
with hypospadias and 99
matched controls
New York State, USA Parity, maternal age, race, area of
residence
Most adjacent birth date, matched
for maternal age, race, area of
residence
Hypospadias
Rothman 1979 390 cases, 1,254
controls. HPTS: 14/388
cases vs 35/1246
controls
State care service for
congenital heart disease
Parity, mother's education level, insulin
use, alcohol, tobacco
Controls: births within same 3
years of the study period; 1,254
respondents from contacts to births
selected randomly from the birth
register.
Congenital heart disease
Sainz 1987 244 cases identified via
the national collaboration
of 42 hospitals registering
congenital abnormalities
between April 1976 to
Sept 1984
Spanish register of
congenital abnormalities
within 42 participating
hospitals
Sex, data and place of birth Controls: unaffected births at same
hospital, matched on sex, date of
birth.
Spina bifida and
anencephaly
Tummler 2014 296 cases, 3,676 infants
with abnormalities
Data from the
Malformation Monitoring
Centre Saxony-Anhalt,
Germany.
Non-reported No information on matching Congenital bladder
exstrophy
Cohort studies
Fleming 1978 RCGP Outcomes of
Pregnancy study 1975:
9,000 women; from
this was selected a
random sample of 500
pregnancies proceeding
to normal outcomes
General practices, UK Non-reported Any malformation
Goujard 1979 3,379 women pregnant
and attending
gynaecology clinics
between 1975 to 1977
Obstetrics and
gynaecology centres,
Paris and Lille, France
Information on current pregnancies
including symptoms and medications
taken; previous pregnancies and general
health backgrounds.
Congenital
malformations, also
congenital heart defects,
skeletal anomalies,
microencephaly.
Hadjigeorgiou
1982
Retrospective cohort,
Alexandra Maternity
Hospital Greece, births
1975-77. 15,535 live
births, 559 exposed
to sex hormones of
which 112 (20%)
exposed to HPTs,
14,976 no hormones,
congenital heart disease
studied confirmed
by cardiologist & lab
tests. Diseases and
medication reported at
admission prior to birth.
Hospital birth centre Cytomegalovirus, infection,
toxoplasmosis, hepatitis, syphilis, rubella,
teratogenic drugs
Congenital heart disease
Haller 1974 3588 pregnant women,
recruited Oct 1969 to
April 1972, University
Hospital Göttingen; 617
(17.2 %) with hormonal
pregnancy test
Hospital birth centre Non-reported Congenital malformations
Kullander 1976 6,376 pregnancies,
Malmo, 1963-5,
resulting in 5,753 live
births, 5,002/5,753 no
abnormality, 751/5,753
with abnormality. 156
women took Primodos.
Sweden Major and minor disease; the woman's
age, parity, marital status, and social
class. Birth weight, placental weight.
Major and minor
malformations
Meire 1978 500 mothers consecutive
births in 3 hospitals in
Bruges, Belgium, 20 had
taken HPTs.
Hospital birth centres Non-reported Oesophageal atresia
Michaelis 1983 13,643 pregnancies Antenatal clinics,
Germany
Detailed general and gynaecological
history, drug intake, exposure to
chemical agents, daily workload,
intercurrent diseases, accidents, surgical
operations and other factors.
Major malformations
Roussel 1968 Pregnancies 1966 to
1967
General practices, UK NR Central nervous
system malformations
including anencephaly,
hydrocephaly,
microcephaly,
meningomyelocele,
myelocele, spina bifida.
Rumeau-
Rouquette 1978
1963-69, recruitment in
12 gynaecology clinics in
Paris; 12,764 women gave
birth to 12,895 children
in hospitals participating
in study; controls were
mothers of unaffected
infants selected at random
among women questioned
in same hospital
Hospital birth centres,
France
Medical history, course of pregnancy,
infectious diseases, inoculations,
reproductive history, social and
occupational category, use of alcohol,
tobacco
Congenital malformations
Torfs 1981 19,906 full term
pregnancies, 227 of
which exposed to HPTs.
Hospital birth centre Age, medical and reproductive history,
socio- economic information, ethnicity
Severe congenital
anomalies including
congenital heart
defects, neuroblastoma,
cleft lip and limb
reduction; non-severe
congenital anomalies
e.g. hypospadias of the
first degree, congenital
dis-location of the hip,
polydactyly.

Quality assessment of included studies

Of the 26 included studies, three were assigned a NOS score of 3 or below and were therefore judged as being at high risk of bias. One was a case-control study (Laurence 1971, a published abstract as a letter) and two were cohort studies (Fleming 1978 and Haller 1974, both unpublished). The NOS scores ranged from 2 to 9 (median 5). Twelve of the 26 included studies scored 7 to 9 and were judged to be at low risk of bias (see Table 3 of NOS scores in the data files). Item 5 of the NOS score addresses comparability of cases and controls based on design or analysis. Of the 16 case control studies, 12 controlled for the most important factor (item 5a) and nine controlled for important additional factors (item 5b). Of the ten cohort studies, six controlled for the most important factor (item 5a) and four controlled for important additional factors (item 5b). The mean Newcastle–Ottawa scale score was 6.1, indicating an overall moderate risk of bias. Table 2 also shows that seven studies did not report the confounding variables collected (Laurence 1971; Levy 1973; Tummler 2014; Fleming 1978; Haller 1974; Moire 1978; Rousel 1968). NOS scores correlated with the increasing number of confounding variables collected (r = 0.83). Supplementary File 6 shows the funnel plots for all congenital malformations and congenital heart disease; because of inadequate numbers of included studies, we did not use more advanced statistical methods to assess publication bias.

Table 3. Newcastle-Ottawa scale scores for included studies.

Newcastle–Ottawa scale case-control studies
Comparability of cases and controls
on the basis of the design or analysis
Study ID Is the case
definition
adequate?
Are the cases
representative?
Selection
of controls
adequate
Definition
of controls
adequate
a) Study
controls for the
most important
factor
b) Study controls
for important
additional factors
Ascertainment
of exposure
adequate
Same method of
ascertainment
for cases and
controls
Non-response
rate adequate
Total
score /9
Ferencz 1980 yes yes yes yes yes yes yes yes yes 9
Gal 1972 unclear unclear yes yes yes yes yes yes unclear 6
Greenberg 1977 yes yes yes yes yes yes unclear yes yes 8
Janerich 1974 no yes yes no yes no yes yes unclear 5
Janerich 1977 yes yes unclear yes yes yes yes unclear yes 7
Hellstrom 1976 yes unclear no yes yes no unclear yes unclear 4
Lammer 1986 yes yes unclear yes yes yes yes yes no 7
Laurence 1971 yes unclear unclear yes no unclear unclear unclear unclear 2
Levy 1973 yes yes no yes yes no unclear unclear unclear 4
Nora 1975 yes yes yes yes yes no yes yes yes 8
Nora 1978
case control 1
yes yes yes no yes yes yes yes yes 8
Nora 1978
case control 2
and 3
yes yes yes no yes yes yes yes unclear 7
Polednak 1983 yes yes yes yes yes no unclear yes unclear 6
Rothman 1979 yes yes no no no yes yes yes unclear 5
Sainz 1987 unclear yes yes unclear yes yes unclear yes unclear 5
Tummler 2014 yes no no no no no no yes yes 3
Newcastle–Ottawa scale cohort studies
Comparability of cases
and controls on the
basis of the design or
analysis
Are the participants
representative
of the exposed
cohort?
Is the non-
exposed cohort
similar to the
exposed?
Is the
ascertainment
of exposure
accurate?
Is there
evidence that
the outcome
of interest was
not present at
start of study?
a) Study
controls
for the
most
important
factor
b) Study
controls for
important
additional
factors
Was the
assessment
of outcome
adequate?
Was follow-up
long enough
for outcomes
to occur?
Was there
adequacy
of follow
up of
cohorts?
Total
score /9
Fleming 1978 yes unclear unclear yes unclear unclear unclear yes unclear 3
Goujard 1979 yes yes unclear yes yes no unclear yes yes 6
Hadjigeorgiou
1982
yes yes yes yes no yes yes yes unclear 7
Haller 1974 unclear unclear unclear yes no no unclear yes unclear 2
Kullander
1976
yes yes yes yes yes unclear yes yes yes 8
Meire 1978 yes no yes yes yes no unclear yes yes 6
Michaelis 1983 yes yes yes yes yes yes yes yes yes 9
Roussel 1968 yes yes yes yes no no unclear yes yes 6
Rumeau-
Rouquette
1978
yes unclear yes yes yes yes yes yes no 7
Torfs 1981 yes yes yes yes yes yes yes yes yes 9

Association of exposure to HPT with the risks of malformations

Nine studies, including 61,642 mothers of infants and 3,274 exposed to HPTs, examined the association in pregnancy with all congenital malformations. Two were case-control studies (Greenberg 1977; Sainz 1987) and seven were cohort studies (Fleming 1987; Goujard 1979; Haller 1974; Kullander 1976; Michaelis 1983; Rumeau-Rouquette 1978; Torfs 1981) ( Figure 2). Exposure to oral HPTs was associated with a 40% increased risk of all congenital malformations: pooled odds ratio (OR) = 1.40 (95% CI 1.18 to 1.66; P < 0.0001; I 2 = 0%). For the two case-control studies only, pooled OR = 1.70 (95% CI 1.01 to 2.86; P = 0.04; I 2 = 63%) and for the seven cohort studies, pooled OR = 1.28 (95% CI 1.05 to 1.56; P = 0.02; I 2 = 0%). The test for subgroup differences was not significant (P = 0.32). In a post-hoc sensitivity analysis, removing the studies that collected no confounding variables (Haller 74 and Fleming 78, both of low quality) did not affect the significance of the result (OR 1.44; 95% CI 1.18 to 1.75; P = 0.0004, I 2 = 11%). The meta-regression showed no association between total NOS score and increased risk (P = 0.51).

Figure 2. Association of exposure to oral HPTs in pregnancy with all malformations in the offspring.

Figure 2.

Seven studies, including 19,267 mothers of infants and 218 exposed to oral HPTs, analysed congenital heart malformations. Five were case-control studies (Ferencz 1980; Janerich 1977; Levy 1973; Nora 1978-2/3) and two were cohort studies (Hadjigeorgiou 1982; Torfs 1981) ( Figure 3). The pooled relative OR = 1.89 (95% CI 1.32 to 2.72; P = 0.0006; I 2 = 0%).

Figure 3. Association of exposure to oral hormone pregnancy tests (HPTs) in pregnancy with congenital heart disease in the offspring.

Figure 3.

In a post-hoc sensitivity analysis, removing one study that collected no confounding variables (Levy 73, a low-quality study) did not affect the significance of the result (OR = 1.88; 95% CI 1.25 to 2.85; P = 0.003, I 2 = 12%) For the five case-control studies only, the pooled OR = 1.87 (95% CI 1.23 to 2.85; P = 0.004; I 2 = 9%); for the two cohort studies the pooled OR = 1.95 (95% CI 0.44 to 8.69; P = 0.38; I 2 = 32%). The meta-regression was not significant (P = 0.94).

For the association between exposure to oral HPTs and nervous system malformations in the offspring, five studies provided data: three case-control studies (Gal 1972; Laurence 1971; Sainz 1987) and two cohort studies (Roussel 1968; Torfs 1981), including 12 486 mothers of infants and 127 exposed ( Figure 4). The pooled OR = 2.98 (95% CI 1.32 to 6.76; P = 0.009; I 2 = 78%). In a post-hoc sensitivity analysis, removing the two studies that collected no confounding variables (Laurence 71; Roussel 68) did not affect the significance of the result and removed the heterogeneity (OR 6.04; 95% CI 3.33 to 10.78; P < 0.00001, I 2 = 0%).

Figure 4. Association of exposure to oral hormone pregnancy tests (HPTs) in pregnancy and nervous system malformations in the offspring.

Figure 4.

Gastrointestinal malformations and exposure to oral HPTs were reported in three studies: a case-control study (Lammer 1986) and two cohort studies (Meire 1978 and Torfs 1981), providing data on 2,722 mothers of infants, including 79 exposed to HPTs ( Figure 5). The pooled OR = 4.50 (95% CI 0.63 to 32.20; P = 0.13; I 2 = 54%). One case-control study (Polednak 1983) and one cohort study (Torfs 1981) examined the relationship between exposure to oral HPTs in pregnancy and urogenital malformations: pooled OR = 2.63 (95% CI 0.84 to 8.28; P = 0.10; I 2 = 0%) ( Figure 6).

Figure 5. Association of exposure to oral hormone pregnancy tests (HPTs) in pregnancy and gastrointestinal malformations in the offspring.

Figure 5.

Figure 6. Association of exposure to oral hormone pregnancy tests (HPTs) in pregnancy and urogenital malformations in the offspring.

Figure 6.

A relation between the exposure to oral HPTs and musculoskeletal malformations was reported in three studies: three case-control studies (Hellstrom 1976; Janerich 1977; Lammer 1986) and one cohort study (Torfs 1981) ( Figure 7), based on 2,464 women, with 79 exposed to HPTs. The pooled OR = 2.24 (95% CI 1.23 to 4.08; P = 0.009; I 2 = 0%). Removal of the zero study events (Torfs 1981) did not affect this result. The association of VACTERL with HPT exposure was reported in two case-control studies (Nora 1978-1 and Nora 1975), based on 135 women and infants and 27 exposed to HPTs; the OR was 7.57 (95% CI 2.92 to 19.07; P < 0.0001; I 2 = 0%) ( Figure 8).

Figure 7. Association of exposure to oral hormone pregnancy tests (HPTs) in pregnancy and musculoskeletal malformations in the offspring.

Figure 7.

Figure 8. Association of exposure to oral hormone pregnancy tests (HPTs) in pregnancy with Vertebral defects, Anal atresia, Cardiovascular anomalies, Tracheoesophageal fistula, Esophageal atresia, Renal anomalies, and Limb defects (VACTERL) syndrome in the offspring.

Figure 8.

Study extraction sheet

Copyright: © 2019 Heneghan C et al.

Data associated with the article are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).

Discussion

We found 24 articles containing 26 studies that reported the association between exposure to oral hormone pregnancy tests in mothers and malformations in their infants: 16 were case-control studies and ten were prospective cohort studies. The overall quality of the evidence, assessed by the Newcastle–Ottawa Scale, was moderate.

We found significant associations for all congenital malformations pooled and separately for congenital heart malformations, nervous system malformations, musculoskeletal malformations, and the VACTERL syndrome. Many of these pooled analyses had zero heterogeneity, and the direction of effect favoured the controls in 30 of the 32 analyses undertaken (Torfs 81 provided the only effect estimate favouring HPT exposure). The analyses were also robust to sensitivity analyses, and there was no relation between NOS score and increasing risk.

Based on the assumptions that a teratogenic effect of HPTs would be mediated by actions on estrogen and progestogen receptors, and that concentrations of ethinylestradiol and norethisterone in the fetus would be too low to have a significant effect on those receptors, it has been suggested that there is no mechanistic argument for teratogenicity 3. However, other unknown mechanisms might be at play. For example, Isabel Gal first reported concerns of malformations in the children of mothers exposed to HPTs in 1967 12, pointing out that bleeding often occurred in pregnant women soon after exposure and suggesting that that would affect the “equilibrium” of the uterus. Between 5 and 11% of exposed women had bleeding, and the RCGP survey reported induced abortions in about 10% of women 13.

The drugs in Primodos were not tested for animal toxicity and teratogenicity at the time, which, although not unusual, meant that there was a gap in mechanistic understanding. A 2018 study showed that the components in Primodos are associated with dose-dependent and time-related damage in zebrafish embryos, and affect nerve outgrowth and blood vessel patterning in zebrafish 12, 14. Although it is difficult to compare drug actions between species, and evidence from animal studies is limited, the drugs accumulated in the zebrafish embryos, persisted for some time, and led to rapid embryonic damage 12, 14. In contrast, other animal studies have shown minimal effects on embryo development 15. There is also evidence that estradiol and progestogens increase the expression of mRNA for isoforms of vascular endothelial growth factor (VEGF) in Ishikawa cells from human endometrial adenocarcinoma 16.

Strengths and weaknesses

Establishing causal associations in the absence of randomization can be difficult. However, the lack of randomized trials in our analysis should not be seen as a barrier to interpreting our findings. It would have been unethical to randomize individuals to drugs with known concerns, and randomization, like systematic reviews, was not the norm at the time. Furthermore, for questions about harms, the Oxford CEBM levels of evidence puts systematic reviews of case-control studies on a par with systematic reviews of randomized trials 17.

However, observational methods have limitations 18. First, interpretation can be affected by confounding factors. Although most of the studies in this review used matched controls, our analysis was based on raw data from the publications and did not adjust for confounders. Secondly, susceptibility bias can occur, as women with threatened abortions might be more likely to present and take the medication. Both of these problems can be mitigated by careful matching; 13 of the 16 studies controlled for the most important factor, item 5a on the NOS scale. Thirdly, the severity of malformations studied will have led to differing risk estimates across studies. Fourthly, inappropriate methods of ascertainment of the malformations and exposures could have introduced bias. Finally, incomplete and uneven reporting, along with publication bias (since it is likely that unreported studies exist) could introduce bias and alter the effect estimates.

The use of scoring systems to assess quality has been criticized. However, the NOS scale has been used widely in assessing the quality of non-randomized studies 1924. A NOS score between 0 and 9 has previously been used as a potential moderator in meta-regression 25, and has been recommended by the Cochrane Collaboration 26. A weakness of the NOS scale is the possible low agreement between assessors 27. This was particularly the case when authors had limited experience in doing systematic reviews, but training, even of novices, improves agreement 19.

The effects were also stable to sensitivity analyses, and changes in NOS score did not affect the risk estimates. The absence of subgroup differences between study designs for the risk estimates supports the robustness of the findings. We also tried to overcome publication bias by translation and assessment of unpublished data. The sample sizes in the studies for all congenital malformations, congenital heart disease, and nervous system malformations were sufficiently large to suggest that small unpublished studies would have little effect on the estimates unless they were highly heterogeneous. The analyses of gastrointestinal, urogenital, musculoskeletal, and VACTERL malformations were limited by their small sample sizes and low number of events: the interpretation of these effects should, therefore, be treated more cautiously. The significant effect observed for VACTERL should also be treated cautiously, as the confidence intervals for this effect were wide.

Our study has several strengths. We used standard systematic review methods, and by asking a focused question solely on exposure to HPTs, and excluding exposure to other hormones, we have been able to assess the heterogeneity of the effect estimates. However, as with any observational studies, there is always the possibility that an unknown confounder could be the cause of the observed difference. While such a possibility cannot be ruled out, the lack of heterogeneity means that such a confounder would potentially have to act in the same direction, despite many different confounders being collected and controlled for. Confounding factors with variable effects on the effect estimates would have probably led to a high degree of heterogeneity, which would have prevented pooling; this was not the case.

Conclusion

Regulators were first made aware of the link between exposure to HPTs and congenital malformations in 1967. After 1975, the Primodos label was changed to state that the medication should not be used in pregnancy because of a risk of malformations (see Figure 9). The evidence of an association has previously been deemed weak, and previous litigation and reviews have been inconclusive. However, we believe that this systematic review shows an association of oral HPTs with congenital malformations.

Figure 9. Primodos label 1975 and 1978.

Figure 9.

Our results show the benefit of undertaking systematic reviews, a study type not in routine use when most of these studies were done. For example, only one study (Greenberg 1997) out of nine reported a significant effect for all congenital malformations; the pooled estimate was significant. Much of the discussion over the associations of HPTs with congenital malformations at the time these studies were published focused on the lack of significance of individual studies 28, although it was also recognized that the numbers involved were insufficient to reject the hypotheses 29.

Declarations

Data availability

F1000Research: Dataset 1. Study extraction sheet, https://dx.doi.org/10.5256/f1000research.16758.d233494 30

Funding Statement

The Evidence Synthesis Working Group is funded by the National Institute for Health Research School for Primary Care Research (NIHR SPCR) [Project Number 390]. The views expressed are those of the author(s) and not necessarily those of the NIHR, the NHS or the Department of Health.

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

[version 2; referees: 3 approved]

Supplementary material

Supplementary File 1. Committee on Safety of Medicines report 1975-colour pdf.

Supplementary File 2. Search Strategy.

Supplementary File 3. List of hormone pregnancy tests (HPTs) included in evidence search.

Supplementary File 4. Completed Meta-analysis Of Observational Studies in Epidemiology (MOOSE) checklist.

Supplementary File 5. Full references for included studies.

Supplementary File 6. Funnel Plots.

References

  • 1. Olszynko-Gryn J, Bjørvik E, Weßel M, et al. : A historical argument for regulatory failure in the case of Primodos and other hormone pregnancy tests. Reprod Biomed Soc Online. 2018;6:34–44. 10.1016/j.rbms.2018.09.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Website: Medicines and Healthcare products Regulatory Agency. 2015 Call for Evidence.(accessed 1 Jun 2018). Reference Source [Google Scholar]
  • 3. Report of the Commission on Human Medicines’ Expert Working Group on Hormone Pregnancy Tests.GOV.UK. (accessed 24 May 2018). Reference Source [Google Scholar]
  • 4. Britton HG: Pregnancy test. Br Med J. 1956;2:419 10.1136/bmj.2.4989.419 [DOI] [Google Scholar]
  • 5. Review launched to respond to patient concerns about NHS treatments.(accessed 2/08/2018). Reference Source [Google Scholar]
  • 6. Ottawa Hospital Research Institute. (accessed 1 Jun 2018). Reference Source [Google Scholar]
  • 7. Lunny C, Knopp-Sihota JA, Fraser SN: Surgery and risk for multiple sclerosis: a systematic review and meta-analysis of case-control studies. BMC Neurol. 2013;13:41. 10.1186/1471-2377-13-41 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Ernst E, Pittler MH: Re-analysis of previous meta-analysis of clinical trials of homeopathy. J Clin Epidemiol. 2000;53(11):1188. 10.1016/S0895-4356(99)00226-7 [DOI] [PubMed] [Google Scholar]
  • 9. Egger M, Smith GD: meta-analysis bias in location and selection of studies. BMJ. 1998;316:61–6. 10.1136/bmj.316.7124.61 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Higgins JP, Thompson SG: Quantifying heterogeneity in a meta-analysis. Stat Med. 2002;21(11):1539–58. 10.1002/sim.1186 [DOI] [PubMed] [Google Scholar]
  • 11. Stroup DF, Berlin JA, Morton SC, et al. : Meta-analysis of observational studies in epidemiology: a proposal for reporting. Meta-analysis Of Observational Studies in Epidemiology (MOOSE) group. JAMA. 2000;283(15):2008–12. 10.1001/jama.283.15.2008 [DOI] [PubMed] [Google Scholar]
  • 12. Gal I, Kirman B, Stern J: Hormonal pregnancy tests and congenital malformation. Nature. 1967;216(5110):83. 10.1038/216083a0 [DOI] [PubMed] [Google Scholar]
  • 13. Gal I: Risks and benefits of the use of hormonal pregnancy test tablets. Nature. 1972;240(5378):241–2. 10.1038/240241a0 [DOI] [PubMed] [Google Scholar]
  • 14. Brown S, Fraga LR, Cameron G, et al. : The Primodos components Norethisterone acetate and Ethinyl estradiol induce developmental abnormalities in zebrafish embryos. Sci Rep. 2018;8(1):2917. 10.1038/s41598-018-21318-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Maier WE, Herman JR: Pharmacology and toxicology of ethinyl estradiol and norethindrone acetate in experimental animals. Regul Toxicol Pharmacol. 2001;34(1):53–61. 10.1006/rtph.2001.1483 [DOI] [PubMed] [Google Scholar]
  • 16. Archer DF, Navarro FJ, Leslie S, et al. : Effects of levonorgestrel, medroxyprogesterone acetate, norethindrone, and 17beta-estradiol on vascular endothelial growth factor isomers 121 and 165 in Ishikawa cells. Fertil Steril. 2004;81(1):165–70. 10.1016/j.fertnstert.2003.05.028 [DOI] [PubMed] [Google Scholar]
  • 17. OCEBM Levels of Evidence. CEBM.2016; (accessed 29 May 2018). Reference Source [Google Scholar]
  • 18. Sackett DL: Bias in analytic research. J Chronic Dis. 1979;32(1–2):51–63. 10.1016/0021-9681(79)90012-2 [DOI] [PubMed] [Google Scholar]
  • 19. Oremus M, Oremus C, Hall GB, et al. : Inter-rater and test-retest reliability of quality assessments by novice student raters using the Jadad and Newcastle-Ottawa Scales. BMJ Open. 2012;2(4): pii: e001368. 10.1136/bmjopen-2012-001368 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Wallis CJ, Mahar AL, Choo R, et al. : Second malignancies after radiotherapy for prostate cancer: systematic review and meta-analysis. BMJ. 2016;352:i851. 10.1136/bmj.i851 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Croles FN, Nasserinejad K, Duvekot JJ, et al. : Pregnancy, thrombophilia, and the risk of a first venous thrombosis: systematic review and bayesian meta-analysis. BMJ. 2017;359:j4452. 10.1136/bmj.j4452 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Hackshaw A, Morris JK, Boniface S, et al. : Low cigarette consumption and risk of coronary heart disease and stroke: meta-analysis of 141 cohort studies in 55 study reports. BMJ. 2018;360:j5855. 10.1136/bmj.j5855 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Boland MV, Ervin AM, Friedman DS, et al. : Comparative effectiveness of treatments for open-angle glaucoma: a systematic review for the U.S. Preventive Services Task Force. Ann Intern Med. 2013;158(4):271–9. 10.7326/0003-4819-158-4-201302190-00008 [DOI] [PubMed] [Google Scholar]
  • 24. Harausz EP, Garcia-Prats AJ, Law S, et al. : Treatment and outcomes in children with multidrug-resistant tuberculosis: A systematic review and individual patient data meta-analysis. PLoS Med. 2018;15(7):e1002591. 10.1371/journal.pmed.1002591 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Veronese N, Cereda E, Solmi M, et al. : Inverse relationship between body mass index and mortality in older nursing home residents: a meta-analysis of 19,538 elderly subjects. Obes Rev. 2015;16(11):1001–1015. 10.1111/obr.12309 [DOI] [PubMed] [Google Scholar]
  • 26. Bae JM: A suggestion for quality assessment in systematic reviews of observational studies in nutritional epidemiology. Epidemiol Health. 2016;38:e2016014. 10.4178/epih.e2016014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Hartling L, Milne A, Hamm MP, et al. : Testing the Newcastle Ottawa Scale showed low reliability between individual reviewers. J Clin Epidemiol. 2013;66(9):982–93. 10.1016/j.jclinepi.2013.03.003 [DOI] [PubMed] [Google Scholar]
  • 28. Laurence KM: Reply to Gal. Nature. 1972;240:242 10.1038/240242a0 [DOI] [Google Scholar]
  • 29. Torfs CP, Milkovich L, van den Berg BJ, et al. : The relationship between hormonal pregnancy tests and congenital anomalies: a prospective study. Am J Epidemiol. 1981;113(5):563–74. 10.1093/oxfordjournals.aje.a113133 [DOI] [PubMed] [Google Scholar]
  • 30. Heneghan C, Aronson JK, Spencer E, et al. : Dataset 1 in: Oral hormone pregnancy tests and the risks of congenital malformations: a systematic review and meta-analysis. F1000Research. 2019;17:1725 10.5256/f1000research.16758.d233494 [DOI] [PMC free article] [PubMed] [Google Scholar]
F1000Res. 2018 Dec 14. doi: 10.5256/f1000research.18318.r40887

Referee response for version 1

Olalekan A Uthman 1

The manuscript reads well. The manuscript was well written. It has the potential to add to the body of knowledge in the field. The method was described in detail to allow for replication of the study. I have no major concerns.

My only comment: the author should change the abstract conclusion to: “This systematic review and meta-analysis of observational studies shows that the use of oral HPTs in pregnancy is associated with increased risks of congenital malformations”.

I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

F1000Res. 2019 Jan 16.
Carl Heneghan 1

Many thanks for these positive comments. The methods and results are clear about the type of studies included, and we consider the conclusions do not require the addition of the study type. This is in line with previous systematic reviews our group has published.

F1000Res. 2018 Nov 29. doi: 10.5256/f1000research.18318.r40291

Referee response for version 1

Jesse Olszynko-Gryn 1, Cyrille Jean 2

This is a timely and much-needed paper that deserves to be widely read and cited. It provides the first systematic review and meta-analysis of old epidemiological data pointing towards a long-acknowledged association between HPTs and birth defects. Most of the paper is devoted to apparently rigorous statistical analysis. We leave constructive criticism of the statistics to other, more appropriately qualified reviewers. Instead, we confine our comments to the historical context and factual details presented in the paper. These, on the whole, are entirely satisfactory. But some minor errors — that do not significantly detract from the overall argument — should be amended:

 

  1. 'Oral hormone pregnancy tests (HPTs), such as Primodos, containing ethinylestradiol and high doses of norethisterone, were given to over a million women from 1958 to 1978’ (p. 1). 

    It is worth clarifying that HPTs were available as injections from 1950 and in tablet form (e.g., Schering’s Orasecron, Roussel’s Amenorone Forte), in the UK, from at least 1956. See, for example, Britton (1956 1); and  https://archive.org/details/b19974760M4180/page/n45?q=amenorone+1956. For an extended discussion, see Olszynko-Gryn (2014), available for download here. Furthermore, not all HPTs contained norethisterone; different companies used other types of synthetic progesterone, and the same goes for ethinylestradiol.

  2. ‘Oral hormone pregnancy tests (HPTs), such as Primodos (known as Duogynon in Germany), were used from 1958 to 1978, before urine pregnancy tests were available’ (p. 3).

    Contrary to popular belief, urine pregnancy tests were in fact widely though unevenly available between 1958 and 1978 and HPTs were never the dominant method of pregnancy testing. For a detailed timeline of pregnancy testing in the UK, please see Olszynko-Gryn et al. (2018 2), esp. pp. 35-36. It would also be helpful to clarify that HPTs were removed from UK market in 1978, but earlier and later elsewhere. See Olszynko-Gryn et al. (2018 2) for details (pp. 41-42).

  3. ‘The test principle was that menstruation would be induced in those who were not pregnant’ (p.3).

    At the time HPTs were variously described as ‘clinical’, ‘hormonal’, or ‘withdrawal bleeding’ pregnancy tests and it would be more precise to refer to their effect as inducing  menstrual-like withdrawal bleeding, which is not identical to menstruation.

  4. Worth mentioning that Gal 1967, though a highly significant intervention, was not the first published warning against HPTs; these began to appear as early as 1956, in response to marketing literature aimed at GPs. See Britton (1956 1) and Olszynko-Gryn et al. (2018 2), p. 36.

  5. ‘However, we believe that this systematic review shows an association of oral HPTs with congenital malformations’ (p. 17).

    More optionally, the authors might consider reflecting on the extent to which the association they identify implies a  causal association. An association between the use of HPTs and birth defects has long been recognised and was rarely in dispute. Many experts explained the association in terms of a suspected though as unknown direct mechanistic effect of HPTs on the developing human embryo. Others, however, preferred to explain the association in terms of underlying factors, e.g., a patient history of miscarriage or birth defects. This view, which still has traction in some quarters, is discussed to some extent in Olszynko-Gryn et al. (2018 2) (pp. 39-41). The authors might usefully offer a fresh perspective based on their findings, in the Conclusion and/or in the interesting discussion of unknown mechanisms on p. 16.

We have read this submission. We believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

References

  • 1. Britton H: Pregnancy Test. BMJ.1956;2(4989) : 10.1136/bmj.2.4989.419 10.1136/bmj.2.4989.419 13329365 [DOI] [Google Scholar]
  • 2. Olszynko-Gryn J, Bjørvik E, Weßel M, Jülich S, Jean C: A historical argument for regulatory failure in the case of Primodos and other hormone pregnancy tests. Reprod Biomed Soc Online.2018;6: 10.1016/j.rbms.2018.09.003 34-44 10.1016/j.rbms.2018.09.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
F1000Res. 2019 Jan 16.
Carl Heneghan 1

Many thanks for these positive comments. 

We have amended the introduction with the following text: 'Oral hormone pregnancy tests (HPTs), such as Primodos (known as Duogynon in Germany), were available as injections from 1950 and in tablet form in the UK from 1956 onwards, before the modern forms of urine pregnancy tests became available [1]’

We have cited: Olszynko-Gryn J, Bjørvik E, Weßel M, Jülich S, Jean C. A historical argument for regulatory failure in the case of Primodos and other hormone pregnancy tests. Reprod Biomed Soc Online. 2018 Oct 23;6:34-44. Doi: 10.1016/j.rbms.2018.09.003. eCollection 2018 Aug.

We have amended the introduction text as per the reviewer's suggestion :

"Warnings about HPTs in pregnancy first emerged in 1956: accumulating concerns over an increased risk of malformations led to their withdrawal in a number of countries at different times. Norway cancelled the indication in pregnancy for HPTs in 1970; the UK did so in 1978, when the manufacturers of Primodos, Schering AG (taken over by Bayer AG in 2008), voluntarily stopped marketing the product; in Germany, Duogynon was taken off the market in 1981  [ref Olszynko-Gryn J].'

We have amended the introduction text as per the reviewer's suggestion:

‘The test principle was that they would induce bleeding similar to menstruation in those who were not pregnant.’

'Warnings about HPTs in pregnancy first emerged in 1956.'

And referenced the Britton H.G. Pregnancy test. Br. Med. J. 1956;2(18 Aug.):419. paper 

The benefits of our systematic review include that it quantifies the magnitude of the association and tests the robustness of this association across multiple studies by meta-analysis. We, therefore, perceive that they are rationale and the objectives are clear.

F1000Res. 2018 Nov 5. doi: 10.5256/f1000research.18318.r40053

Referee response for version 1

David Healy 1

I am fully supportive of this article on the effects of hormone pregnancy tests as it stands. I have no substantive criticism of the content or methods.

I am of course interested in why the regulator (MHRA) did not find comparable results but this is not a matter that should be addressed in this article.

There is one extra point that this article may speak to which is that from some time it was thought that teratogens caused signature defects - such as the phocomelia of thalidomide. This may now be a minority position (I'm not sure of this point). The findings here do not support that point of view. I can understand if the authors may think that commenting on this point is a matter for others or for another article; I mention it for consideration.

I have one very minor point about the layout which is that in the column where the numbers of women recruited to various studies is mentioned, the right justification of paragraphs leads to an odd spacing between 28 thousand and 671 - this doesn't apply when the page is resized.

I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

F1000Res. 2019 Jan 16.
Carl Heneghan 1

Many thanks for these positive comments.

Signature defects are only likely to occur if the timing of exposure is the same in all cases. However, when the timing of exposure varies, fetuses will be affected in different ways, depending on the tissues that are developing at the time of exposure, giving rise to a variety of malformations. The large range of times of exposure during embryogenesis determines which developmental processes are most affected, resulting in a wide variety of potential defects as seen in our review and in alleged Primodos survivors. Furthermore, different fetuses may have different epigenetic susceptibilities to different teratogenic outcomes.

The Primodos components norethisterone acetate and ethinyl estradiol induce developmental abnormalities in zebrafish embryos [Brown S, Fraga LR, Cameron G, Erskine L, Vargesson N. Sci Rep. 2018 Feb 13;8(1):2917. doi: 10.1038/s41598-018-21318-9]. Brown's data in Zebrafish show acetate and ethinylestradiol teratogenicity depends on dose and the embryonic stage of development, embryos at an early stage being more sensitive than those at a later stage.

The comments are interested in why the regulator (MHRA) did not find comparable results but this is not a matter that should be addressed in this article. We agree with this issue - no change required.

We have removed the spacing between numbers to eliminate the odd spacing effect.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Study extraction sheet

    Copyright: © 2019 Heneghan C et al.

    Data associated with the article are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).

    Data Availability Statement

    F1000Research: Dataset 1. Study extraction sheet, https://dx.doi.org/10.5256/f1000research.16758.d233494 30


    Articles from F1000Research are provided here courtesy of F1000 Research Ltd

    RESOURCES