Abstract
Background
Without an adequate animal model permitting experiments the pathophysiology of endometriosis remains unclear and without a non-invasive diagnosis, information is limited to symptomatic women. Lesions are macroscopically and biochemically variable. Hormonal medical therapy cannot be blinded when recognised by the patient and the evidence of extensive surgery is limited because of the combination of low numbers of interventions of variable difficulty with variable surgical skills. Experience is spread among specialists in imaging, medical therapy, infertility, pain and surgery. In addition, the limitations of traditional statistics and P-values to interpret results and the complementarity with Bayesian inference should be realised.
Objectives
To review and discuss evidence in endometriosis management
Materials and Methods
A PubMed search for blinded randomised controlled trials in endometriosis.
Results
Good-quality evidence is limited in endometriosis.
Conclusions
Clinical experience remains undervalued especially for surgery.
What is new?
Evidence-based medicine should integrate traditional statistical analysis and the limitations of P-values, with the complementary Bayesian inference which is predictive and sequential and more like clinical medicine. Since clinical experience is important for grading evidence, specific experience in the different disciplines of endometriosis should be used to judge trial designs and results. Finally, clinical medicine can be considered as a series of experiments controlled by the outcome. Therefore, the clinical opinion of many has more value than a personal opinion.
Keywords: evidence, statistical inference, Bayesian, endometriosis, evidence-based medicine
Introduction
Medicine is based on observations and experience and statistical analysis to help with the interpretation of observations. For more than a century classical statistical methods or the frequentist approach (Fisher 1925; Neyman and Pearson, 1928) used significance levels, power and P-values to grasp in one value the probability that an observed effect, can be attributed to chance (null hypothesis), taking into account its distribution and power. P-values measure the extremeness of a result given the null hypothesis but do not evaluate whether the hypothesis is true (Lesaffre and Lawson, 2012). This mistake is often made in medicine and is known as the P-value fallacy (Goodman, 1999). Recently the American statistical association published a statement that “P-values do not measure the probability that the studied hypothesis is true or the probability that the data were produced by random chance alone and that they are not a good measure of evidence regarding a model or hypothesis” (Wasserstein and Lazar, 2016). Statistical reporting, therefore, is changing in gynaecological journals (Hardwicke and Goodman, 2020) such as the Journal of Minimally Invasive Gynecology (JMIG) and the British Journal of Obstetrics and Gynaecology (BJOG) (Price et al., 2020; Wilson and Falcone, 2020).
Evidence-based medicine (EBM) was developed to integrate research data corrected for biases into clinical medicine. EBM, initiated in the 1990s when calculators permitted more complex analyses and meta-analyses, embraced P-values for evidence resulting in a pyramid of evidence (Djulbegovic and Guyatt, 2017) with the randomised controlled trials (RCT) (Lawrence and Force, 1989), and later meta- analyses (CEBM, 2009 ) and systematic reviews on top. However, the translation of results in grades of evidence and the integration of research data in clinical medicine and guidelines proved to be difficult (Murad et al., 2016a). Today, this difficulty could be seen as the unconscious conflict between the inappropriate use of P-values as ‘evidence’ for the initial hypothesis, and clinical medicine using a rather Bayesian approach for diagnosis and therapy.
An EBM approach to endometriosis needs specific considerations. The absence of adequate animal models limits experiments, and the pathophysiology remains debated and it is still unclear whether endometriosis is one or several diseases. Without an adequate non-invasive diagnosis, epidemiology is poorly understood. RCTs for medical therapy are hampered by the absence of blinding when the patient recognises therapy (e.g., when affecting menstruation). Extensive surgery is too variable for the available number of interventions to permit RCTs.
Considering our recently changing understanding of statistical inference, we will review the EBM approach to endometriosis and highlight clinical experience and judgment in the hierarchy of evidence.
Materials and Methods
A PubMed search “endometriosis AND “double- blind” AND (recurrences OR pain OR fertility) AND (surgery OR medical OR surgical)” yielded 101 results. These were hand searched to exclude so-called placebo-blinded trials when blinding was inadequate e.g., when active treatment was readily recognised by the patient or the gynaecologist by changes in menstruation or menopausal symptoms or vaginal atrophy. When recognition was not clear and when placebo effects could not be judged, trials comparing 2 hormonal therapies such as dienogest and GnRH agonists after surgery (Ceccaroni et al., 2021) were excluded. In addition, 1 trial dealing with acupuncture and 7 trials describing vitamin D or fatty acid treatment and 1 trial with anti-TNFa and 1 with pentoxifylline treatment was eliminated. After this, only 2 trials of laser vaporisation of superficial endometriosis and 4 trials comparing excision with coagulation or ablation of superficial endometriosis, remained. Prism flow sheets are no included, since not adequately blinded RCT describing hormonal medical therapy of endometriosis and no RCT reporting surgery of severe endometriosis were not found.
Classical and Bayesian statistical inference are complimentary
Without discussing classical and Bayesian statistics in detail, the clinician should grasp the differences and the complementarity. Traditional statistics evaluate the probability that the results of an experiment could be obtained by chance alone, without considering previous knowledge or other experiments. Traditional statistics therefore can only refute but not confirm a hypothesis. Judgement of the validity of a hypothesis rather requires Bayesian reasoning, needing a prior probability and the Bayesian factor indicating how experimental results change that probability (Goodman, 2001; Sellke et al., 2001). The relationship between traditional and Bayesian statistics can be illustrated as follows. If we do not know whether something is true or not the (prior) probability is 50%; an experiment demonstrating an effect with a P-value of 0.05 or 0.01 increases this probability to 71% or 89% respectively. This also emphasises the uncertainty, since the initial hypothesis remains wrong in 29% and 11%, respectively (Nuzzo, 2014). The probability that a hypothesis is correct moreover varies with many other factors besides P-values (Goodman and Greenland, 2007).
Bayesian reasoning is rather sequential and the probability that a hypothesis is true, based on previous knowledge, is updated with the new information. The weather forecast, which is more accurate for tomorrow than for the following days, is daily updated with new information. This is more like medical thinking where a diagnosis is progressively refined, and a therapy updated with new information.
This improved understanding of statistics, as discussed over the last decades, explains that observations in medicine are often poorly reproducible (Boos and Stefanski, 2011; Colquhoun, 2017) and that the conclusion that many research findings in medicine are wrong (Goodman and Greenland, 2007) and often "an accurate measure of the prevailing biases" (Ioannidis, 2005). However, classical and Bayesian statistics are complementary. The former is more suited to evaluate the multiple interactions in a large dataset (multivariate analysis), the former being more suited to predict the future or the accuracy of a hypothesis.
Hierarchy of evidence in EBM
‘Evidence-based medicine is the conscientious, explicit, and judicious use of current best evidence in making decisions about the care of individual patients’ (Sackett et al., 1996). By formalising RCTs, reporting, meta-analyses, systematic reviews, and the prevention of selection or observation biases, data became more reliable, improving the quality of evidence (Djulbegovic and Guyatt, 2017). However, the integration of EBM into clinical medicine was difficult (Djulbegovic and Guyatt, 2017). Today we understand that this was to some extent a consequence of considering P-values erroneously as a confirmation of a hypothesis instead of changing the probability. Although more comparable to clinical decision- making for diagnosis and therapy, Bayesian inference (Lesaffre and Lawson, 2012) is still poorly incorporated in EBM.
Considerations of traditional analysis of treatment and diagnosis
Although well-known, clinicians risk forgetting that traditional statistical analyses require a homogeneous population and that they are not suited to detect subgroups (Koninckx et al., 2020c) with different behaviour, or for rare events. The latter requires (often prohibitively) large groups to be evaluated. A historic example is that it took many years to realise that chloramphenicol, an excellent antibiotic, had a 1/10.000 risk of aplastic anaemia (Polin and Plaut, 1977). Another problem is the publication bias (Lin and Chu, 2018; DeVito and Goldacre, 2019) when small, clinically irrelevant differences reach ‘significance’ because of large groups (since P-values improve with the square root of the number of observations). Diagnostic tests are used to estimate the probability that a patient has or does not have a disease, i.e., the positive (PPV) or negative predictive value (NPV). However, the accuracy of prediction decreases sharply when the prevalence of a disease is low, especially when lower than 5 or 1%. This is shown in Figure 1 using the Bayesian formula (Lesaffre and Lawson, 2012) to calculate this relationship, illustrating that a test with 99% sensitivity and 99% specificity for a disease with a 1% prevalence, has a predictive value of only 50%, the number of false positives and true positives being similar. Unfortunately, reports describing the accuracy of tests rarely specify prevalence, which moreover increase because of a referral bias. Therefore, the predictive values for rare diseases will be higher in tertiary referral centres (Drossman et al., 1997) than in routine clinical practice. More difficult to calculate, is the combined diagnostic accuracy of several tests (Pepe, 2000). This is illustrated by a recent Cochrane review, suggesting using tests sequentially, beginning with the test with the highest sensitivity, and then re-testing the negative group (Nisenblat et al. 2016). Although the added value of a second test and the combined accuracy of tests can be calculated with a Bayesian approach (Broemeling, 2011) as illustrated recently for endometriosis (Chen and Hwang, 2019; Chen et al., 2019), this is still rarely used.
Clinical judgment, experience, and artificial intelligence
Clinical judgment is important although difficult to standardise. Some data cannot be compared, such as the efficacy of treatment versus the severity or incidence of side effects. In surgery, both are moreover linked since surgeon dependent. When blinding is not possible, efficacy and placebo effects cannot be separated. Also, the judgment of imprecision, inconsistency, publication bias and external validity, is difficult. We should realise that the same holds for the grades of evidence used in EBM (Murad et al., 2014; Murad et al., 2016a). More subtle is how the criteria used to judge the quality of RCTs (Mahmood et al., 2021) influence the conclusions of meta-analyses up to becoming misleading (Ioannidis, 2016). Although clinical judgment is difficult to define, its value was illustrated by the observation that the formal evaluation of biases in diagnostic accuracy by Quadas tools (Whiting et al., 2011; Yang et al., 2021) is not much superior to clinical judgment. Clinical experience integrates knowledge with experience in the entire population, including heredity, age, antecedents, and rare events. Clinical experience thus is much larger than evidence derived from trials since many trials are not performed when clinical consequences are minimal, or when prevalence is low as in multimorbidity. Many individual and local preferences moreover were implemented following rare events, accidents or near accidents, which become forgotten years later. Therefore, we should be prudent when changing habits because of lack of evidence. Clinical decision-making (You and Krumholz, 2022) is complex. Considering the age, antecedents, symptoms, clinical exam, blood tests and imaging, the clinician considers a series of potential diagnoses, that range from most likely to rare. The integration of all these variables into a PPV and NPV for each diagnosis considered, including the risk when making a mistake, is a progressive complex experience-based, artificial intelligence-like (Letterie, 2021) process. Finally, it should be realised that clinical experience precedes and guides RCTs (Figure 2). They are performed to confirm an observation, or not performed when the superiority of a treatment or an intervention seems repetitively observed, without exceptions, or when the expected effect is so little that the result will be clinically irrelevant.
Emotional intelligence is rarely considered since difficult to define. However, the interaction of the clinician and the patient through body language and expectations of the patient influences diagnosis and therapy, and similar data can be interpreted differently by clinicians with comparable experience.
Conclusion
These considerations explain the difficulties of the clinical integration of EBM and the ongoing discussions on the hierarchy of evidence (Murad et al. 2016b). Besides the interpretation of data and the fallacy of p-values, grades of evidence were introduced to consider all available evidence including observational series and case reports, emphasising the importance of clinical judgment and experience.
In addition, we did not discuss other limitations of EBM and the hierarchy of evidence, such as publication bias (Vercellini, 2014; DeVito and Goldacre 2019), procedural aspects such as financial bias in funding (Howick, 2019) and drug research (Klemperer, 2010), the epistemological discussion to distinguish justified belief from opinions (Gaeta and Gentile, 2016) and the importance of medicolegal aspects as described in the recent introduction of NUTS (Number of Unnecessary Tests to avoid one Suit) statistics (Allen et al., 2021).
EBM and Endometriosis
Specific problems of endometriosis
Endometriosis is a frequent disease causing pain and infertility and is the most frequent reason for surgery in women (Kempers et al., 1960). Given the likely association with adenomyosis and bleeding disorders (Koninckx et al., 2018), endometriosis can be considered for almost any complaint in gynaecology. Without an adequate animal model permitting experimentation and without a non- invasive diagnosis, the pathophysiology, the natural history, and the epidemiology (Koninckx et al., 2021a) are poorly understood. Data on endometriosis are scanty in some age groups such as adolescence and overall limited to symptomatic women except small series (Moen and Stokstad, 2002). The latter is illustrated by the recent suggestion by statisticians to redefine endometriosis as ‘symptomatic’, thus facilitating data handling but eliminating those who did not undergo a laparoscopy (Goodman and Franasiak, 2018). Even for cystic ovarian endometriosis the accuracy of imaging seldom exceeds 90% while it remains difficult to exclude ovarian cancer, especially in older women (Van Holsbeke et al., 2009).
Good-quality data are limited
Most laparoscopies are performed in symptomatic women. In individual women, it is difficult to ascertain whether endometriosis and pain or infertility are causally related. Only half of the superficial lesions are painful (Demco, 2000) and there are many other causes of pelvic pain or infertility. Medical therapies have an important placebo effect (Koninckx et al., 2008), and blinding is an illusion when the patient recognises active therapy by affecting menstruation or menopausal symptoms. Although most trials require a proven endometriosis diagnosis for inclusion, it is unclear which endometriosis they have after laparoscopy with surgical destruction. Judgment of a trial can vary over time, as illustrated by the not-blinded ENDOCAN trial (Marcoux et al., 1997) showing improvement in fertility following surgery. A Cochrane meta-analysis was subsequently withdrawn (Jacobson et al., 2014). For cystic ovarian endometriosis the results of surgery, ovarian damage and recurrence rates are surgeon- dependent (Muzii and Miller, 2011). Deep endometriosis is highly variable, and surgery is technically difficult and complication prone. Because of the variable skills of the surgeon, and the low number of interventions, RCTs are not realistic. If performed nevertheless, unexpected results risk being criticised as occurred recently in the LACC trial demonstrating higher recurrence rates after laparoscopic surgery for cervical cancer (Ramirez et al., 2018).
Clinical judgment varies with subspecialties
The clinical judgment of endometriosis is complicated by different sub-specialists. Clinical experience is bound to vary between radiologists performing magnetic resonance imaging (MRI), gynaecologists specialising in ultrasonography or endocrinology or medical therapy or surgery, and abdominal surgeons with little expertise in other aspects of gynaecology. An additional difficulty is the degree of commercialisation and industrialisation (Perrotta and Geampana, 2021), especially in infertility and medical therapy.
Clinical judgment varies with our perception of pathophysiology
Management should be based on evidence, but our clinical judgement might vary with our understanding of pathophysiology. The implantation theory (Sampson, 1921; Sampson, 1927) defined endometriosis as ‘endometrial glands and stroma outside the uterus’ and thus as one disease, which became clinically considered progressive and recurrent. According to the genetic-epigenetic (G-E), theory endometriosis starts developing after a cumulative but variable series of cellular incidents (Koninckx et al., 2020a). This is consistent with endometriosis lesions being clonal and variable as observed for aromatase activity and progesterone resistance (Bulun et al., 2019), and for the response to medical therapy (Becker et al., 2017; Vercellini et al., 2021). If lesions are different, traditional statistical analysis with means and standard deviations is inadequate (Koninckx et al., 2020c). Since the risk of G-E incidents increases by the oxidative stress of retrograde menstruation or the peritoneal microbiome, it is logical that susceptible women have an increased risk after puberty, and the remaining group will have a progressively lower risk (Koninckx et al., 2021c). Thus, age becomes an important factor in epidemiology. Pelvic endometriosis lesions grow in the peritoneal cavity which is endocrinologically and immunologically a specific microenvironment. The growth of endometrial lesions is self-limiting (Koninckx et al., 2021c) probably as a consequence of fibrosis and inflammation secondary to the immunologic reaction. This is consistent with the clinical observation that most deep endometriosis lesions that are followed clinically since symptoms were insufficient for surgery, do not grow. Viewed as a G-E-driven disease, recurrences might become preventable by decreasing oxidative stress. This is consistent with the lower recurrence rate of cystic ovarian endometriosis when taking oral contraception. Although not demonstrated yet, we might consider prevention by preventing ascending infections, or by changing the peritoneal microbiome by food intake and exercise (Langweiler, 2020). This is consistent with the observations that the risk of developing endometriosis seems lower when taking food rich in antioxidant as omega3, Vit E, Vit C, and citrus (Harris et al., 2018; Afrin et al. 2021). It is too early to fully understand the effect of vitamins on inflammation and immune response in endometriosis (Halpern et al., 2015). New concepts of pathophysiology should be considered for future trials. This could apply more specifically to endometriosis in adolescence, to the prevention of endometriosis, and to interpret results of endometriosis if heterogeneous and more than one disease.
Non-biomedical health systems
A growing number of reports document the management of endometriosis with complementary therapies (Porpora et al., 2013; Mira et al., 2018; Della Corte et al., 2020), acupuncture, food intake (Nirgianakis et al., 2021) and exercise (Langweiler, 2020), and more recently traditional Chinese medicine (Peng et al., 2021). These reports are difficult to interpret since the indications and results of these treatments poorly fit EBM standards. However, indirect, and circumstantial evidence is too strong to ignore these treatments altogether. We are at the crossroads of understanding the role of food intake and exercise on the peritoneal (Koninckx et al., 2019) and the intestinal microbiome. Both might influence endometriosis onset and growth either directly or through immunology and oxidative stress.
Conclusion
In endometriosis high-quality evidence is very limited and the clinical judgment varies with experience that is different for the subspecialties involved. These differences in experience should be addressed since experience affects the grading of evidence and the recognition of bias. It seems logical that the ranking of evidence for diagnosis, medical therapy and surgery should be performed by those with experience in that subspecialty.
Surgery for severe endometriosis requires specific comments. Data are limited to observational series with referral biases and important differences in technique. However, the surgeons with extensive experience are a small group, who know each other’s surgery and who meet and discuss several times a year and progressively adapt their surgery (Donnez and Donnez, 2021). Therefore, the elements on which this group agrees because of a similar experience have more value than an opinion. Each intervention can be seen as an experiment testing management and measured by outcome. Hopefully, statisticians will help to formalise this collective experience and outcome-based observations into evidence.
Discussion
The principles of EBM (Sackett et al., 1996) are clear, but the hierarchy of the evidence is struggling with a poorly transparent clinical judgment, which for endometriosis might vary with the experience of the subspecialists. It seems logical to match the judgment of evidence with the extent and type of experience in each subspecialty.
Evidence needs to be translated into guidelines. This requires the input of all stakeholders. Also, the interpretation might vary over time with changing understanding of pathophysiology. Whether endometriosis is considered as one or several G-E different diseases will help to understand that some 50% of typical lesions are not painful and that response to medical therapy is absent or inadequate in 10% to 40% respectively (Donnez and Dolmans, 2021; Vercellini et al., 2021). Today it seems logical that superficial, cystic ovarian and deep endometriosis are reported separately since likely different entities (Donnez et al., 1996).
Clinicians have been educated with significances and P-values, and risk having misused them to confirm a hypothesis (Nuzzo, 2014). A full discussion of statistical inference being beyond this manuscript, the differences between traditional (P-values) and Bayesian (probabilities) statistics can be illustrated as follows. A 60% probability of rain is different from a non-significant P-value that it is going to rain. Although important, it will take time to incorporate a Bayesian approach (Lesaffre and Lawson, 2012) and to acknowledge the similarities with clinical medicine. First, consider the progressive approach of clinical medicine and Bayesian inference: a clinician seeing a woman of this age, with these antecedents and these symptoms, considers many differential diagnoses which are refined into a workable probability by additional exams and tests. This finally results in a conclusion or treatment considering the consequences of mistakes and complications. In addition, we have to prevent the gap between statistical inference and clinical understanding from becoming wider, as illustrated by a recent diagnostic test of endometriosis using a ‘penalised regression model and machine learning with random forest’ (Moustafa et al., 2020). This risks not being readily understood by most clinicians.
The strength of evidence needs to be re- evaluated for hormonal medical therapy of endometriosis. First, we need to acknowledge that adequate blinding cannot be done when the patient recognises active therapy. In addition, we need to incorporate the high peritoneal fluid concentrations and progesterone resistance (Donnez and Dolmans, 2021; Koninckx et al., 2022) and that endometriosis lesions are a heterogeneous group (Koninckx et al., 2020c; Donnez, 2021) with variable growth even during medical therapy (Vercellini et al., 2021) or after menopause (de Almeida Asencio et al., 2019). We also should avoid vague clinical terms such as “adequate pain relief”, or ‘women with proven endometriosis’. Notwithstanding these considerations, the clinical treatment of superficial endometriosis could be summarised as follows. Women with proven or suspected endometriosis and pain deserve a trial with medical therapy, but the eventual growth of lesions during therapy should be monitored and if pain relief is inadequate, other options should be considered.
Judgment of surgery remains difficult. Quality is poorly defined, the severity of endometriosis is variable, and cystic and deep endometriosis is technically difficult and complication-prone surgery with oocyte damage, sexual problems and bladder, ureteral and bowel complications (Koninckx et al., 2021b). Randomisation is unrealistic and often unethical when surgeons are not equally trained in the techniques to be compared. Since this is unlikely to change, it seems important to use a Bayesian approach to establish the value of the collective experience of surgeons in technique, results and complications and the importance of granular intra-operative details (Kanters et al., 2018). This is not contradicted by the decision of doing a bowel resection or a conservative excision or a discoid excision being based to a large extent on personal preferences (Koninckx et al., 2020b) since results and complications vary with surgical skills and experience. To convert this collective opinion based on repetitive surgical interventions with the outcome as measurement, into a degree of evidence will be a methodological and statistical challenge.
In conclusion, an EBM approach to endometriosis faces specific challenges. The diagnosis is limited to those undergoing laparoscopy and this decision is based on a variable mixture of clinical exams and symptoms and imaging. The accuracies of imaging such as ultrasound or MRI are well described (Guerriero et al., 2021a; Guerriero et al., 2021b), but the predictive values vary with the prevalence, and their importance in clinical decision-making varies from little (Koninckx et al., 2021b; Koninckx et al., 2021c) to very much (Malzoni et al., 2020). We need to incorporate that the recognition of subtle and deep endometriosis (Taylor et al., 2018) is variable. Medical therapy needs re-appraisal and for extensive surgery the value of the collective judgement of surgeons needs evaluation. This complexity will need better integration of traditional and Bayesian statistical analysis and inference to understand which exams and therapies improve outcomes (Bernstein and Wang, 2021).
Footnotes
Acknowledgements: We do thank Manu Lesaffre and Geert Page, Belgium and Lone Hummelshoj, United Kingdom for fruitful discussions.
Contribution to Author: All authors have discussed endometriosis diagnosis and especially surgery during several yearly meetings and live surgeries. These discussions have resulted in a series of joint publications on endometriosis. PK’s interest in statistics initiated this manuscript with the help of ML author of “Bayesian biostatistics”. The text was subsequently edited and discussed by all authors.
References
- 1.Afrin S, AlAshqar A, El Sabeh M, et al. Diet and Nutrition in Gynecological Disorders: A Focus on Clinical Studies. Nutrients. 2021;13:1747. doi: 10.3390/nu13061747. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Allen R, Cai AG, Tepler P, et al. The “NUTS” statistic: Applying an EBM disease model to defensive medicine. J Healthc Risk Manag. 2022;41:9–12. doi: 10.1002/jhrm.21486. [DOI] [PubMed] [Google Scholar]
- 3.Becker CM, Gattrell WT, Gude K, et al. Reevaluating response and failure of medical treatment of endometriosis: a systematic review. Fertil Steril. 2017;108:125–136. doi: 10.1016/j.fertnstert.2017.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bernstein E, Wang TY. Point-of-Care Ultrasonography. JAMA Internal Medicine. 2021;181:1558–1559. doi: 10.1001/jamainternmed.2021.5831. [DOI] [PubMed] [Google Scholar]
- 5.Boos DD, Stefanski LA. P-Value Precision and Reproducibility. The American Statistician. 2011;65:213–221. doi: 10.1198/tas.2011.10129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Broemeling LD. Bayesian Estimation of Combined Accuracy for Tests with Verification Bias. Diagnostics. 2011;1:53–76. doi: 10.3390/diagnostics1010053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Bulun SE, Yilmaz BD, Sison C, et al. Endometriosis. Endocrine reviews. 2019;40:1048–1079. doi: 10.1210/er.2018-00242. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Ceccaroni M, Clarizia R, Liverani S, et al. Dienogest vs GnRH agonists as postoperative therapy after laparoscopic eradication of deep infiltrating endometriosis with bowel and parametrial surgery: a randomized controlled trial. Gynecol Endocrinol. 2021;37:930–933. doi: 10.1080/09513590.2021.1929151. [DOI] [PubMed] [Google Scholar]
- 9.Chen Z, Hwang BS. A Bayesian semiparametric approach to correlated ROC surfaces with stochastic order constraints. Biometrics. 2019;75:539–550. doi: 10.1111/biom.12997. [DOI] [PubMed] [Google Scholar]
- 10.Chen Z, Hwang BS, Kim S. A correlated Bayesian rank likelihood approach to multiple ROC curves for endometriosis. Stat Med. 2019;38:1374–1385. doi: 10.1002/sim.8043. [DOI] [PubMed] [Google Scholar]
- 11.Colquhoun D. The reproducibility of research and the misinterpretation of p-values. R Soc Open Sci. 2017;4:171085. doi: 10.1098/rsos.171085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.de Almeida Asencio F, Ribeiro HA, Ribeiro PA, et al. Symptomatic endometriosis developing several years after menopause in the absence of increased circulating estrogen concentrations: a systematic review and seven case reports. Gynecol Surg. 2019;16:3. [Google Scholar]
- 13.Della Corte L, Noventa M, Ciebiera M, et al. Phytotherapy in endometriosis: an up-to-date review. J Complement Integr Med. 2020;17 doi: 10.1515/jcim-2019-0084. [DOI] [PubMed] [Google Scholar]
- 14.Demco L. Review of pain associated with minimal endometriosis. JSLS. 2000;4:5–9. [PMC free article] [PubMed] [Google Scholar]
- 15.DeVito NJ, Goldacre B. Catalogue of bias: publication bias. BMJ Evidence-Based Medicine. 2019;24:53–54. doi: 10.1136/bmjebm-2018-111107. [DOI] [PubMed] [Google Scholar]
- 16.Djulbegovic B, Guyatt GH. Progress in evidence-based medicine: a quarter century on. Lancet. 2017;390:415–423. doi: 10.1016/S0140-6736(16)31592-6. [DOI] [PubMed] [Google Scholar]
- 17.Donnez J. The heterogeneity of endometriotic lesions could be explained by their progesterone resistance. Hum Reprod. 2021;36:2624–2625. doi: 10.1093/humrep/deab151. [DOI] [PubMed] [Google Scholar]
- 18.Donnez J, Dolmans MM. Endometriosis and Medical Therapy: From Progestogens to Progesterone Resistance to GnRH Antagonists: A Review. J Clin Med. 2021;10:1085. doi: 10.3390/jcm10051085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Donnez J, Nisolle M, Smoes P, et al. Peritoneal endometriosis and “endometriotic” nodules of the rectovaginal septum are two different entities. Fertil Steril. 1996;66:362–368. [PubMed] [Google Scholar]
- 20.Donnez O, Donnez J. Deep endometriosis: The place of laparoscopic shaving. Best Pract Res Clin Obstet Gynaecol. 2021;71:100–113. doi: 10.1016/j.bpobgyn.2020.05.006. [DOI] [PubMed] [Google Scholar]
- 21.Drossman DA, Whitehead WE, Camilleri M. Irritable bowel syndrome: a technical review for practice guideline development. Gastroenterology. 1997;112:2120–2137. doi: 10.1053/gast.1997.v112.agast972120. [DOI] [PubMed] [Google Scholar]
- 22.Fisher RA. Statistical methods for research workers. Edinburgh: Oliver and Boyd; 1925. [Google Scholar]
- 23.Gaeta R, Gentile N. Evidence, discovery and justification: the case of evidence-based medicine. J Eval Clin Pract. 2016;22:550–557. doi: 10.1111/jep.12419. [DOI] [PubMed] [Google Scholar]
- 24.Goodman LR, Franasiak JM. Efforts to redefine endometriosis prevalence in low-risk patients. BJOG. 2018;125:63. doi: 10.1111/1471-0528.14701. [DOI] [PubMed] [Google Scholar]
- 25.Goodman S, Greenland S. Why most published research findings are false: problems in the analysis. PLoS medicine. 2007;4:e168. doi: 10.1371/journal.pmed.0040168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Goodman SN. Toward evidence-based medical statistics. 1: The P value fallacy. Ann Intern Med. 1999;130:995–1004. doi: 10.7326/0003-4819-130-12-199906150-00008. [DOI] [PubMed] [Google Scholar]
- 27.Goodman SN. Of P-values and Bayes: a modest proposal. Epidemiology. 2001;12:295–297. doi: 10.1097/00001648-200105000-00006. [DOI] [PubMed] [Google Scholar]
- 28.Guerriero S, Martinez L, Gomez I, et al. Diagnostic accuracy of sonography for detecting parametrial involvement in women with pelvic endometriosis: a systematic review and meta-analysis. Ultrasound Obstet Gynecol. 2021a;58:669–676. doi: 10.1002/uog.23754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Guerriero S, Pascual MA, Ajossa S, et al. Artificial intelligence (AI) in the detection of rectosigmoid deep endometriosis. Eur J Obstet Gynecol Reprod Biol. 2021b;261:29–33. doi: 10.1016/j.ejogrb.2021.04.012. [DOI] [PubMed] [Google Scholar]
- 30.Halpern G, Schor E, Kopelman A. Nutritional aspects related to endometriosis. Revista da Associação Médica Brasileira. 2015;61:519–523. doi: 10.1590/1806-9282.61.06.519. [DOI] [PubMed] [Google Scholar]
- 31.Hardwicke TE, Goodman SN. How often do leading biomedical journals use statistical experts to evaluate statistical methods? The results of a survey. PLOS ONE. 2020;15:e0239598. doi: 10.1371/journal.pone.0239598. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Harris HR, Eke AC, Chavarro JE, et al. Fruit and vegetable consumption and risk of endometriosis. Hum Reprod. 2018;33:715–727. doi: 10.1093/humrep/dey014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Howick J. Exploring the Asymmetrical Relationship Between the Power of Finance Bias and Evidence. Perspect Biol Med. 2019;62:159–187. doi: 10.1353/pbm.2019.0009. [DOI] [PubMed] [Google Scholar]
- 34.Ioannidis JPA. Why most published research findings are false. PLoS medicine. 2005;2:e124. doi: 10.1371/journal.pmed.0020124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Ioannidis JPA. The Mass Production of Redundant, Misleading, and Conflicted Systematic Reviews and Meta-analyses. Milbank Q. 2016;94:485–514. doi: 10.1111/1468-0009.12210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Jacobson TZ, Duffy JMN, Barlow D, et al. WITHDRAWN: Laparoscopic surgery for pelvic pain associated with endometriosis. Cochrane Database Syst Rev. 2009:CD001300. doi: 10.1002/14651858.CD001300.pub3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Kanters AE, Shubeck SP, Sandhu G, et al. Justifying our decisions about surgical technique: Evidence from coaching conversations. Surgery. 2018;164:561–565. doi: 10.1016/j.surg.2018.04.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kempers RD, Dockerty MB, Hunt AB, et al. Significant postmenopausal endometriosis. Surg Gynecol Obstet. 1960;111:348–356. [PubMed] [Google Scholar]
- 39.Klemperer D. Drug Research: Marketing before evidence, sales before safety [editorial] . Deutsches Aerzteblatt Online. 2010;107:277–278. doi: 10.3238/arztebl.2010.0277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Koninckx PR, Craessaerts M, Timmerman D, et al. Anti-TNF-alpha treatment for deep endometriosis-associated pain: a randomized placebo-controlled trial. Hum Reprod. 2008;23:2017–2023. doi: 10.1093/humrep/den177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Koninckx PR, Martin DC, Donnez J. Fertil Steril. Do we need to separate initiation and growth to understand endometriosis. 2020a;114:766–767. doi: 10.1016/j.fertnstert.2020.06.008. [DOI] [PubMed] [Google Scholar]
- 42.Koninckx PR, Ussia A, Adamian L, et al. Conservative Surgery of Deep Bowel Endometriosis. Clinical Management of Bowel Endometriosis. 2020b:119–133. [Google Scholar]
- 43.Koninckx PR, Ussia A, Adamyan L, et al. Peritoneal fluid progesterone and progesterone resistance in superficial endometriosis lesions. Human Reproduction. 2022;28:203–211. doi: 10.1093/humrep/deab258. [DOI] [PubMed] [Google Scholar]
- 44.Koninckx PR, Ussia A, Adamyan L, et al. The epidemiology of endometriosis is poorly known as the pathophysiology and diagnosis are unclear. Best Pract Res Clin Obstet Gynaecol. 2021a;71:14–26. doi: 10.1016/j.bpobgyn.2020.08.005. [DOI] [PubMed] [Google Scholar]
- 45.Koninckx PR, Ussia A, Adamyan L, et al. Heterogeneity of endometriosis lesions requires individualisation of diagnosis and treatment and a different approach to research and evidence-based medicine. Facts Views Vis Obgyn. 2020c;11:263. [PMC free article] [PubMed] [Google Scholar]
- 46.Koninckx PR, Ussia A, Keckstein J, et al. Review on endometriosis surgery. Gynecology and Pelvic Medicine. 2021b;4:38. [Google Scholar]
- 47.Koninckx PR, Ussia A, Tahlak M, et al. Infection as a potential cofactor in the genetic-epigenetic pathophysiology of endometriosis: a systematic review. Facts Views Vis Obgyn. 2019;11:209–216. [PMC free article] [PubMed] [Google Scholar]
- 48.Koninckx PR, Ussia A, Wattiez A, et al. The severity and frequency distribution of endometriosis subtypes at different ages: a model to understand the natural history of endometriosis based on single centre/single surgeon data. Facts, Views and Vision in ObGyn. 2021c;13:209–219. doi: 10.52054/FVVO.13.3.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Koninckx PR, Ussia A, Zupi E, et al. Association of Endometriosis and Adenomyosis: Vast Literature but Scant Conclusive Data. J Minim Invasive Gynecol. 2018;25:745–748. doi: 10.1016/j.jmig.2018.03.012. [DOI] [PubMed] [Google Scholar]
- 50.Langweiler MJ. Evidence-Based Medicine and the Potential for Inclusion of Non-Biomedical Health Systems: The Case for Taijiquan. Front Sociol. 2021;5:618167. doi: 10.3389/fsoc.2020.618167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Lawrence R, Force USPST. Guide to clinical preventive services. DIANE publishing; 1989. [Google Scholar]
- 52.Lesaffre E, Lawson AB. Bayesian biostatistics. United Kingdom: John Wiley & Sons, ltd; 2012. [Google Scholar]
- 53.Letterie G. Three ways of knowing: the integration of clinical expertise, evidence-based medicine, and artificial intelligence in assisted reproductive technologies. J Assist Reprod Genet. 2021;38:1617–1625. doi: 10.1007/s10815-021-02159-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Lin L, Chu H. Quantifying publication bias in meta-analysis. Biometrics. 2018;74:785–794. doi: 10.1111/biom.12817. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Mahmood S, Nona P, Villablanca P, et al. The Meta-Analysis in Evidence-Based Medicine: High-Quality Research When Properly Performed. J Cardiothorac Vasc Anesth. 2021;35:2556–2558. doi: 10.1053/j.jvca.2021.05.025. [DOI] [PubMed] [Google Scholar]
- 56.Malzoni M, Casarella L, Coppola M, et al. Preoperative Ultrasound Indications Determine Excision Technique for Bowel Surgery for Deep Infiltrating Endometriosis: A Single, High-Volume Center. J Minim Invasive Gynecol. 2020;27:1141–1147. doi: 10.1016/j.jmig.2019.08.034. [DOI] [PubMed] [Google Scholar]
- 57.Marcoux S, Maheux R, Berube S. Laparoscopic surgery in infertile women with minimal or mild endometriosis. Canadian Collaborative Group on Endometriosis. N Engl J Med. 1997;337:217–222. doi: 10.1056/NEJM199707243370401. [DOI] [PubMed] [Google Scholar]
- 58. CEBM. Oxford Centre for Evidence-based Medicine – Levels of Evidence. 2009.
- 59.Mira TAA, Buen MM, Borges MG, et al. Systematic review and meta-analysis of complementary treatments for women with symptomatic endometriosis. Int J Gynaecol Obstet. 2018;143:2–9. doi: 10.1002/ijgo.12576. [DOI] [PubMed] [Google Scholar]
- 60.Moen MH, Stokstad T. A long-term follow-up study of women with asymptomatic endometriosis diagnosed incidentally at sterilization. Fertil Steril. 2002;78:773–776. doi: 10.1016/s0015-0282(02)03336-8. [DOI] [PubMed] [Google Scholar]
- 61.Moustafa S, Burn M, Mamillapalli R, et al. Accurate diagnosis of endometriosis using serum microRNAs. Am J Obstet Gynecol. 2020;223:557.:e1-e.11. doi: 10.1016/j.ajog.2020.02.050. [DOI] [PubMed] [Google Scholar]
- 62.Murad MH, Altayar O, Bennett M, et al. Using GRADE for evaluating the quality of evidence in hyperbaric oxygen therapy clarifies evidence limitations. J Clin Epidemiol. 2014;67:65–72. doi: 10.1016/j.jclinepi.2013.08.004. [DOI] [PubMed] [Google Scholar]
- 63.Murad MH, Asi N, Alsawas M, et al. New evidence pyramid. Evid Based Med. 2016a;21:125–127. doi: 10.1136/ebmed-2016-110401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Murad MH, Asi N, Alsawas M, et al. New evidence pyramid. Evid Based Med. 2016b;21:125–127. doi: 10.1136/ebmed-2016-110401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Muzii L, Miller CE. The singer, not the song. J Minim Invasive Gynecol. 2011;18:666–667. doi: 10.1016/j.jmig.2011.06.016. [DOI] [PubMed] [Google Scholar]
- 66.Neyman J, Pearson E. On the use and interpretation of certain test criteria for the purposes of statistical inference. Biometrica. 1928;20A:175–240. [Google Scholar]
- 67.Nirgianakis K, Egger K, Kalaitzopoulos DR, et al. Effectiveness of Dietary Interventions in the Treatment of Endometriosis: a Systematic Review. Reprod Sci. 2022;29:26–42. doi: 10.1007/s43032-020-00418-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Nisenblat VL, Prentice PM, Patrick Bossuyt, et al. Combination of the non-invasive tests for the diagnosis of endometriosis. Cochrane Database Syst Rev. 2016;7:CD012281. doi: 10.1002/14651858.CD012281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Nuzzo R. Statistical errors. Nature. 2014;506:150–152. doi: 10.1038/506150a. [DOI] [PubMed] [Google Scholar]
- 70.Peng J, Wang R, Ding Z, et al. Effect assessment of laparoscopy in combination with traditional Chinese medicine decoction in the treatment of endometriosis: A protocol for systematic review and meta-analysis. Medicine (Baltimore) 2021;100:e26699. doi: 10.1097/MD.0000000000026699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Pepe MS, Thompson ML. Combining diagnostic test results to increase accuracy. Biostatistics. 2000;1:123–140. doi: 10.1093/biostatistics/1.2.123. [DOI] [PubMed] [Google Scholar]
- 72.Perrotta M, Geampana A. Enacting evidence-based medicine in fertility care: Tensions between commercialisation and knowledge standardisation. Sociology of Health & Illness. 2021;43:2015–2030. doi: 10.1111/1467-9566.13381. [DOI] [PubMed] [Google Scholar]
- 73.Polin HB, Plaut ME. Chloramphenicol. N Y State J Med. 1977;77:378–381. [PubMed] [Google Scholar]
- 74.Porpora MG, Brunelli R, Costa G, et al. A promise in the treatment of endometriosis: an observational cohort study on ovarian endometrioma reduction by N-acetylcysteine. Evid. Based. Complement Alternat. J Minim Invasiv Gynecol Med. 2013:240702. doi: 10.1155/2013/240702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Price R, Bethune R, Massey L. Problem with p values: why p values do not tell you if your treatment is likely to work. Postgrad Med J. 2020;96:1–3. doi: 10.1136/postgradmedj-2019-137079. [DOI] [PubMed] [Google Scholar]
- 76.Ramirez PT, Frumovitz M, Pareja R, et al. Minimally Invasive versus Abdominal Radical Hysterectomy for Cervical Cancer. N Engl J Med. 2018;379:1895–1904. doi: 10.1056/NEJMoa1806395. [DOI] [PubMed] [Google Scholar]
- 77.Sackett DL, Rosenberg WM, Gray JA, et al. Evidence based medicine: what it is and what it isn’t. BMJ. 1996;312:71–72. doi: 10.1136/bmj.312.7023.71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Sampson JA. Perforating Hemorrhagic (chocolate)cysts of the ovary. Arch Surg. 1921;45:162–241. [Google Scholar]
- 79.Sampson JA. Peritoneal endometriosis due to the menstrual dissemination of endometrial tissue into the peritoneal cavity. Am J Obstet Gynecol. 1927;14:422–469. [Google Scholar]
- 80.Sellke T, Bayarri M, Berger JO. Calibration of ρ values for testing precise null hypotheses. The American Statistician. 2001;55:62–71. [Google Scholar]
- 81.Taylor HS, Adamson GD, Diamond MP, et al. An evidence-based approach to assessing surgical versus clinical diagnosis of symptomatic endometriosis. Int J Gynaecol Obstet. 2018;142:131–142. doi: 10.1002/ijgo.12521. [DOI] [PubMed] [Google Scholar]
- 82.Van Holsbeke C, Van Calster B, Guerriero S, et al. Imaging in gynaecology: How good are we in identifying endometriomas? Facts Views Vis Obgyn. 2009;1:7–17. [PMC free article] [PubMed] [Google Scholar]
- 83.Vercellini P. Endometriosis: the elusive gray area between evidence-based and evidence-biased medicine. Fertil Steril. 2014;101:45–46. doi: 10.1016/j.fertnstert.2013.09.023. [DOI] [PubMed] [Google Scholar]
- 84.Vercellini P, Sergenti G, Buggio L, et al. Advances in the medical management of bowel endometriosis. Best Pract Res Clin Obstet Gynaecol. 2021;71:78–99. doi: 10.1016/j.bpobgyn.2020.06.004. [DOI] [PubMed] [Google Scholar]
- 85.Wasserstein RL, Lazar NA. The ASA Statement on p-Values: Context, Process, and Purpose. The American Statistician. 2016;70:129–133. [Google Scholar]
- 86.Whiting PF, Rutjes AWS, Westwood ME, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155:529–536. doi: 10.7326/0003-4819-155-8-201110180-00009. [DOI] [PubMed] [Google Scholar]
- 87.Wilson JR, Falcone T. The JMIG Issues New Guidelines on Statistical Reporting and p-values. J Minim Invasive Gynecol. 2020;27:1–3. doi: 10.1016/j.jmig.2019.10.006. [DOI] [PubMed] [Google Scholar]
- 88.Yang B, Mallett S, Takwoingi Y, et al. QUADAS-C: A Tool for Assessing Risk of Bias in Comparative Diagnostic Accuracy Studies. Ann Intern Med. 2021;174:1592–1599. doi: 10.7326/M21-2234. [DOI] [PubMed] [Google Scholar]
- 89.You SC, Krumholz HM. The Evolution of Evidence-Based Medicine: When the Magic of the Randomized Clinical Trial Meets Real-World Data. Circulation. 2022;145:107–109. doi: 10.1161/CIRCULATIONAHA.121.057931. [DOI] [PubMed] [Google Scholar]