Abstract
Background and Purpose
Urodynamic studies (UDS) are routinely obtained prior to surgery for stress urinary incontinence (SUI) despite a lack of evidence that UDS information has an actual impact on outcome. The primary aim of this non-inferiority randomized clinical trial is to determine whether women with symptomatic, uncomplicated SUI who undergo only a basic office evaluation (BOE) prior to SUI surgery (No UDS arm) have non-inferior treatment outcomes compared to women who have BOE and UDS (UDS arm). Secondary aims are: 1) to determine how often physicians use preoperative UDS results to alter clinical and surgical decision-making, 2)to compare the amount of improvement in incontinence outcomes, and 3) to determine the incremental cost and utility of performing UDS compared with not performing UDS
Methods
After an initial basic office evaluation, women planning surgery for uncomplicated SUI who consent to study participation will be randomized to receive preoperative UDS or No UDS. Treatment will be planned and performed by the surgeon utilizing all the data available to them. We will compare results from the basic office evaluation (No UDS) with results from the basic office evaluation and preoperative UDS.
Results
The primary outcome will be measured at 12 months using responses to the Urogenital Distress Inventory and the Patient Global Index – Improvement.
Conclusions
Randomized trials comparing the effects of different diagnostic alternatives on treatment outcomes pose study design challenges. A non-inferiority design is appropriate when comparing a less invasive and less expensive alternative with a standard of care approach.
Keywords: Randomized trial, Stress Urinary Incontinence, Urodynamic studies, Non-inferiority trial
INTRODUCTION
Urinary incontinence is a common and costly condition that affects 15% to 50% of women of all ages and significantly impairs quality of life. [1, 2] [3] Stress urinary incontinence (SUI) is one of the two major subtypes of urinary incontinence and affects nearly half of these incontinent women. Surgery is one of the most common treatments for SUI; 135,000 inpatient surgeries for SUI were performed in the U.S. in 1998.[4] Urodynamic studies (UDS) are diagnostic investigations often performed preoperatively to better characterize these patient’s incontinence symptoms.
Diagnostic testing is intended to guide clinical management and improve outcomes. Despite its widespread use, there is no evidence that preoperative urodynamic testing improves treatment outcomes in women with uncomplicated stress urinary incontinence. Agur et al [5] have suggested that some urodynamic parameters correlate with adverse outcomes such as decreased efficacy, irritative bladder symptoms, or postoperative voiding difficulty. However, results from a secondary analysis of the Stress Incontinence Surgical Treatment Efficacy Trial (SISTEr) reported that in 655 women who received urodynamics before surgery, typical urodynamic parameters often given as reasons for performing UDS [e.g. Valsalva Leak Point Pressures (VLPP), presence or absence of detrusor overactivity (DO), urodynamic stress incontinence (USI)] did not predict stress incontinence success or failure, although there was a trend toward a lower success rate in women without USI. [6] In the same group of patients, preoperative UDS measures were unable to predict the likelihood of voiding dysfunction after surgery.[7]
Several professional organizations including the International Urogynecological Association (IUGA) Guidelines for Research and Practice, and the Royal College of Obstetrics and Gynecology (RCOG) recommend performance of UDS prior to surgical management of SUI. Not all professional societies and organizations are in agreement regarding the role of UDS in the preoperative assessment of SUI. In 2006, the National Institute for Health and Clinical Excellence (NICE) in the U.K. states that “the use of multichannel cystometry is not routinely recommended before surgery in women with a clearly defined clinical diagnosis of pure SUI”. [8] A large U.K. tertiary referral center has challenged this report noting that only 5% of their patients with urinary incontinence had pure SUI, and of these, a quarter will have other urodynamic diagnoses. These authors suggested that the NICE recommendations are “unwise” and argue that “a randomized trial of preoperative urodynamics is required to inform clinical practice…” [5]. This sentiment for a RCT is supported by an evidence based Cochrane review which concluded that “A larger definitive trial is needed, in which people are randomly allocated to management according to urodynamic findings or to standard management based on history and clinical examination.”.[9] In addition to a lack of evidence that preoperative UDS improves outcomes in uncomplicated patients with SUI, urodynamic testing has other negatives. It is expensive (approximately $1000 USD), time-consuming (approximately 1 hour), invasive (requiring a urinary catheter), and may be uncomfortable and/or embarrassing to patients.
Although studies are needed to determine the benefits of urodynamic information for all urinary incontinence disorders, there is less controversy about performing UDS preoperatively in women with more complex clinical presentations, including concomitant stress and urge incontinence (symptoms of leakage with a strong urge and leakage with stress provocation, both bothersome to the patient), or those who have failed a previous surgery, as findings may help guide primary approaches to treatment. There is also less controversy about the use of UDS prior to any conservative non-surgical therapy, as it is generally agreed that it is not necessary as the risk-benefit ratio of the proposed therapy has a good therapeutic index. A UDS performed for multiple clinical diagnoses does not answer the question of the value of UDS for a specific diagnosis. Therefore we have designed and implemented a narrower and more feasible study that addresses the role of preoperative UDS for uncomplicated SUI in selected patients desiring surgical intervention for predominant SUI following a basic office evaluation (BOE). We propose to study the clinical utility of UDS in women with stress predominant incontinence because of the high volume use and unresolved benefit of UDS for this condition.
METHODS
Study Design
UDS is currently considered standard of care. This trial is designed to compare a less invasive alternative, a basic office evaluation (BOE) without UDS (No UDS), with standard of care (i.e. a BOE and UDS, or the UDS arm) to determine whether the No UDS arm is non-inferior to the UDS arm. Because we anticipate that any difference seen between the UDS and No UDS groups may be subtle, we selected a randomized, non-inferiority design. By randomly assigning participants to receive UDS or not, confounding factors due to surgeon and patient preferences as well as differences by clinical center will be minimized.
At the conclusion of the study, if the No UDS arm is no more than marginally inferior to the UDS arm, it will be possible to deem the No UDS arm non-inferior to the UDS arm and conclude that UDS is not necessary rather than continuing with it as standard of care prior to surgery in this patient population. If this study was designed as a typical superiority trial and the success rates in the No UDS and UDS arms were very similar, the conclusion drawn would be an inability to detect a difference between the two arms. Such a conclusion would not be the same as finding that the No UDS arm is non-inferior to the UDS arm and, therefore, is less clinically relevant. The design of the trial is depicted in Figure 1.
Randomization
Participants are randomized either to UDS or No UDS using an automated randomization system, with a back-up system of sealed randomization envelopes in the case of technical problems. Randomization is stratified by surgeon with institutional balancing, using permuted blocks. Neither patients nor surgeons are blinded to group assignment.
The protocol was approved by the Data and Safety Monitoring Board (DSMB) of the Urinary Incontinence Treatment Network (UITN) and Institutional Review Boards of all participating centers. Written informed consent is obtained from all women prior to study enrollment. Recruitment is estimated to take approximately 2.5 years following initiation of enrollment which began in November 2008.
Study Aims
The primary aim of the study is to determine if women who are surgical candidates for treatment of uncomplicated predominant SUI after a basic office evaluation only (No UDS arm) have non-inferior treatment outcomes compared to women who receive both a basic office evaluation and preoperative UDS (UDS arm).
The trial has three main secondary aims. The first is to determine, among women randomized to UDS, how often physicians use UDS to alter clinical decision-making in selecting and/or performing SUI surgery. The second aim is to compare improvement in incontinence outcomes between groups. The third aim is to determine the incremental cost and utility of performing UDS as compared to not performing UDS.
Study Population
The study population consists of women with predominant stress urinary incontinence (SUI) and no history of incontinence surgery. The women are appropriate surgical candidates and are seeking such correction. Inclusion and exclusion criteria are displayed in Table 1.
Table 1.
Inclusion Criteria
|
Exclusion Criteria
|
Physician Diagnosis and Management
Prior to randomization, the surgeon will specify the diagnosis and planned treatment based on results of the basic office evaluation and clinical judgement. If randomized to UDS, the surgeon will report any changes in diagnosis, treatment plan (for example, the plan could change to a non-surgical therapy), or modifications to the existing treatment plan as a result of the additional information provided by UDS.
Measures
Primary and secondary outcomes will be assessed with validated instruments throughout the study. The schedule of measurements is included in Table 2.
Table 2.
Screen | Baseline | Operative | 3 months | 12 months | |
---|---|---|---|---|---|
UDI | ✓ | ✓ | ✓ | ||
PGI-I | ✓ | ✓ | |||
PGI-S | ✓ | ✓ | ✓ | ||
Stress Test | ✓ | ✓ | |||
UDS | ✓ | ||||
Additional Treatment | ✓ | ✓ | |||
SF-12 | ✓ | ✓ | ✓ | ||
IIQ-7 | ✓ | ✓ | ✓ | ||
ISI | ✓ | ✓ | ✓ | ||
MESA | ✓ | ✓ | ✓ | ||
Voiding Function | ✓ | ✓ | |||
Patient Satisfaction | ✓ | ✓ | |||
Patient Preparedness | ✓ | ||||
Adverse Events | ✓ | ✓ | ✓ | ✓ | |
Urine Dipstick | ✓ | ||||
Costs & Utilities | ✓ | ✓ | |||
Sociodemographic | ✓ | ||||
Comorbidities | ✓ | ✓ | ✓ | ||
H&P | ✓ | ✓ | ✓ | ||
PVR | ✓ | ✓ | |||
Medication Audit | ✓ | ✓ | ✓ | ||
Prolapse Assessment | ✓ | ||||
Urethral Mobility | ✓ | ||||
Operative Measures | ✓ | ||||
MD Dx & Tx Info. | ✓ | ✓ | ✓ | ✓ |
UDI-Urogenital Distress Inventory[10]; PGI-I- Patient Global Impression of Improvement[11–13]; PGI-S-Patient Global Impression of Severity [11–13]; UDS-Urodynamics; SF-12-Short Form-12; IIQ-7-Incontinence Impact Questionnaire-7; ISI-Incontinence Severity Index[15]; MESA-Medical and social Aspects of Aging questionnaire[14]; H&P- History and Physical; PVR- Post void residual volume; MD Dx- Physician diagnosis; Tx Info- Treatment information.
Primary Outcome Measure
The primary outcome is measured at 12 months and includes a traditional, objective measure and a patient-oriented subjective report. The Urogenital Distress Inventory (UDI) is a 20-item patient-reported measure, that assesses symptoms of stress incontinence, urge incontinence, urgency, frequency, and voiding dysfunction [10]. This instrument has established validity, reliability, and responsiveness to change. In addition, we selected the Patient Global Index – Improvement (PGI-I) which is a patient-reported measure of perceived improvement after treatment of stress and urge incontinence that has been shown to have good construct validity in incontinence trials.[11],[12] The PGI-I correlates significantly with incontinence episode frequency (r=0.36), pad test weights (r=0.20) and quality of life (IQOL: r=−0.50) in women with predominant SUI.[13] Significant reductions in values are typically observed after SUI surgery. The PGI-I asks subjects to check the one number that best describes how their urinary tract condition (bladder) is now, compared to how it was before they received treatment for urinary leakage. Response choices are: 1) “very much better”, 2) “much better”, 3) “a little better”, 4) “no change”, 5) “a little worse”, 6) “much worse”, and 7) “very much worse”. Specifically, treatment success is defined as a 70% reduction in the UDI from baseline to 12 months and a score of “1” or “2” (i.e., “very much better” or “much better”) on the PGI-I at 12 months. Thus, treatment failure is defined by the occurrence of one of the following: a less than 70% reduction in the UDI from baseline to 12 months or a score of 3 or greater on the PGI-I at 12 months.
Secondary Outcome Measures
Self-reported stress-type urinary incontinence symptoms will be measured using selected items from the Medical, Epidemiologic and Social Aspects of Aging Project (MESA).[14] Quality of life measures include general health-related quality of life measured by the Short Form-12 Health Survey and condition-specific quality of life measured by the Incontinence Impact Questionnaire (IIQ-7)[10]. Incontinence severity is measured with the Hunskaar/Sandvik Incontinence Severity Index (ISI) [15] and the Patient Global Impression Index of Severity[13]. Patient satisfaction with treatment outcome is measured with a 5-point Likert scale (very dissatisfied to very satisfied) that asks patients to rate satisfaction with how treatment has affected 1) urine leakage; 2) urine leakage related to feeling of urgency; 3) urine leakage related to physical activity, coughing or sneezing; and 4) frequency of urination. Finally, as it is plausible that patients undergoing UDS will feel more prepared for surgery and the post-operative period than those who do not receive UDS, preparedness is measured with an 11-item questionnaire developed for this purpose.
To measure how often physicians alter their clinical decision making based on the results of UDS, specifically designed data collection forms have been developed to record the surgeons clinical diagnosis, treatment plan, confidence in treatment plan, and patient counseling. These forms are administered after the basic office evaluation and again after UDS for the subjects who were randomized to UDS. Urodynamic results and interpretation, using ICS definitions, are recorded on a separate data form and a final form characterizes any treatment changes or modifications because of UDS findings.
Voiding function is evaluated preoperatively by a post-void residual (PVR). Any use of a catheter and/or medical therapy to facilitate bladder emptying at or beyond 6 weeks post-surgery, or any surgical therapy to facilitate bladder emptying at anytime post-surgery is considered voiding dysfunction. For an objective measure of stress incontinence outcome, a provocative stress test standardized to volume (i.e. 300ml) is performed at the 12 month visit for direct observation of urine leakage.[16] To minimize potential bias, a qualified examiner blinded to the randomization group and not the study surgeon will perform this stress test. Additional treatment and evaluation information will be reported by the patient on specific survey instruments at 3 and 12 months..
Data on medical and non-medical costs are being collected. Direct costs of medical care both within the study (i.e. costs associated with surgery) and medical care utilization outside of the study will be assessed. For study-related care, data are collected regarding type of surgery, number of inpatient days, complications/morbidity (any adverse deviation from the normal intraoperative or postoperative course), medications, and additional therapies. Intervention costs will be calculated as the product of resource utilization and unit costs. Medication cost will be estimated using the minimum average wholesale price of commonly prescribed medications. Marginal use of resources (provider visits) will be estimated between groups. Direct costs will be calculated using a proxy for societal cost, Medicare resource-based relative value scale charges for physician services.
For assessing direct non-medical costs, participants report numbers of pads, protection, laundry, dry cleaning, personal hygiene products, household protection, and household cleaning products consumed specifically for their incontinence. Data are also collected regarding indirect resources related to incontinence (time spent on incontinence-related healthcare, limits on employment or volunteer work due to incontinence). The Incontinence Resource Use Questionnaire was derived from other published expense surveys.[17–19] The average national cost of each product will be determined by a survey of several retail and wholesale stores. Finally, health-related quality of life (utilities) for health outcomes is measured with the Health Utilities Index Mark 3 (HUI3).[20]
Adverse events are reported in accordance with the Department of Health and Human Services (DHHS) code of federal regulations (Title 45, Part 46). The UITN DSMB meets a minimum of two times per year to monitor all of the Network’s trials.
Study Arms
All patients receive a basic office evaluation which is a routine part of the surgeons’ preoperative clinical assessment. Core components are standardized per protocol and include the following: self-reported stress-type UI symptoms, a provocative stress test at any volume, post void residual (PVR) by any method, dipstick urinalysis, a standing, straining prolapse exam and assessment of urethral mobility (i.e. by either a Q-tip test, visual inspection, palpation, point Aa on POP-Q exam, or a lateral cystogram).
NO UDS
The patients who after the basic office evaluation randomize to the NO UDS arm will proceed with their planned surgical intervention.
UDS
Patients in the UDS arm receive core urodynamic investigations including non-instrumented uroflowmetry (NIF) with a comfortably full bladder, PVR obtained with catheter (after NIF), filling CMG with VLPP determination attempts, and a pressure flow study (PFS). Testing is performed according to the local high quality standards of urodynamic practices at each study center. Specific testing and procedural details of the core urodynamic procedures conform to ICS Good Urodynamic Practice guidelines[21], and testing results and interpretation of results conform to ICS nomenclature.[22]. Optional UDS investigations (e.g. Urethral Pressure Profiles, Videourodynamics) may be performed if those investigations are customary practice for the patient population at a study center. The UITN has significant experience demonstrating the development of standardized, multi-center, high quality urodynamic studies in previous trials. [23, 24]
Sample Size Determination for Primary Endpoint
As a non-inferiority trial, we determined our sample size by defining P1 as the proportion of women in the UDS arm who have a treatment success (defined by the primary outcome measure) and P2 as the proportion of women in the No UDS group who succeed. The equivalence margin or delta (δ) is the largest clinically relevant difference that would be allowed for the two arms to differ by and still say that the No UDS arm is non-inferior to the UDS arm. Alpha (α) is the probability of declaring that the No UDS group is not inferior to the UDS group given that the No UDS group is indeed inferior by at least an amount δ. Power is the probability of declaring that the No UDS group is not inferior to the UDS group given that the No UDS group is not inferior. The sample size calculation is performed under the assumption that the treatment success rates for the two groups are identical. It is also assumed that a test of the difference in proportions (P1 – P2) using the normal approximation to the binomial distribution will be used to determine non-inferiority.
For sample size calculation, we assume an alpha (α) value of 0.05 and that the true proportion of successes in each group is 70% with an equivalence margin or δ of 11%. This equivalence margin was chosen based on the belief that this was the largest amount that we would allow the success rate to differ and still deem the no UDS group non-inferior, while still producing a realistic sample size. Using these assumptions, 270 women are needed in each study arm to have 80% power for determining whether the No UDS arm is non-inferior to the UDS arm (Hintze, J. (2004). NCSS and PASS. Number Cruncher Statistical Systems. Kaysville, UT). To account for a dropout rate of 10%, a sample size of 300 women per arm (600 women total) will be required.
Analytic Approach for the Primary Endpoint
At the end of the study, we will construct a two-sided 95% confidence interval for the difference in the proportion of successes in the UDS group and the No UDS group (P1 – P2) to determine whether the No UDS arm is non-inferior to the UDS arm. Decisions regarding non-inferiority will be made according to Figure 2 based on the confidence interval. Should the upper limit of the two-sided confidence interval for the difference be less than the delta limit, only then can non-inferiority for the No UDS group compared with the UDS group be declared. According to Figure 2, this occurs only when the upper end of the confidence limit does not cross the dotted line (the δ value of 11%), as in scenarios 3, 4 and 6. In the two inconclusive scenarios in Figure 2 (scenarios 2 and 5), the confidence limit crosses the δ value, so it is not certain that the No UDS arm is truly non-inferior to the UDS arm. In scenario 1, both the lower and upper limit of the confidence interval exceed δ; in this case, the No UDS group can be declared inferior to the UDS group.
The study plans to also test for the possibility that the No UDS group is not just non-inferior, but superior to the UDS group, while maintaining the planned overall α level of 0.05 [25]. As described above, the test for non-inferiority will be done and non-inferiority will be determined according to the two-sided level 1-α confidence interval. If it cannot be concluded that the No UDS arm is non-inferior to the UDS arm, no further testing will be done. However, if non-inferiority can be concluded as in scenarios 3, 4 and 6, then the same confidence interval will be used to assess whether the No UDS arm is better than the UDS arm. Specifically, if the interval does not overlap zero, then the No UDS arm is deemed superior to the UDS arm (scenario 3). If the interval does overlap zero, then the No UDS arm is still declared non-inferior to the UDS arm.
To minimize bias towards non-inferiority, only women treated “per protocol” will be considered in the analysis of the primary endpoint.[26] “Per protocol” is defined as considering only women in the UDS arm who have UDS performed and only women in the No UDS arm who only have an office assessment (and no UDS); e.g. include only those who receive the assessment that they are randomized to in the analysis. Intention-to-treat (ITT) analysis, which includes all patients by randomization group regardless of the treatment received, will also be performed, but will be considered secondary.
Analytic Approach for the Secondary Endpoints
Descriptive statistics will be calculated to measure how often physicians alter their clinical decision-making based on the results of UDS before a planned intervention. Specifically, the rates of altered diagnoses and number of modifications to the initial treatment will be compared using chi-square tests or t tests, as appropriate. To compare the amount of improvement in incontinence outcomes in women randomized to the two diagnostic methods, the women in the two groups will be compared regarding the two components of the primary outcome: the percent improvement in UDI and PGI-I score. The mean percent improvement in UDI in the two groups and the 95% confidence interval of the difference in the mean improvement between the groups will also be computed. For PGI-I score, ordinal logistic regression analysis will be used to estimate the relative odds of a lower score (more improvement) in the PGI-I of the No UDS vs. UDS diagnostic groups and its 95% confidence interval.
An underlying, but untested, premise of this trial is that availability of UDS results can influence how a physician performs the incontinence surgery and therefore the outcome of surgery. To address this important clinical question, a sub-group analysis will be used to compare the outcomes of women who received surgical treatment in the two randomization groups. For this analysis, a two-sided 95% confidence interval for the difference in the proportion of treatment successes between the two arms will be conducted only among women who received surgery. The definition of treatment success for this comparison is the primary outcome measure plus a negative standard volume stress test.
Additional analyses will be performed comparing the two arms in terms of quality of life, urinary incontinence severity, voiding dysfunction, and other measures as necessary. Multivariable linear regression will be used for continuous measures and logistic regression for dichotomous measures controlling for relevant baseline characteristics. P values less than 0.05 will be considered to be statistically significant and no adjustments will be made for multiple comparisons. However, when secondary measures are reported, readers will be informed of the susceptibility to chance findings.
Cost Analysis
Intervention costs will be calculated as the product of resource utilization and unit costs. Standard costs (DRG, CPT, and AWP) will be applied to each utilization. Medication cost will be estimated using the minimum average wholesale price of commonly prescribed medications.[27] Marginal use of resources (provider visits) will be estimated between groups. Direct costs will be calculated using a proxy for societal cost, Medicare resource-based relative value scale charges for physician services.[28] Cost-utility and cost-effectiveness analyses will be performed to compare the two groups. For cost-utility analyses, net cost and net QALY will be estimated in each arm of the model, and the marginal cost per quality adjusted life year (QALY) will be calculated and compared across groups. For cost-effectiveness analyses, we will determine the marginal cost between the groups per treatment outcome.
DISCUSSION
Recent systematic reviews by the National Institute for Health and Clinical Excellence[8] and the Cochrane group[9] have called into question the utility of preoperative UDS for the uncomplicated stress incontinent patient. However, experts do emphasize that this tentative conclusion is derived from flawed and limited evidence in the literature. Consensus expert opinion has called for the development and implementation of a randomized trial to evaluate the utility of preoperative UDS for this surgical population.[5, 9] This trial attempts to evaluate the clinical utility of UDS and we found that designing a trial of a diagnostic test poses several challenges.
It is generally accepted in medicine that diagnostic studies are performed for specific indications to answer specific questions. The ultimate value of a diagnostic test is its ability to change clinical decision-making, and by doing so, improve outcomes. An underlying assumption of this current trial is that UDS is a diagnostic tool that may alter clinical diagnosis, change clinical decision-making, and change outcomes in women planning surgery for SUI. If UDS changes the diagnosis, a different treatment is possibly in the best interests of the patient. If UDS reveals that the patient may be higher risk for treatment failure or treatment complication, patient counseling may be different and the treatment may be altered in a way to reduce the risk of a complication. In the design of a study evaluating a diagnostic test the treating physician has to be able to use that test to change decisions and change treatment and these treatments may have different efficacies. Ultimately however, since we are stratifying the randomization by each surgeon, the efficacy will be a product of the decision-making and treatment produced by adding the UDS.
It is possible that UDS improves outcomes by altering diagnosis and treatments so that the efficacy of the SUI treatment is reduced, but the subject has fewer untoward results like urgency, frequency, urge incontinence, or voiding dysfunction. For this reason, our primary outcome measure needs to be an instrument that broadly measures bladder storage and emptying function and patient assessment of her urinary condition. For all these reasons, we chose the Urogenital Distress Inventory and the Patient Global Impression Index as measures that can capture these outcomes.
In many ways the design of this study investigating a diagnostic test is similar to study designs of pragmatic clinical trials. Practitioners are given freedom to treat the patients normally, the outcome has to be relevant to everyday life, and we are evaluating the overall “package” of care. [29] We hope that this trial, like a pragmatic clinical trial, will deliver evidence of effectiveness or ineffectiveness in an everyday clinical context. Unlike a standard superiority trial, to minimize bias toward non-inferiority, our primary outcome will be “per protocol” rather than an intention-to-treat analysis, although we will perform an intention-to-treat analysis as a secondary analysis. Also unlike a pragmatic clinical trial, this study could not have broad inclusion criteria. We restricted this study to uncomplicated stress predominant incontinence patients who did not have previous incontinent surgery or prolapse. The reason for this decision is that if complicated patients were included, a result showing inferiority of the No UDS arm does not answer the question if uncomplicated patients would also have inferior outcomes without UDS. In contrast, if this study shows that the No UDS arm has inferior outcomes to the UDS arm then nearly all clinicians would extrapolate those same results to an even more complicated patient population and a strong argument could be made for universal preoperative UDS.
We chose a non-inferiority design for several reasons. We consider UDS before surgery to be the generally accepted standard procedure throughout much of the world. The question to be addressed in this study is whether women diagnosed with uncomplicated stress urinary incontinence following a basic office evaluation and managed without preoperative UDS have non-inferior outcomes compared to those who receive the more invasive and expensive preoperative UDS. When comparing two approaches to diagnostic testing and a less invasive, less expensive diagnostic regimen is being compared to a more invasive, more expensive standard then a non-inferiority design is more appropriate than the typical superiority trial design that is used when studying two different randomized treatments. At the conclusion of the study, it will be determined whether it can be ruled out “beyond a reasonable doubt” that baseline office assessment alone (No UDS arm) is inferior to the baseline office visit plus UDS (UDS arm) by a clinically significant amount. Four possible conclusions will be drawn: 1) No UDS is better than UDS (superiority), 2) No UDS is non-inferior to UDS (non-inferiority), 3) No UDS is inferior to UDS (inferiority), or 4) unable to show that No UDS is either better than or non-inferior to UDS (inconclusive).
For a non-inferiority trial, the primary outcome of the trial is not expected to differ. In this case, with the effectiveness outcome of the cost-effectiveness analysis expected to be the same, the cost-effectiveness analysis can reduced to a cost-minimization analysis where the study of interest is the total cost difference between women who receive pre surgical UDS and those who do not. The less costly assessment will be a preferred choice from the cost standpoint. The study focus will be the incremental cost. That is to say, we are interested in the cost components that are different between comparison arms. Those cost components that are likely to be the same between comparison arms will have zero cost increments; therefore, their information will not be critical to collect if the study budget is the concern.
In superiority trials, early stopping rules are used to monitor for safety and traditionally would stop a trial if a treatment difference is so large that it is unethical to continue treating the other group of patients with the “inferior” treatment. Interim monitoring of the outcomes for making a decision to end the trial early is not appropriate in this trial for several reasons. In non-inferiority trials, such as this one, the argument of stopping the trial early when there is definitive evidence of non-inferiority does not apply. Say, for example, that early data suggests that the No UDS arm is non-inferior to the UDS arm. In this case, it is ethical to continue current practice (use of UDS), and one would want extremely strong evidence to abandon a study in favor of changing standard practice. This trial is comparing two commonly used diagnostic assessment methods, not two treatments. Treatment of patients is a matter of each clinician’s clinical judgment. Thus, there would be no need to stop assessing patients by one method or the other due to a difference in treatment outcomes. Also, the more invasive of the two arms is the UDS procedure; however, this is current medical practice and thus any adverse events have been well documented. The “less invasive” arm is simply an office visit and does not pose any significant threat to the patients.
As described above, there will not be a formal statistical stopping rule based on the outcome measure and thus the outcomes will not be analyzed until the end of the trial. However, all subjects in the study will be overseen by three levels of protection: the UITN Data Safety and Monitoring Board, the clinical site Institutional Review Board and the local principal investigator. The DSMB will receive twice yearly information on recruitment and retention, data quality, baseline characteristics and adverse events by randomization arm. Using this information, should the DSMB find the trial unsafe due to excessive adverse events, excessively slow recruitment or other unforeseen safety reasons, then they could recommend stopping the trial.
The major strength of this study is that it is designed to address a common condition, and a common preoperative practice for which there is little evidence. A multi-center, multi-surgeon study with the randomization scheme stratified by surgeon minimizes individual surgeon treatment effects and should provide generalizable results for surgeons and this patient population. The potential weakness in this trial is that if the results show that the NoUDS arm is not inferior, these results are only applicable to this population and more complicated patients may have inferior results. Additional studies on more complicated patients would be needed to answer the same question for this group and certainly there are many more complicated SUI patients that are excluded from this trial. The largest groups of women with SUI who are excluded from this are those with previous surgery and those with anterior or apical prolapse. In a large community population, approximately 30% of women with SUI had co-occurrence of pelvic organ prolapse. [30]
CONCLUSIONS
When evaluating women with stress urinary incontinence, preoperative urodynamic studies can be time-consuming, invasive, and expensive. It is not evident that these investigations commonly alter diagnosis, treatment plans, or treatment outcomes. Randomized trials evaluating diagnostic tests are just as needed as randomized trials evaluating therapeutic interventions. Non-inferiority trial designs are most appropriate for these studies especially when differences between groups are anticipated to be small and subtle. A cost –minimization analysis may substitute for a cost-effectiveness analysis. The information obtained from this trial will be helpful in determining the best preoperative management of women with stress urinary incontinence.
Acknowledgments
Supported by cooperative agreements from the National Institute of Diabetes and Digestive and Kidney Diseases, U01 DK58225, U01 DK58229, U01 DK58234, U01 DK58231, U01 DK60379, U01 DK60380, U01 DK60393, U01 DK60395, U01 DK60397, and U01 DK60401.
Appendix A- The Urinary Incontinence Treatment Network (UITN)
STEERING COMMITTEE
Elizabeth A. Gormley, Chair (Dartmouth Hitchcock Medical Center, Lebanon, NH); Larry Sirls, MD, Salil Khandwala, MD (William Beaumont Hospital, Royal Oak, MI and Oakwood Hospital, Dearborn, MI; U01 DK58231); Linda Brubaker, MD, Kimberly Kenton, MD (Loyola University Medical Center, Maywood, IL; U01 DK60379); Holly E. Richter, PhD, MD, L. Keith Lloyd, MD (University of Alabama, Birmingham, AL; U01 DK60380); Michael Albo, MD, Charles Nager, MD (University of California, San Diego, CA; U01 DK60401); Toby C. Chai, MD, Harry W. Johnson, MD (University of Maryland, Baltimore, MD; U01 DK60397); Halina M. Zyczynski, MD, Wendy Leng, MD (University of Pittsburgh, Pittsburgh, PA; U01 DK 58225); Philippe Zimmern, MD, Gary Lemack, MD (University of Texas Southwestern, Dallas, TX; U01 DK60395); Stephen Kraus, MD, Thomas Rozanski, MD (University of Texas Health Sciences Center, San Antonio, TX; U01 DK58234); Peggy Norton, MD, Ingrid Nygaard, MD (University of Utah, Salt Lake City, UT; U01 DK60393); Sharon Tennstedt, PhD, Anne Stoddard, ScD (New England Research Institutes, Watertown, MA; U01 DK58229); Debuene Chang, MD, Marva Moxey-Mims, MD, Rebekah Rasooly, MD (National Institute of Diabetes & Digestive & Kidney Diseases).
CO-INVESTIGATORS
Amy Arisco, MD; Jan Baker, APRN; Diane Borello-France, PT, PhD; Kathryn L. Burgio, PhD; Ananias Diokno, MD; Melissa Fischer MD; MaryPat Fitzgerald, MD; Chiara Ghetti, MD; Patricia S. Goode, MD; Robert L. Holley, MD; Margie Kahn, MD; Jerry Lowder, MD; Karl Luber, MD; Emily Luckacz, MD; Alayne Markland, DO, MSc; Shawn Menefee, MD; Pamela Moalli, MD; Elizabeth Mueller, MD; Pradeep Nagaraju MD; Kenneth Peters, MD; Elizabeth Sagan, MD; Joseph Schaffer, MD; Amanda Simsiman, MD; Robert Starr, MD; Gary Sutkin, MD; R. Edward Varner, MD.
STUDY COORDINATORS
Laura Burr, RN; JoAnn Columbo, BS, CCRC; Tamara Dickinson, RN, CURN, CCCN, BCIA-PMDB; Rosanna Dinh, RN, CCRC; Judy Gruss, RN; Alice Howell, RN, BSN, CCRC; Chaandini Jayachandran, MSc; Kathy Jesse, RN; D. Lynn Kalinoski, PhD; Barbara Leemon, RN; Kristen Mangus; Karen Mislanovich, RN; Elva Kelly Moore, RN; Caren Prather, RN; Sylvia Sluder, CCRP; Mary Tulke, RN; Robin Willingham, RN, BSN; Kimberly Woodson, RN, MPH; Gisselle Zazueta-Damian.
BIOSTATISTICAL COORDINATING CENTER:
Kimberly J. Dandreo, MSc; Liyuan Huang, MS; Rose Kowalski, MA; Heather Litman, PhD; Marina Mihova, MHA; Anne Stoddard, ScD (Co-PI); Kerry Tanwar, BA; Sharon Tennstedt, PhD (PI); Yan Xu, MS.
DATA SAFETY AND MONITORING BOARD
J. Quentin Clemens MD, (Chair) Northwestern University Medical School, Chicago IL; Paul Abrams MD, Bristol Urological Institute, Bristol UK; Diedre Bland MD, Blue Ridge Medical Associates, Winston Salem NC; Timothy B. Boone, MD, The Methodist Hospital, Baylor College of Medicine, Houston, TX; John Connett PhD, University of Minnesota, Minneapolis MN; Dee Fenner MD, University of Michigan, Ann Arbor MI; William Henderson PhD, University of Colorado, Aurora CO; Sheryl Kelsey PhD, University of Pittsburgh, Pittsburgh PA; Deborah J. Lightner, MD, Mayo Clinic, Rochester, MN; Deborah Myers MD, Brown University School of Medicine, Providence RI; Bassem Wadie MBBCh, MSc, MD, Mansoura Urology and Nephrology Center, Mansoura, Egypt; J. Christian Winters, MD, Louisiana State University Health Sciences Center, New Orleans, LA
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
For a list of UITN investigators see appendix A
REFERENCES
- 1.Diokno AC, Brock BM, Brown MB, Herzog AR. Prevalence of urinary incontinence and other urological symptoms in the noninstitutionalized elderly. J Urol. 1986 Nov;136(5):1022–1025. [PubMed] [Google Scholar]
- 2.Hampel C, Wienhold D, Benken N, Eggersmann C, Thuroff JW. Definition of overactive bladder and epidemiology of urinary incontinence. Urology. 1997 Dec;50(6A Suppl):4–14. doi: 10.1016/s0090-4295(97)00578-5. discussion 5–7. [DOI] [PubMed] [Google Scholar]
- 3.Simeonova Z, Milsom I, Kullendorff AM, Molander U, Bengtsson C. The prevalence of urinary incontinence and its influence on the quality of life in women from an urban Swedish population. Acta Obstet Gynecol Scand. 1999 Jul;78(6):546–551. [PubMed] [Google Scholar]
- 4.Waetjen LE, Subak LL, Shen H, et al. Stress urinary incontinence surgery in the United States. Obstet Gynecol. 2003 Apr;101(4):671–676. doi: 10.1016/s0029-7844(02)03124-1. [DOI] [PubMed] [Google Scholar]
- 5.Agur W, Housami F, Drake M, Abrams P. Could the National Institute for Health and Clinical Excellence guidelines on urodynamics in urinary incontinence put some women at risk of a bad outcome from stress incontinence surgery? BJU Int. 2008 Nov 18; doi: 10.1111/j.1464-410X.2008.08121.x. [DOI] [PubMed] [Google Scholar]
- 6.Nager CW, FitzGerald M, Kraus SR, et al. Urodynamic measures do not predict stress continence outcomes after surgery for stress urinary incontinence in selected women. J Urol. 2008 Apr;179(4):1470–1474. doi: 10.1016/j.juro.2007.11.077. [DOI] [PubMed] [Google Scholar]
- 7.Lemack GE, Krauss S, Litman H, et al. Normal preoperative urodynamic testing does not predict voiding dysfunction after Burch colposuspension versus pubovaginal sling. J Urol. 2008 Nov;180(5):2076–2080. doi: 10.1016/j.juro.2008.07.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.NICE. Urinary Incontinence, the management of urinary incontinence in women. National Institute for Health and Clinical Excellence (NICE) 2006:1–36. http://www.nice.org.uk/nicemedia/pdf/CG40NICEguideline.pdf(NICE clincal guideline 40) [Google Scholar]
- 9.Glazener CM, Lapitan MC. Urodynamic investigations for management of urinary incontinence in adults. Cochrane Database Syst Rev. 2002;(3):CD003195. doi: 10.1002/14651858.CD003195. [DOI] [PubMed] [Google Scholar]
- 10.Shumaker SA, Wyman JF, Uebersax JS, McClish D, Fantl JA. Health-related quality of life measures for women with urinary incontinence: the Incontinence Impact Questionnaire and the Urogenital Distress Inventory. Continence Program in Women (CPW) Research Group. Qual Life Res. 1994 Oct;3(5):291–306. doi: 10.1007/BF00451721. [DOI] [PubMed] [Google Scholar]
- 11.Wagner TH, Patrick DL, Bavendam TG, Martin ML, Buesching DP. Quality of life of persons with urinary incontinence: development of a new measure. Urology. 1996 Jan;47(1):67–71. doi: 10.1016/s0090-4295(99)80384-7. discussion -2. [DOI] [PubMed] [Google Scholar]
- 12.Patrick DL, Martin ML, Bushnell DM, et al. Quality of life of women with urinary incontinence: further development of the incontinence quality of life instrument (I-QOL) Urology. 1999 Jan;53(1):71–76. doi: 10.1016/s0090-4295(98)00454-3. [DOI] [PubMed] [Google Scholar]
- 13.Yalcin I, Bump RC. Validation of two global impression questionnaires for incontinence. Am J Obstet Gynecol. 2003 Jul;189(1):98–101. doi: 10.1067/mob.2003.379. [DOI] [PubMed] [Google Scholar]
- 14.Herzog AR, Diokno AC, Fultz NH. Urinary incontinence: medical and psychosocial aspects. Annu Rev Gerontol Geriatr. 1989;9:74–119. doi: 10.1007/978-3-662-40455-3_3. [DOI] [PubMed] [Google Scholar]
- 15.Hunskaar S, Vinsnes A. The quality of life in women with urinary incontinence as measured by the sickness impact profile. J Am Geriatr Soc. 1991 Apr;39(4):378–382. doi: 10.1111/j.1532-5415.1991.tb02903.x. [DOI] [PubMed] [Google Scholar]
- 16.Shull BL, Halaska M, Hurt G. Physical Examination. In: Abrams P, Saad K, Wein A, et al., editors. Incontinence:Proceedings of the 1st Interntional Consultation on Incontinence. St. Helier, England: Monaco: Health Publications Ltd.; 1998. 1998. [Google Scholar]
- 17.Subak L, Van Den Eeden S, Thom D, Creasman JM, Brown JS. Urinary incontinence in women: Direct costs of routine care. Am J Obstet Gynecol. 2007 Dec;197(6):596, e1–e9. doi: 10.1016/j.ajog.2007.04.029. [DOI] [PubMed] [Google Scholar]
- 18.Dowell CJ, Bryant CM, Moore KH, Simons AM. Calculating the direct costs of urinary incontinence: a new test instrument. BJU Int. 1999 Apr;83(6):596–606. doi: 10.1046/j.1464-410x.1999.00993.x. [DOI] [PubMed] [Google Scholar]
- 19.Subak LL, Brown JS, Kraus SR, et al. The "costs" of urinary incontinence for women. Obstet Gynecol. 2006 Apr;107(4):908–916. doi: 10.1097/01.AOG.0000206213.48334.09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Feeny D, Furlong W, Boyle M, Torrance GW. Multi-attribute health status classification systems. Health Utilities Index. Pharmacoeconomics. 1995 Jun;7(6):490–502. doi: 10.2165/00019053-199507060-00004. [DOI] [PubMed] [Google Scholar]
- 21.Schafer W, Abrams P, Liao L, et al. Good urodynamic practices: uroflowmetry, filling cystometry, and pressure-flow studies. Neurourol Urodyn. 2002;21(3):261–274. doi: 10.1002/nau.10066. [DOI] [PubMed] [Google Scholar]
- 22.Abrams P, Cardozo L, Fall M, et al. The standardisation of terminology in lower urinary tract function: report from the standardisation sub-committee of the International Continence Society. Urology. 2003 Jan;61(1):37–49. doi: 10.1016/s0090-4295(02)02243-4. [DOI] [PubMed] [Google Scholar]
- 23.Nager CW, Albo ME, Fitzgerald MP, et al. Process for development of multicenter urodynamic studies. Urology. 2007 Jan;69(1):63–67. doi: 10.1016/j.urology.2006.08.1118. discussion 7–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Nager CW, Albo ME, Fitzgerald MP, et al. Reference urodynamic values for stress incontinent women. Neurourol Urodyn. 2007;26(3):333–340. doi: 10.1002/nau.20348. [DOI] [PubMed] [Google Scholar]
- 25.Dunnett CW, Gent M. An alternative to the use of two-sided tests in clinical trials. Stat Med. 1996 Aug 30;15(16):1729–1738. doi: 10.1002/(SICI)1097-0258(19960830)15:16<1729::AID-SIM334>3.0.CO;2-M. [DOI] [PubMed] [Google Scholar]
- 26.Snapinn SM. Noninferiority trials. Curr Control Trials Cardiovasc Med. 2000;1(1):19–21. doi: 10.1186/cvm-1-1-019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Cardinale V. Drug Topic: Red Book. Montvale, NJ: Medical Economics Co, Inc; 1997. [Google Scholar]
- 28.Health Care Consultants of America. Physicians Fee & Coding Guide. Augusta, GA: 2000. [Google Scholar]
- 29.Macpherson H. Pragmatic clinical trials. Complement Ther Med. 2004 Jun–Sep;12(2–3):136–140. doi: 10.1016/j.ctim.2004.07.043. [DOI] [PubMed] [Google Scholar]
- 30.Lawrence JM, Lukacz ES, Nager CW, Hsu JW, Luber KM. Prevalence and co-occurrence of pelvic floor disorders in community-dwelling women. Obstet Gynecol. 2008 Mar;111(3):678–685. doi: 10.1097/AOG.0b013e3181660c1b. [DOI] [PubMed] [Google Scholar]