Abstract
Objective
Wide-scale adoption of electronic medical records (EMRs) has created an unprecedented opportunity for the implementation of Rapid Learning Systems (RLS) that leverage primary clinical data for real-time decision support. In cancer, where large variations among patient features leave gaps in traditional forms of medical evidence, the potential impact of a RLS is particularly promising. We developed the Melanoma Rapid Learning Utility (MRLU), a component of the RLS, providing an analytical engine and user interface that enables physicians to gain clinical insights by rapidly identifying and analyzing cohorts of patients similar to their own.
Materials and Methods
A new approach for clinical decision support in Melanoma was developed and implemented, in which patient-centered cohorts are generated from practice-based evidence and used to power on-the-fly stratified survival analyses. A database to underlie the system was generated from clinical, pharmaceutical, and molecular data from 237 patients with metastatic melanoma from two academic medical centers. The system was assessed in two ways: (1) ability to rediscover known knowledge and (2) potential clinical utility and usability through a user study of 13 practicing oncologists.
Results
The MRLU enables physician-driven cohort selection and stratified survival analysis. The system successfully identified several known clinical trends in melanoma, including frequency of BRAF mutations, survival rate of patients with BRAF mutant tumors in response to BRAF inhibitor therapy, and sex-based trends in prevalence and survival. Surveyed physician users expressed great interest in using such on-the-fly evidence systems in practice (mean response from relevant survey questions 4.54/5.0), and generally found the MRLU in particular to be both useful (mean score 4.2/5.0) and useable (4.42/5.0).
Discussion
The MRLU is an RLS analytical engine and user interface for Melanoma treatment planning that presents design principles useful in building RLSs. Further research is necessary to evaluate when and how to best use this functionality within the EMR clinical workflow for guiding clinical decision making.
Conclusion
The MRLU is an important component in building a RLS for data driven precision medicine in Melanoma treatment that could be generalized to other clinical disorders.
Keywords: Rapid Learning, Learning Health System, Precision Medicine, Melanoma, Interactive Data Analysis
Graphical Abstract

1. INTRODUCTION
The promise of leveraging vast medical record data to guide clinical decision making has created growing support for the development of “Rapid Learning Systems” (RLS) that gather and leverage practice-based clinical evidence for real-time clinical decision support [1]. The need for such systems is particularly evident within the field of oncology, where controlled clinical trial evidence is only available to guide therapy in a minority of patients [2–5]. We developed an analytical engine component of the RLS for Melanoma, called the Melanoma Rapid Learning Utility (MRLU). The analytic engine and graphical user interface (GUI) enables users to quickly build and analyze cohorts of patients through an interactive web application. The MRLU could be useful as a key component of future systems aimed at providing evidence-based practice in the era of electronic medical records (EHRs).
In 2012, the Institute of Medicine (IoM) released a landmark report calling for an immediate and fundamental shift in U.S. healthcare. Noting that traditional pathways of knowledge generation and transmission “can no longer keep pace,” the IoM called for “computing capabilities and analytic approaches to develop real-time insights from routine patient care” [6]. The IoM report highlights a rapidly growing interest in the community to develop and implement rapid learning healthcare systems (RLS) that are capable of gathering and leveraging clinical evidence to enable real-time, precision decision support in the clinic [1]. The RLS is an example of the repurposing of primary clinical data to improve healthcare, which falls under the broader term of the “learning health system.”[7,8]
Electronic health record (EHR) data has long been recognized as being a potential source of “practice-based evidence” that can supplement traditional forms of medical evidence in guiding clinical decision making [9]. Rapid learning systems represent a modern paradigm for precision clinical practice, in which knowledge mined from electronic medical records is seamlessly integrated into the clinical workflow of physicians [10–13]. A functional RLS therefore requires several components, including clinical databases supplied with EHR data, information pipelines that facilitate rapid transformation and filtration of clinical data to identify cohorts of interest, analytic platforms, and clinical decision support (CDS) utilities that provide physicians with relevant clinical insights at point of care (Figure 1). While CDS for precision medicine is a primary goal of rapid learning, the implementation of RLS will also provide infrastructure that will support and be enriched by nearly all areas of clinical informatics.
Figure 1. Key components of a rapid learning system.
(A) Patient data and outcomes extracted from electronic medical records and used to build up large clinical databases. (B) Data from clinical databases is consolidated and transformed. (C) Patient cohorts of interest are defined and extracted from featurized clinical databases. (D) Data analysis conducted to identify or confirm clinical trends. (E) Results may be shared, motivating new basic and public health research or facilitate the design of clinical trials. (F/G) Clinical knowledge bases developed from a combination of the scientific literature, clinical guidelines, and dynamically generated results from rapid learning data analysis. (H) Clinical decision support utilities employing leverage clinical knowledge bases for personalized medicine. (I) Patient outcomes may be impacted by CDS utilities. The MRLU focuses primarily on (C) and (D), though our work also involved components (A) and (B).
The need for rapid learning systems is particularly evident in oncology [2–5]. Tumor molecular profiling offers great opportunity to identify those patients most likely to benefit from targeted therapies, but it results in small sub-populations of patients that can be used for evidence generation. With many tumor molecular alterations occurring in as few as 1% of patients, it is challenging to learn from the clinical outcomes of the mere 5% of cancer patients who participate in clinical trials [14]. While randomized prospective clinical trials and clinical practice guidelines remain the predominant evidence base for clinical decision making in oncology, in order to realize the promise of precision cancer medicine, it will be important to be able to learn from the experiences of all cancer patients. Oncology is thus an ideal environment in which to deploy RLS, with the prospect of informing decision making about treatments based on observational evidence when clinical trial data is not available.
Though much has been done in describing the opportunities and requirements of RLS, few efforts have yet been undertaken to actually produce a RLS for cancer. The Athena and CancerLinQ projects have recently been undertaken to aggregate data and implement clinical practice guidelines and initial analyses [15,16]. However, these and other related efforts remain limited with respect to providing an analytical engine with physician-facing analytic tools for real-time decision support; in fact, to our knowledge, efforts to build analytical engines and physician-facing tools that enable real-time exploration of clinical data have been limited to date [17–19].
In this paper, we present the Melanoma Rapid Learning Utility (MRLU), which we believe is the first implementation of an analytical engine and physician-facing tool that could be used at the core of a RLS for Melanoma. We chose Melanoma as the focus of our RLS engine because Melanoma is one of the few cancers whose incidence rates and total deaths continue to increase [20], unlike most other cancers, and thus efforts to improve decision making for this disease could be particularly impactful. The large heterogeneity both within and between patients with respect to tumor features and tumor response is particularly challenging in Melanoma treatment planning [21,22]. This inherent complexity adds to the existing challenge of finding and evaluating clinical evidence to guide patient care, and makes Melanoma an excellent candidate for rapid learning.
The scope of work necessary to produce and deploy a functioning RLS in a hospital setting is large. In building the MRLU, we focused our efforts specifically on developing an interactive analytic interface that could provide easy comparisons of treatment efficacy for patients with user-specified characteristics. By making latent clinical knowledge physician-accessible, such tools may foster a virtuous “learning” cycle, in which data-driven decision-making is informed by clinical encounters from the past, the results of which will be captured in the same EMR system and therefore be used to guide care in the future. The resultant pool of continuously expanding and improving health data is central to the notion of a learning health system.
To deploy a fully operational RLS, however, the other components of the RLS (Figure 1) would be required, including the incorporation of practice guidelines and other clinical knowledge bases. Fully operational RLS infrastructures will therefore be capable of more robust rule-based and statistical learning than the MRLU. Our work represents an essential step towards such systems, and, when integrated with clinical information systems and additional components, could provide physicians with access to real-time, practice-based insights for precision medical practice.
2. MATERIALS AND METHODS
2.1 Data
We acquired clinical, demographic, tumor molecular profiling, and pharmacy data from clinical databases for 200 patients at the Vanderbilt-Ingram Cancer Center and 37 patients from Stanford Hospitals and Clinics with metastatic melanoma. All patient data were de-identified through the creation of new, random patient IDs and the offsetting of all dates. The data collected included:
Clinical data (~240 rows, 1 per patient): sex, age, days to death from first drug treatment, institution of origin.
Pharmaceutical data (~2000 rows, 1 per prescription per patient): drug class, drug name, and ordering date.
Tumor molecular profiling data (~240 rows, 1 per patient): BRAF status, NRAS status, and a Boolean indicating whether other mutations were reported.
Pharmaceutical data were normalized by generic, trade, and development names and mapped onto their drug class according to the MyCancerGenome index of anticancer agents [23]. Combination therapies were identified as consisting of two or more drugs of the same class that were administered on the same date. Unique combinations of treatments were identified and represented within the MRLU as distinct treatment regimens. For example, we separated those patients receiving Carboplatin and Paclitaxel together from those patients receiving only Paclitaxel. Eight patients were identified who received combination therapies in which there were drugs of two or more classes. Since no more than any 2 of these patients received the same treatment protocol, and multi-class protocols could introduce potential biases in outcomes analyses stratified by drug class, we excluded these 8 patients from the cohort. All other patients were included.
The timestamp of the first anti-cancer therapy was labeled as the baseline time point for each patient, and each subsequent treatment time point was considered a follow up. The baseline time point served as a reference point used in conjunction with other timestamps in the medical records to calculate the patient age at first treatment, time to second treatment, and days to death relative to the baseline time point. While calculating the time to second treatment and the days to death relative to the initiation of first treatment, we captured the date of right censoring for all patients (i.e., those who either left the database or whose treatments were ongoing at the time of our data analysis). These data were added to the database as well as genetic results compiled from the EHR.
2.2 System Implementation
An overview of the MRLU and its architecture are shown in Figure 2. The MRLU was implemented using a MySQL database and RStudio’s Shiny package [24]. Shiny enables the rapid prototyping of interactive JavaScript web pages back-ended by the R statistical computing language. The MySQL database contains all the patient data and serves as a data cache for accessing the analyzing the data by Shiny applications. The MRLU accesses the data in its MySQL database through the DBI and RMySQL packages and displays them using Hadley Wickham’s ggplot2 [25–27]. We deployed Shiny Server to host the MRLU on a GNU/Linux server [28].
Figure 2. System Diagram.
Shiny application links JavaScript frontend with R, MySQL backend.
We defined the requirements for the MRLU and its interface during several consultations with practicing clinical oncologists at the Stanford and Vanderbilt cancer centers. During this process, we determined that the core functionality required by clinician users of the tool could be partitioned into three basic tasks: (1) cohort selection, (2) outcomes analysis, and the (3) examination and download of raw data. We sought in developing the MRLU to not only maximize usability, but also to minimize the risk of inadvertent data dredging or selection bias (see Results).
-
Cohort selection (Figure 3) is the process of identifying a group of patients matching particular characteristics of interest. In the RLS scenario, a clinician would describe to the MRLU the characteristics of a patient for whom decision support about treatment is sought. The cohort is specified by means of a series of drop-down, slider, and checkbox filters. As inclusion criteria are tightened and/or relaxed via the user interface, data are fetched from the MySQL database, and an R data frame is constructed on the server-side that consists of those patients who match the user-specified criteria. The data frame is manipulated and displayed using an R plotting routine (ggplot2) in a series of five plots that depict the sex, age, institution, and treatment distributions of the current cohort.
Features to enable expressivity of the user in defining cohorts and analyzing their treatments and outcomes were heavily emphasized in design decisions of the MRLU; even excluding age-based inclusion criteria, the simple menu options allow for more than 8 million combinations of inclusion criteria, outcome variables, and stratification variables. Nevertheless, we sought to maintain a very simple and intuitive user interface. As such, most of the menu options are hidden from view by default and are only dynamically generated when made relevant by other query options or opened explicitly. Furthermore, much of the content of the filters are generated from the contents of the data itself, which allows the engine to be robust to changes in the content of the database. For example, the addition of pharmacy data with previously unlisted drugs or drug classes will not necessitate any adjustment to the interface or server code.
Outcomes analysis (Figure 4) of a given selected patient cohort is conducted using the R survival package [29,30]. The user selects an outcome variable in the MRLU system (survival or time to next treatment) as well as a stratification variable (drug class, drug name, mutation type, BRAF status, NRAS status, or sex) to be used in conjunction with the filters defined in the cohort selection step described previously. This input is then used to construct a Cox proportional hazards model [31]. The MRLU builds a model that is stratified based on the user-defined stratification variable, that controls for all other variables available, and that right censors all patients for whom the data were cut off (due to end of observation in the case of survival, and end of observation or death in the case of time to next treatment). The system displays a Kaplan-Meier plot of the cohort, as well as the specific R code used to generate the model. In addition, summary statistics for each included class including median survival and 95% confidence intervals are provided. The MRLU also invokes the survival package’s implementation of the Mantel-Haenszel test to evaluate the equality of the survival curves, and it reports the chi-square and p-value results of the statistical test, as well as the R code used to generate the result. To lower the risk of inadvertent data dredging (see Discussion), the MRLU also displays a warning that encourages users to conduct only one test at a time, and to avoid adjusting parameters post hoc in search of statistical significance.
Examination and download of raw data, as with cohort selection above, is handled through a combination of R and Shiny functionality. The user is presented with menus to toggle between clinical and pharmaceutical data for those patients that match the inclusion criteria established in the cohort selection step. If viewing pharmaceutical data, the user can choose between displaying all drug entries in the table or merely the first entry for each unique drug used on a patient. By default, the tables are sorted according to patient identification number, though the data can be sorted or searched on the content of any column. Finally, if the user chooses to download the cohort to a file, the underlying R data frame is converted to CSV format before being sent to the user.
Figure 3. Cohort Assembly Page.
Displaying summary visualizations for all for all female, NRAS- patients between the ages of 50 and 75. (Red: Patients from Stanford. Green: Patients from Vanderbilt)
Figure 4. Survival Outcomes Analysis.
The MRLU displays Kaplan-Meier curves, summary statistics, chi-squared statistics and p-values, as well as the number of patients in each stratified cohort. In this case, the MRLU shows survival plots of patients when stratifying on drug class for all female, NRAS- patients between the ages of 50 and 75. Patients who were treated with immunotherapy have longer survival in this instance than those who received chemotherapy or therapeutic antibody, though this particular result is not statistically significant.
2.3 System Evaluation
We conducted a preliminary evaluation of the MRLU focusing on two aspects of the system: (1) computational integrity, evidenced by the ability of the MRLU to recapitulate known biomedical knowledge about Melanoma treatment effectiveness for particular genetic mutations, and (2) clinician users’ experience using the system to glean information about Melanoma patients in their clinical practice. For the first evaluation, we compared the output of the MRLU with known statistics from the biomedical literature. Specifically, we used MRLU to (1) evaluate the frequency of BRAF positivity in Melanoma patients and to compare it with published facts, (2) to evaluate the response rate of BRAF+ Melanoma patients to BRAF inhibitor therapy such as Vemurafenib, comparing that with published facts, and (3) to evaluate the differences of frequency of Melanoma deaths according to age and gender [32–34].
For the second evaluation, 13 practicing physicians at the Stanford and Vanderbilt Cancer Centers used the system in the context of a hypothetical patient. Each user completed a survey that explained how to use the system to build a cohort of similar patients to the one under consideration and how to run analyses to compare potential therapies. The physicians were also asked a series of questions regarding their views on Rapid Learning Systems in general and their assessments of the utility of the MRLU in particular. They assessed the usability of the MRLU interface, and provided opinions on limitations of the system and needs for future work using Likert scale ratings. The survey itself is available in the Supplemental Data. To assess inter-user agreement among survey respondents, pairwise agreement percentages were computed using both raw Likert scores, as well as scores binned as negative (Likert scores 1–2), neutral (Likert score 3), or positive (Likert scores 4–5). Pairwise agreement rates were computed over all combinations of reviewers, as well as mean agreement rates for each reviewers versus all other reviewers, mean agreement rates for each question, and mean agreement rate overall.
3. RESULTS
3.1 System Implementation
A fully functional implementation of the MRLU was completed, loaded with data, and deployed on a live server, which is publicly available at http://mrlu.stanford.edu:3838/mrlu/. We outline the workflow of the system and results from the process of implementation.
Upon opening the browser, the user is presented with a set of summary plots that show the makeup of the entire dataset with respect to sex, age, BRAF status, NRAS status, drug name and class (of first drug used), and hospital. A corresponding set of filters enable the user to adjust the cohort to set specific patient characteristics. As filters are adjusted, the summary plots update in real-time. This allows the system, before any analysis is conducted, to leverage the judgment of the physician-user in selecting a cohort that matches the patient as closely as possible while being cognizant of the number of patients in the cohort (so that judgment is based on sufficient cohort size) as well as expected distributions for the variables not used for matching.
With cohort inclusion criteria established via the filters, the user selects a stratification variable (drug name, drug class, mutation type, BRAF status, NRAF status, or sex) as well as the outcome variable (survival or time to next treatment), over which the Kaplan-Meier analysis will be performed as described in Methods section (Figure 4). As noted above, summary statistics, confidence intervals, chi-square statistics, and p-values are all displayed in conjunction with the plot itself.
Once the preliminary analysis is completed, the cohort data is made immediately available for browsing, searching, sorting, and download.
3.2 Evaluation 1: Confirmation of Known Clinical Trends
An initial assessment of the Melanoma Rapid Learning Utility confirmed that the MRLU successfully identified several known clinical trends. This includes the frequency of BRAF mutations in the Melanoma population [35], as well as a survival rate of approximately 50% in response to BRAF inhibitor therapy such as Vemurafenib among those patients whose tumors had a BRAF mutation [33]. Similarly, in accordance with the literature, the system shows that males account for roughly 56% of cases and 57% of deaths among patients under 65, whereas males account for 68% of cases and 67% of deaths among those patients >65 years of age [36].
Furthermore, the physicians who used our system reported that overall survival outcomes are close to what would be expected for a metastatic Melanoma patient receiving current therapy with immunotherapy and/or BRAF inhibitor therapy [33,34].
3.3 Evaluation 2: Physician Survey
The physician feedback on both the vision for and the implementation of the MRLU system was very favorable (Figures 5–7, Supplemental Data).
Figure 5. Survey Results: RLS systems at large.
Results from survey of 13 practicing oncologists from Stanford and Vanderbilt, assessing the anticipated benefit of RLS systems in various settings. Respondents were asked to provide feedback on a Likert scale from 1–5, with 5 being the most positive. Lines indicate the mean rating for each question.
Figure 7. Clinical Adoption Survey Results.
Breakdown of survey responses to questions pertaining to clinical adoption (questions were select all that apply with option to input additional custom “other” responses). *Manual entry as “other” option.
When asked various questions regarding the benefit that would be provided by Rapid Learning Systems in both clinical and research settings, the mean response was a score of 4.54 (CI: 4.38–4.70) on a Likert scale from 1–5, 5 being the most positive (Figure 5). The greatest enthusiasm (4.62/5, CI: 4.31–4.92) was for the benefit that would be provided to non-academic centers that could leverage data from academic centers for clinical decision support.
Assessment of the utility of the MRLU was also positive, with a mean rating of 4.21/5 (CI: 4.00 – 4.41) (Figure 6a). Enthusiasm was greatest for the MRLU in context of clinical research, with average ratings of 4.38/5 (CI: 3.99–4.78) in its current state, and 4.62/5 (CI: 4.22–5.01) if only additional variables were made available. While responses were also generally positive for the utility of the system in a clinical setting, the mean ratings were lower in this context, 4/5 (CI: 3.34–4.66) “as is,” and 4.23/5 (CI: 3.79–4.67) with additional variables. When asked separately if the system currently implements the analytic capabilities necessary for clinical usage, the mean score was 3.75/5 (CI: 3.27–4.23), which could indicate that further capabilities would be requested by many clinicians were the MRLU to be rolled out into clinical practice. Clinicians did, however, find the system usable, with a mean rating of 4.42/5 (CI: 4.14 – 4.71) for interpretability and navigability (Figure 6b). Taken together, these data (overall mean score 4.37/5, CI: 4.25–4.49) indicate that clinicians are generally eager to see Rapid Learning Systems in both the clinical and research settings, and view the progress made towards that aim by the MRLU as valuable in its current state and potentially even more valuable with simple additions to its underlying database.
Figure 6. Survey Results: MRLU system.
Survey results assessing (a) the utility of MRLU, and (b) the usability of the MRLU interface. Lines indicated mean ratings.
We were particularly interested to gauge physician attitudes towards what would be required from a practice-based evidence tool like the MRLU in order for it to be trusted in practice (Figure 7). When asked how many patients matching their own would be expected in order for their outcomes to be clinically useful, there was wide discordance: 23% of respondents indicated that they would use outcomes data from even one similar patient, whereas 30% indicated that they would expect more than 500 patients. Strikingly, only one clinician felt that (s)he could only answer this question if able to perform a power analysis on the data. In contrast, expectations for confounding variables represented in the database were much more uniform, with a majority of clinicians indicated that they would want to see primary tumor characteristics, co-occurring medical conditions, location of metastases, and additional genetic features. However, the lack of features was generally not listed a major hindrance to the future clinical adoption of these systems, with the strongest concern being an efficient workflow in using the system and, potential distrust of the data by clinicians.
Inter-rater reliability results were mixed, though generally positive (Supplemental Data). While the rates of exact agreement were moderately low for the raw 1–5 scale Likert data (overall mean 0.41, CI 0.37–0.44), agreement rates were much higher when the scores were binned into negative, neutral, and positive ratings (mean agreement 0.80, CI: 0.75–0.85). This indicates that discordance among the respondents was likely due to the subjectivity inherent, for example, in distinguishing a positive rating of 4/5 from an extremely positive rating of 5/5 –two responses which, taken together, comprise 87.3% of all recorded ratings. Nevertheless, we do note that, unsurprisingly, the two questions with the lowest average rating also demonstrated the lowest rate of agreement. These questions, which asked respondents to rate the clinical utility of the MRLU in its current implementation, each had only 47% agreement of respondents on the 1–3 scale of binned scores. Notably, however, this discordance largely vanished (mean pairwise agreement 72%) when respondents were asked to assess the clinical utility of the system if additional patient variables were added. Similarly, the distribution of one-versus-all reviewer agreement rates was somewhat left-skewed, with two reviewers having average agreement rates more than two standard deviations below the mean. Given the presence of these apparent outliers, evaluations of future systems could be performed to study whether reviewer characteristics such as age or computer-experience, other reviewer factors such as practice attitudes, or mere chance drives discordance.
4. DISCUSSION
The MRLU we developed is a prototype of the analytical engine component of a RLS for Melanoma (Figure 1). This analytical engine enables physician users, who in many cases lack statistical programming skills or user-friendly tools that access clinical data, to use primary clinical data to interactively build cohorts and run retrospective outcomes analyses tailored to patients like their own. Our results are promising in that our system was able to recapitulate known facts about Melanoma in an observation cohort. This raises confidence that the MRLU may be useful clinically in providing insight into the disease for patient characteristics for which there is not yet published evidence. In addition, the results of our survey suggest that our interface is usable and potentially clinically useful. However, additional studies with a larger amount of patient data and more detailed evaluation of potential clinical benefit will be needed. In particular, the utility of the MRLU may also be limited by the absence of a number of important clinical features in its database. Our clinical experts identified a number of variables that should be included in MRLU before it would be contemplated for clinical use. Since our system is extensible, adding such variables would not be expected to be difficult if available in the EMR. Further, our user evaluation indicates that many clinical practitioners desired to use such systems.
4.1 MRLU in Context of Existing Systems
The MRLU is complementary to recently developed enterprise-level platforms such as i2b2, STRIDE, RedCAP, and tranSMART [37–40]. These are data warehouse systems that also provide tools for integrating and querying medical record and research data at a large scale. They can fulfill an important role as one of the components of the RLS (Component A in Figure 1); however, while some of these systems do provide analytics features, they have been generally been designed and optimized for research rather than real-time use by physicians for rapid learning in a clinical setting [41]. Similarly, several industry-sponsored visual analytic tools have also been designed to facilitate the transformation and navigation of electronic medical record data, as well as for visualizing “care pathways” among similar patients in order to facilitate intervention planning (Component B in Figure 1) [17–19]. These latter tools lack analyses needed by a RLS analytical engine (e.g., treatment-stratified Kaplan-Meier, etc.).
Another effort related to our work is the CancerLinQ project, which has created a system based on aggregating primary clinical data from a group of community oncology practices [16]. Their system provides many features of the RLS (Components A, B, and C in Figure 1), but is still in the early stages of development. Its analytical engine only permits rudimentary data exploration, such as providing histograms of patient characteristics, and it lacks the types of analytical capabilities we have developed in MRLU, such as comparative survival analysis in user-defined patient cohorts. Prototype efforts such as our MRLU could inform further development of efforts such as CancerLinQ. Similar, disease-specific RLS tools like the MRLU have also been reported for other diseases, including for lung cancer [42].
4.2 Statistical Concerns
Interactive data analysis for real-time clinical decision support is complicated by a range of statistical challenges. As highlighted above, we designed the MRLU analytical engine for flexibility—the user can easily adjust almost all aspects of the analysis. In making this design decision, the MRLU can accommodate a wide range of clinical questions that can be quickly posed by the user and answered in near real-time. However, this flexibility raises the risk of inadvertent errors in judgment by users lacking statistical expertise. Furthermore, as discussed above, our survey corroborates previous work suggesting that many clinicians may not be extremely concerned by statistical limitations while using these systems[13]. As such, before deploying tools such as ours in the clinic, careful considerations should be made to ensure that statistical models are appropriately designed and interpreted.
The first statistical challenge faced by the MRLU arises during cohort selection in the form of the bias-variance tradeoff. In order for models built with the MRLU to be relevant to a given patient, selection criteria must be applied to avoid high bias. Conversely, excessive selection criteria will lead to overfit models of high variance. This tradeoff is particularly complicated to manage in the case of Melanoma decision support, as even the largest institutions will have relatively small, yet still widely heterogeneous, datasets. Ultimately this problem can only be addressed by integrating data from many participating institutions, which is the vision for all learning health systems [7].
In order to assist the user in managing the bias-variance tradeoff as effectively as possible, the MRLU’s summary plots on the cohort selection page (Figure 3) convey descriptive information. As prospective users apply filters to narrow in on patients of a target demographic, plots depict the makeup of the cohort they have built. A note encourages the physician to confirm that the cohort both reflects the characteristics of the patient and fits with his or her professional judgment. Nevertheless, it is impossible to provide sufficient automation to completely preclude the inadvertent misuse of the system by inexperienced users.
Similarly, the system provides no advanced statistical analyses for managing potential pitfalls such as outlier effects and violations of the proportional hazards assumption. As such, these considerations can only be managed through the manual inspection and/or analysis of the cohort data, and the MRLU thus provides this capability. Though our interface is designed to make this as easy as possible, physicians will likely not have the time nor the expertise to conduct such an analysis in a clinical setting. In addition, while our analytical engine seeks to limit confounders by controlling for all variables other than the stratification variable, the inherent complexity of data gathering in routine care leaves open the possibility of other forms of latent confounding [43].
Another risk inherent within any interactive statistical analysis platform is that of inadvertent data dredging. A user of the MRLU can quickly examine many clinical hypotheses in a single sitting. Due to the principle of multiple hypothesis testing, this ease of use can be dangerous, since it could enable a user running sufficient queries to find statistically significant trends by chance [44]. While the risk of data dredging can never be fully eliminated, the physical separation of the cohort selection page from that of the outcomes analysis was designed to encourage users to carefully formulate a cohort for a single question of interest before seeing whether or not significant results will be obtained. The MRLU produces a warning to users about multiple testing, encouraging them to utilize the tool for one test at a time, and warning them of the false-positive risk in rapid and repeated adjustments of the query in search of statistical significance. Nevertheless, probably the best way to prevent statistical misuse of analytical engines such as MRLU is to engage experts in interpreting the analyses.
To further limit the statistical risks inherent in rapid learning, we will present, in a future work, a hybrid system in which a rapid MRLU analytical engine will be linked to a series of interfaces, some of which will allow expert statisticians to design statistical models, others of which will allow physicians to apply these models to their patients without the capacity to define patient cohorts or new models themselves.
4.3 Looking Forward
Broader philosophical and legal questions that surround RLS pertain to where exactly the output of these tools should stand within the existing hierarchy of evidence-based practice [12,13,45,46]. The results from controlled clinical trials provide the highest, most reliable evidence, while retrospective, observational evidence can be less reliable. Rapid learning tools based on engines such as the MRLU are no substitute for traditional forms of medical evidence when they exist and are relevant to the patient of interest. In the future, we plan to implement in MRLU a mechanism to identify and present clinical guidelines and prominent articles from the literature that relate to the user’s query. However, in the (common) circumstance of cancer patients for which there is no clinical trial evidence available, the MRLU could be particularly useful, by leveraging evidence from clinical practice to provide information to physicians that may help in making treatment decisions. While our survey indicates that RLS analytical engines such as the MRLU would likely see clinical use, prospective research studies that relate their use to clinical outcomes will ultimately be needed to show clinical value.
As highlighted in the section Statistical Concerns, an increasingly important component of systems such as MRLU will be the ability to expand the number of patients and data types used for analyses across many institutions. We are currently expanding and generalizing MRLU to address this challenge by building a scalable, decentralized system. In this system, a front-end interface derived from the MRLU will be used to execute distributed statistical models using data from participating institutions. A server at each institution will compute intermediate statistical results on local data, and computational results will be sent back to the coordinating server. This approach will allow centers to participate in inter-institutional computations without sharing any granular patient data. Each site would do a onetime mapping of certain key patient data fields to those used by the system, and this could expand over time to include new data types in future. The menus and utilities in the system that use these fields would dynamically update based on the data types available from the connected institutions. This system could scale up to including many patients as more sites participate, and these institutions would have the freedom to withdraw at any time.
Finally, while the MRLU was developed specifically for use in Melanoma, the key functionality – integrating genetic variants, treatments, and survival outcomes – is relevant to many types of cancer (and other disease). As such, small adaptations to the covariates stored in and analyzed by the system would allow it to scale across cancer types. Since the menus and model can easily be adapted to fit the data at hand, the rate-limiting steps in such adaptation would almost certainly be data acquisition and clinician interest.
Our MRLU is just a portion of the complete RLS (Components C and D in Figure 1). Clearly, the other components are needed, and the MRLU must be combined with the other infrastructure in order to realize the RLS. On the other hand, we believe our results provide useful insights into design considerations, feasibility and potential utility of the analytical engine component of the RLS.
5. CONCLUSION
The MRLU is an analytical engine and user interface that represents a component of the RLS. It can provide real-time, data-driven clinical decision support for Melanoma treatment planning. In a preliminary evaluation, the MRLU successfully recapitulated known biomedical knowledge about Melanoma treatment, and it showed promise for clinical utility when used by oncologists. Given its flexible architecture, it is extensible to other types of cancer and to incorporating more and richer data for greater future clinical utility in the future. We plan to incorporate the MRLU into the rest of the learning system infrastructure and may ultimately enable EHR-driven evidence to be incorporated into medical practice.
Supplementary Material
Highlights.
We developed the MRLU, an analytical engine of clinical cancer data
MRLU is one component of a Rapid Learning System (RLS) for cancer
MRLU allows clinicians to build/analyze custom patient cohorts in near real-time
MRLU leverages clinical, pharmacologic, and genetic features for survival analysis
Physician evaluation results suggest MRLU could help clinical decision making
Acknowledgments
This project has been funded from National Cancer Institute, National Institutes of Health, under grants U01CA142555 and U01 CA190214, and a seed grant from the Big Data for Human Health Stanford University and Oxford University. This project was also supported by award Number T32GM007753 from the National Institute of General Medical Sciences. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of General Medical Sciences or the National Institutes of Health.
Philip Lavori, PhD and Balasubramanian Narasimhan, PhD from Stanford University provided consultation in the development of the MRLU. Vanessa Sochat, Linda Szabo, and Luke Yancy Jr. from Stanford University also provided early consultation relevant to user interface design.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Etheredge LM. A rapid-learning health system. Health Aff (Millwood) 2007;26:w107–18. doi: 10.1377/hlthaff.26.2.w107. [DOI] [PubMed] [Google Scholar]
- 2.Abernethy AP, Etheredge LM, Ganz PA, Wallace P, German RR, Neti C, et al. Rapid-learning system for cancer care. J Clin Oncol. 2010;28:4268–74. doi: 10.1200/JCO.2010.28.5478. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Lambin P, Roelofs E, Reymen B, Velazquez ER, Buijsen J, Zegers CML, et al. “Rapid Learning health care in oncology” - an approach towards decision support systems enabling customised radiotherapy. Radiother Oncol. 2013;109:159–64. doi: 10.1016/j.radonc.2013.07.007. [DOI] [PubMed] [Google Scholar]
- 4.Shaikh AR, Butte AJ, Schully SD, Dalton WS, Khoury MJ, Hesse BW. Collaborative biomedicine in the age of big data: the case of cancer. J Med Internet Res. 2014;16:e101. doi: 10.2196/jmir.2496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Shrager J, Tenenbaum JM. Rapid learning for precision oncology. Nat Rev Clin Oncol. 2014;11:109–18. doi: 10.1038/nrclinonc.2013.244. [DOI] [PubMed] [Google Scholar]
- 6.Smith M, Saunders R, Stuckhardt L, McGinnis JM America C on the LHCS in, Medicine I of. A Continuously Learning Health Care System. 2013 [PubMed] [Google Scholar]
- 7.Friedman CP, Wong AK, Blumenthal D. Achieving a nationwide learning health system. Sci Transl Med. 2010;2:57cm29. doi: 10.1126/scitranslmed.3001456. [DOI] [PubMed] [Google Scholar]
- 8.Abernethy AP, Etheredge LM, Ganz PA, Wallace P, German RR, Neti C, et al. Rapid-learning system for cancer care. J Clin Oncol. 2010;28:4268–74. doi: 10.1200/JCO.2010.28.5478. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.MARGISON FR. Measurement and psychotherapy: Evidence-based practice and practice-based evidence. Br J Psychiatry. 2000;177:123–30. doi: 10.1192/bjp.177.2.123. [DOI] [PubMed] [Google Scholar]
- 10.Nordyke RA, Kulikowski CA. An informatics-based chronic disease practice: case study of a 35-year computer-based longitudinal record system. J Am Med Inform Assoc. 5:88–103. doi: 10.1136/jamia.1998.0050088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ebadollahi S, Sun J, Gotz D, Hu J, Sow D, Neti C. Predicting Patient’s Trajectory of Physiological Data using Temporal Trends in Similar Patients: A System for Near-Term Prognostics. AMIA Annu Symp Proc. 2010;2010:192–6. [PMC free article] [PubMed] [Google Scholar]
- 12.Frankovich J, Longhurst CA, Sutherland SM. Evidence-based medicine in the EMR era. N Engl J Med. 2011;365:1758–9. doi: 10.1056/NEJMp1108726. [DOI] [PubMed] [Google Scholar]
- 13.Longhurst CA, Harrington RA, Shah NH. A “green button” for using aggregate patient data at the point of care. Health Aff (Millwood) 2014;33:1229–35. doi: 10.1377/hlthaff.2014.0099. [DOI] [PubMed] [Google Scholar]
- 14.Finn R. Surveys Identify Barriers to Participation in Clinical Trials. J Natl Cancer Inst. 2000;92:1556–8. doi: 10.1093/jnci/92.19.1556. [DOI] [PubMed] [Google Scholar]
- 15.Elson SL, Hiatt RA, Anton-Culver H, Howell LP, Naeim A, Parker BA, et al. The Athena Breast Health Network: developing a rapid learning system in breast cancer prevention, screening, treatment, and care. Breast Cancer Res Treat. 2013;140:417–25. doi: 10.1007/s10549-013-2612-0. [DOI] [PubMed] [Google Scholar]
- 16.Sledge GW, Miller RS, Hauser R. CancerLinQ and the future of cancer care. Am Soc Clin Oncol Educ Book. 2013:430–4. doi: 10.1200/EdBook_AM.2013.33.430. [DOI] [PubMed] [Google Scholar]
- 17.Catherine Plaisant RMASJLDHBSKPC. LifeLines: Using Visualization to Enhance Navigation and Analysis of Patient Records. n.d. [PMC free article] [PubMed] [Google Scholar]
- 18.Gotz D, Wongsuphasawat K. Interactive intervention analysis. AMIA Annu Symp Proc. 2012;2012:274–80. [PMC free article] [PubMed] [Google Scholar]
- 19.Perer A, Gotz D. Abstr Hum Factors Comput Syst - CHI EA ‘13. New York, New York, USA: ACM Press; 2013. Data-driven exploration of care plans for patients. CHI ’13 Ext; p. 439. [DOI] [Google Scholar]
- 20.Jemal A, Simard EP, Dorell C, Noone A-M, Markowitz LE, Kohler B, et al. Annual Report to the Nation on the Status of Cancer, 1975–2009, Featuring the Burden and Trends in Human Papillomavirus (HPV)-Associated Cancers and HPV Vaccination Coverage Levels. JNCI J Natl Cancer Inst. 2013;105:175–201. doi: 10.1093/jnci/djs491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Yancovitz M, Litterman A, Yoon J, Ng E, Shapiro RL, Berman RS, et al. Intra- and inter-tumor heterogeneity of BRAF(V600E))mutations in primary and metastatic melanoma. PLoS One. 2012;7:e29336. doi: 10.1371/journal.pone.0029336. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Menzies AM, Haydu LE, Carlino MS, Azer MWF, Carr PJA, Kefford RF, et al. Inter- and intra-patient heterogeneity of response and progression to targeted therapy in metastatic melanoma. PLoS One. 2014;9:e85004. doi: 10.1371/journal.pone.0085004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Ashton JL, Lovly CM. Anticancer Agents. [accessed February 27, 2014];My Cancer Genome. n.d http://www.mycancergenome.org/content/other/molecular-medicine/anticancer-agents.
- 24.Chang W, Cheng J, Allaire JJ, Xie Y, McPherson J. shiny: Web Application Framework for R. 2015 [Google Scholar]
- 25.on Databases RSIG. DBI: R Database Interface 2014
- 26.Ooms J, James D, DebRoy S, Wickham H, Horner J. RMySQL: Database Interface and “MySQL” Driver for R. 2015. [Google Scholar]
- 27.Wickham H. ggplot2: elegant graphics for data analysis. Springer; New York: 2009. [Google Scholar]
- 28.RStudio I. [accessed September 17, 2015];Shiny Server. n.d https://www.rstudio.com/products/rstudio/#Server.
- 29.Therneau Terry M, Grambsch Patricia M. Modeling Survival Data: Extending the {C}ox Model. New York: Springer; 2000. [Google Scholar]
- 30.Therneau TM. A Package for Survival Analysis in S. 2015. [Google Scholar]
- 31.Fox J. Cox Proportional-Hazards Regression for Survival Data. [accessed April 2013];Appendix to An R and S-PLUS Companion to Applied Regression. 2002 See http.Stat Ethz ch/CRAN/doc/contrib/Fox-Companion/appendix-Cox-Regression.Pdf.
- 32.Curtin JA, Fridlyand J, Kageshita T, Patel HN, Busam KJ, Kutzner H, et al. Distinct Sets of Genetic Alterations in Melanoma. N Engl J Med. 2005;353:2135–47. doi: 10.1056/NEJMoa050092. [DOI] [PubMed] [Google Scholar]
- 33.Flaherty KT, Puzanov I, Kim KB, Ribas A, McArthur GA, Sosman JA, et al. Inhibition of mutated, activated BRAF in metastatic melanoma. N Engl J Med. 2010;363:809–19. doi: 10.1056/NEJMoa1002011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Balch CM, Gershenwald JE, Soong S-J, Thompson JF, Ding S, Byrd DR, et al. Multivariate analysis of prognostic factors among 2,313 patients with stage III melanoma: comparison of nodal micrometastases versus macrometastases. J Clin Oncol. 2010;28:2452–9. doi: 10.1200/JCO.2009.27.1627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Balch CM, Gershenwald JE, Soong S-J, Thompson JF, Atkins MB, Byrd DR, et al. Final version of 2009 AJCC melanoma staging and classification. J Clin Oncol. 2009;27:6199–206. doi: 10.1200/JCO.2009.23.4799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Polsky D, Wang SQ, Hendi A, Kim C, Karen J. Skin Cancer Facts. [accessed March 10, 2013];Ski Cancer Found. n.d http://www.skincancer.org/skin-cancer-information/skin-cancer-facts#Melanoma.
- 37.Murphy SN, Mendis ME, Berkowitz DA, Kohane I, Chueh HC. Integration of clinical and genetic data in the i2b2 architecture. AMIA Annu Symp Proc; 2006; p. 1040. [PMC free article] [PubMed] [Google Scholar]
- 38.Lowe HJ, Ferris TA, Hernandez PM, Weber SC. STRIDE--An integrated standards-based translational research informatics platform. AMIA Annu Symp Proc. 2009;2009:391–5. [PMC free article] [PubMed] [Google Scholar]
- 39.Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (REDCap)--a metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform. 2009;42:377–81. doi: 10.1016/j.jbi.2008.08.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Szalma S, Koka V, Khasanova T, Perakslis ED. Effective knowledge management in translational medicine. J Transl Med. 2010;8:68. doi: 10.1186/1479-5876-8-68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Canuel V, Rance B, Avillach P, Degoulet P, Burgun A. Translational research platforms integrating clinical and omics data: a review of publicly available solutions. Brief Bioinform. 2015;16:280–90. doi: 10.1093/bib/bbu006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Dekker A, Vinod S, Holloway L, Oberije C, George A, Goozee G, et al. Rapid learning in practice: a lung cancer survival decision support system in routine patient care data. Radiother Oncol. 2014;113:47–53. doi: 10.1016/j.radonc.2014.08.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Brookhart MA, Stürmer T, Glynn RJ, Rassen J, Schneeweiss S. Confounding Control in Healthcare Database Research. Med Care. 2010;48:S114–20. doi: 10.1097/MLR.0b013e3181dbebe3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Sainani KL. The Problem of Multiple Testing. PM&R. 2009;1:1098–103. doi: 10.1016/j.pmrj.2009.10.004. [DOI] [PubMed] [Google Scholar]
- 45.Schilsky RL, Michels DL, Kearbey AH, Yu PP, Hudis CA. Building a Rapid Learning Health Care System for Oncology: The Regulatory Framework of CancerLinQ. J Clin Oncol. 2014;32:2373–9. doi: 10.1200/JCO.2014.56.2124. [DOI] [PubMed] [Google Scholar]
- 46.Ginsburg GS, Kuderer NM. Comparative effectiveness research, genomics-enabled personalized medicine, and rapid learning health care: a common bond. J Clin Oncol. 2012;30:4233–42. doi: 10.1200/JCO.2012.42.6114. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.







