Skip to main content
Journal of Assisted Reproduction and Genetics logoLink to Journal of Assisted Reproduction and Genetics
. 2020 Sep 29;37(11):2817–2824. doi: 10.1007/s10815-020-01950-z

AI in the treatment of fertility: key considerations

Jason Swain 1, Matthew Tex VerMilyea 2, Marcos Meseguer 3, Diego Ezcurra 4,; Fertility AI Forum Group
PMCID: PMC7642046  PMID: 32989510

Abstract

Artificial intelligence (AI) has been proposed as a potential tool to help address many of the existing problems related with empirical or subjective assessments of clinical and embryological decision points during the treatment of infertility. AI technologies are reviewed and potential areas of implementation of algorithms are discussed, highlighting the importance of following a proper path for the development and validation of algorithms, including regulatory requirements, and the need for ecosystems containing enough quality data to generate it. As evidenced by the consensus of a group of experts in fertility if properly developed, it is believed that AI algorithms may help practitioners from around the globe to standardize, automate, and improve IVF outcomes for the benefit of patients. Collaboration is required between AI developers and healthcare professionals to make this happen.

Keywords: fertility, AI, algorithms, embryos

Introduction

The treatment of infertility has continually evolved since the birth of Louise Brown in 1978. However, a constant factor during this evolution has remained; that success in assisted reproductive technologies (ART) is in the details. Indeed, ART success relies on a complex series of interrelated events, with each procedural step having several variables and impacting the other. Therefore, to excel in ART and achieve the ultimate goal of a healthy singleton live birth in the shortest time span possible, it is necessary to optimize the success and efficiency in each of these procedural steps and variables.

Despite continued advancements in the development of new drugs, innovative stimulation protocols, and lab technologies, success rates continue to still tend in only average around one-third of patients taking home a baby [1]. Notably, reported success rates of some centers are considerably higher than others. These significant differences in ART outcomes observed between countries [2, 3] and even individual clinics [4] are likely a result of patient variability and the minute details of these ART procedural steps.

In recent years, the development and implementation of artificial intelligence (AI) technology have shown the potential to address inefficiencies in various steps of ART, including the standardization of some IVF laboratory processes and particularly in embryo selection. Indeed, AI is also being proposed for clinical applications including diagnostics and precise treatment paths in combination with digital devices remotely collecting real-time data to be analyzed.

What is AI?

Simply put, AI is widely described as a machine behaving in a way that would be called “intelligent” if viewed by a human. It is a broad term that includes machine learning (ML) and deep learning (DL). AI refers to any program with the ability to solve problems, learn from experiences, and perform tasks as human beings normally do. Machine learning is a subset of AI that includes statistical techniques which allow machines to improve on experienced tasks, learning from data without a constant supervision [5]. ML algorithms are developed to identify patterns in given data to make predictions and support decisions. Lastly, deep learning is a subset of machine learning that uses multilayer neural networks inspired by the human brain to learn from large amounts of data. In ART, clinical data, patient demographics, and laboratory data can be used to train models to support clinicians and embryologists to answer multiple questions like predicting a stimulation protocol [6], the probability of an oocyte to reach blastocyst stage, the quality of the blastocysts obtained in a treatment, and the likelihood to achieve a pregnancy [7] or eventually live birth.

Artificial intelligence approach in medicine and ART

The recent increase in computer power and data availability with the developments in machine learning has led to an augment of predictive algorithms in most medical fields [8]. While the beginnings of AI dates back to the early twentieth century, from the concepts presented by Alan Turing [9, 10] and Warren McCulloch and Walter Pitts [11], it was not until 2012 that the ML was widely accepted as a viable form of AI [12]. Most health professionals (72%) show great interest in implementing AI in their work, claiming that this technology is an extension, not the extinction of professional capacity in health care [13]. In recent years, there has been significant growth in the number of AI publications in medicine and health, with 61.6% of the papers dated between 2008 and 2017 [14], making it one of the most promising fields of study in health science. In just 1 year, publications on this subject have increased sevenfold [5]. The American Society for Reproductive Biology (ASRM) [15] and European Society for Human Reproduction and Embryology (ESHRE) [8] have already reported on different applications of machine learning ranging from sperm identification and morphology, identifying empty follicles, predicting the stages of embryonic cells and the formation of blastocysts from oocytes, assessing the quality of human blastocysts, predicting live births following embryo transfer, improving the selection of embryos, and defining optimal IVF stimulation protocols [5]. Until July 2019, the FDA (Food and Drug Administration) had approved 39 algorithms based on AI in medicine [16]. Most of those algorithms are for pattern recognition and suggest dichotomous (yes/no) diagnostic decisions, leading to sporadic implementation and use to support clinical and therapeutic decisions (some of them are summarized in Table 1). None of these tools are broadly available and only work on selected platforms or setups. It has been shown that predictive models built on data and domain-specific knowledge can support and improve or solve specific problems. Those examples can be used to inspire developments of similar approaches in different fields [16, 44]. It has been shown that deep learning algorithms have performances similar to the healthcare professional, and also it has been reported that ML applications might perform better than healthcare professionals, but validation and prospective clinical studies are necessary to confirm those results [44]. In conclusion, AI-enabled decision support algorithms allow medical teams to make more accurate diagnoses [13]. However, it is important to consider the limitations of those studies: most of them assess diagnostic accuracy and performances of models in isolation, thus not reflecting clinical practice. A major limitation to the interpretation of claims reporting a superior performance of these algorithms versus humans is that analytics are performed on previously generated data in silico, not prospectively in real-world clinical conditions [16].

Table 1.

Peer-reviewed publications of AI algorithms compared with doctors [16]

Specialty images Publication
Radiology/neurology CT head, acute neurological events Titano et al. [17]
CT head for brain hemorrhage Arbabshirani et al. [18]
CT head for trauma Chilamkurthy et al. [19]
CXR for metastatic lung nodules Nam et al. [20]
CXR for multiple findings Singh et al. [21]
Mammography for breast density Lehman et al. [22]
Wrist X-ray Lindsey et al. [23]
Pathology Breast cancer Ehteshami Bejnordi et al. [24]
Lung cancer (+ driver mutation) Coudray et al. [25]
Brain tumors (+ methylation) Capper et al. [26]
Breast cancer metastases Steiner et al. [27]
Breast cancer metastases Liu et al. [28]
Dermatology Skin cancers Esteva et al. [29]
Melanoma Haenssle et al. [30]
Skin lesions Han et al. [31]
Ophthalmology Diabetic retinopathy Gulshan et al. [32]
Diabetic retinopathy Abramoff et al. [33]
Diabetic retinopathy Kanagasingam et al. [34]
Congenital cataracts Long et al. [35]
Retinal diseases (OCT) De Fauw et al. [36]
Macular degeneration Burlina et al. [37]
Retinopathy of prematurity Brown et al. [38]
AMD and diabetic retinopathy Kermany et al. [39]
Gastroenterology Polyps at colonoscopy Mori et al. [40]
Polyps at colonoscopy Wang et al. [41]
Cardiology Echocardiography Madani et al. [42]
Echocardiography Zhang et al. [43]

Furthermore, the term “validation” about the testing of AI algorithms is used in multiple ways. In a survey of 400 AI papers presented at major conferences, just 6% included code for the papers’ algorithms and 30% included test data and 54% included pseudocode which is a limited summary of an algorithm. Reproducibility is especially challenging in ML for health applications because of the availability, quality, and consistency of clinical or biomedical data. Specifically, in our field, infertility is increasingly affecting couples, and the interest in assisted reproduction technologies is growing. There are a multitude of AI solutions for patients that are only related to conception and pregnancy. Even though many researchers and commercial companies are developing potential AI IVF/ART tools for fertility, there are currently very few products or broadly accepted tools under development that have consistently provided clinical improvement, and many of them cannot be utilized without restrictions, limiting clinical implementation (some examples are summarized in Table 2).

Table 2.

Examples of some AI technologies related to ART

Target Objective Company
Embryo assessment

EEVA viability status

Geri Assess 2.0

https://hcp.merckgroup.com/en/fertility/technologieshttps://hcp.merckgroup.com/en/fertility/technologies
Viability and implantation potential www.Presagen.com
Viability and implantation potential www.IVFVision.ai
Viability and implantation potential www.FutureFertility.com
Viability and implantation potential www.Binflux.com
Viability and implantation potential www.IVY.ai
Viability and implantation potential STORK
Viability and implantation potential Fairtility
Euploidy/aneuploidy status www.Presagen.com
Euploidy/aneuploidy status ERICA—www.embryoranking.com
Oocyte assessment Viability status VIOLET—www.FutureFertility.com
Stimulation protocol

Predict live birth

Predict pregnancy

www.UNIVFY.com

www.apricity.life

Uterine receptivity Predict receptivity potential www.FutureFertility.com

Notwithstanding the large number of existing AI publications, there is no standardization of the required material, and there are no clear minimum criteria for the acceptance of AI and ML studies. This creates a conundrum in terms of the literature and trying to ensure high quality data. Indeed, due to this issue, in the field of human reproduction and embryology, numerous groups are participating in the development of AI algorithms for ART with significant variability in the expertise and resulting published research. These ART/AI publications should enforce common minimum standards to permit fully transparent peer-review of the studies. These include utilizing sufficient amounts of high-quality data and proper development paths for AI algorithms, in full compliance with existent international regulations. Notably, what these minimum criteria are difficult to define and will depend on the dataset utilized.

Clinically relevant endpoints for AI in fertility

The potential of AI opportunities to investigate in fertility is very broad and largely extends through the entire IVF patient’s journey (Fig. 1).

Fig. 1.

Fig. 1

Summary scheme of how AI can improve IVF treatments throughout the process

Data-driven solutions might help detect early indicators of infertility allowing an earlier identification of potential patients to refer for treatment as well as support doctors during a consultation to define a precise treatment strategy, providing estimations of success in terms of pregnancy and take-home baby rates.

Before starting treatment, we may also be able to define the best protocol of stimulation to apply for each patient to maximize success. We can also define the need of adjusting drugs and doses, during treatment, based on real-time biomarkers of follicle development. In the embryology and andrology laboratories, we could use data-driven solutions to evaluate gamete viability, embryo implantation potential, and uterine receptivity status before embryo transfer, therefore allowing us to target defined windows of implantation potential. Finally, we may be able to provide individualized support for the luteal phase. All this would allow the improvement of pregnancy while reducing the time to achieve a healthy live birth.

In summary, widely accepted clinically relevant endpoints to target are as follows: (1) the birth of a single and healthy baby, (2) decrease of miscarriages, (3) higher implantation and clinical and ongoing pregnancy rates, (4) objective assessments of cytoplasmic and genetic health of gametes and embryos, and (5) standardization of operator assessments.

The quality and quantity of data is key

The availability of data has increased in the last decades, and the rapid developments and advances of machine learning have sparked enthusiasm for applying data science techniques to solve problems and improve current status in clinical care [45]. However, it often happens that only retrospective data is available, leading to a very expensive post-process for sorting and labeling/annotating the data (if possible) or making it necessary to collect new data. Labels are indeed the most valuable part of data for any supervised machine learning model. Data quality and data quantity are challenges that need to be overcome to develop reliable and robust algorithms [46]. The amount of available data is particularly relevant in ART, given the small general population of patients being treated. Given a problem, the information content of the collected data is relevant as a model can only learn from the data that is used for its training. The complexity of the problem to solve and the complexity of the model itself are influencing factors for data sufficiency. Domain experts can help to estimate how representative the collected data is for a given defined problem. Additionally, measures from information theory (e.g., entropy, mutual information, and information gain) can be used to quantify the amount of information present in the data and guide data collection and data modeling, assessing, for example, the introduction of new variables or the collection of new samples [47]. Moreover, data size requirements might depend on the problem dimensionality: a higher number of features likely lead to a larger amount of data needed for analysis and modeling [48]. In the case of a high number of variables, the search problem space can be large, while available data might be sparse with potentially redundant information. To evaluate and quantify the needed sample size while training a model, learning curves can be used, describing the model performance as a function of the training size. While the number of training samples increases, the model performance increases rapidly at the beginning, subsequently slowing down until a point where the model performance improves only marginal with an increasing number of samples. Several studies have been published to characterize performances and generalization error with data size [49, 50].

An additional challenge to consider may be the heterogeneity of data sources generated by different types of equipment and not centralized data collection systems. Every ART clinic has implemented different workflows and standard procedures that need to be considered during data analysis and modeling. The standardization of the processes generating data is a promising and important effort that all involved organizations should likely embrace. A definition and consequent implementation of quality standard procedures would facilitate the modeling of data coming from different sources reducing data variability. This would also help in a broad validation of any algorithm before potential production and commercialization. Data collection is the first imperative step, but it requires coming with quality and quantity, depending on the problem in question. Moreover, it is crucial to develop and implement proper modeling approaches to better understand the data, extract information, and make reliable generalizations under proper assumptions and limitations.

Proper development path of AI algorithms for fertility

The development of AI-based solutions for medical decisions in many therapeutic areas is challenging and might be even more complex for ART for several reasons including (1) small general population of patients being treated, leading to small data sets, (2) AI solutions infertility are complicated to patent and protect, (3) lack of standardization leads to no homogeneous data sets, (4) data is stored in diverse ecosystems, (5) data protection laws and regulations differ by country, (6) no technical ecosystem that allows applying AI solutions at scale, and (7) there are unclear regulatory approval routes that differ by country. In the treatment of fertility, data generation is fragmented and makes the application of AI solutions a challenge.

We must also take into consideration that many healthcare providers do not fully understand the technology and therefore shy away from its applicable relevance not supporting its development. Indeed, the complexity of fertility procedures applied by different doctors and embryologists and the need for continuous training to secure the collection of high-quality data to be used for the development of any AI solution can make it difficult to understand and accept.

To build a successful decision support system, several factors need to be considered. A minimal potential workflow to address and model a clinically relevant problem can be summarized as follows: (1) identify and define clinically a relevant problem, (2) design the study including patients and clinicians, (3) collect a large amount of quality data, automatically if possible, in an ecosystem that allows collection and storage of entire cycle data (electronic medical record), (4) label or annotate the data, (5) correct for biases and errors, (6) develop model/s to solve the considered problem rigorously following ML principles and best practice rules to avoid overfitting and to obtain a robust solution, (7) validate the selected solution, and (8) monitor and transparently provide oversight and accountability of the methodology applied for training, test, and validation. Depending on the scope of the solution and the possibility to commercialize the technology, different levels of validation will be required. Additionally, a more systematic recording process should be implemented to automatize tasks such as identify unusual or outlier laboratory results. A robust predictive algorithm that provides probability estimations needs to be trained, tested, and properly validated in randomized clinical trials, to support clinically relevant data-driven decisions. Eventually, it is important to consider potential ethical issues and develop a friendly user interface for healthcare providers to facilitate the use of such a system.

Regulations related to the development of medical software solutions

Medical software allows healthcare operators to access clinical information stored in electronic health records and additionally may support clinical decisions. These systems must be designed, built, validated, and maintained following strict global regulations and standards to ensure reliability and safety. The International Electrotechnical Commission specifies the life cycle requirements for the development of medical software and software within medical devices, harmonized by the EU/USA. It describes a set of processes, activities, and tasks, establishing a common framework for the life cycle processes of medical device software. Thus, it should be taken into consideration that the development of any AI-related technologies for fertility must follow the International Standard IEC 62304 (https://www.iso.org/obp/ui/#iso:std:iec:62304:ed-1:v1:en).

The Organization for Economic Cooperation and Development (OECD), a unique forum where the governments of 36 member states with market economies work with each other, as well as with more than 70 non-member economies to promote economic growth, prosperity, and sustainable development, defined AI principles that must be supported and in particular the need to “commit to transparency and responsible disclosure” in the use of AI systems. Essential points to consider should be the following: (1) requirement of disclosure to provide public assurances of liability taken by different organizations that are developing AI; (2) existence of rules for evaluating the level of potential damage to the technology; (3) ensure transparency, without revealing source code, trade secrets, or intellectual property; (4) any AI system on the market that is making determinations or recommendations with potentially significant implications for people should be able to explain and contextualize how and why it reached a particular conclusion; and (5) it is necessary to test AI for biases, carrying out test for fairness, bias, robustness, and security and take corrective measures if necessary [51].

Conclusions

We must recognize the importance of continually innovating in the fertility space to improve outcomes for the benefit of patients. Objectivity must be brought to the decision-making process in every step of ART to help more precisely determine the right treatment for the circumstance. This includes assessing the cause of infertility; determining type and decision points of the stimulation protocol; and accurately assessing gametes viability, embryo implantation, and uterine receptivity potential. The use of algorithms/AI to help guide human decision-making in each of these critical areas in ART has the potential to reduce subjectivity and improve consistency. However, it is necessary to be cautious and ensure that appropriate approaches are followed in developing accurate predictive algorithm models.

Another potential issue with universal use of AI in treatment of infertility arises from clinic competition and intellectual property. Will developed algorithms/AI become a commodity, specific for a particular clinic, or will these decision-making tools be widely available and applicable between centers? Certainly the hurdles that accompany data confidentiality and clinic competition will need to be addressed.

In summary, fertility relevant areas where AI applications could be focusing in the medium or short term should be the treatment-related areas like diagnostics; the selection of precise drugs and doses as well as adjustment during treatment; the homogeneity of the evaluation by the operators; and assessment of genetic quality, viability of blastocysts, implantation, and clinical and ongoing pregnancy rates. A very fundamental aspect to build any predictive algorithm is data, in a sufficient quantity and quality. Although the minimum data size requirements might be difficult to determine as it depends on the final point to consider, the amount of data required for training, test, and validation of an algorithm to reach a valuable and accurate clinical support is critical. A precise definition of quality standards would help in a potential reduction of variability and promote collaborations and projects. It is necessary to secure that any trained, validated, and robust models follow IEC 62304 regulatory requirements. Finally, it is important to re-emphasize that if our intention is to utilize AI-based algorithms and to potentially improve precision and outcomes for the benefit of patients, we must start by securing not only those algorithms that were based on proper paths for development and validation but also on a foundation of big quality data ecosystems.

Acknowledgments

Fertility AI Forum Group: Gerard Letterie, Integramed; Pascual Sánchez, Ginemed; Geoff Trew, The Fertility Partnership; Jason Swain, CCRM Management Co.; Marcos Meseguer, IVIRMA; Dan Nayot, Trio Fertility; Alison Campbell, CARE; Ian Huang, Storck–Binflux; Jan Choma, Cognexa; Kevin Loewke, DANA; María Paola Piqueras, Ginemed; Paul Nader, Baby Sentry; Michael Schindler, Meditex; Marck Marcom, Ideas EMR; Ed Vom, Planet Innovation; Eleanora Lippolis, Merck; Sebastian Bohl, Merck, Jan Kirsten, Merck; Daniel Abshagen, Merck; Diego Ezcurra, Merck.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Diego Ezcurra, Email: diego.ezcurra@emdserono.com.

Fertility AI Forum Group:

Diego Ezcurra, Gerard Letterie, Pascual Sánchez, Geoff Trew, Jason Swain, Marcos Meseguer, Dan Nayot, Alison Campbell, Ian Huangv, Jan Choma, Kevin Loewke, María Paola Piqueras, Paul Nader, Michael Schindler, Eleanora Lippolis, Sebastian Bohl, Jan Kirsten, and Daniel Abshagen

References

  • 1.European IVF-Monitoring Consortium (EIM) for the European Society of Human Reproduction and Embryology (ESHRE) Calhaz-Jorge C, et al. Assisted reproductive technology in Europe, 2012: results generated from European registers by ESHRE. Hum Reprod. 2016;31:1638–1652. doi: 10.1093/humrep/dew151. [DOI] [PubMed] [Google Scholar]
  • 2.de Mouzon J, Goossens V, Bhattacharya S, Castilla JA, Ferraretti AP, Korsak V, Kupka M, Nygren KG, Andersen AN, The European IVF-monitoring (EIM) Consortium, for the E. S. of H. R. and E. (ESHRE) Assisted reproductive technology in Europe, 2006: results generated from European registers by ESHRE. Hum Reprod. 2010;25:1851–1862. doi: 10.1093/humrep/deq124. [DOI] [PubMed] [Google Scholar]
  • 3.Adamson GD, de Mouzon J, Chambers GM, Zegers-Hochschild F, Mansour R, Ishihara O, Banker M, Dyer S. International committee for monitoring assisted reproductive technology: world report on assisted reproductive technology, 2011. Fertil Steril. 2018;110:1067–1080. doi: 10.1016/j.fertnstert.2018.06.039. [DOI] [PubMed] [Google Scholar]
  • 4.Centres for Disease Control and Prevention. Assisted reproductive technology: ART trends 2002–2011. http://www.cdc.gov/art/ART2011/section5.htm#f43. Accessed 16 Aug 2016.
  • 5.Curchoe CL, Bormann CL. Artificial intelligence and machine learning for human reproduction and embryology presented at ASRM and ESHRE 2018. J Assist Reprod Genet. 2019;36:591–600. doi: 10.1007/s10815-019-01408-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Letterie GS, MacDonald A. A computerized decision –support system for day to day management of ovarian stimulation cycles during in vitro fertilization. Fertil Steril. 2019;112:e28. doi: 10.1016/j.fertnstert.2019.07.206. [DOI] [PubMed] [Google Scholar]
  • 7.Khosravi P, et al. Robust automated assessment of human blastocyst quality using deep learning. bioRxiv. 2018;394882. 10.1101/394882.
  • 8.Hoo-Chang S, et al. Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans Med Imag. 2016;35:1285–1298. doi: 10.1109/TMI.2016.2528162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Turing AM. On computable numbers, with an application to the entscheidungsproblem. a correction. Proc Lond Math Soc. 1938;s2-43:544–546. doi: 10.1112/plms/s2-43.6.544. [DOI] [Google Scholar]
  • 10.Turing AM. Computing machinery and intelligence. Mind. 1950;236:433–460. doi: 10.1093/mind/LIX.236.433. [DOI] [Google Scholar]
  • 11.Mcculloch W, Pitts W. A logical calculus of the ideas immanent in nerous activity (reprinted from 1943) Bull Math Biol. 1990;52:99–115. doi: 10.1016/S0092-8240(05)80006-0. [DOI] [PubMed] [Google Scholar]
  • 12.Krizhevsky A, Sutskever I, Hinton G. ImageNet classification with deep convolutional neural networks. Handb. Approx. Algorithms Metaheuristics. 2007;45-1–45–16. 10.1201/9781420010749.
  • 13.The AI effect. How artificial intelligence is making health care more human. MIT Technology reviews insights. 2019.
  • 14.Tran D, Cooke S, Illingworth PJ, Gardner DK. Deep learning as a predictive tool for fetal heart pregnancy following time-lapse incubation and blastocyst transfer. Hum Reprod. 2019;34:1011–1018. doi: 10.1093/humrep/dez064. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Shen D, Wu G, Suk H. Deep learning in medical image analysis. Physiol Behav. 2017;176:139–148. doi: 10.1016/j.physbeh.2017.03.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. 2019;25:44–56. doi: 10.1038/s41591-018-0300-7. [DOI] [PubMed] [Google Scholar]
  • 17.Titano JJ, Badgeley M, Schefflein J, Pain M, Su A, Cai M, Swinburne N, Zech J, Kim J, Bederson J, Mocco J, Drayer B, Lehar J, Cho S, Costa A, Oermann EK. Automated deep-neural-network surveillance of cranial images for acute neurologic events. Nat Med. 2018;24:1337–1341. doi: 10.1038/s41591-018-0147-y. [DOI] [PubMed] [Google Scholar]
  • 18.Arbabshirani MR, Fornwalt BK, Mongelluzzo GJ, Suever JD, Geise BD, Patel AA, Moore GJ. Advanced machine learning in action: identification of intracranial hemorrhage on computed tomography scans of the head with clinical work flow integration. NPJ Digit Med. 2017;1:9. doi: 10.1038/s41746-017-0015-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Chilamkurthy S, et al. Articles Deep learning algorithms for detection of critical findings in head CT scans: a retrospective study. Lancet. 2018;6736:1–9. doi: 10.1016/S0140-6736(18)31645-3. [DOI] [PubMed] [Google Scholar]
  • 20.Nam JG, Park S, Hwang EJ, Lee JH. Development and validation of deep learning – based automatic detection algorithm for malignant pulmonary nodules on chest radiographs. Radiology. 2019;290:218–228. doi: 10.1148/radiol.2018180237. [DOI] [PubMed] [Google Scholar]
  • 21.Singh R, et al. Deep learning in chest radiography: detection of findings and presence of change. PLoS ONE. 2018;13(13):1–12. doi: 10.1371/journal.pone.0204155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Lehman CD, et al. Mammographic breast density assessment using deep learning: clinical implementation. Radiology. 2018;00:1–7. doi: 10.1148/radiol.2018180694. [DOI] [PubMed] [Google Scholar]
  • 23.Lindsey R, Daluiski A, Chopra S, Lachapelle A, Mozer M, Sicular S, Hanel D, Gardner M, Gupta A, Hotchkiss R, Potter H. Deep neural network improves fracture detection by clinicians. Proc Natl Acad Sci USA. 2018;115:11591–11596. doi: 10.1073/pnas.1806905115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Bejnordi BE, et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. JAMA. 2017;318:2199–2210. doi: 10.1001/jama.2017.14585. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Coudray N, et al. images using deep learning. Nat Med. 2018;24:1559–1569. doi: 10.1038/s41591-018-0177-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Capper D, Jones DTW, Sill M, Hovestadt V, Schrimpf D, Sturm D, Koelsche C, Sahm F, Chavez L, Reuss DE, Kratz A, Wefers AK, Huang K, Pajtler KW, Schweizer L, Stichel D, Olar A, Engel NW, Lindenberg K, Harter PN, Braczynski AK, Plate KH, Dohmen H, Garvalov BK, Coras R, Hölsken A, Hewer E, Bewerunge-Hudler M, Schick M, Fischer R, Beschorner R, Schittenhelm J, Staszewski O, Wani K, Varlet P, Pages M, Temming P, Lohmann D, Selt F, Witt H, Milde T, Witt O, Aronica E, Giangaspero F, Rushing E, Scheurlen W, Geisenberger C, Rodriguez FJ, Becker A, Preusser M, Haberler C, Bjerkvig R, Cryan J, Farrell M, Deckert M, Hench J, Frank S, Serrano J, Kannan K, Tsirigos A, Brück W, Hofer S, Brehmer S, Seiz-Rosenhagen M, Hänggi D, Hans V, Rozsnoki S, Hansford JR, Kohlhof P, Kristensen BW, Lechner M, Lopes B, Mawrin C, Ketter R, Kulozik A, Khatib Z, Heppner F, Koch A, Jouvet A, Keohane C, Mühleisen H, Mueller W, Pohl U, Prinz M, Benner A, Zapatka M, Gottardo NG, Driever PH, Kramm CM, Müller HL, Rutkowski S, von Hoff K, Frühwald MC, Gnekow A, Fleischhack G, Tippelt S, Calaminus G, Monoranu CM, Perry A, Jones C, Jacques TS, Radlwimmer B, Gessi M, Pietsch T, Schramm J, Schackert G, Westphal M, Reifenberger G, Wesseling P, Weller M, Collins VP, Blümcke I, Bendszus M, Debus J, Huang A, Jabado N, Northcott PA, Paulus W, Gajjar A, Robinson GW, Taylor MD, Jaunmuktane Z, Ryzhova M, Platten M, Unterberg A, Wick W, Karajannis MA, Mittelbronn M, Acker T, Hartmann C, Aldape K, Schüller U, Buslei R, Lichter P, Kool M, Herold-Mende C, Ellison DW, Hasselblatt M, Snuderl M, Brandner S, Korshunov A, von Deimling A, Pfister SM. DNA methylation-based classification of central nervous system tumours. Nature. 2018;555:469–474. doi: 10.1038/nature26000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Steiner DF, MacDonald R, Liu Y, Truszkowski P, Hipp JD, Gammage C, Thng F, Peng L, Stumpe MC. Impact of deep learning assistance on the histopathologic review of lymph nodes for metastatic breast cancer. Am J Surg Pathol. 2018;42:1636–1646. doi: 10.1097/PAS.0000000000001151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Liu Y, Kohlberger T, Norouzi M, Dahl GE, Smith JL, Mohtashamian A, Olson N, Peng LH, Hipp JD, Stumpe MC. Artificial intelligence – based breast cancer Nodal. Arch Pathol Lab Med. 2019;143:859–868. doi: 10.5858/arpa.2018-0147-OA. [DOI] [PubMed] [Google Scholar]
  • 29.Esteva A, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nat Publ Group. 2017;542:115–118. doi: 10.1038/nature21056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Haenssle HA, Fink C, Schneiderbauer R, Toberer F, Buhl T, Blum A, Kalloo A, Hassen ABH, Thomas L, Enk A, Uhlmann L, Reader study level-I and level-II Groups. Alt C, Arenbergerova M, Bakos R, Baltzer A, Bertlich I, Blum A, Bokor-Billmann T, Bowling J, Braghiroli N, Braun R, Buder-Bakhaya K, Buhl T, Cabo H, Cabrijan L, Cevic N, Classen A, Deltgen D, Fink C, Georgieva I, Hakim-Meibodi LE, Hanner S, Hartmann F, Hartmann J, Haus G, Hoxha E, Karls R, Koga H, Kreusch J, Lallas A, Majenka P, Marghoob A, Massone C, Mekokishvili L, Mestel D, Meyer V, Neuberger A, Nielsen K, Oliviero M, Pampena R, Paoli J, Pawlik E, Rao B, Rendon A, Russo T, Sadek A, Samhaber K, Schneiderbauer R, Schweizer A, Toberer F, Trennheuser L, Vlahova L, Wald A, Winkler J, Wölbing P, Zalaudek I. Man against machine: diagnostic performance of a deep learning convolutional neural network for dermoscopic melanoma recognition in comparison to 58 dermatologists. Ann Oncol. 2018;29:1836–1842. doi: 10.1093/annonc/mdy166. [DOI] [PubMed] [Google Scholar]
  • 31.Han SS, Kim MS, Lim W, Park GH, Park I. Classification of the clinical images for benign and malignant cutaneous tumors using a deep learning algorithm. J Invest Dermatol. 2018;138:1529–1538. doi: 10.1016/j.jid.2018.01.028. [DOI] [PubMed] [Google Scholar]
  • 32.Gulshan V, et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA. 2016;94043:1–9. doi: 10.1001/jama.2016.17216. [DOI] [PubMed] [Google Scholar]
  • 33.Abràmoff MD, Lavin PT, Birch M, Shah N, Folk JC. Pivotal trial of an autonomous AI-based diagnostic system for detection of diabetic retinopathy in primary care offices. NPJ Digit Med. 2018. 10.1038/s41746-018-0040-6. [DOI] [PMC free article] [PubMed]
  • 34.Kanagasingam Y, Xiao D, Vignarajan J, Preetham A, Mehrotra A. Evaluation of artificial intelligence – based grading of diabetic retinopathy in primary care. JAMA. 2018;1:1–6. doi: 10.1001/jamanetworkopen.2018.2665. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Long E, et al. An artificial intelligence platform for the multihospital collaborative management of congenital cataracts. Nat Biomed Eng. 2017;0024:1–8. [Google Scholar]
  • 36.De Fauw J, et al. Clinically applicable deep learning for diagnosis and referral in retinal disease. Nat Med. 2018;24:1342–1354. doi: 10.1038/s41591-018-0107-6. [DOI] [PubMed] [Google Scholar]
  • 37.Burlina PM, Joshi N, Pekala M, Pacheco KD, Freund DE, Bressler NM. Automated grading of age-related macular degeneration from color fundus images using deep convolutional neural networks. JAMA Ophthalmol. 2017;135:1170–1176. doi: 10.1001/jamaophthalmol.2017.3782. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Brown JM, Campbell JP, Beers A, Chang K, Ostmo S, Chan RVP, Dy J, Erdogmus D, Ioannidis S, Kalpathy-Cramer J, Chiang MF, for the Imaging and Informatics in Retinopathy of Prematurity (i-ROP) Research Consortium Automated diagnosis of plus disease in retinopathy of prematurity using deep convolutional neural networks. JAMA Ophthalmol. 2018;136:803–810. doi: 10.1001/jamaophthalmol.2018.1934. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Kermany DS, Goldbaum M, Cai W, Lewis MA. Identifying medical diagnoses and treatable diseases by image-based deep learning resource. Cell. 2018;172:1122–1131.e1129. doi: 10.1016/j.cell.2018.02.010. [DOI] [PubMed] [Google Scholar]
  • 40.Mori Y, Kudo SE, Misawa M, Saito Y, Ikematsu H, Hotta K, Ohtsuka K, Urushibara F, Kataoka S, Ogawa Y, Maeda Y, Takeda K, Nakamura H, Ichimasa K, Kudo T, Hayashi T, Wakamura K, Ishida F, Inoue H, Itoh H, Oda M, Mori K. Real-time use of artificial intelligence in identification of diminutive polyps during colonoscopy. Ann Intern Med. 2018;169:357–366. doi: 10.7326/M18-0249. [DOI] [PubMed] [Google Scholar]
  • 41.Wang P, Xiao X, Glissen Brown JR, Berzin TM, Tu M, Xiong F, Hu X, Liu P, Song Y, Zhang D, Yang X, Li L, He J, Yi X, Liu J, Liu X. Development and validation of a deep-learning algorithm for the detection of polyps during colonoscopy. Nat Biomed Eng. 2018;2:741–748. doi: 10.1038/s41551-018-0301-3. [DOI] [PubMed] [Google Scholar]
  • 42.Madani A, Arnaout R, Mofrad M. Fast and accurate view classification of echocardiograms using deep learning. NPJ Digit Med. 2018;1–8. 10.1038/s41746-017-0013-1. [DOI] [PMC free article] [PubMed]
  • 43.Zhang J, Gajjala S, Agrawal P, Tison GH, Hallock LA, Beussink-Nelson L, Lassen MH, Fan E, Aras MA, Jordan CR, Fleischmann KE, Melisko M, Qasim A, Shah SJ, Bajcsy R, Deo RC. Fully automated echocardiogram interpretation in clinical practice feasibility and diagnostic accuracy. Circulation. 2018;138:1623–1635. doi: 10.1161/CIRCULATIONAHA.118.034338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Fujisawa Y, Otomo Y, Ogata Y, Nakamura Y, Fujita R, Ishitsuka Y, Watanabe R, Okiyama N, Ohara K, Fujimoto M. Deep-learning-based, computer-aided classifier developed with a small dataset of clinical images surpasses board-certified dermatologists in skin tumour diagnosis. Br J Dermatol. 2019;180:373–381. doi: 10.1111/bjd.16924. [DOI] [PubMed] [Google Scholar]
  • 45.Celi LA, Csete M, Stone D. Optimal data systems: the future of clinical predictions and decision support. Curr Opin Crit Care. 2014;20:573–580. doi: 10.1097/MCC.0000000000000137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Hee, K. Is data quality enough for a clinical decision?: apply machine learning and avoid bias. Proc. - 2017 IEEE Int. Conf. Big Data, Big Data 2017 2018-Janua, 2612–2619. 2017.
  • 47.MacKay DJC. Information theory, inference, and learning algorithms. Cambridge University Press 2003; 2005. 10.1166/asl.2012.3830.
  • 48.Bellman RE. Dynamic Programming: Princeton University Press; 2010.
  • 49.Cho J et al. How much data is needed to train a medical image deep learning system to achieve neces-sary high accuracy. Conf Pap ICLR 2016 HOW. 2016.
  • 50.Hestness J et al. Deep learning scaling is predictable, empirically. arXiv:1712.00409. 2017.
  • 51.Hagemann BR, Leclerc J. Precision regulation for artificial intelligence. IBM Policy Lab 1–5.

Articles from Journal of Assisted Reproduction and Genetics are provided here courtesy of Springer Science+Business Media, LLC

RESOURCES