Abstract
Artificial intelligence (AI) will likely transform many aspects of healthcare, and physicians will need to adapt and lead. The expanding range of AI tools calls for physicians to become competent in their proper use if we are to achieve better patient experience, population health and health equity, and with greater efficiency, while enhancing physician satisfaction. This viewpoint proposes a practical and manageable set of core competencies for physicians in using AI tools effectively and ethically and suggests methods for acquiring these competencies.
Keywords: artificial intelligence, clinical practice, competencies, core, physicians
Introduction
Artificial intelligence (AI), broadly defined as computer systems capable of mimicking human intelligence, will likely transform many aspects of healthcare, and physicians will need to adapt and lead. The advent of generative AI in the form of large language models (LLMs) is already impacting clinical practice with ambient AI scribes and AI‐enabled decision support. 1 All specialties and all tasks (diagnosis, treatment, prognostication) are likely to be influenced, as will various clerical and administrative functions performed by health professionals and information searching by patients. AI tools for predicting risk of sepsis, interpreting radiology or endoscopic images, and detecting clinical deterioration are already deployed in various Australian healthcare organisations. 2 Health professionals and patients in general hold positive views of using AI to assist care if certain safeguards are met. 3 Repetitive, routine tasks may be done by AI tools, enabling greater attention to higher‐order, irreducibly human elements of care. However, the evidence‐based medicine movement taught clinicians to appraise new interventions critically before large‐scale adoption, and AI tools are no exception. Their expanding range calls for physicians to become competent in their proper use in achieving the Quintuple Aim of better population health, patient experience and health equity together with greater efficiency and physician satisfaction.
In this viewpoint, we propose a practical set of core competencies for physicians in using AI tools effectively and ethically (Table 1) and suggest methods for acquiring these competencies. We define competencies as the demonstrable knowledge, skills, abilities and behaviours that contribute to individual and organisational performance. While several groups have proposed AI competencies for health professionals, 4 , 5 , 6 none target practising physicians, most focus on undergraduate curricula and all predate the advent of generative AI.
Table 1.
Proposed list of core competencies in AI
| Domain | Competencies – the competent physician |
|---|---|
|
Foundational knowledge of AI tools ‘What are these AI tools?’ |
|
|
Critical appraisal of AI tools ‘Should this AI tool be used?’ |
|
|
Use of AI tools ‘How to best use this AI tool’ |
|
|
Patient interaction with AI tools ‘How to involve patients when using AI tools?’ |
|
|
Professionalism and AI tools ‘What are our responsibilities in advancing the use of AI tools?’ |
|
Foundational knowledge of AI tools
While not expected to become AI experts, physicians will need to understand key concepts and terminology pertaining to machine learning (ML) models used for classification or prediction tasks, deep learning (DL) models applied to computer vision or speech recognition and generative AI used to generate different outputs (text, video and audio) from different inputs or prompts. 7 , 8 Familiarity with the key steps in developing and evaluating AI models and derivative tools lends an appreciation of where in this pipeline faults and limitations in either model or tool may develop (Figure 1). 9 Physicians will need to be alert to the potential for AI tools to influence their cognitive processes negatively, resulting in loss of situational awareness, de‐skilling and over‐reliance on AI‐generated advice presented with undue confidence. 10 Being aware of AI tools specific to their specialty that are effective, safe and ready for implementation is important, as is staying attuned to the evolving regulatory, legal and technical standards that will likely apply to such tools. 11
Figure 1.

Key steps in developing and evaluating AI models and tools.
Critical appraisal of AI tools
Physicians must be assured that AI tools meet a pressing need, are accurate and reliable for their assigned tasks, pose minimal safety concerns, can cope with the nuances of clinical practice, do not cause cognitive overload or alert fatigue and can be integrated into clinical workflows. 12 In appraising any particular AI tool, key performance metrics applied to their models according to type and function need to be understood. These might comprise sensitivity, specificity, predictive values and discrimination curves for ML classification or prediction tasks, 7 and accuracy, relevance, completeness and latency metrics for LLMs. 8 Models should be externally validated on unseen datasets from other institutions separate from original training datasets. Models and tools trained on static, retrospective datasets must maintain acceptable performance when evaluated prospectively using dynamic, real‐time data feeds from electronic medical records (EMRs) and/or other sources which resemble the operational environment in which they will function.
Appraisal guides (Table 2) help assess the clinical suitability of a particular AI tool and its impact on Quintuple Aim outcomes, including costs and environmental impacts of the tool itself. 13 For specific AI tools, physicians should request and be able to interpret a factsheet or model card that details its function and context, training datasets, model performance, bias evaluation, failure mode analyses, safety assessment, technical architecture, data storage, prompt engineering (for LLMs), user testing and conditions of use. 14 Tools lacking such information should be avoided. Physician users should also request a risk assessment that considers both the likelihood and impacts of (tool inaccuracy) (due to unrecognised hallucinations (fabricated information), omissions and bias), cybersecurity breaches, sparse validation on local contexts, infrastructure constraints, opportunity costs and carbon footprint. 15 Many studies comparing AI tools with standard clinical practice use retrospective designs and highly curated settings, with high risk of bias or irrelevance to routine care. 16 The required level of methodological rigour should be proportionate to the level of risk inherent to the task the tool is performing. A decision support tool will necessarily require a high evidence standard, in most cases a randomised controlled trial, while for less critical documentation tasks or data retrieval, observational studies may suffice.
Table 2.
Basic appraisal guide for AI tools
| What is the purpose and context of the tool? |
|
|
| How accurate and reliable is the tool? |
|
|
|
|
| Is the tool able to be integrated into clinical workflows with minimal disruption? |
|
|
|
| Is use of the tool likely to realise clinically meaningful improvements in care? |
|
| Does the tool carry risk of harming patients or compromising the physician‐patient relationship? |
| Does the tool satisfy current governance, ethical and regulatory standards? |
| Is the tool financially and environmentally sustainable over its lifecycle? |
Use of AI tools
Physicians must be able to identify the clinical circumstances in which use of a tool is appropriate regarding errors and biases that may apply to specific scenarios. For example, some tools may not work well in certain minority populations such as First Nations people (insufficient or unrepresentative training data), in particular sites (different equipment settings, mode of operation or, for LLMs, use of vernacular language) or in different contexts (highly time‐sensitive or person‐sensitive encounters). 17 Physicians must demand the necessary training and support be provided in using AI tools proficiently, participate in user testing of human‐computer interfaces and visualisations of outputs during prototype development and have input into the procurement process for deployable applications. In using LLMs effectively, physicians need to construct specific, contextualised instructions (or prompts) that enable the chatbot tool to generate the most nuanced, relevant and factually correct output, with reduced risk of ‘hallucinations'. Physicians will also need to adjust to any tool‐related restructuring of EMR functionality (from embedded or stand‐alone tools), current workflows, professional roles and interactions with other team members or with consumers accessing AI tools for information prior to any clinical encounter.
Tools will automate clerical tasks, assist physicians in decision‐making and reassign certain functions, such as triage or screening, to different team members. Physicians should understand how and why task‐specific performance thresholds (for ML models) and benchmark cases with chain‐of‐thought (CoT) reasoning (for LLMs) should be regularly applied to AI tools using auditing processes 18 and scenario testing, 19 both aimed at identifying when tool functioning is degrading to unacceptable levels. This aligns with physician accountability for all clinical actions involving AI tools and avoiding unquestioning over‐reliance on such tools. Adhering to guidance on safe tool use will minimise misuse and consequent liability for harm from inputting data errors, misinterpreting output displays or clicking wrong options. 20
Patient interaction with AI tools
Using AI tools that reconfigure the conduct of clinical encounters and directly inform clinical decisions introduces a third member into the traditional physician‐patient dyad. How much trust patients will have in both the AI tool and the physician using it may depend on how well the latter can explain what the tool is, how it works and why it should be used. It is impossible to render highly complex ‘black box’ models fully explainable, 21 and no consensus currently exists on whether or to what extent AI tools should be inherently explainable. 22 The compromise is for tools to report key features and rationales for their outputs that physicians can comprehend and communicate and for physicians to acquire a basic understanding of tool operations. Rather than tool explainability, patients are more concerned about tool accuracy and data privacy protections 23 and desire tools that enable physicians to spend more time interacting directly with them in shared decision‐making. 24 More research needs to define patient preferences for how AI tools should be used according to the context of individual encounters.
Professionalism and AI tools
Physicians must be involved from the outset in developing AI tools that are fit for clinical purposes and learn from the pitfalls of poorly designed EMRs. This requires physicians to recognise data and informatics professionals as partners in teams within healthcare organisations and to collaborate with them in designing, evaluating, implementing and monitoring AI tools that address important clinical problems amenable to AI solutions. Tool‐related changes in professional roles, responsibilities, training and work practices must be acknowledged and worked through collaboratively at the organisational level in achieving the full benefit of AI tools. Physicians will also need to be aware of system factors (such as infrastructure and resource availability, access to training data, cybersecurity and financial sustainability) that either promote or inhibit the adoption of effective AI tools and be prepared to articulate and advocate for the necessary reforms in avoiding lost opportunities to benefit from AI innovation. 25 The increasing personal use of smartphone AI applications, outside the scrutiny of healthcare organisations, also brings professional obligations regarding appropriate and cybersafe use. Finally, physicians should try to remain up to date in their understanding of rapidly evolving AI methods and tools that hold the promise of improving care while maintaining awareness of prevailing regulatory standards applied by the Therapeutic Goods Administration (available at: Is my software regulated?) and knowing which AI tools require or do not require regulatory approval (e.g. decision support tools vs ambient scribes).
Methods for acquiring AI competencies
Foundational knowledge can be imparted from academic sources using primers, 7 , 8 webinars, mini‐courses, micro‐credentialling and training modules, many available online. Qualifications in digital health should include training in AI and for those seeking a career path in clinical AI, formal degree courses are on offer from some Australian universities (see Australian digital health course directory – Digital Health CRC). These all aim to build stronger interdisciplinary partnerships with data and computer scientists and informatics personnel.
Critical appraisal skills can be acquired by consulting AI‐specific checklists, 12 , 13 , 26 guidelines and research reporting standards 27 , 28 and incorporating these into journal clubs, topic reviews and quiz sessions, while noting standardised appraisal guides for LLMs are yet to be developed. 29
Expertise in using AI tools can be developed using simulation exercises within sandbox environments using tool prototypes. 30 These activities also allow physicians to suggest changes to the computer interface or output visualisations that enhance user experience, help foresee tool effects on clinical workflows and provide opportunities to inform regulatory approaches to AI tools. 31
In using LLMs, physicians will need to practise prompt engineering using simulations (or ‘prompt‐a‐thons’) 32 and scenario exercises, 33 iterating their prompts and working with colleagues in learning which tactics produce the most precise and informative outputs, and to understand the limitations and errors of LLMs.
Having to report incidents of adverse events and near misses from using AI tools and determining if these result from inherent defects in the tool or misuse by users will help raise physician skill in optimising both tool design and human‐AI interactions. Having to develop threshold or benchmark standards for assessing tool impacts on diagnostic accuracy, operational efficiency or other outcomes requires physicians to understand tool performance metrics and what constitutes meaningful improvements in AI‐assisted care compared to current practice.
The use of AI tools in clinical consultations will require patient consent procedures that acknowledge the limits of an individual physician's depth of knowledge about particular tools. Physicians will need to collect patient feedback on their experiences of AI‐enabled encounters during the early phases of tool deployment in ensuring patient autonomy and sanctity of physician‐patient relationships.
Leadership courses, supervisor development workshops and congresses of the Royal Australasian College of Physicians (RACP) could regularly feature sessions on AI. The RACP should consider establishing a ‘community of AI practice’ that promotes ongoing and systematically recorded discussions and sharing of insights about physicians' use of AI tools, complemented by AI articles authored by RACP members and published in Internal Medicine Journal (IMJ) (Table 3). Specialty societies could consider establishing special interest groups in AI tasked with imparting knowledge and skills related to specialty‐specific AI tools to their members. Similar to sister institutions overseas, 34 , 35 the RACP and its affiliated specialties need to formulate position statements on their approach to AI. These should emphasise optimisation of patient outcomes allied with synergistic human‐AI collaboration and ongoing vigilance and verification of AI tool outputs by appropriately trained physicians.
Table 3.
Bibliography of articles on artificial intelligence published in IMJ
|
|
|
|
|
|
|
|
|
Conclusion
Not every physician needs to become an AI expert, but all will need to know how to scrutinise and use AI technologies effectively within their areas of practice. This non‐exhaustive list of core competencies in AI is proposed as a starting point for further discussion, and which needs formally to consider consumer wishes and sentiment – a limitation of the current work, although some of us (IS, CS and CMS) have engaged extensively with consumer groups in previous AI‐related projects. Such engagement is an area for future research and must feature adequate minority group representation. Additional competency domains and subcompetencies may be required for different specialties and as AI tools and systems evolve over time and are applied to more aspects of care. Incorporating AI competency training into current, already crowded curricula and professional development programmes will be challenging but necessary. Development and validation of methods for assessing these competencies among practising physicians will also be required.
Acknowledgements
The authors are all members of the RACP Digital Health Advisory Group. The views expressed in this article are solely those of the authors and should not be viewed as official policy or opinion of the RACP or any of its committees, faculties, or affiliated societies. Open access publishing facilitated by The University of Queensland, as part of the Wiley ‐ The University of Queensland agreement via the Council of Australian University Librarians.
Funding: None.
Conflict of interest: None.
References
- 1. Karalis VD. The integration of artificial intelligence into clinical practice. Appl Biosci 2024; 3: 14–44. [Google Scholar]
- 2. Janssen AB, Kavisha S, Johnson A, Marinic A, Teede H, Shaw T. Implementation of artificial intelligence applications in Australian healthcare organisations: environmental scan findings. Stud Health Technol Inform 2024; 310: 1136–1140. [DOI] [PubMed] [Google Scholar]
- 3. Scott IA, Carter SM, Coiera E. Exploring stakeholder attitudes towards AI in clinical practice. BMJ Health Care Inform 2021; 28: e100450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Caliskan SA, Demir K, Karaca O. Artificial intelligence in medical education curriculum: an e‐Delphi study for competencies. PLoS One 2022; 17: e0271872. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Russell RG, Lovett Novak L, Patel M, Garvey KV, Craig KJT, Jackson GP et al. Competencies for the use of artificial intelligence‐based tools by health care professionals. Acad Med 2023; 98: 348–356. [DOI] [PubMed] [Google Scholar]
- 6. Liaw W, Kueper JK, Lin S, Bazemore A, Kakadiaris I. Competencies for the use of artificial intelligence in primary care. Ann Fam Med 2022; 20: 559–563. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Scott IA. Demystifying machine learning: a primer for physicians. Intern Med J 2021; 51: 1388–1400. [DOI] [PubMed] [Google Scholar]
- 8. Scott IA, Zuccon G. The new paradigm in machine learning ‐ foundation models, large language models and beyond: a primer for physicians. Intern Med J 2024; 54: 705–715. [DOI] [PubMed] [Google Scholar]
- 9. van der Vegt AH, Scott IA, Dermawan K, Schnetler RJ, Kalke VR, Lane PJ. Implementation frameworks for end‐to‐end clinical AI: derivation of the SALIENT framework. J Am Med Inform Assoc 2023; 30: 1503–1515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Lyell D, Coiera E. Automation bias and verification complexity: a systematic review. J Am Med Inform Assoc 2017; 24: 423–431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Johnson SLJ. Artificial intelligence in health care: the challenge of effective regulation. J Leg Med 2022; 42: 75–86. [DOI] [PubMed] [Google Scholar]
- 12. Scott IA, van der Vegt A, Lane P, McPhail S, Magrabi F. Achieving large‐scale clinician adoption of AI‐enabled decision support. BMJ Health Care Inform 2024; 31: e100971. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Scott I, Carter S, Coiera E. Clinician checklist for assessing suitability of machine learning applications in healthcare. BMJ Health Care Inform 2021; 28: e100251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Sendak MP, Gao M, Brajer N, Balu S. Presenting machine learning model information to clinical end users with model facts labels. NPJ Digit Med 2020; 3: 41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Queensland Government . Foundational artificial intelligence risk assessment (FAIRA) framework. 2024. Accessed 20/12/24 at: Foundational artificial intelligence risk assessment framework | For government | Queensland Government.
- 16. Nagendran M, Chen Y, Lovejoy CA, Gordon AC, Komorowski M, Harvey H et al. Artificial intelligence versus clinicians: systematic review of design, reporting standards, and claims of deep learning studies. BMJ 2020; 368: m689. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Doyen S, Dadario NB. 12 plagues of AI in healthcare: a practical guide to current issues with using machine learning in a medical context. Front Digit Health 2022; 4: 765406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Liu X, Glocker B, McCradden MM et al. The medical model audit. Lancet Digit Health 2022; 4: e384–e397. [DOI] [PubMed] [Google Scholar]
- 19. Tam TYC, Sivarajkumar S, Kapoor S, Stolyar AV, Polanska K, McCarthy KR et al. A framework for human evaluation of large language models in healthcare derived from literature review. NPJ Digit Med 2024; 7: 258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Lyell D, Wang Y, Coiera E, Magrabi F. More than algorithms: an analysis of safety events involving ML‐enabled medical devices reported to the FDA. J Am Med Inform Assoc 2023; 30: 1227–1236. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Ghassemi M, Oakden‐Rayner L, Beam AL. The false hope of current approaches to explainable artificial intelligence in health care. Lancet Digit Health 2021; 3: e745–e750. [DOI] [PubMed] [Google Scholar]
- 22. Abgrall G, Holder AL, Chelly Dagdia Z et al. Should AI models be explainable to clinicians? Crit Care 2024; 28: 301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. van der Veer SN, Riste L, Cheraghi‐Sohi S, Phipps DL, Tully MP, Bozentko K et al. Trading off accuracy and explainability in AI decision‐making: findings from 2 citizens' juries. J Am Med Inform Assoc 2021; 28: 2128–2138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Lorenzini G, Arbelaez Ossa L, Shaw DM, Elger BS. Artifcial intelligence and the doctor–patient relationship: expanding the paradigm of shared decision making. Bioethics 2023; 37: 424–429. [DOI] [PubMed] [Google Scholar]
- 25. Nair M, Svedberg P, Larsson I, Nygren JM. A comprehensive overview of barriers and strategies for AI implementation in healthcare: mixed‐method design. PLoS One 2024; 19: e0305949. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Vollmer S, Mateen BA, Bohner G, Király FJ, Ghani R, Jonsson P et al. Machine learning and artificial intelligence research for patient benefit: 20 critical questions on transparency, replicability, ethics, and effectiveness. BMJ 2020; 368: l6927. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Collins GS, Moons KGM, Dhiman P, Riley RD, Beam AL, Van Calster B et al. TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. BMJ 2024; 385: e078378. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Liu X, Cruz Rivera S, Moher D, Calvert MJ, Denniston AK, Ashrafian H et al. Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT‐AI extension. Lancet Digit Health 2020; 2: e537–e548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Bedi S, Liu Y, Orr‐Ewing L, Dash D, Koyejo S, Callahan A et al. Testing and evaluation of health care applications of large language models: a systematic review. JAMA 2025; 333: 319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Lyell D, Lustig A, Denyer K, Vedantam S, Magrabi F. Using clinical simulation to evaluate AI‐enabled decision support. Stud Health Technol Inform 2024; 310: 299–303. [DOI] [PubMed] [Google Scholar]
- 31. O'Driscoll F, O'Brien N, Guo C, Prime M, Darzi A, Ghafur S. Clinical simulation in the regulation of software as a medical device: an eDelphi study. JMIR Form Res 2024; 8: e56241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Small WR, Malhotra K, Major VJ, Wiesenfeld B, Lewis M, Grover H et al. The first generative AI prompt‐a‐thon in healthcare: a novel approach to workforce engagement with a private instance of ChatGPT. PLoS Digit Health 2024; 3: e0000394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Mesko B. Prompt engineering as an important emerging skill for medical professionals: tutorial. J Med Internet Res 2023; 25: e50638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Daneshvar N, Pandita D, Erickson S, Snyder Sulmasy L, DeCamp M, ACP Medical Informatics Committee and the Ethics, Professionalism and Human Rights Committee . Artificial intelligence in the provision of health care: an American College of Physicians policy position paper. Ann Intern Med 2024; 177: 964–967. [DOI] [PubMed] [Google Scholar]
- 35. Royal College of Physicians Position Statement on AI . Accessed 26/12/24 at: Artificial intelligence (AI) in health | RCP London.
