Skip to main content
Lippincott Open Access logoLink to Lippincott Open Access
. 2024 Jul 2;111(1):1654–1655. doi: 10.1097/JS9.0000000000001898

GPT-4o is more like a real person: potentials in surgical oncology

Zaijie Sun a, Jia Yang b, Nan Zhang c,*, Haiyang Wu d,*, Cheng Li e,f,*
PMCID: PMC11745773  PMID: 38954671

Dear Editor,

In today’s rapidly advancing field of modern medicine, Generative Artificial Intelligence (GAI) is increasingly becoming a vital tool across various medical domains due to its powerful data processing and analytical capabilities. The potential applications of GAI are particularly promising in the complex and challenging branch of oncological surgery. Oncological surgery encompasses the diagnosis, surgical treatment, and postoperative management of cancer, each step requiring precise and efficient decision support. However, traditional diagnostic and therapeutic methods often rely on oncological surgeons’ experience and limited medical records, which, to some extent, constrains the quality and efficiency of healthcare services. GAI, especially natural language processing models such as ChatGPT, can provide significant support in interpreting medical data, patient consultation, and surgical planning through large-scale data training and deep learning techniques. ChatGPT is capable of processing vast amounts of medical literature and clinical data, generating treatment recommendations that adhere to medical standards, and assisting physicians in making more accurate decisions. Numerous clinical studies have already evaluated the application of ChatGPT versions GPT-3.5 and GPT-4 in oncological surgery1,2. However, the GPT-4o version, released just a month ago, has garnered widespread attention for its superior performance3,4. In this study, we will explore several applications of GPT-4o in surgical oncology by highlighting its advancements with the comparison of previous models such as ChatGPT-3.5 and ChatGPT-4.

In the context of oncological surgery, GPT-4o’s real-time video dialogue and emotion recognition capabilities hold significant potential and practical value. For instance, during preoperative consultations, patients often experience nervousness and anxiety, which can affect their emotions as well as their understanding and acceptance of surgical and treatment plans. GPT-4o facilitates real-time communication with patients and employs its emotion recognition feature to detect their emotional states. During the GPT-4o launch event, the host expressed feeling somewhat nervous and began to breathe rapidly. GPT-4o detected the change in his breathing, sensed his anxiety, and provided guidance to calm his nerves. This simple scenario highlights the strength of GPT-4o’s real-time video dialogue and emotion recognition capabilities. In future clinical care settings, GPT-4o could function as a virtual assistant, offering personalized and empathetic explanations and responses by identifying patients’ emotional states. For example, when a patient exhibits anxiety during a preoperative consultation, GPT-4o is able to provide immediate reassurance, explain the surgical process and expected outcomes, and help alleviate the patient’s anxiety. Additionally, by monitoring patients’ emotional changes, GPT-4o can promptly inform physicians, enabling them to better understand the patient’s psychological state and adjust their communication strategies accordingly.

In addition, one of the most significant advancements of GPT-4o over its predecessors is its exceptional multimodal processing capability. Unlike ChatGPT-3.5 and ChatGPT-4, which primarily focus on text input, GPT-4o seamlessly integrates text, audio, and image data5. This multimodal ability enables GPT-4o to achieve broader and deeper applications across various fields, with particularly notable performance in precision medicine. In this domain, GPT-4o combines genomic data with clinical and imaging data to offer unprecedented diagnostic and treatment solutions. It could analyze genetic mutations alongside clinical history and imaging data, providing a comprehensive diagnostic perspective. For instance, in breast cancer cases, traditional treatments often rely on limited genetic and clinical information. In contrast, GPT-4o comprehensively analyzes the genomic data, hormone receptor status, tumor size, location, and spread. This allows oncologists to develop highly personalized treatment plans based on the unique genetic and molecular characteristics of each tumor, recommending the most effective combinations of surgery, chemotherapy, and targeted therapy. This integrated data analysis capability enables GPT-4o to provide more precise and personalized treatment plans. Additionally, GPT-4o’s multimodal processing could monitor patients’ responses in real time during treatment, dynamically adjusting treatment plans through regular analysis of new imaging data and clinical reports. This real-time feedback mechanism enhances treatment precision and significantly reduces side effects.

Moreover, GPT-4o has achieved significant advancements in handling non-English languages, establishing itself as a valuable tool in diverse and multilingual healthcare environments. Oncological surgery typically involves complex medical terminology and detailed medical records, where precise communication is crucial for ensuring high-quality care. However, in practice, patients and healthcare providers may not share a common language, especially in multiethnic and multilingual regions. This is where GPT-4’s multilingual processing capabilities become particularly important. GPT-4o can efficiently translate medical documents, including medical records, surgical reports, and treatment plans. Accurate translation of these documents is vital for the treatment of cancer patients. Through GPT-4o, medical institutions could ensure accurate translation between different languages, preventing misunderstandings or errors due to language barriers, thereby ensuring patients receive the correct treatment. During medical consultations, if there is a language barrier between the patient and the doctor, communication efficiency and accuracy are compromised. With GPT-4o’s real-time interpretation capabilities, doctors and patients can communicate seamlessly, allowing patients to accurately describe their conditions and doctors to promptly provide professional advice and treatment plans. Additionally, GPT-4o’s robust language processing capabilities have brought revolutionary changes to academic exchanges in the field of oncological surgery. International oncological surgery conferences and publications are typically conducted in English, posing a language barrier for researchers from non-English-speaking countries. GPT-4o may assist in translating academic papers, making research findings from non-English-speaking countries more accessible and recognized by the international community, thus promoting global academic exchange and collaboration.

In conclusion, GPT-4o offers numerous advancements over previous models like ChatGPT-3.5 and ChatGPT-4, making it a powerful tool in the field of surgical oncology. Here, we provide additional insights into the pivotal role of GPT-4o, enhancing the existing discussion. We believe that the release of GPT-4o will catalyze an increase in clinical studies investigating its applications in the diagnosis and treatment of various cancers5,6. However, it is crucial to address ethical considerations and establish comprehensive guidelines to fully leverage the benefits of this groundbreaking technology.

Ethical approval

This study does not include any individual-level data and thus does not require any ethical approval.

Source of funding

This study is supported by China Postdoctoral Science Foundation (2022M720385) and Beijing JST Research Funding (YGQ-202313).

Author contribution

Z.S., J.Y., and N.Z.: conceptualization, methodology, data curation, formal analysis, resources, investigation, and writing – original draft; H.W. and C.L.: conceptualization, methodology, formal analysis, resources, investigation, and writing – review and editing.

Conflicts of interest disclosure

The authors declare no conflicts of interest.

Research registration unique identifying number (UIN)

  1. Name of the registry: not applicable.

  2. Unique identifying number or registration ID: not applicable.

  3. Hyperlink to your specific registration (must be publicly accessible and will be checked): not applicable.

Guarantor

Nan Zhang, Haiyang Wu, and Cheng Li.

Data availability statement

The data underlying this article will be shared by the corresponding author on reasonable request.

Footnotes

Zaijie Sun, Jia Yang, and Nan Zhang contributed equally to the study.

Sponsorships or competing interests that may be relevant to content are disclosed at the end of this article.

Contributor Information

Zaijie Sun, Email: gukeszj@163.com.

Jia Yang, Email: yj19860720@tmu.edu.cn.

Nan Zhang, Email: nancynancy1992@163.com.

Haiyang Wu, Email: wuhaiyang2021@tmu.edu.cn.

Cheng Li, Email: lichengcharite@gmail.com.

References

  • 1.Deng L, Wang T, Yangzhang, et al. Evaluation of large language models in breast cancer clinical scenarios: a comparative analysis based on ChatGPT-3.5, ChatGPT-4.0, and Claude2. Int J Surg 2024;110:1941–1950. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Cheng K, Wu H, Li C. ChatGPT/GPT-4: enabling a new era of surgical oncology. Int J Surg 2023;109:2549–2550. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Zhang N, Sun Z, Xie Y, et al. The latest version ChatGPT powered by GPT-4o: what will it bring to the medical field? Int J Surg 2024;Jun 10[Epub ahead of print]. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Zhu N, Zhang N, Shao Q, et al. OpenAI’s GPT-4o in surgical oncology: revolutionary advances in generative artificial intelligence. Eur J Cancer 2024;206:114132. [DOI] [PubMed] [Google Scholar]
  • 5.Temsah MH, Jamal A, Alhasan K, et al. Transforming virtual healthcare: the potentials of ChatGPT-4omni in telemedicine. Cureus 2024;16:e61377. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Sonoda Y, Kurokawa R, Nakamura Y, et al. Diagnostic performances of GPT-4o, Claude 3 Opus, and Gemini 1.5 Pro in “Diagnosis Please” cases. medRxiv 2024. 10.1007/s11604-024-01619-y [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data underlying this article will be shared by the corresponding author on reasonable request.


Articles from International Journal of Surgery (London, England) are provided here courtesy of Wolters Kluwer Health

RESOURCES