Radiomics analysis for the early diagnosis of common sexually transmitted infections and skin lesions

Jiajun Sun; Zhen Yu; Yingping Li; Janet M Towns; Lin Zhang; Jason J Ong; Zongyuan Ge; Christopher K Fairley; Lei Zhang

doi:10.1371/journal.pdig.0000926

. 2025 Jul 23;4(7):e0000926. doi: 10.1371/journal.pdig.0000926

Radiomics analysis for the early diagnosis of common sexually transmitted infections and skin lesions

Jiajun Sun ^1,², Zhen Yu ^3,^4,^*, Yingping Li ⁵, Janet M Towns ^1,², Lin Zhang ^6,⁷, Jason J Ong ^1,², Zongyuan Ge ^3,⁴, Christopher K Fairley ^1,², Lei Zhang ^1,^2,^8,^9,^*

Editor: Hisham Al-Obaidi¹⁰

¹Melbourne Sexual Health Centre, Alfred Health, Melbourne, Victoria, Australia

²School of Translational Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Clayton, Victoria, Australia

³AIM for Health Lab, Monash University, Clayton, Victoria, Australia

⁴Faculty of IT, Monash University, Clayton, Victoria, Australia

⁵School of Artificial Intelligence, Xidian University, Xi’an, China

⁶Suzhou Industrial Park Monash Research Institute of Science and Technology, Suzhou, China

⁷School of Public Health and Preventative Medicine, School of Medicine, Nursing and Health Sciences, Monash University, Clayton, Victoria, Australia

⁸Phase I Clinical Trial Research Ward, The Second Affiliated Hospital of Xi’an Jiaotong University, Xi’an, China

⁹China-Australia Joint Research Center for Infectious Diseases, School of Public Health, Xi’an Jiaotong University Health Science Center, Xi’an, China

¹⁰University of Reading Reading School of Pharmacy, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND

The authors have declared that no competing interests exist.

^✉

* E-mail: Zhen.Yu@monash.edu (ZU); lei.zhang1@monash.edu (LZ)

Roles

Jiajun Sun: Data curation, Formal analysis, Investigation, Project administration, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

Zhen Yu: Methodology, Visualization, Writing – review & editing

Yingping Li: Methodology, Validation, Visualization, Writing – original draft

Janet M Towns: Investigation, Writing – review & editing

Lin Zhang: Supervision, Writing – review & editing

Jason J Ong: Funding acquisition, Resources, Supervision, Writing – review & editing

Zongyuan Ge: Supervision, Writing – review & editing

Christopher K Fairley: Funding acquisition, Resources, Supervision, Writing – review & editing

Lei Zhang: Investigation, Methodology, Supervision, Visualization, Writing – review & editing

Hisham Al-Obaidi: Editor

PMCID: PMC12286352 PMID: 40700364

Abstract

Early identification of sexually transmitted infection (STI) symptoms can prevent subsequent complications and improve STI control. We analysed 597 images from STIAtlas and categorised the images into four typical STIs and two skin lesions by the anatomical sites of infections. We first applied nine image filters and 11 machine-learning image classifiers to the images. We then extracted radiomics features from the filtered images and trained them with 99 models that combined image filters and classifiers. Model performance was evaluated by area under curve (AUC) and permutation importance. When the information of infection sites was unspecified, a combined Gradient-Boosted Decision Trees (GBDT) classifier and Laplacian of Gaussian (LoG) filter model achieved the best overall performance with an average AUC of 0.681 (95% CI 0.628-0.734). This model predicted best for lichen sclerosus (AUC = 0.768, 0.740-0.796). The incorporation of infection site information led to a substantial improvement in the model’s performance, with 22.3% improvement for anal infections (AUC = 0.833, 0.687-0.979) and 3.8% for skin infections (AUC = 0.707, 0.608-0.806). Lesion texture and statistical radiomics features were the most predictive for STIs. Combining machine learning and radiomics techniques is an effective method to categorise skin lesions associated with STIs clinically.

Author summary

We developed an artificial intelligence tool that can help identify sexually transmitted infections (STIs) from photographs of skin lesions. Using machine learning and a technique called radiomics—which extracts detailed information about texture and shape from medical images—we analysed 597 images from the STI Atlas database covering four common STIs and two skin conditions. Our approach combines computer algorithms with radiomics to automatically detect features in skin images that might indicate specific infections. We found that when we included information about where on the body the infection appeared (genitals, anus, or other skin areas), our tool’s accuracy improved significantly. The biggest improvement was for anal infections, where accuracy increased by over 22%. This technology could be particularly valuable in areas with limited access to healthcare specialists, allowing people to take photographs with their smartphones for preliminary assessment. While not intended to replace clinical diagnosis, our tool could help people decide whether they need urgent medical attention. This aligns with global health efforts to improve early detection and treatment of sexually transmitted infections, potentially reducing transmission and complications in communities worldwide.

Introduction

Sexually transmitted infections (STIs) pose a major public health challenge. According to the World Health Organization (WHO), an estimated 374 million new infections with trichomoniasis, chlamydia, gonorrhoea, and syphilis occurred globally in 2020 [1]. Furthermore, the prevalence of STIs has been steadily increasing in recent years. For example, there was a 72.5% increase in the number of trichomoniasis cases between 1990 and 2019 which rose from 205.4 million to 354.5 million. Similarly, syphilis cases rose by 60% to 14.1 million from 8.8 million during the same period [2]. In 2020, out of 7.1 million new syphilis cases, 1 million occurred among pregnant women aged 15–49 years old [3]. Pregnant women living with HIV and with concomitant STIs face a twofold higher risk of preterm delivery [4], and untreated syphilis during pregnancy carries a 25% risk of stillbirth. Furthermore, the vertical transmission rate of untreated syphilis from mother to child in the third trimester of pregnancy can be as high as 60–100% [5]. In addressing the concerning rise of STIs, the WHO has set ambitious targets to promote the end of STIs by 2030 [6].

To achieve the goal of ending the spread of STIs, the WHO has developed specific strategies for the prevention and early detection of STIs. The WHO strategies on HIV and sexually transmitted infections [6] all highlight the importance of early diagnosis for reducing transmission and the impact of STIs on individuals and communities. Specifically, the WHO required that by 2030, the annual coverage of syphilis and gonorrhoea screening among key populations exceeds 90%. The WHO advocates for the implementation of targeted strategies aimed at early detection and treatment, emphasising the importance of timely interventions to prevent complications and transmission. The WHO advocates for the implementation of the “foster innovations for impact” strategy, prioritizing the combination of the latest technologies to establish screening tools for the early detection and treatment of STIs.

The current model for STI care where individuals attend a health care service depends on them accurately recognising symptoms of STIs and there being adequate and accessible health services [7–9]. To overcome these challenges, recent research has explored the potential of artificial intelligence (AI) as a tool assisting individuals in determining if they need urgent STI care or not thereby facilitating urgent presentations for those with STIs and avoiding unnecessary presentations for those without STIs [10]. Machine learning methods consistently outperform traditional multivariate logistic regression in the prediction of infection risk for HIV/STIs based on clinical records [11]. MySTIRisk uses self-reported information from individuals attending an STI clinic in an AI-based risk assessment tool capable of predicting syphilis with an AUC of 0.84 [12,13]. Perhaps the greatest role for AI is in Low- or Middle-Income Countries (LMICs), where trained clinicians and medical resources are scarce, and traditional healthcare infrastructure is limited [14,15]. The introduction of AI-assisted tools could revolutionise STI care by bringing it directly to individuals’ mobile phones.

A further tool that could potentially improve the accuracy of AI for STI diagnosis is radiomics [16]. AI using radiomics was initially developed for tumour detection, where it demonstrated high accuracy. However, AI tools are often perceived as black boxes, lacking clinically persuasive interpretability [17]. But radiomics is less of a black box, radiomics extracts clinically significant information, such as texture and shape details, from medical images [18]. The advantages of radiomics tools have meant that they have been rapidly applied to medical image processing such as computed tomography (CT), magnetic resonance imaging (MRI) and positron emission tomography (PET) to facilitate a more accurate diagnosis [19]. However, we believe the application of radiomics for STI detection is a relatively unexplored area.

The other factor that could potentially improve the AI diagnosis of STIs is the anatomical site of the lesion. As far as we know, none of the existing studies investigate STI lesions in multiple sites. We hypothesise that the inclusion of infection site information in the models may substantially improve the performance of the model. This variability of images at different anatomical locations may be a key factor that influences the radiomics-based AI models for STI detection. For instance, early syphilis infection presents distinct clinical manifestations depending on the body area affected [20]. On the trunk and limbs, it typically appears as a macular or papulosquamous eruption, characterised by flat or slightly raised lesions. In contrast, within the genito-anal region, the infection often manifests as confluent nodules of condyloma latum, which are larger, raised lesions that tend to merge together [21]. In research on STI skin diagnosis tools, this factor has not been adequately investigated or analysed.

We aim to develop an AI-assisted testing tool for common STIs by integrating machine learning and radiomics approaches. The Melbourne Sexual Health Centre (MSHC), a public sexual health clinic in Melbourne, Australia, has collected a substantial database of images, and it has been publicly available at STI Atlas [22]. This exploratory study may facilitate earlier detection of several types of STIs which could lead to skin lesions, aiding in the prevention of further transmission and contributing significantly to public health efforts.

Methods

We developed models for the diagnosis of skin lesions from four common sexually transmitted infections and two skin lesions. We built these models by combining radiomics technology and deep learning classifiers. Before building models, we collected STI images from STI ATLAS (https://stiatlas.org/), which supplied skin images with infection types and body sites. All diagnoses for the cases in the STI Atlas dataset have been confirmed with serological and/or molecular testing, serving as the gold standard for STI diagnosis. To ensure the reliability of manual segmentation, two STI specialists independently segmented these images, with inter-observer agreement evaluated by a third research member. Additionally, one specialist re-evaluated the same subset after a one-month interval to assess intra-observer consistency. In cases of discrepancies, the specialists discussed and reached a consensus to finalise the segmentation. Then, we used the radiomics technique to extract radiomics features and selected low-correlation features based on the Pearson correlation coefficient. Finally, we trained and compared several deep-learning classifiers. The structure of our model is shown in S1 Fig. The models we developed could identify common STIs and can be applied to self-diagnosis.

STIs images collection

We trained our model on STI ATLAS. It provides royalty-free, high-quality images of STIs as an educational resource. We extracted 945 of the images and infection site information on the website and saved it as an original dataset. JT, JO and CKF are STI specialists with extensive clinical experience. Before proceeding, JO and CKF each conducted a thorough review of all images to confirm both the presence of an STI and the absence of identifying information. Images with different opinions were removed. Following the initial review, CKF, JO, and LZ collaborated to re-evaluate the remaining images, summarising the depicted body sites and types of sexually transmitted infections. This resulted in the identification of 21 distinct infection types across seven different body sites. Our initial dataset included images of diverse STIs, but some were too uncommon (less than 10 cases each). After excluding these, we had a final dataset of 597 images for analysis.

Manual lesion segmentation for the region of interest

Regions of interest (ROIs) refer to manually labelled polygonal regions on the image. Radiomics techniques could extract extra radiomics features from each ROI. The accuracy of the labelled regions has a crucial impact on the model accuracy. We utilised the LabelMe tool [23] for ROI labelling. To ensure consistency and reliability, JJS initially trained JT and CKF on the use of the LabelMe tool, emphasising the importance of expanding segmentation areas beyond the immediate infection to capture subtle changes like colour variations. JT and CKF then independently completed the manual segmentation, with JJS overseeing quality control, documenting progress, and harmonising any discrepancies through regular meetings.

Images derived by filters

We applied nine built-in filters provided by pyradiomics [24], an open-source Python package for the extraction of radiomics features from medical imaging, to generate derived images from the original skin images. Applying image filters significantly enhanced key features in the original images, particularly texture and boundary details, allowing for clearer differentiation between infected and normal areas. These image filters included the original grey filter, gaussian laplace (LoG) filter, gradient filter, square filter, square root filter, logarithm filter, exponential filter, two-dimensional local binary pattern (LBP2D) and wavelet filters targeting specific edge features (four wavelet sub-filters). We applied each of these nine filters to the original images, generating the derived images shown in S2 Fig. These filtered images then served as inputs for radiomics feature extraction.

Radiomics feature extraction and selection

From each ROI in the derived images, we extracted 102 radiomics features using the Python package pyradiomics. To comprehensively analyse the infection areas, we categorised the extracted 102 radiomics features into four groups. The first group encompassed statistical features, incorporating 18 first-order features such as the range of grey values. The second group focused on shape features, including nine 2D shape-based (shape2D) features. The third group involved texture features, comprising 24 grayscale co-occurrence matrix (GLCM) features, 16 grayscale run length matrix (GLRLM) features, 16 grayscale size region matrix (GLSZM) features, and five neighbouring grey tone difference matrix (NGTDM) features. The final group centred on voxel dependence features, including 14 grayscale dependence matrix (GLDM) features. Through meticulous comparisons of filter performance, we pinpointed the most effective filter for each prediction classifier.

We filtered radiomics features based on their correlation to ensure robust and accurate analysis. Using the Pearson correlation coefficient, we identified and eliminated features exhibiting strong dependencies with others. We considered features with absolute correlation values exceeding 0.8 as highly correlated and likely to introduce redundancy or instability into prediction models. Consequently, we initially identified pairs of such highly correlated features and then calculated the sum of the absolute correlation coefficients between each of these pairs and all other remaining features. Subsequently, we removed the feature from each correlated pair with the highest sum of correlations with the other features. By removing strong correlation features, we mitigated their potential adverse effects on prediction accuracy and ensured the model’s focus on independent, informative features.

We standardised the radiomics features prior to inputting them into the machine learning models. This was crucial to guarantee unbiased training and to ensure each feature contributed optimally. The raw values of these features varied significantly, a factor that could potentially bias the model’s learning process. Features with broader value ranges might overshadow those with smaller ranges during training, even though the latter could be equally or more informative. To rectify this, we employed the standard scaling method, transforming each feature to have a mean of 0 and a variance of 1. This method ensures that no single feature dominates or gets ignored due to its original scale. Every radiomics feature contributed meaningfully to the model’s learning and prediction, resulting in more reliable and unbiased results.

Classification model development and interpretation

We conducted a comprehensive evaluation using 11 widely used machine learning classifiers from the PyTorch library [25]. Each classifier was trained on nine image filter-derived images, resulting in a total of 99 models. We further refined the dataset by splitting it based on the infection site, creating dedicated models for each site. Given the limited number of infection cases, we opted for a 5-fold cross-validation strategy for robust performance assessment (S3 Fig). This involved dividing the images into five folds and sequentially training the classifiers on four folds while testing on the remaining fold. To address the class imbalance in our dataset and ensure robust model evaluation, we employed Stratified ShuffleSplit cross-validator for cross-validation. This approach maintains the percentage of samples for each class across the splits while allowing for random sampling with replacement. Due to the stratified nature of the sampling and the need to maintain balanced class proportions, some samples may be selected multiple times across different folds. Each iteration yielded an AUC score, which was then averaged to provide a more reliable performance metric. True/false positive and negative results were further visualised using confusion matrix diagrams.

We used the permutation importance method to evaluate the significance of radiomics features on top models and understand how it makes its predictions. The permutation importance method measured the radiomics feature’s importance by shuffling its values across all cases and observing the change in model performance. We repeated the shuffling process 100 times for each radiomics feature and calculated the average decrease in AUC. Higher drops indicated a greater influence on the model’s performance, revealing the most critical features for its predictions. This analysis revealed the most influential radiomics features in the models, providing valuable insights into its decision-making process and ultimately boosting understanding of the model’s interpretability.

Ethics statement

Permission to conduct the project was submitted and approved by the Ethics Committee of the Alfred Hospital (Ethics Project No. 191/23). The images in this study are from STI Atlas and do not contain any identifiable information. This study does not include factors necessitating patient consent.

Results

Description of image data

We focused on four STIs and two skin lesions across three key body sites: genitals, anus, and other skin areas. “Genitals” referred to external genitalia, including the penis, scrotum, vulva, and perineum. “Anus” specifically denoted the perianal region. “Other skin” referred to areas excluding the genitals and anus, covering other body regions. These prevalent conditions captured in our dataset included sexually transmitted infections that caused skin lesions, such as early syphilis (143 cases), herpes (128 cases), warts (127 cases), and molluscum contagiosum (66 cases), as well as general skin lesions, including lichen sclerosus (145 cases) and tinea (61 cases). S1 Table provides a description of our dataset, detailing the distribution of infections across the sites with herpes being documented in 36 genital and 25 other skin cases, lichen sclerosus in 127 genital and 18 other skin cases, molluscum contagiosum in 12 genital and 54 other skin cases, early syphilis in 45 genital, 12 anal, and 86 other skin cases, tinea exclusively in 61 skin cases, and warts in 54 genital, 15 anal, and 52 other skin cases.

We categorized the images into three groups based on the infection site: genitals (274 images), skin (296 images), and anus (26 images), as detailed in S1 Table. The performance of the models for unspecified infection sites is detailed in S2–S5 Tables. S6–S9 Tables presents a comprehensive performance comparison of all models across the three infection sites. The confusion matrix in Fig 1 represents the aggregated results across all cross-validation folds, rather than predictions on unique samples. This methodology was chosen to provide a comprehensive view of the model’s performance across different data splits while maintaining class balance. Some samples may appear multiple times in the final confusion matrix, but this stratified sampling strategy is essential for reliable performance evaluation when dealing with imbalanced datasets.

Models performance when infection sites were unspecified

Fig 2A demonstrated the model performance for four STIs and two skin lesions when infection sites were unspecified. The combined GBDT with LoG filter performed best, with an average AUC of 0.681 (95% CI, 0.628-0.734) on the test dataset. With this best-performed model shown in Fig 3, the prediction of lichen sclerosus showed the highest AUC of 0.768 (95% CI, 0.74-0.796). Followed by molluscum contagiosum, with an AUC of 0.751 (95% CI, 0.701-0.801). Warts ranked third, with an AUC of 0.691 (95% CI, 0.656-0.726).

The confusion matrix for the highest AUC models when infection sites were unspecified is shown in Fig 1A. The model correctly identified 16 out of 60 cases (26.7%) with herpes, 103 out of 145 cases (71.0%) with lichen sclerosus, 36 out of 65 cases (55.4%) with molluscum contagiosum, 65 out of 145 cases (44.8%) with early syphilis, 18 out of 60 cases (30.0%) with tinea, and 64 out of 125 cases (51.2%) with warts. Overall, the accuracy of the model with unspecific sites is 50.3%.

The top 10 radiomics features, as determined by permutation importance for the leading models in Fig 4A, were ranked from most to least significant. Maximum was the most important feature for the unspecific sites model. Randomly disrupting it caused the model AUC to decrease by 0.023 (95% CI 0.022-0.024). And the majority of the top 10 radiomics features are statistical features and textural features. This suggested that the best model was primarily relying on the statistics of pixels and texture of the lesions to make predictions.

Models performance on genital conditions

As shown in Fig 2B, for infections on the genitals, the combined MLP classifier with the exponential filter showed the best performance, with an average AUC of 0.647 (95% CI, 0.553-0.741) on the test dataset. As shown in Fig 3, the genitals model was slightly more effective at predicting lichen sclerosus (AUC, 0.774, 0.78% improved) and warts (AUC, 0.739, 6.95% improved) than when no specific infection site was considered. As shown in Fig 4B, the major axis length was the most important feature for the genitals model. Randomly disrupting it caused the model AUC to decrease by 0.075 (95%CI 0.07-0.08).

The confusion matrix with genitals site information in 1B showed that the model correctly identified 6 out of 35 cases (17.1%) with herpes, 99 out of 130 cases (76.1%) with lichen sclerosus, 2 out of 10 cases (20.0%) with molluscum contagiosum, 15 out of 45 cases (33.3%) with early syphilis, and 33 out of 55 cases (60.0%) with warts. Overall, the accuracy of the model with genitals site information is 56.4%.

Models performance on anus infections

Regarding infections on the anus shown in Fig 2C, the combined Gaussian Naive Bayes with the square root filter was the most effective, with an average AUC of 0.833 (95% CI, 0.687 - 0.979). Compared with models without infected body site specification in Fig 3, the AUC made a substantial increase of 0.152 (22.3%), reflecting a notable improvement in the model’s predictive accuracy for early syphilis and warts infections. As shown in Fig 4C, cluster shade was the most important feature of the anus model. Randomly disrupting it caused the model AUC to decrease by 0.098 (95% CI 0.084-0.11).

The confusion matrix in Fig 1C with anus site information showed that the model correctly identified 11 out of 15 cases (73.3%) with early syphilis, and 14 out of 15 cases (93.3%) with warts. Overall, the accuracy of the model with anus site information is 83.3%.

Models performance on other skin infections

In the case of infections on the skin shown in Fig 2D, the combined Logistic Regression with the LoG filter demonstrated superior performance, with an average AUC of 0.707 (95% CI, 0.608-0.806). When compared with models that did not specify infected body sites in Fig 3, the AUC increased by 0.026 (3.82%), indicating a significant enhancement in the model’s ability to predict skin infections accurately. As shown in Fig 4D, zone percentage was the most important feature for skin models. Randomly disrupting it caused the model AUC to decrease by 0.078 (95%CI 0.073-0.083).

The confusion matrix with skin in Fig 1D showed that the model correctly identified 12 out of 25 cases (48.0%) with herpes, 11 out of 20 cases (55.0%) with lichen sclerosus, 31 out of 55 cases (56.4%) with molluscum contagiosum, 38 out of 85 cases (44.7%) with early syphilis, 37 out of 60 cases (61.7%) with tinea, and 24 out of 55 cases (43.6%) with warts. Overall, the accuracy of the model with skin site information is 51.0%.

Models comparison and radiomics feature interpretability

Fig 3 illustrates an enhancement in the model’s predictive accuracy for most infections when the site of infection is taken into account. The prediction of early syphilis at the anus site showed the most substantial improvement. In terms of feature importance in Fig 4, texture features ranked highest, followed by statistical features. For two out of the four infection sites, texture features emerged as the most crucial, while for the genitals site, shape features were predominant. Additionally, statistical features were identified as the second most significant at three sites. This pattern suggests that the radiomics tool predominantly relies on the texture of the STI skin lesions for making predictions.

Discussion

Our study integrated machine learning and radiomics approaches to identify key features of STIs and skin lesions based on a sizable database of skin images. The ridge classifier, when paired with a LoG filter, showed superior performance, especially for non-specific sites. Integrating information about the infection site into the model markedly enhanced its accuracy, with notable improvements for anal infections. The AUC for the prediction of early syphilis infection in the peri-anal region increased significantly by 37.5% (AUC improved from 0.606 to 0.833). To our knowledge, this is the first study employing radiomics for STI diagnosis from skin images. We employed permutation importance to pinpoint texture and statistical features are very important in identifying STIs in all infection sites.

Our findings underscore the potential of radiomics technology in self-testing for sexually transmitted infections. We propose an innovative approach that synergizes machine learning with radiomics, fostering an AI-driven, convenient self-testing method using skin images. This strategy significantly boosts accuracy and efficiency, bypassing invasive procedures and facilitating accessible testing via smart devices. This is particularly beneficial for remote or underserved communities with limited access to specialists. Enhancing the image dataset and integrating clinical data could further refine these AI self-diagnostic tools, increasing their precision. Compared to conventional healthcare interactions, our tool offers digital healthcare solutions that support private, anonymous, and secure early diagnosis. This advancement aligns with timely and effective sexual health services, contributing to the WHO’s global STI control efforts [1].

Our study found that including infection sites as the models’ parameter could help to improve the classifier accuracy when predicting STIs. By incorporating body site information into the model, we were able to achieve a significant improvement in accuracy. This finding has important implications for the development of clinical decision support systems for STI diagnosis. These results suggest that considering body site information can significantly improve the accuracy of the model for predicting STIs. Specifically, the model’s accuracy for predicting lichen sclerosus, early syphilis and warts increased significantly when body site information was considered. The most significant increase in predictive performance was observed for early syphilis, with an average increase in AUC of 30.0%. This is likely because body site information can provide additional clues about the type of STI that is present. For example, lichen sclerosus primarily affects the genital area and around the anus [26,27], early syphilis often leads to lesions on skin [28], while warts are more likely to occur on the anus [29]. Other clinical data such as information on symptoms presentation and sexual behavioural patterns of the individuals may also improve the model’s accuracy [30,31].

Our study found the texture and statistical features emerged as highly sensitive to skin lesions, demonstrating a positive correlation with accurate STI classification. The predictive model on the genitals site emphasizes the importance of shape characteristics, revealing that different STIs occurring at the site exhibit unique shape characteristics. This suggests they hold valuable information for discriminating between STI and non-STI cases. The high importance of these features in our machine learning models implies they capture crucial differences in the texture or structure of medical images, potentially reflecting underlying STI pathology. Analysing feature importance within these models can therefore offer reasonable explanations for the skin-related symptoms observed in STIs. Ultimately, high-importance radiomics features shed light on the morphological characteristics of STI sites, providing valuable insights for both diagnosis and understanding of disease progression.

Our study also has several limitations. Firstly, the types of infections covered in our dataset are limited. We only focused on a small range of infection types commonly captured in our clinic. Secondly, there was an imbalance in the number of images representing each infection type within our dataset. We captured as many skin images as we could, but infections in some body sites are relatively rare. This imbalance contributed to the discrepancy between sensitivity and specificity in our model. Moreover, within each infection category, the available image count is constrained, potentially affecting the model’s ability to discern subtle nuances associated with specific infections. Furthermore, the accuracy of our models is contingent upon the manual annotation of infection regions. Relying on hand-annotated data introduces a degree of subjectivity and potential bias, which may influence the model’s performance, mainly when applied to diverse and real-world clinical scenarios. In our forthcoming efforts, we aim to address this limitation by delving into the development of automated annotation methods for infection sites. By exploring techniques for unbiased and high-precision region annotation, we anticipate enhancing the accuracy and reliability of our models.

In conclusion, our study demonstrates that combining machine learning with radiomics can effectively utilise skin images for early STI diagnosis. Incorporating infected body site information into models significantly improves predictive accuracy.

Supporting information

S1 Fig. The structure of the typical STI prediction models.

(DOCX)

pdig.0000926.s001.docx^{(350.4KB, docx)}

S2 Fig. Nine different filters.

(A) Original grey filter, (B) Gaussian Laplace (LoG) filter, (C) gradient filter, (D) square filter, (E) square root filter, (F) logarithm filter, (G) exponential filter,(H) two dimension local binary pattern (LBP2D), (I) HH Wavelet filter (extract diagonal features), (J) HL Wavelet filter (extract vertical features), (K) LH Wavelet filter (extract horizontal features), (L) LL Wavelet filter (the approximate image).

(DOCX)

pdig.0000926.s002.docx^{(483.9KB, docx)}

S3 Fig. Methods of dividing images into training and testing data.

(DOCX)

pdig.0000926.s003.docx^{(94.7KB, docx)}

S1 Table. General information about the STIs images dataset.

(DOCX)

pdig.0000926.s004.docx^{(15.8KB, docx)}

S2 Table. AUC results for ten classifiers with nine filters on unspecific sites.

(DOCX)

pdig.0000926.s005.docx^{(30.8KB, docx)}

S3 Table. Accuracy results for ten classifiers with nine filters on unspecific sites.

(DOCX)

pdig.0000926.s006.docx^{(30.7KB, docx)}

S4 Table. Specificity results for ten classifiers with nine filters on unspecific sites.

(DOCX)

pdig.0000926.s007.docx^{(30.8KB, docx)}

S5 Table. Sensitivity results for ten classifiers with nine filters on unspecific sites.

(DOCX)

pdig.0000926.s008.docx^{(30.7KB, docx)}

S6 Table. AUC results of the classifiers with three infection body sites.

(DOCX)

pdig.0000926.s009.docx^{(57.4KB, docx)}

S7 Table. Accuracy results of the classifiers with three infection body sites.

(DOCX)

pdig.0000926.s010.docx^{(58.6KB, docx)}

S8 Table. Specificity results of the classifiers with three infection body sites.

(DOCX)

pdig.0000926.s011.docx^{(60.3KB, docx)}

S9 Table. Sensitivity results of the classifiers with three infection body sites.

(DOCX)

pdig.0000926.s012.docx^{(58.5KB, docx)}

Data Availability

All data in this study is public available on https://stiatlas.org/.

Funding Statement

The author(s) received no specific funding for this work.

References

1.World Health Organization. Sexually transmitted infections (STIs) [cited 2023 1 December]. Available from: https://www.who.int/news-room/fact-sheets/detail/sexually-transmitted-infections-(stis) [Google Scholar]
2.Du M, Yan W, Jing W, Qin C, Liu Q, Liu M, et al. Increasing incidence rates of sexually transmitted infections from 2010 to 2019: an analysis of temporal trends by geographical regions and age groups from the 2019 Global Burden of Disease Study. BMC Infect Dis. 2022;22(1):574. doi: 10.1186/s12879-022-07544-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
3.WHO. Global progress report on HIV, viral hepatitis and sexually transmitted infections. World Health Organization; 2021. [Google Scholar]
4.Burnett E, Loucks TL, Lindsay M. Perinatal outcomes in HIV positive pregnant women with concomitant sexually transmitted infections. Infect Dis Obstet Gynecol. 2015;2015:508482. doi: 10.1155/2015/508482 [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Sankaran D, Partridge E, Lakshminrusimha S. Congenital Syphilis-An Illustrative Review. Children (Basel). 2023;10(8):1310. doi: 10.3390/children10081310 [DOI] [PMC free article] [PubMed] [Google Scholar]
6.World Health Organization. Global health sector strategies on, respectively, HIV, viral hepatitis and sexually transmitted infections for the period 2022-2030. 2022. [cited 2025 03/25]. Available from: https://www.who.int/publications/i/item/9789240053779 [Google Scholar]
7.Jenkins WD, Williams LD, Pearson WS. Sexually Transmitted Infection Epidemiology and Care in Rural Areas: A Narrative Review. Sex Transm Dis. 2021;48(12):e236–40. doi: 10.1097/OLQ.0000000000001512 [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Gruber AG, Pereira PMB, de Souza Goldim MP, Cassol MEG, Portela WF, de Bitencourt RM. Sexually transmitted infections in indigenous communities of the Alto Rio Solim es. Brazilian J Sexually Transmit Dis. 2021;33. [Google Scholar]
9.Okoboi S, Castelnuovo B, Moore DM, Musaazi J, Kambugu A, Birungi J, et al. Incidence rate of sexually transmitted infections among HIV infected patients on long-term ART in an urban and a rural clinic in Uganda. BMC Public Health. 2019;19:1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Thomford NE, Bope CD, Agamah FE, Dzobo K, Owusu Ateko R, Chimusa E, et al. Implementing Artificial Intelligence and Digital Health in Resource-Limited Settings? Top 10 Lessons We Learned in Congenital Heart Defects and Cardiology. OMICS. 2020;24(5):264–77. doi: 10.1089/omi.2019.0142 [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Bao Y, Medland NA, Fairley CK, Wu J, Shang X, Chow EPF, et al. Predicting the diagnosis of HIV and sexually transmitted infections among men who have sex with men using machine learning approaches. J Infect. 2021;82(1):48–59. doi: 10.1016/j.jinf.2020.11.007 [DOI] [PubMed] [Google Scholar]
12.Xu X, Yu Z, Ge Z, Chow EPF, Bao Y, Ong JJ, et al. Web-Based Risk Prediction Tool for an Individual’s Risk of HIV and Sexually Transmitted Infections Using Machine Learning Algorithms: Development and External Validation Study. J Med Internet Res. 2022;24(8):e37850. doi: 10.2196/37850 [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Latt PM, Soe NN, Xu X, Ong JJ, Chow EPF, Fairley CK, et al. Identifying Individuals at High Risk for HIV and Sexually Transmitted Infections With an Artificial Intelligence-Based Risk Assessment Tool. Open Forum Infect Dis. 2024;11(3):ofae011. doi: 10.1093/ofid/ofae011 [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Mollura DJ, Culp MP, Pollack E, Battino G, Scheel JR, Mango VL, et al. Artificial Intelligence in Low- and Middle-Income Countries: Innovating Global Health Radiology. Radiology. 2020;297(3):513–20. doi: 10.1148/radiol.2020201434 [DOI] [PubMed] [Google Scholar]
15.Williams D, Hornung H, Nadimpalli A, Peery A. Deep Learning and its Application for Healthcare Delivery in Low and Middle Income Countries. Front Artif Intell. 2021;4:553987. doi: 10.3389/frai.2021.553987 [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Papadimitroulas P, Brocki L, Christopher Chung N, Marchadour W, Vermet F, Gaubert L, et al. Artificial intelligence: Deep learning in oncological radiomics and challenges of interpretability and data harmonization. Phys Med. 2021;83:108–21. doi: 10.1016/j.ejmp.2021.03.009 [DOI] [PubMed] [Google Scholar]
17.Zhang R, Wei Y, Shi F, Ren J, Zhou Q, Li W, et al. The diagnostic and prognostic value of radiomics and deep learning technologies for patients with solid pulmonary nodules in chest CT images. BMC Cancer. 2022;22(1):1118. doi: 10.1186/s12885-022-10224-z [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Koçak B. Key concepts, common pitfalls, and best practices in artificial intelligence and machine learning: focus on radiomics. Diagn Interv Radiol. 2022;28(5):450–62. doi: 10.5152/dir.2022.211297 [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Rogers W, Thulasi Seetha S, Refaee TAG, Lieverse RIY, Granzier RWY, Ibrahim A, et al. Radiomics: from qualitative to quantitative imaging. Br J Radiol. 2020;93(1108):20190948. doi: 10.1259/bjr.20190948 [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Cantisani C, Rega F, Ambrosio L, Grieco T, Kiss N, Meznerics FA, et al. Syphilis, the Great Imitator-Clinical and Dermoscopic Features of a Rare Presentation of Secondary Syphilis. Int J Environ Res Public Health. 2023;20(2):1339. doi: 10.3390/ijerph20021339 [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Balagula Y, Mattei PL, Wisco OJ, Erdag G, Chien AL. The great imitator revisited: the spectrum of atypical cutaneous manifestations of secondary syphilis. Int J Dermatol. 2014;53(12):1434–41. doi: 10.1111/ijd.12518 [DOI] [PubMed] [Google Scholar]
22.Morton A, Bradshaw C, Fairley C, Lee D, Henzell H, Williams H, et al. STIATLAS [cited 2024 1 Jan]. Available from: https://stiatlas.org/ [Google Scholar]
23.Wada K. Labelme: Image Polygonal Annotation with Python. Available from: https://github.com/wkentaro/labelme [Google Scholar]
24.van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 2017;77(21):e104–7. doi: 10.1158/0008-5472.CAN-17-0339 [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Paszke A, Gross S, Chintala S, Chanan G, Yang E, DeVito Z, et al. Automatic differentiation in pytorch. 2017.
26.Arif T, Fatima R, Sami M. Extragenital lichen sclerosus: A comprehensive review. Australas J Dermatol. 2022;63(4):452–62. doi: 10.1111/ajd.13890 [DOI] [PubMed] [Google Scholar]
27.Chi C-C, Kirtschig G, Baldo M, Brackenbury F, Lewis F, Wojnarowska F. Topical interventions for genital lichen sclerosus. Cochrane Database Syst Rev. 2011;2011(12):CD008240. doi: 10.1002/14651858.CD008240.pub2 [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Çakmak SK, Tamer E, Karadağ AS, Waugh M. Syphilis: A great imitator. Clin Dermatol. 2019;37(3):182–91. doi: 10.1016/j.clindermatol.2019.01.007 [DOI] [PubMed] [Google Scholar]
29.Wilson M, Wilson PJK. Genital Warts. In: Close Encounters of the Microbial Kind. 2021. doi: 10.1007/978-3-030-56978-5_30 [DOI] [Google Scholar]
30.Soe NN, Latt PM, Yu Z, Lee D, Kim C-M, Tran D, et al. Clinical features-based machine learning models to separate sexually transmitted infections from other skin diagnoses. J Infect. 2024;88(4):106128. doi: 10.1016/j.jinf.2024.106128 [DOI] [PubMed] [Google Scholar]
31.Latt PM, Soe NN, Xu X, Ong JJ, Chow EPF, Fairley CK, et al. Identifying Individuals at High Risk for HIV and Sexually Transmitted Infections With an Artificial Intelligence-Based Risk Assessment Tool. Open Forum Infect Dis. 2024;11(3):ofae011. doi: 10.1093/ofid/ofae011 [DOI] [PMC free article] [PubMed] [Google Scholar]

PLOS Digit Health. doi: 10.1371/journal.pdig.0000926.r001

Decision Letter 0

Hisham Al-Obaidi, Fiona Kolbinger

PDIG-D-24-00497Radiomics Analysis for the Early Diagnosis of Common Sexually Transmitted Infections and Skin LesionsPLOS Digital Health Dear Dr. Sun, Thank you for submitting your manuscript to PLOS Digital Health. After careful consideration, we feel that it has merit but does not fully meet PLOS Digital Health's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript within 60 days May 12 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at digitalhealth@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pdig/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:* A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers '. This file does not need to include responses to any formatting updates and technical items listed in the 'Journal Requirements' section below.* A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes '.* An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript '. If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. We look forward to receiving your revised manuscript. Kind regards, Hisham Al-Obaidi, PHDAcademic EditorPLOS Digital Health Hisham Al-ObaidiAcademic EditorPLOS Digital Health Leo Anthony CeliEditor-in-ChiefPLOS Digital Healthorcid.org/0000-0001-6712-6626 Journal Requirements:

1. Please provide separate figure files in .tif or .eps format.

For more information about figure files please see our guidelines:

https://journals.plos.org/digitalhealth/s/figures

https://journals.plos.org/digitalhealth/s/figures#loc-file-requirements

Additional Editor Comments (if provided): [Note: HTML markup is below. Please do not edit.] Reviewers' Comments: Reviewer's Responses to Questions

Comments to the Author

1. Does this manuscript meet PLOS Digital Health’s publication criteria ? Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe methodologically and ethically rigorous research with conclusions that are appropriately drawn based on the data presented.

Reviewer #1: Partly

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available (please refer to the Data Availability Statement at the start of the manuscript PDF file)?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception. The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS Digital Health does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Comment: Radiomics Analysis for the Early Diagnosis of Common Sexually Transmitted Infections and Skin Lesions

1. Line 14-126 The authors asked specialists with extensive clinical experience to confirm the presence of an STI. It seems there is a lack of golden standard to determine STI. It is not sure how reliable about the subjective clinical judgment. The authors need to address this.

2. Line 114-115. The authors used two STI specialists to make manual segmentation to label the infection area. Is there any inter-observer and intra-observer studies to validate the reliability of the segmentation method? If there is any discrepancies among the specialists, what will be done to make the final decision?

3. The study needs to use accuracy, specificity and sensitivity to illustrate the results. I am afraid that since there are imbalance data in each category of skin lesion, this will lead to result with high sensitivity and low specificity. The authors should consider this.

Reviewer #2: The use of radiomics analysis is fairly new in medicine and apart from use in radiology and oncology, it's use is novice in genital dermatology and sexual health medicine. This is another 'tool' (Line 81) which used in conjunction with self-reported risk (presumably symptoms) of STI (Line 78-79) would allow an individual recognising 'signs' and directing them to health-services for confirmation through laboratory techniques and lead to a 'cure'. This would enhance a better 'self diagnosis' that individuals may use "Dr Google" for. It would also enhance a working diagnosis (or differential) of many primary care physicians and nurses, who may not have encountered genital ulcer infections such as syphilis, as the incidence remains low in Australia.

Line 111

4 common STIs and 2 skin lesions - the lesions of concern are reported in 210-212 and perhaps this could have been made clearer in the Methods. The only common STI discussed in Line 48-49 that seems relevant in this study, is syphilis; whereas the author(s) states that the mots global prevalent STIs are trichomiasis, gonococcal and chlamydial infection. The title is mis-leading in that the study looked at infections of genital lesion / ulcers / genital dermatosis.

Line 136

LabelMe tool - there is no reference as to what this 'tool' is - was it developed as part of this study; and if so, has it been validated / standardised. Need more information / reference to this 'tool'

Line 97 - 101 and 323

Has syphilis been diagnostically 'misclassified' as 'secondary syphilis'?

Was radiomics analysis able to define between a primary syphilitic chancre (particularly in the genital / anus) to secondary manifestations (such as Condylomata lata, mucous patches) with / without a 'skin' rash? If radiomics analysis was unable to distinguish these, I suggest using the term "Early Syphilis" versus "Secondary syphilis"

Line 214 / 218 - site of photograph; Table S1, S2, S3

The sites of the photographs need better clarity - what sites are 'genitals' ? (name sites) as well as "Skin" - e.g. keratinised skin versus 'non-keratinised"

References that need attention:

Line 366 and 377

Reference 1 and Reference 6 need to be clearly written per acaedemic standards. Was the source the World Health Organisation (WHO); and it would be preferably followed by a link to a website and when this was last accessed.

Line 420

Reference 22 - this reference is not sufficiently provided as is. Can this reference be linked to a webpage and last accessed please?

Figure 1

This figure took me some time to decipher and is another way of showing AUCs; perhaps could be clearer in terms of graphics for readers.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files.

Do you want your identity to be public for this peer review? If you choose “no”, your identity will remain anonymous but your review may still be made public.

For information about this choice, including consent withdrawal, please see our Privacy Policy .

Reviewer #1: Yes: Professor Fuk Hay Tang

Reviewer #2: Yes: David Lee

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] Figure resubmission: While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. If there are other versions of figure files still present in your submission file inventory at resubmission, please replace them with the PACE-processed versions. Reproducibility: To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

PLOS Digit Health. 2025 Jul 23;4(7):e0000926. doi: 10.1371/journal.pdig.0000926.r002

Author response to Decision Letter 1

10 Apr 2025

Attachment

Submitted filename: 0.Point-by-Point Response to Reviewers.docx

pdig.0000926.s014.docx^{(264.5KB, docx)}

PLOS Digit Health. doi: 10.1371/journal.pdig.0000926.r003

Decision Letter 1

Hisham Al-Obaidi, Fiona Kolbinger

Radiomics Analysis for the Early Diagnosis of Common Sexually Transmitted Infections and Skin Lesions

PDIG-D-24-00497R1

Dear Mr. Sun,

We are pleased to inform you that your manuscript 'Radiomics Analysis for the Early Diagnosis of Common Sexually Transmitted Infections and Skin Lesions' has been provisionally accepted for publication in PLOS Digital Health.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow-up email from a member of our team.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they'll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact digitalhealth@plos.org.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Digital Health.

Best regards,

Hisham Al-Obaidi, PHD

Academic Editor

PLOS Digital Health

***********************************************************

Additional Editor Comments (if provided):

The authors have adequately addressed all concerns, improved the manuscript's clarity, and enhanced the figures and references. I am pleased to recommend acceptance of this manuscript for publication in PLOS Digital Health.

Reviewer Comments (if any, and for reference):

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Fig. The structure of the typical STI prediction models.

(DOCX)

pdig.0000926.s001.docx^{(350.4KB, docx)}

S2 Fig. Nine different filters.

(DOCX)

pdig.0000926.s002.docx^{(483.9KB, docx)}

S3 Fig. Methods of dividing images into training and testing data.

(DOCX)

pdig.0000926.s003.docx^{(94.7KB, docx)}

S1 Table. General information about the STIs images dataset.

(DOCX)

pdig.0000926.s004.docx^{(15.8KB, docx)}

S2 Table. AUC results for ten classifiers with nine filters on unspecific sites.

(DOCX)

pdig.0000926.s005.docx^{(30.8KB, docx)}

S3 Table. Accuracy results for ten classifiers with nine filters on unspecific sites.

(DOCX)

pdig.0000926.s006.docx^{(30.7KB, docx)}

S4 Table. Specificity results for ten classifiers with nine filters on unspecific sites.

(DOCX)

pdig.0000926.s007.docx^{(30.8KB, docx)}

S5 Table. Sensitivity results for ten classifiers with nine filters on unspecific sites.

(DOCX)

pdig.0000926.s008.docx^{(30.7KB, docx)}

S6 Table. AUC results of the classifiers with three infection body sites.

(DOCX)

pdig.0000926.s009.docx^{(57.4KB, docx)}

S7 Table. Accuracy results of the classifiers with three infection body sites.

(DOCX)

pdig.0000926.s010.docx^{(58.6KB, docx)}

S8 Table. Specificity results of the classifiers with three infection body sites.

(DOCX)

pdig.0000926.s011.docx^{(60.3KB, docx)}

S9 Table. Sensitivity results of the classifiers with three infection body sites.

(DOCX)

pdig.0000926.s012.docx^{(58.5KB, docx)}

Attachment

Submitted filename: 0.Point-by-Point Response to Reviewers.docx

pdig.0000926.s014.docx^{(264.5KB, docx)}

Data Availability Statement

All data in this study is public available on https://stiatlas.org/.

[pdig.0000926.ref001] 1.World Health Organization. Sexually transmitted infections (STIs) [cited 2023 1 December]. Available from: https://www.who.int/news-room/fact-sheets/detail/sexually-transmitted-infections-(stis) [Google Scholar]

[pdig.0000926.ref002] 2.Du M, Yan W, Jing W, Qin C, Liu Q, Liu M, et al. Increasing incidence rates of sexually transmitted infections from 2010 to 2019: an analysis of temporal trends by geographical regions and age groups from the 2019 Global Burden of Disease Study. BMC Infect Dis. 2022;22(1):574. doi: 10.1186/s12879-022-07544-7 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref003] 3.WHO. Global progress report on HIV, viral hepatitis and sexually transmitted infections. World Health Organization; 2021. [Google Scholar]

[pdig.0000926.ref004] 4.Burnett E, Loucks TL, Lindsay M. Perinatal outcomes in HIV positive pregnant women with concomitant sexually transmitted infections. Infect Dis Obstet Gynecol. 2015;2015:508482. doi: 10.1155/2015/508482 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref005] 5.Sankaran D, Partridge E, Lakshminrusimha S. Congenital Syphilis-An Illustrative Review. Children (Basel). 2023;10(8):1310. doi: 10.3390/children10081310 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref006] 6.World Health Organization. Global health sector strategies on, respectively, HIV, viral hepatitis and sexually transmitted infections for the period 2022-2030. 2022. [cited 2025 03/25]. Available from: https://www.who.int/publications/i/item/9789240053779 [Google Scholar]

[pdig.0000926.ref007] 7.Jenkins WD, Williams LD, Pearson WS. Sexually Transmitted Infection Epidemiology and Care in Rural Areas: A Narrative Review. Sex Transm Dis. 2021;48(12):e236–40. doi: 10.1097/OLQ.0000000000001512 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref008] 8.Gruber AG, Pereira PMB, de Souza Goldim MP, Cassol MEG, Portela WF, de Bitencourt RM. Sexually transmitted infections in indigenous communities of the Alto Rio Solim es. Brazilian J Sexually Transmit Dis. 2021;33. [Google Scholar]

[pdig.0000926.ref009] 9.Okoboi S, Castelnuovo B, Moore DM, Musaazi J, Kambugu A, Birungi J, et al. Incidence rate of sexually transmitted infections among HIV infected patients on long-term ART in an urban and a rural clinic in Uganda. BMC Public Health. 2019;19:1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref010] 10.Thomford NE, Bope CD, Agamah FE, Dzobo K, Owusu Ateko R, Chimusa E, et al. Implementing Artificial Intelligence and Digital Health in Resource-Limited Settings? Top 10 Lessons We Learned in Congenital Heart Defects and Cardiology. OMICS. 2020;24(5):264–77. doi: 10.1089/omi.2019.0142 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref011] 11.Bao Y, Medland NA, Fairley CK, Wu J, Shang X, Chow EPF, et al. Predicting the diagnosis of HIV and sexually transmitted infections among men who have sex with men using machine learning approaches. J Infect. 2021;82(1):48–59. doi: 10.1016/j.jinf.2020.11.007 [DOI] [PubMed] [Google Scholar]

[pdig.0000926.ref012] 12.Xu X, Yu Z, Ge Z, Chow EPF, Bao Y, Ong JJ, et al. Web-Based Risk Prediction Tool for an Individual’s Risk of HIV and Sexually Transmitted Infections Using Machine Learning Algorithms: Development and External Validation Study. J Med Internet Res. 2022;24(8):e37850. doi: 10.2196/37850 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref013] 13.Latt PM, Soe NN, Xu X, Ong JJ, Chow EPF, Fairley CK, et al. Identifying Individuals at High Risk for HIV and Sexually Transmitted Infections With an Artificial Intelligence-Based Risk Assessment Tool. Open Forum Infect Dis. 2024;11(3):ofae011. doi: 10.1093/ofid/ofae011 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref014] 14.Mollura DJ, Culp MP, Pollack E, Battino G, Scheel JR, Mango VL, et al. Artificial Intelligence in Low- and Middle-Income Countries: Innovating Global Health Radiology. Radiology. 2020;297(3):513–20. doi: 10.1148/radiol.2020201434 [DOI] [PubMed] [Google Scholar]

[pdig.0000926.ref015] 15.Williams D, Hornung H, Nadimpalli A, Peery A. Deep Learning and its Application for Healthcare Delivery in Low and Middle Income Countries. Front Artif Intell. 2021;4:553987. doi: 10.3389/frai.2021.553987 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref016] 16.Papadimitroulas P, Brocki L, Christopher Chung N, Marchadour W, Vermet F, Gaubert L, et al. Artificial intelligence: Deep learning in oncological radiomics and challenges of interpretability and data harmonization. Phys Med. 2021;83:108–21. doi: 10.1016/j.ejmp.2021.03.009 [DOI] [PubMed] [Google Scholar]

[pdig.0000926.ref017] 17.Zhang R, Wei Y, Shi F, Ren J, Zhou Q, Li W, et al. The diagnostic and prognostic value of radiomics and deep learning technologies for patients with solid pulmonary nodules in chest CT images. BMC Cancer. 2022;22(1):1118. doi: 10.1186/s12885-022-10224-z [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref018] 18.Koçak B. Key concepts, common pitfalls, and best practices in artificial intelligence and machine learning: focus on radiomics. Diagn Interv Radiol. 2022;28(5):450–62. doi: 10.5152/dir.2022.211297 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref019] 19.Rogers W, Thulasi Seetha S, Refaee TAG, Lieverse RIY, Granzier RWY, Ibrahim A, et al. Radiomics: from qualitative to quantitative imaging. Br J Radiol. 2020;93(1108):20190948. doi: 10.1259/bjr.20190948 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref020] 20.Cantisani C, Rega F, Ambrosio L, Grieco T, Kiss N, Meznerics FA, et al. Syphilis, the Great Imitator-Clinical and Dermoscopic Features of a Rare Presentation of Secondary Syphilis. Int J Environ Res Public Health. 2023;20(2):1339. doi: 10.3390/ijerph20021339 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref021] 21.Balagula Y, Mattei PL, Wisco OJ, Erdag G, Chien AL. The great imitator revisited: the spectrum of atypical cutaneous manifestations of secondary syphilis. Int J Dermatol. 2014;53(12):1434–41. doi: 10.1111/ijd.12518 [DOI] [PubMed] [Google Scholar]

[pdig.0000926.ref022] 22.Morton A, Bradshaw C, Fairley C, Lee D, Henzell H, Williams H, et al. STIATLAS [cited 2024 1 Jan]. Available from: https://stiatlas.org/ [Google Scholar]

[pdig.0000926.ref023] 23.Wada K. Labelme: Image Polygonal Annotation with Python. Available from: https://github.com/wkentaro/labelme [Google Scholar]

[pdig.0000926.ref024] 24.van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 2017;77(21):e104–7. doi: 10.1158/0008-5472.CAN-17-0339 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref025] 25.Paszke A, Gross S, Chintala S, Chanan G, Yang E, DeVito Z, et al. Automatic differentiation in pytorch. 2017.

[pdig.0000926.ref026] 26.Arif T, Fatima R, Sami M. Extragenital lichen sclerosus: A comprehensive review. Australas J Dermatol. 2022;63(4):452–62. doi: 10.1111/ajd.13890 [DOI] [PubMed] [Google Scholar]

[pdig.0000926.ref027] 27.Chi C-C, Kirtschig G, Baldo M, Brackenbury F, Lewis F, Wojnarowska F. Topical interventions for genital lichen sclerosus. Cochrane Database Syst Rev. 2011;2011(12):CD008240. doi: 10.1002/14651858.CD008240.pub2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000926.ref028] 28.Çakmak SK, Tamer E, Karadağ AS, Waugh M. Syphilis: A great imitator. Clin Dermatol. 2019;37(3):182–91. doi: 10.1016/j.clindermatol.2019.01.007 [DOI] [PubMed] [Google Scholar]

[pdig.0000926.ref029] 29.Wilson M, Wilson PJK. Genital Warts. In: Close Encounters of the Microbial Kind. 2021. doi: 10.1007/978-3-030-56978-5_30 [DOI] [Google Scholar]

[pdig.0000926.ref030] 30.Soe NN, Latt PM, Yu Z, Lee D, Kim C-M, Tran D, et al. Clinical features-based machine learning models to separate sexually transmitted infections from other skin diagnoses. J Infect. 2024;88(4):106128. doi: 10.1016/j.jinf.2024.106128 [DOI] [PubMed] [Google Scholar]

[pdig.0000926.ref031] 31.Latt PM, Soe NN, Xu X, Ong JJ, Chow EPF, Fairley CK, et al. Identifying Individuals at High Risk for HIV and Sexually Transmitted Infections With an Artificial Intelligence-Based Risk Assessment Tool. Open Forum Infect Dis. 2024;11(3):ofae011. doi: 10.1093/ofid/ofae011 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Radiomics analysis for the early diagnosis of common sexually transmitted infections and skin lesions

Jiajun Sun

Zhen Yu

Yingping Li

Janet M Towns

Lin Zhang

Jason J Ong

Zongyuan Ge

Christopher K Fairley

Lei Zhang

Roles

Abstract

Author summary

Introduction

Methods

STIs images collection

Manual lesion segmentation for the region of interest

Images derived by filters

Radiomics feature extraction and selection

Classification model development and interpretation

Ethics statement

Results

Description of image data

Fig 1. Confusion matrices for the models predicted results on the testing dataset on (a) unspecific sites; (b) genitals; (c) anus; (d) other skin.

Models performance when infection sites were unspecified

Fig 2. Models’ AUC (×102) with nine images filter and eleven classifiers on (a) unspecific sites; (b) genitals; (c) anus; (d) other skin.

Fig 3. Comparison of the AUC of the best infection prediction model for infections on unspecific sites, genitals, anus and other skin.

Fig 4. The top rank 10 radiomics features in the best models using a permutation importance analysis.

Models performance on genital conditions

Models performance on anus infections

Models performance on other skin infections

Models comparison and radiomics feature interpretability

Discussion

Supporting information

Data Availability

Funding Statement

References

Decision Letter 0

Hisham Al-Obaidi

Fiona Kolbinger

Roles

Author response to Decision Letter 1

Decision Letter 1

Hisham Al-Obaidi

Fiona Kolbinger

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Fig 2. Models’ AUC (×10²) with nine images filter and eleven classifiers on (a) unspecific sites; (b) genitals; (c) anus; (d) other skin.