Skip to main content
PLOS One logoLink to PLOS One
. 2025 Jun 2;20(6):e0323440. doi: 10.1371/journal.pone.0323440

A novel spectral analysis-based grading system for gastrointestinal activity

Çağlar Cengizler 1,*
Editor: Mohammad Amin Fraiwan2
PMCID: PMC12129166  PMID: 40455773

Abstract

Intestinal sounds, primarily generated by the movement of digested gas and liquids during peristalsis, are acoustic signals that provide valuable insights into intestinal functioning. Traditionally, doctors have relied on stethoscopes to assess the degree of gastrointestinal activity. Recent advancements in computer-aided technologies and electronic stethoscopes have enhanced the understanding and analysis of these sounds. Studies utilizing advanced techniques like deep learning and convolutional neural networks have shown promise in analyzing bowel sounds. Nevertheless, the reliance on personal judgment and the need for large labeled datasets limit the broader applicability of these methods. This study introduces an innovative, unsupervised grading system to objectively evaluate gastrointestinal motility by analyzing bowel sounds through spectral feature analysis. This system offers a practical alternative to traditional listening techniques or complex models. It computes an activity score for digital audio using a cost-effective numerical grading method to assist doctors in quantifying gastrointestinal motility. The method’s reliability, validated by Spearman’s rank correlation, confirms its accuracy in assessing activity levels and highlights its potential as a reliable and practical tool for supporting objective medical assessments of bowel activity.

1 Introduction

Intestinal sounds are acoustic projections from the internals created while the intestine performs its natural functions. Bowel sounds are heard in the abdominal area as rumbling or gurgling [1]. The primary source of these sounds is the movement of digested food, gas, and liquids during intestinal peristalsis [2]. These sounds are directly linked to bowel movements and would reflect valuable information about the intensity of activity and function of the intestine. Therefore, it might be possible to suggest that bowel sounds can be considered a vital sign similar to the sound of a heart [3]. The literature shows that the ability to diagnose changes in the characteristics of gastrointestinal sounds is expanding the perspective on human intestinal physiology [4]. At this point, the stethoscope is the primary tool for listening to and assessing sounds in gastrointestinal activity [5]. With the advancement of technology, electronic stethoscopes and computer-aided systems have also been used. These modern tools make understanding the crucial details of intestinal sounds easier. Many studies show great potential for computer-aided analysis, which could significantly improve practitioners’ ability to distinguish between different bowel conditions [6]. For instance, in one study, Kaneshiro et al. demonstrated the efficiency of bowel sounds in predicting the development of postoperative ileus. They have collected the sounds with a microphone that adheres to the abdominal wall [7]. In another study, gastrointestinal motility was similarly assessed by real-time bowel sound detection. Authors have indicated that their signal power-based approach is promising for detecting the activity [8]. It should be noted that advanced machine learning has been widely used in recent studies to provide robust discrimination ability. In one study, authors have adopted a deep learning approach to propose a smartphone-based instrument that can discriminate bowel sounds [9]. Similarly, another study adapted convolutional neural networks to classify different types of bowel sounds with accurate time stamps. Authors have reported an accuracy of 91.06% [10]. Additionally, Zhang et al. proposed a deep learning mechanism for segmenting bowel sound events, which reportedly outperformed similar approaches. They have implemented a loss function to reduce the sensitivity of their model to thresholds [11]. Long short-term memory is also used to discriminate bowel sounds and noise with MFCC features. Liu et al. reported a classification accuracy of 92.56% with this approach [1]. In addition to deep learning-based methods, some studies have adopted different supervised machine-learning approaches to identify bowel activity frames. For instance, Kölle et al. have implemented support vector machines to discriminate the power distribution over the frequency spectrum [12].

Previous studies have highlighted the necessity of automated bowel sound analysis to support clinical decision-making. While many studies have succeeded using supervised methods, these approaches mainly focus on classification tasks and depend heavily on large labeled datasets. There remains a significant demand for systems capable of providing objective and practical assessments of gastrointestinal activity in clinical settings.

This study addresses this gap by proposing an objective grading system for quantifying the amount of activity based on characteristic changes in the frequency spectrum associated with increased bowel activity. Introduced novel methodology extracts and analyses spectral features to provide a low-cost numerical grading system. It aims to simplify and improve the assessment of bowel activity, particularly in clinical settings where experts may be required to know the gastrointestinal activity levels.

To achieve this, the system calculates the membership of equal-length segments to non-activity regions using several criteria. Multiple features were extracted for evaluation; however, traditional event-based segmentation was not applied. This approach does not require the segmentation of activity regions. The introduced novel methodology aims to simplify and enhance the assessment of bowel activity, allowing for better quantification and insight into bowel activity levels.

2 Materials and methods

2.1 Data set

In this study, an original data set created by Ficek et al. was utilized (accessed on 23/02/2024). They employed a specialized contact microphone to capture bowel sounds from 19 subjects. A total of 1487 WAV files (44.1kHz, 24bit, two seconds each) from the original data set were analyzed in the study. A 12.5-millisecond Hamming window was chosen to capture rapid changes in bowel sounds. The main goal of this selection was to provide a dynamic assessment of intestinal activity, as the original data authors noted that a window around 10 ms was optimal for their spectral analysis [13]. In addition to the WAV recordings, the authors have provided labels indicating the presence of bowel sounds for each segment, which were used to calculate the percentage of bowel activity [13]. While these annotations are generally reliable, minor inconsistencies, such as end times exceeding the total signal duration, were observed in some cases. These discrepancies are likely due to manual labeling or slight rounding errors during segmentation. It is important to note that 323 recordings were marked as non-activity, meaning they had no bowel sounds and were used for objective grading. Time domain representations of two different WAV samples belonging to different activity groups are shown in Fig 1.

Fig 1. Representation of a waveform belonging to the non-activity group (top) and high activity group (bottom).

Fig 1

Additionally, raw bowel sounds from the Health Education Media Library, provided by the Arthur Labatt Family School of Nursing, were utilized for further validation and testing (accessed on 16/08/2024) [14]. The utilized audio was initially sampled at 4kHz and recorded from 4 quadrants of the abdominal region. A total of 8 audio recordings were used, each divided into 83 WAV files after resampling to 44.1kHz. Each WAV file was 2 seconds long. The activity percentage of each WAV file was calculated using empirically placed labels indicating the activity regions, which served as the ground truth. Eight WAV files were detected to contain zero activity and were used as control data. It should be noted that before analysis, all samples from both data sets were initially preprocessed using a bandpass filter with 100 Hz and 1000 Hz cut-off frequencies to suppress irrelevant noise. All data used in this study were anonymized by the original authors before publication and made publicly available under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).

2.2 Spectral feature analysis

In this study, all sound segments were characterized using 11 spectral features, and the amount of intestinal activity was graded based on variations in these features. A Hamming window with a 6.25-millisecond overlap was applied along each segment, recalculating the features for each window. The standard deviation of these features across the entire signal was then computed to capture variability and reflect the continuous nature of bowel sounds, providing a holistic assessment of gastrointestinal activity. It should be noted that traditional event-based segmentation was not performed. Instead, each audio recording was divided into equal-length segments of 2 seconds, ensuring consistency in the analysis. The proposed grading system evaluates gastrointestinal motility by analyzing spectral features extracted from these short time windows of bowel sound recordings. Each time window is processed using an unsupervised similarity measure, it assigns a numerical motility score based on its resemblance to a reference non-activity group, consisting of recordings with no bowel activity. The final motility score reflects the similarity between the test window and the control group, providing an objective and quantifiable measure of gastrointestinal function. This system has potential clinical applications in activity monitoring and motility assessment. The experimental setup evaluated six grading metrics to determine the most effective approach: Euclidean Distance, Silhouette Score, Cosine Similarity, KNN Cluster Grade, Cluster Membership Strength, and Gaussian Membership. The workflow of the proposed system is illustrated in Fig 2.

Fig 2. Flowchart of the proposed system illustrating the grading process based on spectral feature analysis.

Fig 2

The feature extraction stage was conducted using the MATLAB framework. Eleven spectral features were extracted from each segment utilizing MATLAB’s ‘audioFeatureExtractor’ function. The specific details of the extracted features are provided below.

Spectral Centroid: It indicates the centroid of the spectrum, which strongly correlates with the brightness of a sound [15]. The existence of more high-frequency components, likely originating from sharp bowel activities such as rapid muscle contractions, would increase the value of the spectral centroid.

Spectral Crest: This feature measures the peakiness of the spectrum. Accordingly, a lower spectral crest may indicate more noise [16]. A higher crest indicates a dominant frequency in bowel sounds originating from specific types of bowel activity, like peristalsis.

Spectral Decrease: It would be defined as the rate of spectral amplitude decreases across the spectrum [16]. Softer or rumbling bowel sounds would cause energy to be concentrated in the lower frequencies, resulting in a higher spectral decrease.

Spectral Entropy: It measures the complexity of the spectrum. High spectral entropy indicates a more complex or noisy spectrum, while low entropy indicates a purer tone. Moreover, this feature also indicates the amount of information that the signal carries [17]. Irregular contractions in bowel sounds would result in higher entropy, whereas regular, rhythmic sounds would lead to lower entropy.

Spectral Flatness: This feature measures how closely a signal resembles either a pure tone or a noise-like sound [18]. A low spectral flatness would indicate increased tonality and a higher presence of tonal components in bowel sounds.

Spectral Kurtosis: Spectral kurtosis is a measure that correlates with the shape of the spectral distribution. It evaluates how much a signal’s spectrum distribution deviates from a normal distribution, indicating signal impulsiveness [19]. Accordingly, a low kurtosis value would indicate more evenly distributed spectral energy.

Spectral Roll-off Point: This is the frequency below which a certain percentage (commonly 85% to 95%) of the total spectral energy is contained. It can be used to measure the spectral shape [15]. A deeper or more resonant bowel may increase the roll-off point value.

Spectral Skewness: This feature measures the asymmetry of the spectral shape. Accordingly, it indicates a symmetric structure around the centroid [16]. Deeper bowel sounds with some higher-pitched components would be characterized by positive skewness.

Spectral Slope: This feature reflects the amount of decreasing of the spectral amplitude, which would be helpful to measure the tilt of the spectral envelope [20]. Typical low-pitched, rumbling bowel sounds likely to cause a characteristic steep spectral slope, indicating that the sound’s energy drops off quickly as frequency increases.

Spectral Spread: It is a measure for indicating the spectral variance. It measures the width of spectral distribution, which changes with the amount spread out of the spectrum [16]. A wider spread could indicate the complexity and richness of the sound, reflecting a broad range of frequencies.

Harmonic Ratio: This metric measures the energy ratio in the periodic (harmonic) components to the total energy of the sound. It is particularly variable in the unvoiced (noisy) parts of a signal [21]. Accordingly, regular, rhythmic contractions or vibrations would enhance the tonal quality of the sound, thereby increasing the harmonic ratio.

2.3 Membership based evaluation

The study aimed to assess the amount of activity in each segment with respect to its resemblance to the non-activity (control) cluster. Accordingly, several membership metrics were calculated and compared to reveal the functionality of the approach. All examined approach’s functionality are explained below.

2.3.1 Euclidean distance.

Euclidean distance is a common metric for measuring the straight-line distance between two points in a multi-dimensional space. Euclidean distance can effectively evaluate an observation’s resemblance to a particular cluster in data analysis or machine learning. For each feature, Euclidean distance di between the observation and the centroid (mean) of the control features was calculated by:

di=j=1n(xijx¯j)2 (1)

where xij represents the observation, x¯j represents the centroid, and n is the number of features.

2.3.2 Silhouette score.

The Silhouette score is a distance metric between two observation clusters [22]. Accordingly, it was aimed to calculate as a measure to evaluate how well each sound segment fits into the non-activity group by using mean intra-cluster and mean nearest-cluster distances. Silhouette score si is defined by:

s=bamax(a,b) (2)

Where a is the average distance from the test observation to other observations in the same cluster, and b is the minimum average distance from the test observation to other observations in a different cluster. This study uses MATLAB’s ’evalclusters’ function to compute the Silhouette score. It should be noted that a single observation was accepted as a singleton cluster, and its individual Silhouette score was calculated based on its distance to the observations belonging to another cluster. Accordingly, the scores of all other observations, which are part of the larger cluster, are also calculated and contribute to the average. A higher Silhouette score in this context indicates that the observation is more similar to the non-activity group, indicating a lower activity level.

2.3.3 Cosine similarity.

Cosine similarity grades the similarity of two vectors, based on the cosine of the angle between two vectors [23]. It would be particularly effective in high-dimensional spaces, where Euclidean distances can cause increased computational demand. This study uses cosine similarity to assess how closely an observation aligns with the control cluster regarding directionality and orientation in the vector space. Cosine Similarity ci was calculated by:

ci=xi·x¯xi·x¯ (3)

where xi represents an individual observation vector and xi is the Euclidean norm of xi, and x¯ is the Euclidean norm of x¯. Thus, if an observation is oriented in a similar direction to the control cluster, it receives a higher cosine similarity score.

2.3.4 K-nearest neighbors (KNN) cluster grade.

This measure was calculated by identification of the k closest members of the control cluster to a particular observation [24]. Accordingly, overall grading was based on local rather than global structure. The metric, denoted as g, is determined by:

g=1ki=1kd(observation,neighbori) (4)

where d(observation,neighbori) calculates the Euclidean distance between the observation and its ith nearest neighbour within the cluster. Here, k is the number of closest neighbors, which allows us to adjust how we judge whether an observation fits well with the control cluster. Accordingly, a smaller average distance means the observation is closer to the cluster’s center, indicating a better fit. It should be noted that the parameter k was accepted as a fine-tuning hyper-parameter and determined through empirical testing to find the optimal number.

2.3.5 Cluster membership strength.

The Cluster Membership Strength depends on the average distance to all other control observations. Accordingly, the metric computation was based on the mean of the Euclidean distance between the observation in question and each member of the control cluster. Unlike the K-Nearest Neighbors (KNN) Cluster Grade, which considers only the k nearest members, the CMS evaluates the observation’s relationship with all control observations. This metric was calculated as the inverse of the average Euclidean distance between the observation and each control cluster member. The CMS is defined as follows:

CMS=11Ni=1Nd(observation,memberi)+eps (5)

Here, d(observation,memberi) computes the Euclidean distance between the observation and the ith member of the control cluster, and N represents the total number of members in the control cluster. A minimal value (eps) is added to the average distance to prevent division by zero. This approach grades lower average distance with higher membership scores, which indicates a stronger association with the control cluster.

2.3.6 Gaussian membership.

Gaussian membership is one of the fuzzy membership functions. It grades the degree of resemblance of an observation to a particular cluster based on the assumption that data in each cluster follows a Gaussian distribution. Accordingly, Gaussian membership m was calculated based on a Gaussian distribution model, which represents the probability of the test observation belonging to the control cluster:

m=exp((xix¯)22σ2) (6)

where σ represents the standard deviation of the Gaussian distribution [25]. The output is between 0 and 1, while a higher score indicates a higher degree of membership to the control cluster.

2.4 Evaluation of the performance

Performance evaluation was based on an objective comparison between the ground truth and the final scores of each metric. All test recordings were analyzed and ranked according to their scores to determine which metrics most closely align with the actual rankings of bowel sound activity. Performance evaluation was thus completed by comparing the actual sequence of observations (based on expert assessments) against the calculated sequence derived from each grading method. This comparison aimed to quantify the extent to which each grading method accurately reflects the actual sequence of observations in terms of their similarity to the control cluster. Spearman’s rank correlation coefficient was calculated for objective comparison. It is a statistical measure of the degree of association between two ranked variables (order of the observations in our case). This non-parametric measure could be used efficiently as a metric even without a linear relationship between the variables [26]. The Spearman’s rank correlation coefficient, denoted as ρ is computed as follows:

ρ=16di2n(n21) (7)

Where di is the difference between the ranks of each observation in the actual order and the calculated order (n is the number of observations). The ranks for the actual and calculated orders were determined based on the magnitude of the measurements from each grading method, with tied ranks accounted for in cases of identical values. The coefficient values range from -1 to 1, where 1 indicates a perfect positive correlation, -1 indicates a perfect negative correlation, and 0 indicates no correlation. A higher positive value of ρ was accepted as an indicator of the grading method’s effectiveness in accurately ranking observations according to their similarity to the control cluster, thus validating the method’s performance.

2.4.1 Feature selection.

A greedy Forward Selection (GFS) approach was implemented for feature selection in the study. The GFS method starts with an empty set of features and iteratively adds features that maximize the model’s performance, one at a time, until no further improvements can be achieved [27]. The contribution of remaining features was measured individually with each iteration by temporarily including them in the feature set and assessing the model’s performance. The measurement was based on comparing the orders of the observations graded with Spearman’s rank correlation, reflecting the predictive power and relevance of the included features.

3 Results

The experimental set-up was implemented to evaluate the effectiveness of the membership-based bowel activity scoring approach. Each membership criterion was examined using test observations, with the previously marked activity labels accepted as ground truth. The amount of activity was initially calculated for each WAV file, which was then sorted based on the level of activity they contained. The actual order of observations reflects the ranking of sound segments according to the amount of bowel activity present. Performance grading was completed by comparing the actual and resulting orders of the test observations. Fig 3 illustrates the distribution of bowel activity percentages for both datasets. The horizontal axis represents the percentage of detected bowel activity in each segment, while the vertical axis corresponds to the frequency of occurrences. Dataset 1 consists of 1487 samples, whereas Dataset 2 includes 83 samples. The two distributions highlight potential variations in activity levels across datasets.

Fig 3. Histograms showing the distribution of activity percentages for data set 1 (left) and data set 2 (right).

Fig 3

Two different data sets were analyzed during the study. Initially, three subsets were formed with the data set 1. Selection and grouping criteria were activity percent. Subset properties are presented in Table 1. It should be noted that control observations are the recordings labeled as zero activity silence segments for both data sets.

Table 1. Properties of subsets used for performance evaluation.

Subset Activity Percent Number of Test Segments Number of Control Segments
Subset 1 >%30 53 323
Subset 2 >%40 30 323
Subset 3 >%50 16 323

A greedy feature selection process is performed with each membership criterion to reveal the best-resulting feature combinations and scores. Each feature combination was initially z-scored before any further analysis. Results are presented in Table 2.

Table 2. The best resulting feature combinations are presented for all similarity criteria.

Subset 1
Membership Criteria Best Score Spectral Feature Combination
Euclidean Distance 0.461 Skewness, Slope
Silhouette Score 0.435 Crest, Kurtosis, Skewness, Slope, Harmonic Ratio
Cosine Similarity 0.441 Crest, Entropy, Flatness, Slope, Harmonic Ratio
KNN Score 0.360 Harmonic Ratio
Cluster Membership 0.466 Crest, Skewness
Gaussian Membership 0.461 Skewness, Slope
Subset 2
Membership Criteria Best Score Spectral Feature Combination
Euclidean Distance 0.463 Skewness
Silhouette Score 0.594 Crest, Kurtosis, Skewness
Cosine Similarity 0.690 Centroid, Kurtosis, Slope
KNN Score 0.726 Centroid, Entropy, Kurtosis, Slope
Cluster Membership 0.673 Crest, Kurtosis, Skewness, Slope
Gaussian Membership 0.463 Skewness
Subset 3
Membership Criteria Best Score Spectral Feature Combination
Euclidean Distance 0.544 Crest, Entropy, Skewness
Silhouette Score 0.688 Centroid, Crest
Cosine Similarity 0.721 Centroid, Crest, Slope, Harmonic Ratio
KNN Score 0.874 Entropy, Slope
Cluster Membership 0.803 Centroid, Slope
Gaussian Membership 0.544 Crest, Entropy, Skewness

Table 2 indicates that KNN-based scoring is the most effective sorting criterion for all subsets. Accordingly, KNN was additionally examined with all observations between 10% and 50% bowel activity to evaluate its performance at different activity levels. The results are plotted in Fig 4.

Fig 4. The effect of increasing bowel activity percent on grading capability of KNN based membership measure is shown.

Fig 4

To further analyze the effectiveness of membership-based scoring, Fig 5 compares the sorting accuracy of the three best membership criteria. Each segment was color-coded based on its bowel activity level to represent the sorting performance visually. The ground truth represents the actual order of the segments, allowing a direct comparison with the predicted rankings.

Fig 5. Visual demonstration of the top three criteria’s scoring ability, with segments color-coded by bowel activity level.

Fig 5

The ground truth indicates the actual sequence of segments.

In addition to the analysis with Data set 1, WAV samples from Data set 2 (processed with identical windowing and preprocessing parameters) were also utilized to validate the potential of the presented approach. The scoring ability of the top three criteria was evaluated across different bowel activity levels using the recordings. For each activity level, higher activity percentages were similarly compared with zero-activity samples. Greedy feature selection was conducted at varying activity levels. The findings are illustrated in Fig 6, where the horizontal axis represents the percentage of bowel activity, and the vertical axis shows the Spearman’s rank correlation score.

Fig 6. Validation of the activity grading performance of the top three criteria across different bowel activity levels using data set 2.

Fig 6

The results indicate that different criteria exhibit varying levels of effectiveness in detecting bowel activity. Cosine similarity demonstrated a moderate relationship with bowel activity, showing improved performance at higher activity levels. In contrast, KNN-based and cluster membership grading provided more stable and reliable correlations across different activity levels. Skewness and slope were consistently selected as the most relevant for activity grading among the extracted features.

It should be noted that each test observation is compared with the control cluster individually in this study. Accordingly, membership criteria based on the comparison of two groups of observations are calculated with a single observation versus the control group or as a singleton cluster over the control cluster. If one data set’s control group (non-activity) has more variance or internal diversity, it could affect how test observations are scored. During the experiments, it was observed that the grading performance of cosine similarity and silhouette score metrics could be skewed because of the distribution of the control observations. Typically, a higher cosine and silhouette score would indicate more resemblance to the non-activity control group, with lower scores likely pointing to more activity. However, when sorted in descending order, these criteria produced better Spearman rank correlation results (a higher score means more activity) for one data set. In contrast, the opposite sorting order (lower score means more activity) improved results for another data set. Despite the performance differences, a consistent sorting approach was maintained across data sets to ensure uniformity in the analysis.

Furthermore, a comparative evaluation was also performed to demonstrate the proposed method’s performance relative to existing approaches. Accordingly, the second data set was first stratified and split into training and test sets to ensure that the distribution of the activity percent was consistent across both sets. Consequently, observations with zero activity were excluded from the stratified sets and reserved as a control group. The unsupervised method was applied directly to the test set, where it calculated the membership of each observation to a non-activity group based on the spectral features.

For comparison, linear regression, Random Forest, Support Vector Regression (SVR), and K-Nearest Neighbors (KNN) learning models are implemented. Prior to model training, the spectral features were standardized using z-scoring. A greedy feature selection process was similarly performed to identify the optimum features for each model. The results of the experiment are presented in Table 3.

Table 3. Results of the proposed unsupervised method compared with various supervised approaches using the second data set.

Proposed Approach
Criteria Best Score Selected Features
Cosine Similarity 0.4182 Roll-off Point
KNN Grade 0.6429 Slope
Cluster Membership 0.6453 Slope
Supervised Methods
Criteria Best Score Selected Features
Linear Regression 0.6363 Slope
Random Forest 0.5680 Slope, Centroid
SVR 0.6363 Slope
KNN 0.5753 Slope

4 Discussion

This study proposed an alternative unsupervised grading approach for objectively evaluating bowel activity. Accordingly, spectral features characterizing the activity were extracted to simplify and effectively assess bowel activity without extensive segmentation. Results showcase significant promise compared to traditional supervised methods. Previous studies underline that exploiting the advantages of alternative potential spectral features would be necessary to increase the current models’ performance [2]. One contribution of this study is the identifying the most effective basic spectral features that vary with increased bowel activity. Accordingly, the proposed cluster membership-based scoring system was evaluated with two independent data sets to examine features with related to bowel activity. Extracted features from both data sets were evaluated via the implemented greedy feature selection scheme. The results indicate that the proposed method is promising; however, it should be noted that applying our system in a clinical setting would require further validation with larger, more diverse data sets to ensure its efficiency.

In this study, instead of discriminating and labeling intestinal activity regions from recordings, the segments of the recordings were directly processed, and the amount of activity sound they contained was scored. Thus, digital recordings were not subjected to direct classification but were objectively scored by comparison with a group of observations that did not contain activity. This numerical approach could be used as an objective and accurate criterion rather than listening to the intestine with a stethoscope for judgment. It has been reported in the literature that neural network-based classification would require a large amount of data for better generalization ability. Moreover, such a mechanism’s detection sensitivity would be critically affected by noise [1]. Since no discrimination has been made to assign sound segments to a class in this study, it can be considered that the proposed approach could partially overcome some of the challenges (such as over-fitting or sensitivity to noise) that conventional trained machine learning approaches suffer from.

Dividing the audio recordings into equal-length segments creates a consistent basis for comparing different recordings, improving feature extraction reliability and intestinal activity grading. Additionally, using fixed-length segments and predefined spectral features reduces computational load, potentially enabling quicker analysis. These improvements make the method particularly suited for continuous monitoring, where accuracy and speed are critical. In wearable technology scenarios, segmentation might be unavoidable; however, a continuous fixed time interval analysis approach can capture the overall characteristics of bowel sounds while ensuring robust feature representation without biases introduced by incorrect segmentation. Therefore, this approach offers a plausible methodology for consistently monitoring and evaluating gastrointestinal motility.

One critical factor for the proposed grading system is the necessity of a standardized recording environment. Even in noisy environments, if the statistical distribution of noise remains consistent, the cluster-based grading approach can still effectively assess the activity level. However, unexpected noise sources and substandard audio recordings may introduce variability, underscoring the importance of a relatively quiet environment and stable recording conditions. This becomes particularly crucial in wearable and portable device applications, where environmental noise fluctuations could significantly affect grading performance.

It should be noted that, in this study, distinguishing different bowel activities was not the primary aim. While the current method does not specifically distinguish between different gastrointestinal conditions such as irritable bowel syndrome (IBS) or bowel obstruction, future work could extend the approach to identify specific spectral features associated with different disorders using more tailored analysis windows and spectral features.

Additionally, in its current form, the proposed grading system functions as a feature extractor in cases where bowel abnormalities are associated with increased or decreased activity levels. Although the presented methodology is not designed to differentiate between specific gastrointestinal disorders, it still has the potential to support the detection of various pathologies, such as post-operative ileus. Furthermore, the proposed method could be integrated into deep neural networks in future studies, where it may serve as a building block within a novel diagnostic architecture. This integration could enhance the diagnostic capabilities of deep learning models by incorporating the grading system as an intermediate feature extraction layer.

Results with both data sets can be seen as an indication that KNN and cluster membership based scoring might be more effective at distinguishing between varying levels of bowel activity, even in cases where the total amount of activity changes. The diversity in the best-performing spectral feature combinations across different criteria and subsets may have originated from the complex nature of bowel sounds. This underscores the necessity of a nuanced approach to their analysis. Moreover, the imbalance between the number of observations in the test and control groups, particularly when test observations are treated as singleton clusters, can cause metrics like cosine similarity and silhouette score to disproportionately reflect alignment with the larger control group, leading to higher scores that may better correlate with activity levels in one data set but not in another. Additionally, differences in the internal diversity and distribution of observations within the control group across data sets could further explain the variation in results.

Table 2 underscores the effectiveness of various membership criteria across different subsets of bowel activity. The KNN Score was identified as the most effective sorting criterion across all subsets from Data Set 1, particularly in Subset 3, where it achieved the best score of 0.874 using spectral entropy and slope features. This observation is further supported by Fig 5, which provides a visual comparison of the sorting accuracy of the top three membership criteria, highlighting KNN’s superior performance.

To further validate the grading performance across different bowel activity levels, Fig 6 presents the results obtained using Data Set 2. The findings indicate that the effectiveness of membership-based scoring varies with activity levels. While Cosine Similarity exhibited a moderate relationship with bowel activity, its performance improved at higher activity levels. In contrast, KNN-based and cluster membership grading showed more stable and consistent correlations across all levels. These results suggest that some criteria are more reliable in distinguishing activity levels and that selecting an appropriate method based on the expected range of activity could enhance grading accuracy.

Additionally, results with data set 2 also highlight the significance of the slope feature, which was consistently selected as the optimal feature by both supervised and unsupervised methods. The discrepancies between these methods suggest that while the slope is dominant, other spectral features, such as the roll-off point and centroid, may also play significant roles depending on the model used. The proposed unsupervised approach achieved the highest Spearman Rank Correlation of 0.645, slightly outperforming the supervised methods with the limited training data. This suggests that the proposed method is robust across different data sets and has the potential for effectively ranking activity levels. However, the variations in performance across different methods and data sets, as reflected in the differences among the supervised methods, highlight the importance of optimizing the feature space.

It should be noted that the Greedy Feature Selection (GFS) process used in this study is not part of the final application but rather an experimental setup to determine the most relevant features for grading gastrointestinal activity. Once the optimal feature set is identified, there is no need to repeat feature selection in a real-world implementation, minimizing any risk of overfitting in the proposed method. Future studies could explore larger and more diverse datasets with additional spectral features to refine the feature space further. Additionally, alternative feature selection algorithms could be investigated to enhance the robustness of feature selection and improve generalization.

In this study, it was observed that an increased amount of bowel activity causes the segments to differ more from the non-activity group. Accordingly, sound segments with increased bowel activity were scored more accurately (Table 2, Fig 4). Furthermore, the improvement in best scores from Subset 1 to Subset 3 of data set 1, especially noticeable in the Cosine Similarity and Cluster Membership criteria performance, suggests that the grading system’s accuracy increases with higher percentages of bowel activity. This trend could have significant implications for diagnosing conditions with increased gastrointestinal motility. A limitation observed is the fluctuation in the effectiveness of different membership criteria across data with varying levels of activity and distribution. The results demonstrate that the best-performing membership criteria can accurately score most segments. However, future studies should focus on improving the proposed approach by integrating additional spectral features and refining the data collection process to enhance diagnostic precision. The timing and methodology of the audio recordings might critically affect performance. In the experiments with the second dataset, non-activity segments were collected from different WAV samples recorded from different quadrants, which may have influenced the grading performance of the proposed scheme. Using the most significant features and a well-matched control group would enhance the accuracy of the grading activity. A standardized approach would also likely improve accuracy. This study highlights the potential of the evaluated spectral features and the variance approach when applied with supervised methods.

However, further investigation is needed to understand how different sorting and feature selection criteria might impact the results, particularly in diverse datasets. Since the proposed approach has not yet been clinically tested, a prospective clinical study on a larger patient population could help validate its effectiveness. Such a study could also explore the interaction between additional spectral features and the influence of model-specific characteristics, refining the results and improving generalizability.

5 Conclusions

This study proposes an objective evaluation method for assessing the amount of gastrointestinal activity. The presented method does not need a training stage with limited observations; instead, it utilizes several non-activity recordings for numerical comparison. Instead of detecting activity regions within the sound recordings to assess the amount and existence of gastrointestinal motility, fixed-duration whole sound segments were evaluated without any regional classification. It has been revealed that the standard deviation of basic spectral features may be effective descriptors for increased activity. Thus, using this method with wearable technology for continuous monitoring could be a practical way to evaluate motility. Therefore, the presented novel approach would contribute to automating diagnostic processes, offering a cost-effective solution with broad implications for gastrointestinal health monitoring.

Supporting information

S1 Table. Spectral features extracted from Dataset 1.

(CSV)

pone.0323440.s001.csv (308.4KB, csv)
S2 Table. Spectral features extracted from Dataset 2.

(CSV)

pone.0323440.s002.csv (16.8KB, csv)

Acknowledgments

In this study, ChatGPT-4o was used for language editing and grammatical corrections during the preparation of the manuscript. The authors have carefully reviewed all AI-assisted content and take full responsibility for its accuracy.

Data Availability

The first dataset is hosted on Kaggle and can be accessed at https://www.kaggle.com/datasets/robertnowak/bowel-sounds (accessed on 23/02/2024), under the CC BY-NC 4.0 License. The second dataset is publicly available through the Health Education Media Library at the University of Western Ontario, accessible at https://ir.lib.uwo.ca/clinicalskills_abdominalexam/ (accessed on 16/08/2024), under the Creative Commons Attribution-Noncommercial 4.0 License (CC BY-NC 4.0). The dataset was created by the Arthur Labatt Family School of Nursing and includes audio recordings of bowel sounds. All spectral feature tables used in the analysis have been uploaded as Supporting Information files.

Funding Statement

The author(s) received no specific funding for this work.

References

  • 1.Liu J, Yin Y, Jiang H, Kan H, Zhang Z, Chen P, et al. Bowel sound detection based on MFCC feature and LSTM neural network. 2018 IEEE biomedical circuits and systems conference (BioCAS). IEEE. 2018. p. 1–4. doi: 10.1109/biocas.2018.8584723 [DOI] [Google Scholar]
  • 2.Sitaula C, He J, Priyadarshi A, Tracy M, Kavehei O, Hinder M, et al. Neonatal bowel sound detection using convolutional neural network and laplace hidden semi-markov model. IEEE/ACM Trans Audio Speech Lang Process. 2022;30:1853–64. doi: 10.1109/taslp.2022.3178225 [DOI] [Google Scholar]
  • 3.Nowak J, Nowak R, Radzikowski K, Grulkowski I, Walkowiak J. Automated bowel sound analysis: an overview. Sensors. 2021;21(16):5294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Gu Y, Lim HJ, Moser MAJ. How useful are bowel sounds in assessing the abdomen?. Dig Surg. 2010;27(5):422–6. doi: 10.1159/000319372 [DOI] [PubMed] [Google Scholar]
  • 5.Allwood G, Du X, Webberley KM, Osseiran A, Marshall BJ. Advances in acoustic signal processing techniques for enhanced bowel sound analysis. IEEE Rev Biomed Eng. 2019;12:240–53. doi: 10.1109/RBME.2018.2874037 [DOI] [PubMed] [Google Scholar]
  • 6.Inderjeeth A-J, Webberley KM, Muir J, Marshall BJ. The potential of computerised analysis of bowel sounds for diagnosis of gastrointestinal conditions: A systematic review. Syst Rev. 2018;7(1):124. doi: 10.1186/s13643-018-0789-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kaneshiro M, Kaiser W, Pourmorady J, Fleshner P, Russell M, Zaghiyan K, et al. Postoperative gastrointestinal telemetry with an acoustic biosensor predicts ileus vs. uneventful GI recovery. J Gastrointest Surg. 2016;20(1):132–9; discussion 139. doi: 10.1007/s11605-015-2956-3 [DOI] [PubMed] [Google Scholar]
  • 8.Deniz Ulusar U, Canpolat M, Yaprak M, Kazanir S, Ogunc G. Real-time monitoring for recovery of gastrointestinal tract motility detection after abdominal surgery. 2013 7th international conference on application of information and communication technologies. IEEE. 2013. p. 1–4. doi: 10.1109/icaict.2013.6722654 [DOI] [Google Scholar]
  • 9.Kutsumi Y, Kanegawa N, Zeida M, Matsubara H, Murayama N. Automated bowel sound and motility analysis with CNN using a smartphone. Sensors (Basel). 2022;23(1):407. doi: 10.3390/s23010407 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Wang N, Testa A, Marshall BJ. Development of a bowel sound detector adapted to demonstrate the effect of food intake. Biomed Eng Online. 2022;21(1):1. doi: 10.1186/s12938-021-00969-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Zhang M, Cui X, Zhao L, He X, Shi Y, Yang J, et al. A lightweight deep learning based bowel sounds segmentation algorithm for gastrointestinal (GI) monitoring. Eng Appl Artif Intell. 2024;127:107446. doi: 10.1016/j.engappai.2023.107446 [DOI] [Google Scholar]
  • 12.Kolle K, Fougner AL, Ellingsen R, Carlsen SM, Stavdahl O. Feasibility of Early Meal Detection Based on Abdominal Sound. IEEE J Transl Eng Health Med. 2019;7:3300212. doi: 10.1109/JTEHM.2019.2940218 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Ficek J, Radzikowski K, Nowak JK, Yoshie O, Walkowiak J, Nowak R. Analysis of Gastrointestinal Acoustic Activity Using Deep Neural Networks. Sensors (Basel). 2021;21(22):7602. doi: 10.3390/s21227602 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Arthur Labatt Family School of Nursing. Abdominal exam: Clinical skills audio collection. 2021. Available from: https://ir.lib.uwo.ca/clinicalskills_abdominalexam/.
  • 15.Li T, Ogihara M, Li Q. A comparative study on content-based music genre classification. Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval. ACM; 2003. p. 282–9. doi: 10.1145/860435.860487 [DOI] [Google Scholar]
  • 16.Peeters G. A large set of audio features for sound description (similarity and classification) in the CUIDADO project. CUIDADO Ist Project Report. 2004;54(0):1–25. [Google Scholar]
  • 17.Misra H, Ikbal S, Bourlard H, Hermansky H. Spectral entropy based feature for robust ASR. 2004 IEEE international conference on acoustics, speech, and signal processing. IEEE. p. I–193. [Google Scholar]
  • 18.Johnston JD. Transform coding of audio signals using perceptual noise criteria. IEEE J Select Areas Commun. 1988;6(2):314–23. doi: 10.1109/49.608 [DOI] [Google Scholar]
  • 19.Wang Y, Xiang J, Markert R, Liang M. Spectral kurtosis for fault detection, diagnosis and prognostics of rotating machines: A review with applications. Mech Syst Signal Proc. 2016;66–7:679–98. doi: 10.1016/j.ymssp.2015.04.039 [DOI] [Google Scholar]
  • 20.Lerch A. An introduction to audio content analysis: Applications in signal processing and music informatics. Wiley-IEEE Press; 2012. [Google Scholar]
  • 21.Kim H, Moreau N, Sikora T. Mpeg-7 audio and beyond: Audio content indexing and retrieval. John Wiley & Sons; 2006. [Google Scholar]
  • 22.Ogbuabor G, Ugwoke F. Clustering algorithm for a healthcare dataset using silhouette score value. Int J Comput Sci Inf Technol. 2018;10(2):27–37. [Google Scholar]
  • 23.Xia P, Zhang L, Li F. Learning similarity with cosine similarity ensemble. Information Sciences. 2015;307:39–52. doi: 10.1016/j.ins.2015.02.024 [DOI] [Google Scholar]
  • 24.Gallego A-J, Calvo-Zaragoza J, Valero-Mas JJ, Rico-Juan JR. Clustering-based k-nearest neighbor classification for large-scale data with neural codes representation. Pattern Recogn. 2018;74:531–43. doi: 10.1016/j.patcog.2017.09.038 [DOI] [Google Scholar]
  • 25.Kulkarni UV, Shinde SV. Neuro-fuzzy classifier based on the Gaussian membership function. 2013 fourth international conference on computing, communications and networking technologies (ICCCNT). IEEE; 2013. p. 1–7. doi: 10.1109/icccnt.2013.6726629 [DOI] [Google Scholar]
  • 26.Hauke J, Kossowski T. Comparison of values of Pearson’s and Spearman’s correlation coefficients on the same sets of data. Quaestiones Geographicae. 2011;30(2):87–93. [Google Scholar]
  • 27.Farahat AK, Ghodsi A, Kamel MS. Efficient greedy feature selection for unsupervised learning. Knowl Inf Syst. 2012;35(2):285–310. doi: 10.1007/s10115-012-0538-1 [DOI] [Google Scholar]

Decision Letter 0

Mohammad Amin Fraiwan

26 Feb 2025

PONE-D-25-03395A novel spectral analysis-based grading system for gastrointestinal activityPLOS ONE

Dear Dr. cengizler,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Apr 12 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Mohammad Amin Fraiwan

Academic Editor

PLOS ONE

Journal requirements:

When submitting your revision, we need you to address these additional requirements.

1.  Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf  and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, we expect all author-generated code to be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

Reviewer #2: N/A

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: No

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The paper presents a novel and practical spectral analysis-based grading system for objectively evaluating gastrointestinal activity, addressing a significant gap in traditional assessment methods. By utilizing an unsupervised approach to analyze bowel sounds, the study reduces reliance on subjective interpretation and extensive labeled datasets, making it more accessible for clinical use. The method demonstrates robust performance across multiple datasets, validated through statistical measures such as Spearman's rank correlation, highlighting its potential to enhance diagnostic accuracy and facilitating continuous monitoring of gastrointestinal health.

Limitations and Review Questions

1. The accuracy of the grading system may be influenced by the quality of audio recordings, which could introduce variability. What measures can be taken to standardize audio recording conditions to minimize variability in the results?

2. The method does not differentiate between various gastrointestinal disorders, limiting its diagnostic utility. How could the authors modify their approach to allow for the identification of specific gastrointestinal conditions based on spectral features?

3. The feature selection process may lead to overfitting, particularly in smaller datasets, affecting the robustness of the model. What strategies can be implemented to ensure that the feature selection process remains robust and prevents overfitting in diverse datasets?

4. While the system shows promise, it has not been tested in real clinical settings, which is essential for practical application. What steps are necessary for the authors to undertake clinical validation of their grading system to ensure its effectiveness in real-world scenarios?

Reviewer #2: In this study, the authors tried to present an unsupervised grading system to evaluate gastrointestinal motility by analyzing spectral feature of bowel sounds. The work is interesting. However, a lot of issues need to be addressed before publication:

1. Which output gives the information of gastrointestinal motility evaluation? please make this clear using figures or block diagram.

2.Figure 3 is not clear. How did you get these two figures? What is the difference between two data sets? What is the value of the horizontal axis?

3. The description and discussion for figure 4 and figure 5 is missing.

4. Figure 6 is not clear. The description and discussion is also missing.

5. English writing need to be improved for clarity.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Asaad Ahmed

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2025 Jun 2;20(6):e0323440. doi: 10.1371/journal.pone.0323440.r003

Author response to Decision Letter 1


24 Mar 2025

Dear Reviewers,

Thank you for the opportunity to submit a revised draft of our manuscript. We greatly appreciate the time and effort that the reviewers dedicated to providing thorough and constructive feedback and insightful comments. We have carefully considered all the reviewer comments and made the necessary revisions to address each of their concerns.

All revisions are highlighted within the manuscript. Below are our responses to the specific reviewer comments:

Reviewer #1:

1. The accuracy of the grading system may be influenced by the quality of audio recordings, which could introduce variability. What measures can be taken to standardize audio recording conditions to minimize variability in the results?

Thanks to the reviewer for raising this important questions. A new paragraph has been added to the Discussion section to address this issue. This update indicates the importance of a standardized recording environment, explains how the system handles consistent noise, and highlights potential challenges from unexpected noise sources. It also underscores the need for stable recording conditions, especially in wearable and portable device applications.

2. The method does not differentiate between various gastrointestinal disorders, limiting its diagnostic utility. How could the authors modify their approach to allow for the identification of specific gastrointestinal conditions based on spectral features?

We thank the reviewer for this valuable comment. To clarify the distinction between the proposed objective grading approach and a classifier for specific gastrointestinal disorders, a new statement has been added to the Discussion section.

Additionally, as the reviewer has kindly pointed out, the current approach does not differentiate between specific gastrointestinal conditions such as IBS or bowel obstruction. However, we acknowledge its potential to support the detection of activity-related pathologies, such as post-operative ileus.

We have also noted that future research could explore the integration of the proposed system into deep learning models, where it may serve as a feature extraction module within more advanced diagnostic frameworks.

3. The feature selection process may lead to overfitting, particularly in smaller datasets, affecting the robustness of the model. What strategies can be implemented to ensure that the feature selection process remains robust and prevents overfitting in diverse datasets?

We sincerely thank the reviewer for this valuable comment. The Greedy Feature Selection (GFS) process in our study is not intended for real-world applications but rather serves as an experimental setup to identify the most relevant features for grading gastrointestinal activity. Once the optimal feature set is determined, there is no need to perform feature selection repeatedly, which minimizes any risk of overfitting in the proposed method. To clarify this point, we have updated the Discussion section. We have also noted that future studies could explore larger datasets and alternative feature selection methods to further improve robustness.

4. While the system shows promise, it has not been tested in real clinical settings, which is essential for practical application. What steps are necessary for the authors to undertake clinical validation of their grading system to ensure its effectiveness in real-world scenarios?

While our study demonstrates the effectiveness of the proposed grading system, it has not yet been clinically tested. We acknowledge the importance of clinical validation and have added a statement in the Discussion section addressing this as the reviewer kindly asked. Specifically, we suggest that a prospective clinical study on a larger patient population could be conducted to validate the system’s effectiveness. Such a study could also explore the interaction between additional spectral features and model-specific characteristics, further refining the results and improving generalizability.

Reviewer #2:

1. Which output gives the information of gastrointestinal motility evaluation? Please make this clear using figures or block diagram.

Thanks to the reviewer for this important suggestion. To improve clarity, we have revised the manuscript by describing the grading process in the Spectral Feature Analysis subsection. Additionally, we have updated the corresponding figure to illustrate the overall workflow of the grading system. The revised figure is now included as Figure 2 in the manuscript.

2. Figure 3 is not clear. How did you get these two figures? What is the difference between two data sets? What is the value of the horizontal axis?

Thanks to the reviewer for valuable feedback. We have revised the Results section to clarify the differences between the two datasets. We have also noted that the horizontal axis represents bowel activity percentages, while the vertical axis shows their frequency.

3. The description and discussion for figure 4 and figure 5 is missing.

Thanks to the reviewer for this important feedback. The descriptions of Figures 4 and 5 have been revised for better clarity as suggested. Additionally, the incorrect figure reference (Figure 1 instead of Figure 4) has been corrected in the Discussion section. The Discussion now clearly describes Figures 4 and 5, explicitly linking Figure 5 to the evaluation of different membership criteria and highlighting KNN’s performance.

4. Figure 6 is not clear. The description and discussion is also missing.

We sincerely thank the reviewer for this valuable feedback. We have revised the results section to clarify the description of Figure 6. Also, we have explicitly stated that Figure 6 illustrates the grading performance of the top three membership criteria across different bowel activity levels using Data Set 2. The axes have been clearly defined, and the findings have been explained to highlight the trends observed. Additionally, a new paragraph has been added to the Discussion section to further elaborate on the implications of Figure 6.

5. English writing need to be improved for clarity.

Thanks to the reviewer. A general English revision is performed for language refinement.

Attachment

Submitted filename: Response to reviewers.docx

pone.0323440.s003.docx (16.6KB, docx)

Decision Letter 1

Mohammad Amin Fraiwan

9 Apr 2025

A novel spectral analysis-based grading system for gastrointestinal activity

PONE-D-25-03395R1

Dear Dr. cengizler,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Mohammad Amin Fraiwan

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: thanks, all are done

there are not any additional comments.

the paper is reviewed by authors

well, done

Reviewer #2: The authors have addressed all issues that I suggested. It can be accepted for publication in current form.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Ahmed A. A. Gad-Elrab

Reviewer #2: Yes: Yuee Li

**********

Acceptance letter

Mohammad Amin Fraiwan

PONE-D-25-03395R1

PLOS ONE

Dear Dr. Cengizler,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

You will receive further instructions from the production team, including instructions on how to review your proof when it is ready. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few days to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Mohammad Amin Fraiwan

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Spectral features extracted from Dataset 1.

    (CSV)

    pone.0323440.s001.csv (308.4KB, csv)
    S2 Table. Spectral features extracted from Dataset 2.

    (CSV)

    pone.0323440.s002.csv (16.8KB, csv)
    Attachment

    Submitted filename: Response to reviewers.docx

    pone.0323440.s003.docx (16.6KB, docx)

    Data Availability Statement

    The first dataset is hosted on Kaggle and can be accessed at https://www.kaggle.com/datasets/robertnowak/bowel-sounds (accessed on 23/02/2024), under the CC BY-NC 4.0 License. The second dataset is publicly available through the Health Education Media Library at the University of Western Ontario, accessible at https://ir.lib.uwo.ca/clinicalskills_abdominalexam/ (accessed on 16/08/2024), under the Creative Commons Attribution-Noncommercial 4.0 License (CC BY-NC 4.0). The dataset was created by the Arthur Labatt Family School of Nursing and includes audio recordings of bowel sounds. All spectral feature tables used in the analysis have been uploaded as Supporting Information files.


    Articles from PLOS One are provided here courtesy of PLOS

    RESOURCES