Skip to main content
PLOS One logoLink to PLOS One
. 2026 Mar 27;21(3):e0345753. doi: 10.1371/journal.pone.0345753

Assessing the impact of human error factors on railway accident severity: Evidence from accident investigation reports in Korea

Changhun Kim 1, Jun Lee 1,*
Editor: Qing-Chang Lu2
PMCID: PMC13028362  PMID: 41894360

Abstract

This study investigates the role of human error factors in shaping the severity of railway accidents. Using a structured coding scheme to transform qualitative accident investigation reports into quantitative variables, the analysis reveals that deficiencies in managerial oversight, shortcomings in maintenance practices, and failures in equipment and system reliability are consistently associated with higher accident costs. These findings underscore the organizational and technical dimensions of human error as critical factors linked to accident severity, rather than merely front line worker mistakes. At the modeling level, the explanatory variables collectively account for a meaningful portion of the variation in accident costs, indicating that such outcomes are not purely random but systematically related to underlying human and organizational conditions. By identifying the relative importance of managerial, maintenance, and equipment-related deficiencies, this study provides empirical evidence for prioritizing these factors in railway safety management. The results highlight the need to strengthen organizational accountability, improve maintenance regimes, and ensure the reliability of technical systems as essential strategies for mitigating high-consequence accidents. These insights contribute to a deeper understanding of the organizational and systemic correlates of accident severity and can inform targeted interventions aimed at enhancing overall railway safety.

1. Introduction

Railway systems are widely recognized as complex socio-technical infrastructures in which safety emerges from the interaction of technical components, human operators, and organizational structures [1]. Despite continuous advances in automation, digital signaling, and system engineering, human and organizational factors remain persistent sources of risk in railway operations [2]. According to the National Safety Council (NSC) [3], between 2015 and 2024, railway incidents in the United States resulted in an annual average of approximately 3,800–4,300 cases when fatal and non-fatal events among railway employees are combined (Fig 1). This sustained level of workforce casualties indicates that risks related to human and organizational factors remain substantial.

Fig 1. Employee railroad incidents in the United States (2015–2024).

Fig 1

Source: NSC Injury Facts (2024).

Railway employees—including operators, signal staff, maintenance workers, and construction personnel—are routinely exposed to high-risk environments [45], where even relatively minor human errors can escalate into severe incidents [6]. In addition to injuries and fatalities, such accidents are associated with service disruptions and economic losses [7], making workforce safety an important empirical issue in railway systems.

Recent studies have shown that railway accidents are rarely the result of a single technical failure and are more often associated with combinations of human error, organizational deficiencies, and management-related shortcomings [89]. Previous work has further emphasized the importance of human factors in safety-critical signaling operations [10]. In this context, official accident investigation reports are widely used as primary information sources because they document accident circumstances, causal chains, and contributory factors in detail.

Although research on railway safety and human factors has expanded, much of the existing literature remains qualitative or case-based. Only limited efforts have been made to transform narrative accident investigation reports into structured quantitative datasets suitable for large-sample statistical analysis. One example is a previous study that demonstrates the feasibility of extracting structured risk factors from railway accident reports [11]. Nevertheless, investigation reports continue to be used far more often for qualitative analysis and safety recommendations than for quantitative modeling.

Moreover, most existing studies focus on accident occurrence or accident causation mechanisms rather than on accident severity measured in economic terms, such as accident cost and material damage. Accident severity expressed in financial terms represents an important dimension of railway safety impact; yet, only a small number of studies have examined severity-related outcomes in the railway context. Among them, one study provides one of the few empirical analyses linking accident characteristics to financial losses [12]. Recent review work also indicates that large-sample, quantitative, report-based analyses focusing on accident severity remain relatively rare [13].

Against this background, the purpose of this study is not to validate or benchmark a classification scheme itself, but to use a structured coding scheme as an analytical tool to examine which types of human and organizational errors are more strongly associated with accident cost severity. This study focuses on workforce-related railway accidents and transforms qualitative findings from official accident investigation reports into a structured data set for statistical analysis. By doing so, it provides empirical evidence on the relative importance of different categories of human and organizational deficiencies in relation to accident severity and contributes to a more systematic, data-driven understanding of high-consequence railway accidents.

2. Literature review

2.1 Human error factors among railway workers

Human error has long been recognized as a dominant cause of accidents across high-risk sectors, yet systematic research focusing specifically on railway employees remains limited. Early railway safety research tended to emphasize technical causes, including mechanical breakdowns or signaling failures [1415], while human and organizational deficiencies were treated as secondary considerations.

However, empirical evidence from multiple national contexts underscores the central role of human and organizational deficiencies in railway safety. Analyses of U.S. incidents have repeatedly identified inadequate supervision as a contributing factor [1617]. In Australia, human error has been confirmed as a causal factor in official investigation reports [18], while in South Africa, high accident rates have been linked to weak supervision and a negative safety culture [19]. Formal inquiries into major rail accidents have also highlighted communication breakdowns and hierarchical “authority gradients” among workers as causal contributors [20]. In addition, the U.S. Federal Railroad Administration has reported that accident and injury rates in railroad yards significantly exceed industry averages, prompting research into workforce safety issues such as safety climate and training [21]. Regulatory oversight itself has also been noted as an important influence on employee safety outcomes [22]. Collectively, these findings demonstrate that supervisory practices, organizational culture, and human factors remain fundamental drivers of accident risk across diverse railway systems.

A growing body of research has also explored the influence of safety culture and organizational attitudes toward risk. Comparative studies indicate that organizations with proactive reporting systems, continuous safety education, and participatory decision-making processes tend to have lower accident rates [23]. Conversely, rigid hierarchical communication structures and punitive approaches to error reporting may exacerbate risks by discouraging disclosure of near misses [24]. Despite these insights, the literature on railway workers is fragmented and often qualitative, offering descriptive accounts of accidents without robust quantification of the relative importance of different human error categories [25]. This lack of systematic frameworks for classification and measurement underscores the need for more rigorous, data-driven approaches to evaluating human error factors in railway operations.

2.2 Accident report data and analytical frameworks

Accident investigation reports are widely regarded as one of the most informative yet underutilized data sources for safety research [2627]. They typically provide detailed accounts of accident circumstances, contributing factors, and outcomes, but the narrative and semi-structured nature of these documents often makes them difficult to analyze systematically. In other transportation sectors, researchers have addressed this limitation by transforming raw reports into structured datasets, which in turn have enabled large-scale statistical analysis [28]. For example, in aviation safety studies, narrative reports of crew errors have been systematically classified into error categories [29], while in road safety, police crash reports have been reorganized to capture human, environmental, and technical factors [30]. These efforts have revealed recurring causal patterns, such as driver distraction in road crashes and communication failures in aviation incidents [31].

In the railway sector, however, the use of accident reports has been more limited. Existing studies have often remained qualitative or descriptive, emphasizing case studies that highlight maintenance deficiencies, aging equipment, or procedural violations without translating them into variables suitable for quantitative evaluation [32]. A small number of studies have attempted to apply structured approaches, but these have rarely extended beyond basic categorization or summary statistics [33]. Compared with the more developed practices in aviation and road safety, railway research still lacks systematic frameworks for coding and analyzing the human error content embedded in accident reports [34].

This absence of consistent methods for structuring and analyzing the human error content of railway reports illustrates the current limitations of the field. While accident investigation documents provide rich qualitative descriptions, they have seldom been transformed into formats that allow systematic evaluation of workforce-related error factors.

2.3 Research gap and contributions

Despite growing recognition of the importance of human error in railway safety, existing research has remained largely descriptive and has not developed systematic frameworks for classifying or quantifying worker-related factors. At the same time, although accident investigation reports provide rich qualitative accounts of incidents, their potential as a data source for quantitative modeling has not been fully realized. These limitations have restricted the ability of prior studies to determine which categories of human error most strongly influence accident outcomes.

This study seeks to address these gaps in several ways. First, it shifts the focus of railway safety research toward the workforce, providing an evidence-based perspective on employee safety that has often been overlooked. Second, it consolidates diverse human error factors into a structured framework, enabling both systematic classification and empirical analysis. Third, by transforming narrative accident reports into a structured database and applying quantitative modeling techniques, the study demonstrates how the relative influence of different error categories on accident severity can be rigorously assessed. Finally, the findings provide a basis for advancing evidence-informed approaches to railway workforce safety. By identifying the relative influence of different error categories, the study supports the design of comprehensive strategies that may include organizational, technical, and human-centered measures, with potential applications in supervisory systems, maintenance planning, and workforce health management.

3. Data

3.1 Data source and case selection

The primary data set for this study consists of official railway accident investigation reports issued by the Aviation and Railway Accident Investigation Board (ARAIB) of the Republic of Korea [35]. These reports are mandated under the Act on the Investigation of Aviation and Railway Accidents, which establishes a standardized and independent investigation process oriented toward safety improvement rather than fault attribution. We collected and reviewed all railway accident investigation reports published between 2011 and 2025, thereby ensuring more than a decade of data produced under a consistent institutional and legal framework.

Under Article 4 of the Act, the Investigation Board is responsible for collecting and analyzing accident-related information, determining causes, and issuing safety recommendations. The scope of reportable accidents is defined in Article 2 and includes events such as collisions, derailments, operationally disruptive fires, accidents involving three or more casualties, and cases with property damage exceeding 50 million KRW. Importantly, an accident is subject to investigation if it satisfies any one of these criteria, rather than all of them simultaneously. Accordingly, the investigation reports cover a broad range of accidents defined by both the nature and the severity of the event under a unified legal framework.

Each report follows a standardized structure comprising a factual section and an analytical section. The factual section documents the event chronology, casualties and damages, personnel and operational information, and technical data on rolling stock, track, electrical, and signaling systems. The analytical section integrates these elements to evaluate human performance, infrastructure conditions, and organizational procedures. The final report concludes with an official determination of causes and safety recommendations. Fig 2 presents examples of accident evidence included in these reports, providing visual documentation that supports the assessment of both causal factors and damage severity.

Fig 2. Examples of accident evidence from investigation reports.

Fig 2

(Left) Signal equipment room terminal block before and after the accident. (Right) Condition of the accident train at the time of the incident. Source: Aviation and Railway Accident Investigation Board (ARAIB), Republic of Korea. Images reproduced for illustrative purposes.

The dependent variable in this study, accident cost, is constructed directly from the official damage assessment reported in Section 1.2 (Damage Information) of the factual section of each report, which consists of Section 1.2.1 (Casualties), Section 1.2.2 (Property Damage), and Section 1.2.3 (Other Damage). Because casualties are reported only in terms of the number of deaths and injuries and are not monetized, the cost measure used in this study is defined as the sum of property damage and other damage. Property damage includes losses related to track, rolling stock, signaling systems, and structures and facilities, while other damage covers compensation and economic losses associated with service disruptions, such as line closures and train delays. These amounts are officially determined and finalized by the Investigation Board through on-site investigations, technical testing, and deliberative procedures, and are published in the final investigation reports.

Because the same legal framework, reporting format, and assessment procedures have been applied consistently throughout the sample period, the accident cost measure provides a reasonable degree of temporal comparability across years, although it should be understood as an official ex post damage assessment rather than an accounting-based valuation.

With respect to sample construction, a total of 143 railway accident investigation reports were published by ARAIB during the study period. Among these, 105 cases were included in the final analytical sample, while 38 cases were excluded based on explicit exclusion criteria. First, cases in which no accident cost was reported or could be identified were excluded, such as incidents involving derailments without any physical damage or measurable property loss. Second, duplicate reports referring to the same accident (e.g., re-investigations or revised versions of previously published reports) were removed to avoid double counting. Third, unresolved cases in which the accident causes were not conclusively determined at the time of reporting were excluded. As a result, the final sample consists of 105 accidents for which both reliable cost information and sufficiently clear causal analyses were available for subsequent coding and quantitative analysis.

With respect to explanatory variables, each report specifies both the primary cause of the accident and a set of contributory factors. This standardized structure provides a consistent basis for identifying and coding human, technical, and organizational error factors, ensuring transparent and methodologically consistent mapping from qualitative investigation findings to predefined error categories.

3.2 Conceptual framework and coding scheme

To transform the qualitative content of accident investigation reports into analyzable data, this study adopts a structured classification framework for human and organizational error factors. Particular attention is directed to the sections describing official accident causes and contributory factors, as these elements are formally identified by investigators and therefore provide a consistent and authoritative basis for identifying human error variables.

Based on prior studies in railway safety and human factors, as well as a review of established human error classification frameworks across safety-critical domains, contributory factors are classified into eight categories (C1–C8). These categories represent recurring human and organizational deficiencies documented in accident investigations, including supervision and management, training and education, maintenance and inspection, equipment and emergency systems, safety regulation and compliance, communication, decision-making and competence, and physical and health conditions.

The conceptual foundations of each category and their correspondence with existing human reliability analysis frameworks are summarized in Table 1, which provides the theoretical rationale supporting the validity of the C1–C8 classification scheme. By grounding the classification in both prior literature and the institutional logic of accident investigation practice, this framework is intended to capture the major dimensions of human and organizational contributions to railway accidents in a systematic and reproducible manner.

Table 1. Basis for defining categories (C1–C8) from prior human error classifications.

Human Error HFACS (Aviation domain) RSSB (Rail domain) HPEP (Nuclear domain)
C1 Unsafe Supervision Competence Management
(Interpretive)
Management Oversight
C2 Organizational Influences (training elements) Knowledge, Skills, Abilities emphasis (domain focus) Implicit in Performance Evaluations
C3 Preconditions
(technical environment)
Infrastructure/System condition (Interpretive) Task Environment
(technical focus)
C4 Preconditions
(physical environment)
Task/Workplace system factors (domain focus) Human-System Interface
C5 Organizational Influences (process) Procedures and guidance domain (Interpretive) Implicit Regulatory/
Procedural Evaluation
C6 Preconditions
(group factors)
Communications
(domain focus)
Communications/ Coordination & Control
C7 Unsafe Acts
(decision/policy level)
Decision and competency
(Interpretive)
Decision-related measures in performance Evaluation
C8 Preconditions
(physiological)

Workload/Fatigue domain focus (domain focus)
Fitness/Health aspects in performance Evaluation

Note. The coding categories (C1–C8) were developed based on and conceptually aligned with established human factors frameworks, including HFACS [36], the RSSB human factors guidance [37], and the HPEP framework [3839].

3.3 Coding procedure and reliability

For empirical coding, each category was operationalized using observable indicators derived from investigation reports. Table 2 presents representative phrases and investigation findings that were used to assign a value of “1” to each category, thereby clarifying how abstract human factor concepts were translated into report-based coding decisions. These indicators were not treated as an exhaustive checklist but served as guiding criteria to ensure consistent interpretation across cases. When at least one indicator corresponding to a category was explicitly identified as a cause or contributory element in the report, the category was coded as present (1); otherwise, it was coded as absent (0).

Table 2. Operational definitions and report-based evidence for coding categories (C1–C8).

Code Description Representative Investigation Findings
C1 Deficiencies in Supervisory and Managerial Control of Safety-Critical Activities The station operations manager handled train entry without complying with operating regulations, resulting in derailment
C2 Inadequate Organizational Training and Competency Development Insufficient training on operational rules and driving regulations
C3 Deficiencies in Maintenance, Inspection, or Technical System Integrity Track gauge and cross-level measurements showed that the cross-level in a curved section exceeded the allowable limit
C4 Inadequacies in Equipment, Emergency Systems, or Human-System Interfaces The transition curve was constructed shorter (20m) than the design standard (30m) due to topographical constraints
C5 Noncompliance With or Circumvention of Safety Regulations and Procedures The train was operated with a load exceeding the allowable limit
C6 Failures in Communication, Coordination, or Information Transfer The dispatcher provided incorrect speed (45 km/h) information to the rescue train (25 km/h)
C7 Deficient Operational Decision-Making or Task-Related Competence After the initial accident, the local controller failed to display a stop signal
C8 Performance Degradation Due to Fatigue or Physical and Health Conditions The train driver ignored the stop signal due to drowsiness and reduced alertness

To enhance coding reliability, the reports were independently reviewed and coded by two members of the research team in separate coding rounds. Inter-rater agreement was assessed using Cohen's kappa, which indicated a high level of consistency between coders (mean κ = 0.88, range: 0.62–0.98; see Table 3).

Table 3. Inter-rater agreement for human error categories (Cohen’s kappa).

Category C1 C2 C3 C4 C5 C6 C7 C8 Mean Range
Cohen`s κ 0.981 0.621 0.943 0.961 0.935 0.918 0.854 0.852 0.883 0.621-0.981

Note. κ ≥ 0.80 indicates almost perfect agreement, and 0.60 ≤ κ < 0.80 indicates substantial agreement.

Discrepancies were subsequently examined through joint discussion, and final coding decisions were determined through consensus. In addition, an external railway safety expert reviewed the coding scheme and a subset of representative case assignments to validate the conceptual consistency of the coding logic. Cases that remained ambiguous after the initial comparison—particularly those involving borderline distinctions between training and competency deficiencies (C2) and adjacent categories such as supervisory control (C1) or operational decision-making (C7)—were flagged and re-evaluated through joint discussion with expert consultation until final coding decisions were reached by consensus. The full coding scheme with category definitions and illustrative examples is provided in Appendix A in S1 File.

Although binary coding may simplify the complex nature of human error mechanisms, the level of detail available in official investigation reports does not allow reliable quantitative measurement of the relative severity or contribution of individual factors. Given this limitation, an all-or-nothing coding scheme was considered the most appropriate and objective approach for identifying the presence of contributory human and organizational factors across cases. This strategy is consistent with prior accident analysis studies that rely on investigation reports as primary data sources and aim to capture factor occurrence rather than magnitude.

The overall coding workflow is illustrated in Fig 3, which depicts the sequential procedure by which qualitative descriptions are reviewed, highlighted, assigned to predefined categories, and subsequently encoded into binary variables. In addition to showing the conversion process, Fig 3 also illustrates how coded variables are compiled into a structured database, enabling consistent storage, retrieval, and comparative analysis of accident cases. By representing each human error factor as a standardized indicator grounded in both theoretical frameworks (Table 1) and report-based evidence (Table 2), the resulting data set supports transparent and reproducible analysis and provides a consistent basis for applying regression-based methods to evaluate the association between human error factors and accident severity.

Fig 3. Workflow for extracting, categorizing, and coding accident report data.

Fig 3

3.4 Data description

The data set used in this study consists of 105 train accident cases compiled from official investigation reports. Each case is treated as a single observation, and the primary dependent variable is accident cost, expressed in its original monetary value prior to any transformation. Accident costs exhibit substantial variation across cases, capturing the wide range of financial consequences associated with train accidents.

Table 4 presents the descriptive statistics of the dependent variable together with the eight independent indicators of human error. The results indicate that supervisory and management-related deficiencies are the most frequently observed factor, while health-related conditions appear only rarely. Overall, the data set captures both high-cost accidents with significant human error involvement and lower-cost events with fewer contributory factors, thereby providing a comprehensive empirical basis for subsequent statistical analysis.

Table 4. Descriptive statistics of variables (N = 105).

Variable Description Mean
(Prevalence)
Std. Dev. Min – Max
Cost Accident cost (KRW, original value) 144.46M 250.62M 0.17M – 1593.00M
C1 Deficiencies in Supervisory and Managerial Control of Safety-Critical Activities (0/1) 0.43 0.50 0 - 1
C2 Inadequate Organizational Training and Competency Development (0/1) 0.08 0.27 0 - 1
C3 Deficiencies in Maintenance, Inspection, or Technical System Integrity (0/1) 0.48 0.50 0 - 1
C4 Inadequacies in Equipment, Emergency Systems, or Human-System Interfaces (0/1) 0.42 0.50 0 - 1
C5 Noncompliance With or Circumvention of Safety Regulations and Procedures (0/1) 0.32 0.47 0 - 1
C6 Failures in Communication, Coordination, or Information Transfer (0/1) 0.12 0.33 0 - 1
C7 Deficient Operational Decision-Making or Task-Related Competence (0/1) 0.24 0.43 0 - 1
C8 Performance Degradation Due to Fatigue or Physical and Health Conditions (0/1) 0.03 0.17 0 - 1

4. Methodology

4.1 Log transformation of the dependent variable

The dependent variable of this study is accident cost, initially expressed in its original monetary values. Preliminary inspection reveals that the raw distribution of costs is heavily right-skewed, with a small number of accidents accounting for extremely high losses. As shown in Fig 4, the raw distribution deviates substantially from normality.

Fig 4. Distribution of accident cost before and after log transformation.

Fig 4

This distributional pattern highlights the need for modeling approaches that can accommodate strictly positive and highly skewed outcomes. Accordingly, in the main analysis, this study adopts a Gamma generalized linear model (GLM) with a log link, which is specifically designed for such data structures.

For comparison and robustness checks, a log-linear specification is also considered. To this end, the natural logarithm of accident cost is applied. This transformation compresses the scale of extreme values and results in a distribution that more closely approximates normality, as illustrated in the right panel of Fig 4.

The improvement is further supported by the Q–Q plots presented in Fig 5, which compare the distributions before and after transformation. The log-transformed variable aligns more closely with the theoretical normal line, with only minor deviations in the tails. These results indicate that the log transformation provides a reasonable approximation for alternative log-linear specifications, which are used as supplementary analyses alongside the baseline Gamma GLM.

Fig 5. Q–Q plots of accident cost before and after log transformation.

Fig 5

4.2 Multicollinearity and co-occurrence diagnostics

Before conducting the regression analyses, potential dependence among the explanatory variables was carefully examined to assess the risk of multicollinearity and related identification problems. As a first step, we evaluated linear multicollinearity using the Variance Inflation Factor (VIF) [40]. VIF values indicate how much the variance of an estimated coefficient is inflated due to linear dependence among regressors. Common rules of thumb suggest that values exceeding 10 indicate severe multicollinearity, while more conservative thresholds such as 5 are often applied in safety-related research [41].

The VIF results for the eight human error variables (C1–C8) all fall well below the conservative threshold of 5, indicating that no serious linear multicollinearity is present. This confirms the suitability of including all variables simultaneously in the regression models without the need for dimensionality reduction or variable aggregation. The detailed VIF results are reported in Table 5.

Table 5. VIF results for human error variables (C1–C8).

Variable VIF Tolerance Variable VIF Tolerance
C1 1.06 0.94 C5 1.31 0.77
C2 1.27 0.79 C6 1.27 0.79
C3 1.10 0.90 C7 1.31 0.76
C4 1.31 0.90 C8 1.10 0.91

Because the explanatory variables are binary indicators, linear collinearity diagnostics alone may not fully capture their joint occurrence structure. We therefore additionally examined pairwise co-occurrence patterns among C1–C8 using phi correlation coefficients. Fig 6 presents both a heat-map and the corresponding correlation matrix for the pairwise phi correlations among the eight human error categories. The figure shows that all off-diagonal associations are modest in magnitude, with all absolute values remaining below 0.35, and no block structure or clustering pattern is observed.

Fig 6. Pairwise phi correlations among C1–C8 shown as a heatmap and a correlation matrix.

Fig 6

Taken together, these diagnostics suggest that the human error categories are not strongly dependent on one another, either in a linear sense or in terms of joint occurrence patterns. This diagnostic evidence indicates that any lack of statistical significance observed in subsequent regression models is unlikely to be primarily attributable to masking effects due to multicollinearity or systematic co-occurrence among the explanatory variables.

4.3 Model performance comparison

To determine an appropriate modeling framework for accident cost severity, this study adopts a two-step specification strategy that explicitly accounts for both the distributional characteristics of the dependent variable and the robustness of the estimation results.

First, to assess the appropriate distributional form for accident cost, a Gamma generalized linear model (GLM) with a log link is compared with a log-linear OLS model. The Gamma GLM is estimated on accident cost in levels under a Gamma likelihood, whereas the log-linear OLS model is estimated on ln(COST) assuming normally distributed errors. Because these two models rely on different likelihood functions and are defined on different scales of the dependent variable, their goodness-of-fit statistics are not strictly comparable in a formal sense, and any direct comparison of likelihood-based criteria should therefore be interpreted with caution.

Nevertheless, likelihood-based measures such as the log-likelihood, AIC, and deviance are reported to verify that the Gamma specification fits the data in a numerically stable and well-behaved manner (Table 6). The results show that the Gamma GLM converges properly and yields finite and stable fit statistics, indicating that the model does not suffer from numerical instability or obvious misspecification.

Table 6. Comparison of distributional specifications for accident cost.

Model DV Dist. LogLik AIC BIC RMSE MAE
Gamma GLM Cost Gamma −2045.43 4110.86 −229.30 2.34e8 1.46e8
OLS_Log (ln Cost) ln Cost Normal −203.09 424.17 448.06 2.61e8 1.27e8

To complement this assessment, we additionally examine predictive performance on the original cost scale using the root mean squared error (RMSE) and the mean absolute error (MAE). Although this comparison is not fully symmetric due to the retransformation required for the log-linear OLS predictions, these measures provide a practical benchmark for model performance. As reported in Table 6, the Gamma GLM achieves a lower RMSE (2.34e8) than the log-linear OLS model (2.61e8), whereas the log-linear OLS model yields a lower MAE (1.27e8) than the Gamma GLM (1.46e8). Thus, the two models exhibit broadly comparable predictive performance, with no decisive dominance of either specification in terms of overall prediction error.

Taken together, these results indicate that the choice of the Gamma GLM is not driven primarily by superior predictive accuracy, but rather by its theoretical and empirical suitability for modeling a positive, right-skewed, and highly heterogeneous cost variable. Given the strong skewness and heteroskedastic nature of accident cost data, the Gamma GLM with a log link provides a more appropriate distributional framework for the dependent variable and is therefore adopted as the baseline specification in this study.

Second, within the log-linear framework, OLS, Ridge, and Lasso regressions are compared as robustness checks. All three models use ln(COST) as the dependent variable, but Ridge and Lasso impose L2 and L1 regularization, respectively, with the regularization parameter selected by cross-validation. Because regularized estimators do not have a standard definition of R² and are not estimated by maximum likelihood in a directly comparable way, model performance is evaluated using in-sample and cross-validated prediction errors on the log scale, such as RMSE and MAE (Table 7).

Table 7. Robustness checks using regularized log-linear models.

Model DV RMSE(log scale) MAE (log scale) CV RMSE (log scale)
OLS ln Cost 1.673 1.362 1.841
Ridge ln Cost 1.729 1.373 1.790
Lasso ln Cost 1.818 1.426 1.801

As reported in Table 7, the three models exhibit very similar predictive performance. OLS achieves the lowest in-sample RMSE (1.673) and MAE (1.362), while Ridge yields a slightly lower cross-validated RMSE (1.790) than OLS (1.841) and Lasso (1.801). However, the differences across models are small, indicating that the results are not sensitive to the choice of regularization and are not driven by overfitting or multicollinearity within the log-linear framework.

Taken together, these results support the use of the Gamma GLM with a log link as the baseline specification. This choice is primarily motivated by the strictly positive and highly right-skewed nature of accident cost and is further reinforced by the satisfactory and stable goodness-of-fit of the Gamma specification. The log-linear models are retained as supplementary specifications for robustness checks.

The baseline Gamma GLM specification can be expressed as follows:

E(Costi)=exp(β0+β1X1i+β2X2i++βkXki) (1)

Notes on the model specification:

Costᵢ: accident cost for observation i (dependent variable).

X₁, …, Xk: explanatory variables included in the model.

βj: estimated regression coefficients.

This formulation ensures strictly positive fitted values and allows the variance of accident cost to increase with its mean, which is consistent with the empirical properties of the data. It therefore provides a theoretically appropriate and empirically robust framework for examining the factors associated with accident cost severity.

5. Results

Following the comparative evaluation of alternative specifications, the Gamma generalized linear model (GLM) with a log link was adopted as the baseline model for accident cost analysis. This choice is motivated by the strictly positive and highly right-skewed nature of accident cost and is supported by the stable goodness-of-fit of the Gamma specification. Log-linear OLS and its regularized variants are retained as supplementary models for robustness checks.

The estimation results of the baseline Gamma GLM are reported in Table 8. The dependent variable is accident cost in levels, and eight explanatory variables representing human error categories (C1–C8) are included in the model. The table reports coefficient estimates, standard errors, z-statistics, and p-values. Because a log link is used, the coefficients can be interpreted in multiplicative terms: exp(β) represents the proportional change in expected accident cost associated with a one-unit change in the explanatory variable.

Table 8. Baseline regression results (Gamma GLM).

Variable Coefficient Std. Error z-Statistic p-Value
Constant 17.851 0.313 57.117 0.000
C1 (Supervisory and Managerial Control) 0.898 0.282 3.185 0.001***
C2 (Training and Competency Development) 0.674 0.575 1.171 0.242
C3 (Maintenance and Inspection) 0.529 0.285 1.854 0.064
C4 (Equipment and Emergency Systems) 0.295 0.289 1.021 0.307
C5 (Safety Regulation Noncompliance) −0.292 0.331 −0.882 0.378
C6 (Communication Failures) 0.979 0.462 2.118 0.034*
C7 (Decision-Making and Task Competence) −0.161 0.364 −0.441 0.659
C8 (Fatigue and Health Conditions) −2.347 0.854 −2.749 0.006**

Note: Statistical significance is evaluated at the 0.05 (*), 0.01 (**) and 0.001(***) levels.

Among the explanatory variables, C1 (Supervisory and Managerial Control) and C6 (Communication and Coordination Failures) exhibit positive and statistically significant coefficients, indicating that accidents involving organizational and coordination failures are associated with substantially higher accident costs. In particular, the estimated coefficients imply that the presence of C1 is associated with roughly a twofold increase in the expected accident cost, while the presence of C6 is associated with an increase of approximately 2.5 to 3 times. By contrast, C8 (Fatigue and Physical/Health Conditions) shows a statistically significant negative coefficient.

Importantly, the negative coefficient on C8 should not be interpreted as implying that physical or health-related problems reduce accident risk. Rather, conditional on other technical and organizational factors, accidents primarily attributed to such individual-level conditions tend to involve smaller material and operational losses. In contrast, accidents driven by organizational, technical, and systemic failures are much more likely to escalate into large-scale, high-cost events. This interpretation is also consistent with the fact that the cost measure used in this study does not monetize human injuries and therefore primarily reflects material and operational damage.

Other factors, including C2 (Training and Competency Development), C3 (Maintenance and Inspection), C4 (Equipment and Emergency Systems), C5 (Safety Regulation Noncompliance), and C7 (Decision-Making and Task Competence), do not reach conventional levels of statistical significance in the Gamma GLM, suggesting that their effects on accident cost are either weaker or more context-dependent once organizational and systemic factors are taken into account.

Robustness checks based on the log-linear OLS, Ridge, and Lasso specifications yield qualitatively similar patterns in terms of coefficient signs and relative importance, confirming that the main conclusions do not depend on the specific estimation method.

Overall, these results provide consistent evidence that accident cost severity is more strongly associated with organizational and systemic deficiencies rather than with individual-level conditions alone, reinforcing the view that high-cost accidents in complex railway systems are rooted in higher-level structural and organizational weaknesses.

6. Discussion and conclusion

This study examined the role of human error factors in shaping the severity of train accidents using 105 official investigation reports. Using a structured coding scheme as an analytical tool to transform qualitative investigation findings into quantitative variables, the analysis provides consistent evidence that organizational and technical deficiencies—particularly supervisory failures and coordination problems—are most strongly associated with higher accident costs. These results reinforce the view that accident severity is not merely a consequence of front-line worker mistakes, but is deeply rooted in higher-level systemic and organizational structures.

An especially important insight from the empirical results is the contrast between organizational/systemic factors and individual-level conditions. While organizational and coordination failures are associated with substantially higher accident costs, accidents primarily attributed to individual physical or health conditions tend to involve significantly smaller material and operational losses. This pattern should not be interpreted as implying that individual-level problems are unimportant for safety. Rather, it suggests that accidents rooted in organizational and systemic failures are far more likely to escalate into large-scale and high-cost events, whereas individual-level failures more often correspond to relatively localized and less costly incidents. This interpretation is also consistent with the fact that the cost measure used in this study does not monetize human injuries and therefore primarily reflects material and operational damage.

The findings carry both theoretical and practical implications. Theoretically, the study demonstrates how qualitative accident reports can be systematically converted into structured variables, not for the purpose of validating the classification scheme itself, but to enable comparative and reproducible empirical analysis of different types of human and organizational errors within sociotechnical systems. By bridging narrative data with statistical modeling, this research enhances methodological rigor and expands the scope of railway safety studies beyond descriptive or anecdotal approaches. Practically, the results highlight that effective safety strategies must extend beyond front-line worker performance to encompass organizational oversight, coordination mechanisms, preventive maintenance, and infrastructure reliability. For regulators and railway operators, this implies that investments in supervisory systems, organizational monitoring, and preventive maintenance regimes are likely to yield higher returns in terms of accident loss reduction than measures aimed solely at training or penalizing individual workers.

A key methodological contribution of this study lies in its integration of narrative-based accident investigation reports with quantitative modeling techniques. This approach not only strengthens analytical robustness but also illustrates a transferable analytical procedure that can be applied to other high-risk sectors such as aviation, maritime transport, and nuclear energy. By demonstrating how qualitative accident narratives can be converted into structured data for empirical testing, the study opens up new opportunities for comparative cross-industry analyses and broader applications of human error quantification.

These results should be interpreted in light of several limitations inherent in the institutional and data environment of railway accident investigation. The data set is limited to officially investigated accidents, reflecting the institutional context in which serious accidents are legally required to be investigated and disclosed, whereas minor incidents are treated as internal safety materials and are not publicly accessible. As a result, the data set provides near-comprehensive coverage of severe accidents, but does not represent the full universe of all railway incidents. This implies a selection effect toward high-severity events, and the estimated relationships should therefore be interpreted as associations conditional on serious accidents, rather than as population-average effects across the entire spectrum of incidents.

With respect to accident costs, although formal guidelines for cost estimation exist, the reconstruction of costs from investigation reports inevitably involves a degree of investigator judgment, which introduces measurement noise. From a statistical perspective, this primarily affects the precision and stability of coefficient estimates rather than their qualitative direction, implying that the reported magnitudes should be interpreted as approximate associations rather than exact causal effect sizes. As the number of observations increases in future studies, such idiosyncratic variation is expected to be partially averaged out.

Finally, the binary coding of human and organizational factors reflects the structure of the current accident investigation system itself, which records causal factors mainly in terms of their presence or absence rather than their intensity or degree. This institutional constraint makes it difficult to model continuous gradients of risk and implies that the estimated effects capture the impact of the existence of a given factor, not the marginal effect of its severity. Consequently, the analysis likely understates the underlying complexity and heterogeneity of human and organizational error mechanisms.

In conclusion, within the scope of officially investigated serious accidents, organizational and technical deficiencies remain the most critical factors associated with accident severity despite advances in railway technology and regulation. By systematically transforming accident reports into structured evidence and applying appropriate statistical models, this study provides a foundation for evidence-based strategies aimed at reducing high-consequence accidents.

The findings highlight the importance of shifting the focus of railway safety management from reactive, individual-centered interventions toward proactive, systemic improvements. Taken together, the results should be read as providing robust evidence about the direction and relative importance of systemic factors, rather than as precise estimates of their population-wide causal magnitudes or as universally generalizable effects across all types of railway incidents. Ultimately, addressing organizational and technical weaknesses is likely to deliver more sustainable safety outcomes and to strengthen the long-term resilience of railway transport systems.

Supporting information

S1 File. Appendix.

(DOCX)

pone.0345753.s001.docx (17.9KB, docx)

Data Availability

All data and code files are available from the Zenodo repository at https://doi.org/10.5281/zenodo.18310486.

Funding Statement

This research was supported by a Korean Agency for Infrastructure Technology Advancement (KAIA) grant funded by the Ministry of Land, Infrastructure and Transport (grant no. RS-2023-00239464).” We also confirm in the revised Funding Statement that the funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Topcu TG, Triantis K, Roets B. Estimation of the workload boundary in socio-technical infrastructure management systems: the case of Belgian railroads. Eur J Oper Res. 2019;277(2):702–15. doi: 10.1016/j.ejor.2019.04.009 [DOI] [Google Scholar]
  • 2.Eylem T, Faily S, Dogan H, Freer M. Human factors and cyber-security risks on the railway: the critical role played by signalling operations. Inf Comput Secur. 2024;32(1):73–90. doi: 10.1108/ICS-05-2023-0078 [DOI] [Google Scholar]
  • 3.National Safety Council. Railroad deaths and injuries. Injury facts: home and community safety topics; 2025 [cited 2025 Sept 15]. Available from: https://injuryfacts.nsc.org/home-and-community/safety-topics/railroad-deaths-and-injuries
  • 4.Morgan JI, Abbott R, Furness P, Ramsay J. UK rail workers’ perceptions of accident risk factors: an exploratory study. Saf Sci. 2016;89:57–66. doi: 10.1016/j.ergon.2016.08.003 [DOI] [Google Scholar]
  • 5.Fan J, Smith AP. A preliminary review of fatigue among rail staff. Front Psychol. 2018;9:634. doi: 10.3389/fpsyg.2018.00634 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Carra S, Monica L, Vignali G. The critical role of human factor in safety of complex high-risk working environments. Chem Eng Trans. 2020;82:109–14. doi: 10.3303/CET2082024 [DOI] [Google Scholar]
  • 7.Ercegovac P, Stojić G, Tanackov I, Sremac S. Application of statistical analysis for risk estimate of railway accidents and traffic incidents at level crossings. ENTRENOVA. 2022;8(1):225–38. doi: 10.54820/entrenova-2022-0021 [DOI] [Google Scholar]
  • 8.Bhuiyan MR, Ibtihal SA, Nokshi KN. Involvement of human and organizational factors in railway accidents: application of the human factors analysis and classification system and Bayesian network. Transp Res Rec. 2023;2677(6):505–18. doi: 10.1177/03611981231156932 [DOI] [Google Scholar]
  • 9.Gawlak K. Analysis and assessment of the human factor as a cause of occurrence of selected railway accidents and incidents. Open Eng. 2023;13(1):20220106. doi: 10.1515/eng-2022-0398 [DOI] [Google Scholar]
  • 10.Thron E, Faily S, Dogan H, Freer M. Human factors and cyber-security risks on the railway: the critical role played by signalling operations. Inf Comput Secur. 2024;32(2):236–63. doi: 10.1108/ICS-05-2023-0078 [DOI] [Google Scholar]
  • 11.Hua L, Zheng W, Gao S. Extraction and analysis of risk factors from Chinese railway accident reports. 2019 IEEE Intelligent Transportation Systems Conference (ITSC). IEEE; 2019. p. 3297–302. doi: 10.1109/itsc.2019.8917094 [DOI] [Google Scholar]
  • 12.Dhingra N, Bridgelall R, Lu P, Szmerekovsky JG, Bhardwaj B. Ranking risk factors in financial losses from railroad incidents: a machine learning approach. Transp Res Rec. 2022;2676(9):266–78. doi: 10.1177/03611981221133085 [DOI] [Google Scholar]
  • 13.Nardo MD, Amato S, Dubbioso GR, Gallab M, Upadhyay A, Vaiola F. Railway safety analysis: trends and challenges. Proceeding of the 33rd European Safety and Reliability Conference (ESREL 2023). Research Publishing; 2023. p. 3236–43. doi: 10.3850/978-981-18-8071-1_p033-cd [DOI] [Google Scholar]
  • 14.Pope N. Dickens's The Signalman and information problems in the railway age. Technol Cult. 2001;42(3):436–61. doi: 10.1353/tech.2001.0133 [DOI] [Google Scholar]
  • 15.Tang L. Reliability assessments of railway signaling systems: a comparison and evaluation of approaches [Master's thesis]. Norwegian University of Science and Technology; 2015. Available from: https://ntnuopen.ntnu.no/ntnu-xmlui/handle/11250/2352940 [Google Scholar]
  • 16.Reinach S, Viale A. Application of a human error framework to conduct train accident/incident investigations. Accid Anal Prev. 2006;38(2):396–406. doi: 10.1016/j.aap.2005.10.013 [DOI] [PubMed] [Google Scholar]
  • 17.Zhang Z, Turla T, Liu X. Analysis of human-factor-caused freight train accidents in the United States. J Transp Saf Secur. 2019;12(9):1157–86. doi: 10.1080/19439962.2019.1697774 [DOI] [Google Scholar]
  • 18.Baysari MT, McIntosh AS, Wilson JR. Understanding the human factors contribution to railway accidents and incidents in Australia. Accid Anal Prev. 2008;40(5):1750–7. doi: 10.1016/j.aap.2008.06.013 [DOI] [PubMed] [Google Scholar]
  • 19.Tau S. Analyzing the impact that lack of supervision has on safety culture and accident rates as a proactive approach to curbing the South African railway industry's high incident occurrence rate. In: Goossens R, editor. Adv Soc Occup Ergon. Springer; 2017. doi: 10.1007/978-3-319-41688-5_17 [DOI] [Google Scholar]
  • 20.Luva B, Naweed A. Authority gradients between team workers in the rail environment: a critical research gap. Theor Issues Ergon Sci. 2021;23(2):155–81. doi: 10.1080/1463922x.2021.1881653 [DOI] [Google Scholar]
  • 21.Reinach SJ, Gertler JB. Railroad yard safety: perspectives from labor and management. Proc Hum Factors Ergon Soc Ann Meet. 2002;46(13):1201–4. doi: 10.1177/154193120204601341 [DOI] [Google Scholar]
  • 22.Mendoza CM. Government rule compliance, safety, and the influence of regulation on railroad trainmen [Doctoral dissertation]. Northcentral University. ProQuest Dissertations Publishing; 2017. Available from: https://search.proquest.com/openview/d322493dd3702bc9d37d1a94d695a94b [Google Scholar]
  • 23.Vredenburgh AG. Organizational safety: which management practices are most effective in reducing employee injury rates? J Safety Res. 2002;33(2):259–76. doi: 10.1016/S0022-4375(02)00016-6 [DOI] [PubMed] [Google Scholar]
  • 24.Armstrong G. Commentary: the effect of patient safety culture on nurses’ near-miss reporting intention: the moderating role of perceived severity of near misses. J Res Nurs. 2021;26(1–2):17–8. doi: 10.1177/1744987120979756 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Thommesen J, Andersen H. Human error probabilities (HEPs) for generic tasks and performance shaping factors (PSFs) selected for railway operations. Proc Eur Saf Reliab Conf (ESREL 2012); 2012. [Google Scholar]
  • 26.Macedo JB, Ramos PMS, Maior CBS, Moura MJC, Lins ID, Vilela RFT. Identifying low-quality patterns in accident reports from textual data. Int J Occup Saf Ergon. 2023;29(3):1088–100. doi: 10.1080/10803548.2022.2111847 [DOI] [PubMed] [Google Scholar]
  • 27.Schlesinger D. Sources of transportation accident information. 2016 Joint Rail Conference. ASME; 2016. doi: 10.1115/jrc2016-5836 [DOI] [Google Scholar]
  • 28.Miyamoto A, Bendarkar MV, Mavris DN. Natural language processing of aviation safety reports to identify inefficient operational patterns. Aerospace. 2022;9(8):450. doi: 10.3390/aerospace9080450 [DOI] [Google Scholar]
  • 29.Sarter NB, Alexander HM. Error types and related error detection mechanisms in the aviation domain. Int J Aviat Psychol. 2000;10(2):189–206. doi: 10.1207/S15327108IJAP1002_5 [DOI] [Google Scholar]
  • 30.Nawaz M, Qayyum TI. Development of road crash report form for Punjab traffic police. Int J Curr Res Acad Rev. 2014;2(11):72–87. Available from: http://www.ijcrar.com/vol-2-11/Maryam%20Nawaz%20and%20TanvirIqbal%20Qayyum.pdf [Google Scholar]
  • 31.Stutts JC, Reinfurt DW, Rodgman E. The role of driver distraction in crashes. Proc Assoc Adv Automot Med. 2001;45:287–301. [PubMed] [Google Scholar]
  • 32.Wright LB. The analysis of UK railway accidents and incidents: a comparison of their causal patterns [Doctoral dissertation]. University of Strathclyde; 2002. Available from: https://stax.strath.ac.uk/concern/theses/ws859f716 [Google Scholar]
  • 33.Jeong RG, Kim GY, Kim BH. Study on the improvement of accident classification system for human-error management in electrical railways. Proc Int Conf Electr Mach Syst (ICEMS 2011); 2011. doi: 10.1109/ICEMS.2011.6073549 [DOI] [Google Scholar]
  • 34.Hammerl M, Vanderhaegen F. Human factors in the railway system safety analysis process. In: Vanderhaegen F, editor. Advances in human factors and ergonomics. CRC Press; 2009. p. 107–16. doi: 10.1201/b12742-11 [DOI] [Google Scholar]
  • 35.Aviation and Railway Accident Investigation Board. Aviation and Railway Accident Investigation Board official website. Republic of Korea: Ministry of Land, Infrastructure and Transport; 2025. Available from: https://araib.molit.go.kr/intro.do [Google Scholar]
  • 36.Shappell SA, Wiegmann DA. The human factors analysis and classification system (HFACS). FAA Civil Aerospace Medical Institute Technical Report DOT/FAA/AM-00/7; 2000 [cited 2025 Sept 15]. Available from: https://commons.erau.edu/publication/737/
  • 37.Rail Safety and Standards Board. Understanding human factors: a guide for the railway industry; n.d. Available from: https://www.rssb.co.uk/safety-and-health/improving-safety-health-and-wellbeing/understanding-human-factors
  • 38.International Atomic Energy Agency. Method of psychological root cause analysis of human factors (IAEA-TECDOC-1756); n.d. Available from: https://www-pub.iaea.org/MTCD/Publications/PDF/TE-1756_web.pdf
  • 39.U.S. Nuclear Regulatory Commission. The human performance evaluation process (HPEP); 2011. Available from: https://www.nrc.gov/docs/ML0209/ML020930054.pdf
  • 40.Miles J. Tolerance and variance inflation factor. In: Wiley StatsRef: Statistics Reference Online; 2014. doi: 10.1002/9781118445112.stat06593 [DOI] [Google Scholar]
  • 41.Salas R, Hallowell M. Predictive validity of safety leading indicators. J Constr Eng Manag. 2016;142(10):04016052. doi: 10.1061/(ASCE)CO.1943-7862.0001167 [DOI] [Google Scholar]

Decision Letter 0

Qing-Chang Lu

2 Jan 2026

Dear Dr. Lee,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Feb 16 2026 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org . When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

  • A letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols ..

We look forward to receiving your revised manuscript.

Kind regards,

Qing-Chang Lu

Academic Editor

PLOS One

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS One has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, we expect all author-generated code to be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

3. Thank you for uploading your study's underlying data set. Unfortunately, the repository you have noted in your Data Availability statement does not qualify as an acceptable data repository according to PLOS's standards.

At this time, please upload the minimal data set necessary to replicate your study's findings to a stable, public repository (such as figshare or Dryad) and provide us with the relevant URLs, DOIs, or accession numbers that may be used to access these data. For a list of recommended repositories and additional information on PLOS standards for data deposition, please see https://journals.plos.org/plosone/s/recommended-repositories ..

4. When completing the data availability statement of the submission form, you indicated that you will make your data available on acceptance. We strongly recommend all authors decide on a data sharing plan before acceptance, as the process can be lengthy and hold up publication timelines. Please note that, though access restrictions are acceptable now, your entire data will need to be made freely accessible if your manuscript is accepted for publication. This policy applies to all data except where public deposition would breach compliance with the protocol approved by your research ethics board. If you are unable to adhere to our open data policy, please kindly revise your statement to explain your reasoning and we will seek the editor's input on an exemption. Please be assured that, once you have provided your new statement, the assessment of your exemption will not hold up the peer review process.

5. Thank you for stating the following financial disclosure:

The author(s) received no specific funding for this work.

At this time, please address the following queries:

a) Please clarify the sources of funding (financial or material support) for your study. List the grants or organizations that supported your study, including funding received from your institution.

b) State what role the funders took in the study. If the funders had no role in your study, please state: “The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.”

c) If any authors received a salary from any of your funders, please state which authors and which funders.

d) If you did not receive any funding for this study, please state: “The authors received no specific funding for this work.”

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

6. Thank you for stating the following in the Acknowledgments Section of your manuscript:

This research was supported by a Korean Agency for Infrastructure Technology Advancement (KAIA) grant funded by the Ministry of Land, Infrastructure and Transport (grant no. RS-2023-00239464).

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

The author(s) received no specific funding for this work.

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

7. If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #1: Yes

Reviewer #2: Partly

Reviewer #3: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously? -->?>

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available??>

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.-->

Reviewer #1: No

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English??>

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: No

**********

Reviewer #1: The paper addresses a significant and relevant issue in railway safety: the need for a systematic framework to analyze human error from accident reports. The objective is clear, and the proposed approach has the potential to make a valuable contribution to the field. However, several aspects of the manuscript require clarification and strengthening before it can be considered for publication.

1.While the introduction effectively identifies a historical gap in the literature (citing sources from 2002, 2009, and 2011), it would be more compelling if the authors could demonstrate that this gap persists today. I would recommend including a brief discussion of more recent literature.

2. The methodology section mentions that investigation reports were selected based on “incidents of significant safety and operational relevance.” This is too vague. The authors should provide precise inclusion and exclusion criteria.

3.The paper describes a “structured extraction and coding procedure” but provides no details on its implementation.

4. The reference list requires careful proofreading.

5.The snippets suggest the paper’s contribution is a new coding framework. It is unclear if the paper goes on to apply this framework and present findings from the analysis .

Reviewer #2: The manuscript addresses an important and understudied aspect of railway safety: the contribution of human error factors to the severity of railway accidents. The use of a decade of ARAIB investigation reports and the attempt to systematically code contributory factors represent valuable contributions.

However, several methodological issues require attention before the manuscript can be considered for publication.

Below I provide detailed comments aligned with the assessment questions.

1. Technical Soundness

Your approach - extracting structured variables from narrative accident investigation reports and modeling accident costs using a log-linear OLS model - is conceptually appropriate and of potential interest for safety research. However, the technical soundness of the study is only partially demonstrated.

Model assumptions are not validated.

The manuscript does not include regression diagnostics such as residual plots, normality tests, tests for heteroscedasticity, or influence analysis. Without these, the validity of the OLS model cannot be confirmed.

Model explanatory power is weak.

The adjusted R2 (ca 0.08) indicates that the predictors explain only a small fraction of variance. While this does not invalidate the analysis, it requires explicit discussion.

Potential omitted variable bias.

Important covariates such as accident type, operational environment, temporal trends, rolling stock characteristics, and contextual factors are not included in the model. This omission may bias coefficient estimates and weaken interpretative claims.

Binary coding oversimplifies complex causal factors.

Contributory factors are coded as 0/1, which does not reflect severity or degree of contribution and may flatten the nuances present in the investigation reports.

Given these issues, the conclusions must be framed more cautiously, emphasizing associations rather than causal implications.

2. Statistical Analysis

The statistical strategy-log transformation, VIF analysis, comparison of alternative models-is a good start. However, further rigor is required.

The absence of diagnostic tests prevents evaluation of OLS assumptions.

No robustness checks (e.g., alternative specifications, sensitivity analyses, cross-validation) are presented.

The comparison with Poisson and Negative Binomial models could be clarified, as pseudo-R2 measures are not directly comparable to adjusted R2 in continuous-outcome models.

The coding procedure would benefit from an assessment of inter-coder reliability or clearer coding rules.

Strengthening the statistical validation would significantly improve the paper.

3. Major Conceptual Issue: “Causation” vs “Severity”

The manuscript repeatedly refers to “accident causation” in the title, abstract, and narrative.

However, the dependent variable used in the statistical model is accident cost, which measures severity rather than causation.

Severity and causation are conceptually distinct: a factor can cause an accident without increasing its cost, and vice versa.

I recommend that the authors:

revise the title, abstract, and framing to reflect that the study analyzes severity (cost impacts), not causation;

avoid causal language (e.g., “determinants”, “drivers”);

consider a more accurate title such as:

“Assessing the Impact of Human Error Factors on Railway Accident Severity…”

This adjustment would substantially improve conceptual clarity and align the paper with its actual analytical content.

4. Additional Suggestions for Improvement

Expand the limitations section.

You should explicitly acknowledge:

- the limited generalizability of officially investigated accidents,

- potential biases in investigator-defined contributory factors,

- limitations arising from binary coding,

- the low explanatory power of the model.

Provide concrete examples of coding.

Including short excerpts from investigation reports illustrating how specific statements were mapped to categories C1 - C8 would greatly enhance transparency.

Consider additional covariates in future models.

Even a few basic controls (accident type, operator, year, infrastructure type) would improve model robustness.

Interpret findings with greater caution.

Some statements imply causality ("key determinants", "drivers"), whereas the study design supports associations only. Rephrasing would improve accuracy.

5. Ethical and Editorial Issue: Inconsistent Funding Disclosure

There is an inconsistency in the funding information that should be corrected.

In the submission metadata, the authors state that “the authors received no specific funding for this work”.

However, in the Acknowledgements section of the manuscript, the authors report support from the Korean Agency for Infrastructure Technology Advancement (KAIA), grant RS-2023-00239464.

This discrepancy should be corrected to ensure compliance with PLOS funding disclosure requirements. The funding source should be declared consistently in both the metadata and the manuscript.

6. Overall Evaluation

This is a promising manuscript on an important topic, and the dataset represents a valuable resource. However, methodological and analytical limitations need to be addressed before the results can be considered robust. Strengthening the statistical validation, expanding the limitations, and clarifying aspects of the coding scheme will enhance both the credibility and the contribution of the study.

Reviewer #3: The peer-review report and the language-editing corrections have been provided and submitted in a separate file. Please refer to the attached document for the full set of reviewer comments as well as detailed, line-by-line English language revisions and consistency fixes (spelling, punctuation, and terminology).

**********

what does this mean? ). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our Privacy Policy .-->

Reviewer #1: No

Reviewer #2: Yes: Lorenzo FedeleLorenzo Fedele

Reviewer #3: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

To ensure your figures meet our technical requirements, please review our figure guidelines: https://journals.plos.org/plosone/s/figures

You may also use PLOS’s free figure tool, NAAS, to help you prepare publication quality figures: https://journals.plos.org/plosone/s/figures#loc-tools-for-figure-preparation.

NAAS will assess whether your figures meet our technical requirements by comparing each figure against our figure specifications.

Attachment

Submitted filename: PLOS_ONE_Review_Report.pdf

pone.0345753.s002.pdf (225.1KB, pdf)
PLoS One. 2026 Mar 27;21(3):e0345753. doi: 10.1371/journal.pone.0345753.r002

Author response to Decision Letter 1


28 Jan 2026

We have carefully revised the manuscript in accordance with all comments and requests from the Academic Editor and the reviewers, and we believe that all issues raised in the decision letter have now been fully addressed. Please refer to the accompanying “Response to Reviewers” document for a detailed, point-by-point explanation of how each comment was handled.

Attachment

Submitted filename: Response_to_Reviewers.docx

pone.0345753.s004.docx (41.5KB, docx)

Decision Letter 1

Qing-Chang Lu

10 Mar 2026

Assessing the impact of human error factors on railway accident severity: Evidence from accident investigation reports in Korea

PONE-D-25-51103R1

Dear Dr. Lee,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager®  and clicking the ‘Update My Information' link at the top of the page. For questions related to billing, please contact  and clicking the ‘Update My Information' link at the top of the page. For questions related to billing, please contact billing support ..

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Qing-Chang Lu

Academic Editor

PLOS One

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions??>

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously? -->?>

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available??>

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.-->

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English??>

Reviewer #1: Yes

Reviewer #2: Yes

**********

Reviewer #1: The authors have well addressed all my comments, I appreciate the authors' efforts to improve the manuscript.

Reviewer #2: The authors have adequately addressed all the.

The manuscript has substantially improved in terms of clarity, methodological rigor, statistical analysis, and transparency.

I therefore consider the manuscript suitable for publication in its current form.

**********

what does this mean? ). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our Privacy Policy .-->

Reviewer #1: Yes: Gen LiGen Li

Reviewer #2: Yes: Lorenzo FedeleLorenzo Fedele

**********

Acceptance letter

Qing-Chang Lu

PONE-D-25-51103R1

PLOS One

Dear Dr. Lee,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS One. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

You will receive further instructions from the production team, including instructions on how to review your proof when it is ready. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few days to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

You will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Qing-Chang Lu

Academic Editor

PLOS One

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File. Appendix.

    (DOCX)

    pone.0345753.s001.docx (17.9KB, docx)
    Attachment

    Submitted filename: PLOS_ONE_Review_Report.pdf

    pone.0345753.s002.pdf (225.1KB, pdf)
    Attachment

    Submitted filename: Response_to_Reviewers.docx

    pone.0345753.s004.docx (41.5KB, docx)

    Data Availability Statement

    All data and code files are available from the Zenodo repository at https://doi.org/10.5281/zenodo.18310486.


    Articles from PLOS One are provided here courtesy of PLOS

    RESOURCES