Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2025 Nov 27;15:44281. doi: 10.1038/s41598-025-27812-1

Multifidelity digital twin for real-time monitoring of structural dynamics in aquaculture net cages

Eirini Katsidoniotaki 1,, Biao Su 2, Eleni Kelasidi 2, Themistoklis P Sapsis 1,
PMCID: PMC12722713  PMID: 41309898

Abstract

As the global population grows, ensuring sustainable food production has become critical. Marine aquaculture provides a sustainable and scalable source of protein; however, its continued expansion requires the development of novel technologies that enable remote management and autonomous operations. Digital twin technology emerges as a transformative tool for realizing this goal, yet its adoption remains limited. Fish net cages—flexible, floating structures—are critical but vulnerable components of aquaculture systems. Exposed to harsh and dynamic marine conditions, they experience substantial hydrodynamic loads that can cause structural damage leading to fish escapes, environmental impacts, and financial losses. We propose a multifidelity surrogate modeling framework for integration into a digital twin that enables real-time monitoring of net cage structural dynamics under stochastic marine conditions. At the core of the framework lies the nonlinear autoregressive Gaussian process method, which captures complex, nonlinear cross-correlations between models of varying fidelity. It combines low-fidelity simulation data with a limited set of high-fidelity field sensor measurements, which, although accurate, are costly and spatially sparse. The framework was validated at the SINTEF ACE fish farm in Norway, where the digital twin assimilates online metocean data to accurately predict net cage displacements and mooring line loads, closely matching field measurements. This approach is especially valuable in data-scarce environments, offering rapid predictions and real-time structural representation. Beyond monitoring, the developed digital twin enables proactive assessment of structural integrity and supports remote operations with unmanned underwater vehicles. Finally, we compare Gaussian processes and graph convolutional networks for predicting net cage deformation, demonstrating the superior ability of the latter to capture in complex structural behaviors.

Keywords: Multifidelity surrogate modeling, NARGP, Digital twin, Aquaculture net pen, Real-time monitoring, Graph convolutional networks

Subject terms: Statistics, Computational science

Introduction

As the global population grows and the effects of climate change escalate, concerns about future food security and human nutrition increase, creating an urgent demand to enhance food production13. Fish and fishery products have become vital in diets worldwide, particularly in developing countries4, but unsustainable fishing practices threaten natural fish populations5. Marine aquaculture emerges as a vital solution3, offering a sustainable protein source that meets the rising global food demands while serving as a response to climate change. Fish farms have lower carbon emissions6 compared to other animal proteins and support a diverse, resilient food system. Sustainable growth of aquaculture requires further reducing the carbon footprint of fish farms through strategic farm placement, efficient feed use, minimizing food waste and fish mortality rates, and the adoption of novel technologies to enhance autonomy in challenging fish farm operations. As the industry matures and operations move offshore to harsh and uncertain environments7, advancing real-time monitoring, operational planning, and decision-making practices is essential for ensuring efficiency, safety, cost-effectiveness, and sustainability8,9.

Digital twins have great success in various sectors, yet its adoption in aquaculture has been limited, partly because it is a comparatively young industry9. Digital twin technology can significantly advance aquaculture by enabling fish farm remote monitoring, autonomous operations, management, and predictive analytics8,9. Integrating real-time observations from in-situ monitoring equipment alongside realistic models within a digital twin framework can provide more detailed and accurate insights into the dynamics of fish farms. Implementing a digital twin typically involves several components8,9, such as assessing fish health and behavior10, tracking environmental conditions11, and monitoring the structural dynamics of fish net cages12. Our study primarily focuses on the latter component.

Fish net cages are flexible floating structures anchored to the seabed, serving as critical yet vulnerable components of aquaculture farms. Exposed to the harsh and dynamic marine environment, they experience significant loads and are at risk of damage that can lead to fish escapes, resulting in negative environmental impacts and substantial financial losses13. Influenced by waves and currents, the net cages are prone to large motions and deformations14 due to their elastic structure. These deformations reduce the available space for fish, causing decreased oxygen levels and increased stress15, which negatively impacts fish growth and survival, leading to higher mortality rates and significant food waste. The development of a digital twin capable to real-time monitor the net cage dynamics can serve as an early warning system for fish farm operators, helping to prevent significant reductions in cage volume. Additionally, a digital twin is essential for assessing the structural integrity and preventing potential catastrophic damages, while it can be used to facilitate the autonomous inspection and repair operations with unmanned underwater vehicles (UUVs) in fish farms. For example, when autonomously controlling a remotely operated vehicle (ROV) within the fish cage to assess potential damage, prior knowledge of the cage’s shape is crucial for effective path planning and collision avoidance with the net16.

Unlike more rigid marine structures such as floating wind turbine foundations or vessels, the elastic and complex geometry of aquaculture net cages–composed of millions of slender twines–poses unique challenges in predicting their dynamics17. Modeling these structures is a multifaceted task that involves solving sub-problems: one focuses on the interaction between the fluid and a rigid moored-floating structure, and the other addresses the flexible porous net. Accurately depicting the deformation of fish net cages requires detailed information on the displacements of each twine, necessitating extensive data and computational effort. While high-fidelity numerical models like CFD simulations provide accurate estimations of structural dynamics17, they demand vast computational resources, making them unsuitable for real-time forecasting or rapid hindcasting tasks in digital twins. Lower fidelity models1821 are more accessible and less computationally demanding but often fail to capture the real system accurately, particularly under non-linear effects from extreme waves and fluid-structure interactions, thus limiting their integration into digital twins. Conversely, field sensor measurements offer the most accurate depiction of net cage dynamics but are constrained by high sensor costs and the risk of data transmission loss12. Additionally, sensors typically provide localized information, resulting in significant spatial gaps or high sparsity. These challenges underscore the need for innovative approaches to effectively monitor structural dynamics in real-time.

Current research is focused on innovative models for precise real-time monitoring of net cage dynamics that can be integrated into digital twins. Su et al.14 introduced a real-time numerical simulation model that uses in-situ, real-time positioning sensor data to dynamically adjust model inputs–i.e., the magnitude and direction of the current–based on the difference between simulated and actual net positions. Building on this, Su et al. further developed their model into a physics-based digital twin12 that incorporates in-situ sensor data to accurately simulate real-time cage responses, net deformations, and mooring loads. Although successfully tested at a full-scale aquaculture site, this model faces challenges such as high computational costs and potential inaccuracies due to inconsistent or faulty sensor data caused by harsh environmental conditions, hardware limitations, and unexpected interference, making it less suited for real-time applications. These issues highlight the need for digital twins using data-driven and machine learning (ML)-based surrogate models22,23.

ML-based surrogate models for predicting the dynamic behavior of fish net cages remain an under-explored research area. Some studies utilized artificial neural networks (ANNs) for net damage detection2426, yet research on predicting the structural dynamics of aquaculture net pens remains sparse. Zhao et al.27 developed a backpropagation ANN with one hidden layer, using training data from LF numerical simulations, to correlate ocean waves with net cage structural responses, including maximum tension in mooring lines, minimum effective volume ratio of the cage, and maximum stress on the floating collar. Although their model outputs the volume ratio, it does not capture the precise topology of the deformed net under wave excitation. To the best of our knowledge, no surrogate models have been developed to predict the entire net pen deformation, which can be beneficial for preventing catastrophic events and supporting autonomous fish farm operations. This underscores the need for more advanced ML-based surrogate models, built on high-fidelity data, that can accurately predict the complex dynamics of flexible net pens. To support effective real-time monitoring, these models should be capable of integrating into on-the-fly digital twins, which means their computational demands need to be limited.

Creating surrogate models that accurately represent complex systems often encounters the challenge of acquiring high-fidelity, application-specific data. Multifidelity data assimilation28 involves methods that merge information about the same underlying truth obtained through multiple models or observation operators at different fidelity levels, resulting in a model that best represents the system’s state. Among these methods, optimal interpolation is a widely used data assimilation technique; however, it may exhibit large discrepancies with the truth29. To this end, multifidelity surrogate modeling30 aims to leverage machine learning methods with computationally inexpensive low-fidelity data and scarce high-fidelity data to accurately predict the quantities of interest. Despite their limitations, low-fidelity data can reveal system trends and patterns, providing valuable information to complement the limited HF data used for model training. Multifidelity methods3036 for constructing surrogate models aim to achieve effective data fusion from various fidelity levels, enabling strong generalization performance of data-driven models in regions where high-fidelity data are limited. This approach is particularly beneficial for various digital twin applications3740, enabling accurate, real-time system representation.

We propose a framework specifically designed for seamless integration into digital twins, enabling real-time monitoring and visualization of aquaculture net pen structural dynamics under dynamically evolving marine conditions. The framework maps site-specific environmental inputs–such as wave and current conditions–to structural responses of the net cages, including displacements and mooring line loads, offering a realistic representation of its physical behavior. Accurately capturing this behavior typically requires high-fidelity data, such as sensor measurements or other validated sources that closely reflect the physical system. However, in marine applications, such data are often scarce due to restricted access, harsh environmental conditions, and the high cost of deployment and maintenance. To address these limitations, the proposed framework incorporates a multifidelity modeling approach that combines broadly available, lower-fidelity simulation data with limited high-fidelity observations. While the low-fidelity data may lack precision, they are more readily accessible; the high-fidelity data, though sparse, serve as critical anchors for ensuring physical accuracy. At the core of the framework is a Nonlinear Autoregressive Gaussian Process (NARGP) surrogate model32, which captures complex, nonlinear, and space-dependent correlations between different fidelity levels. While NARGP has shown strong performance in benchmark problems, its application in real-world systems has remained largely unexplored–this study demonstrates its effectiveness in a full-scale, operational environment.

We validated the proposed framework using data from the SINTEF ACE fish farm, a full-scale facility in Norway for testing innovative aquaculture technologies. The low-fidelity data were generated through numerical simulations19, which model the dynamic response of net cages to environmental exchitation. High-fidelity data were obtained from field measurements collected over a two-month period (January–March 2020). Although the high-fidelity dataset spans only a limited timeframe, it captures a particularly harsh winter season characterized by strong currents and wave loads–conditions under which nonlinear structural responses are most pronounced. As discussed by the co-authors in a previous study12, this dataset likely captures biofouling effects, which are known to significantly alter the net’s hydrodynamic behavior and have been identified as a key factor contributing to the failure of numerical simulations to accurately reproduce observed net deformations. Our results show that the model successfully learns to map low-fidelity simulation outputs to realistic structural responses, even in this highly nonlinear regime. This highlights the framework’s capacity to capture complex dynamics. While broader data coverage would enhance model robustness, the current results establish a strong proof of concept. Moreover, the modular and lightweight architecture of the framework ensures that it can be efficiently retrained as additional data-including from future seasons or other sites–become available.

Due to its reduced algorithmic complexity and low computational cost, the proposed framework enables rapid predictions that are essential for real-time representation of the actual state of the system. This makes it well-suited for deployment in dynamic marine environments where fast decision-making is critical. In the operational phase, the digital twin continuously assimilates sensor data to estimate the current structural state of the net and detect anomalies in real time. It supports autonomous operations by enabling ROV path planning and forecasting short-term dynamics under changing environmental conditions. This capability was recently demonstrated in a full-scale industrial setting, where the same framework was successfully integrated into a closed-loop ROV path planning system to estimate the dynamic net shape in real time and avoid collisions during autonomous inspections16. Beyond real-time operations, the framework also supports broader lifecycle applications of digital twins in aquaculture. In the design phase, the digital twin serves as a virtual testbed for simulating alternative net cage and mooring system configurations, validating low-fidelity models, and optimizing design choices under diverse environmental conditions. This allows for early-stage identification of inefficiencies, structural weaknesses, and environmental impacts-offering a cost-effective means to iterate and refine designs before physical implementation, akin to digital twin-based smart manufacturing design workflows in Industry 4.0 and 5.0 contexts. During the configuration and reconfiguration phase, the digital twin can simulate alternative system layouts and mooring configurations under site-specific marine conditions, enabling rapid validation before implementation. In addition to semi-physical performance testing, the framework can support sensor placement optimization, evaluate risk under failure scenarios, and forecast environmental impacts, providing a practical tool for informed and adaptive farm reconfiguration. Figure 1 illustrates the full digital twin architecture, showcasing the multifidelity integration pipeline and its relevance across design, configuration, and operational phases. This framework significantly enhances efficiency, automation, safety, and cost savings. A key strength lies in its ability to operate with the existing sensing infrastructure already deployed at most aquaculture sites, minimizing the need for new instrumentation and associated costs.

Fig. 1.

Fig. 1

Real-time monitoring and remote fish farm management using digital twin technology. The structural response of a net cage to dynamic marine environments is modeled using data from varying fidelity sources, including low-fidelity numerical simulations and high-fidelity field sensor measurements. A digital twin of the physical net cage is developed using the proposed multifidelity framework, central to which is the Nonlinear Autoregressive Gaussian Process (NARGP) method32. This method synergistically combines low-fidelity model realizations with a limited set of high-fidelity observations. The nonlinear autoregressive scheme effectively learns complex nonlinear cross-correlations between datasets of differing fidelity, which is essential for practical applications. The multifidelity digital twin accurately predicts the structural response under dynamic marine conditions, closely mirroring real behavior. The digital twin facilitates remote fish farm management, serving as an early warning system to prevent catastrophic damage, informing decision-making, and supporting autonomous operations such as ROV path planning for net inspection and repair, net cleaning, and equipment installation.

In addition to the proposed multifidelity framework, we explore alternative machine learning methods for constructing surrogate models capable of predicting the full deformation of the net pen under interaction with waves and currents. In particular, we assess the effectiveness of Graph Convolutional Networks (GCNs)41,42. While GCNs have been increasingly applied across life sciences, physics, and materials research, our study highlights their potential for accurately capturing the complex spatial dependencies inherent in the structural dynamics of flexible marine structures43.

Results

Our objective is to develop a digital-twin framework that enables real-time monitoring and visualization of fish net cage dynamics under stochastic marine conditions, ensuring operational efficiency, structural safety, and environmental sustainability. At the core of this framework lies machine-learned surrogate modeling, which establishes the mapping between the marine environmental inputs and the dynamic response of the net cage. Accurate prediction of real-world behavior requires comprehensive, high-quality data; however such data is often limited or expensive to obtain. When only a small number of high-fidelity observations—such as direct sensor measurements—are available, constructing reliable models becomes challenging. Consequently, practitioners frequently rely on simplified numerical models that provide lower-fidelity but more accessible data, creating a gap between simulated outcomes and actual field observations. To enable real-time operation, the digital twin must be both data-efficient and computationally lightweight. Therefore, incorporating reduced-complexity surrogate models is essential to achieve rapid training and execution while maintaining predictive accuracy.

We propose a multifidelity framework that integrates data from sources of varying fidelity to learn the system dynamics and produce outputs that closely replicate real-world observations. At the core of this framework lies the NARGP method32, which refines low-fidelity numerical simulation outputs to align more accurately with high-fidelity sensor measurements. This is achieved by learning the nonlinear correlations between these two data fidelities through data-efficient multifidelity information fusion algorithms built upon standard Gaussian process (GP) regression. A detailed description of the NARGP formulation is provided in the Methods section. In the present case study, the model is implemented with two fidelity levels, though the framework is readily scalable to incorporate additional fidelity layers if needed. The trained NARGP model is embedded within the digital twin, following the workflow illustrated in Fig. 1. It receives real-time metocean data, where the first GP model, trained on low-fidelity numerical simulations, predicts the net cage dynamics equivalent to the simulation results. These low-fidelity predictions, together with the metocean inputs, are then passed to the recursive GP surrogate which has been trained using the posterior mean from the low-fidelity model and a limited set of high-fidelity measurements. This recursive model yields predictions that accurately capture the true net cage response observed in the field.

This multifidelity approach enables the construction of accurate and computationally efficient surrogates without relying exclusively on costly high-fidelity simulations. By exploiting the learned correlations across fidelity levels, the framework substantially reduces data and computational requirements while maintaining high predictive accuracy. Consequently, it extends the applicability of digital twin technology to large-scale or data-scarce aquaculture systems, where acquiring extensive high-fidelity measurements is often impractical.

Case study for SINTEF ACE fish farm

The multifidelity framework for real-time monitoring of structural dynamics is developed and validated using data from the SINTEF ACE facility–a full-scale research farm dedicated to the development and testing of innovative aquaculture technologies. All data used in this study were collected from the Buholmen site of SINTEF ACE, which hosts several operational fish cages. The analyzed cage has a diameter of 50 m, a depth of 31 m, a solidity ratio of 0.21, and it is secured to the mooring frame via 12 bridle (mooring) lines. As detailed in the SINTEF report44, environmental conditions at the Buholmen site have been characterized through continuous monitoring from January 2020 to October 2021. The average significant wave height is 0.47 m, with 84% of waves remaining below 0.8 m and a maximum recorded height of 1.84 m. The current profile is primarily bidirectional along the northeast–southwest axis, with current speeds remaining below 0.3 m/s for 80% of the time and exceeding 0.5 m/s only 2% of the time.

Data collection

See Fig. 2.

Fig. 2.

Fig. 2

Net cage numerical representation and field sensor configuration. (a) Schematic of the fish-net cage showing its main structural components: floating collar, bottom ring, net pen, and mooring lines. (b) Numerical discretization of the net cage in the FhSim environment into 321 nodes across 10 layers, with the first 32 nodes forming the uppermost layer and node 321 positioned at the base connecting all nodes of the tenth layer. (c) The simulated net cage under combined wave and current excitations. (d) Top view of the fish farm layout highlighting cage number 11, which accommodates the sensors. A buoy located 400 m away measures waves and currents. The main flow direction is indicated. (e) Close-up of net cage 11 showing sensor placement: five load shackles measuring mooring line tension and three depth sensors tracking net displacement. (f) Side view of the net cage illustrating the vertical distribution of depth sensors along the structure.

Low-fidelity data

FhSim is a mathematical modeling and numerical simulation software tailored for marine applications, providing time-domain representations of complex system dynamics. In this study, we employ FhSim to simulate the dynamic response of a full-scale aquaculture net cage subjected to a wide range of marine conditions. To generate a comprehensive dataset for training surrogate models, we performed simulations across 1000 distinct sea states. These conditions span current velocities from 0 to 1 m/s, significant wave height from 0 to 3 m, peak wave period from 0 to 8.66 s, and wave/current directions uniformly distributed over 0–360 degrees–covering a range that exceeds the typical conditions observed at the Buholmen site. As shown in Fig. 2a, the simulated cage replicates the geometry of the real structure. While the physical net consists of thousands of twines, numerical simplification is required for computational tractability. The cage is therefore discretized into 321 representative nodes (Fig. 2b). Each simulation models a 30-min interaction between waves, currents and the net cage, producing the x, y, and z displacements of all nodes (Fig. 2c) as well as the load profiles on the 12 mooring lines. From each simulation, we extract time-averaged values of these quantities to build a reduced dataset that characterizes the steady-state structural deformation of the net cage and loads under each sea condition.

FhSim has previously been used by Su et al.12 to simulate the dynamic response of net cages at the same fish farm studied here. In their work, they compared FhSim simulation results with real measurement data from the SINTEF ACE facility, which are also used as high-fidelity data in the present study. Their findings revealed that the direct FhSim output consistently underestimated net deformations. This discrepancy was partially attributed to biofouling effects that were captured in the field measurements, leading to increased net solidity and altered hydrodynamic behavior–phenomena not explicitly modeled in the numerical simulations. These findings reinforce our classification of FhSim as a low-fidelity model in this study and highlight the importance of incorporating high-fidelity data that reflect complex, real-world phenomena such as biofouling.

Additionally, despite being faster than high-fidelity numerical solvers (e.g. CFD simulations), FhSim simulations are not suitable for real-time deployment. On our hardware setup (56 CPU cores, 2.2 GHz, 128 GB RAM), each 30-min simulation required approximately 10 min of computational time, followed by an additional 20 min for processing and storing time-series outputs. These constraints reinforce the need for surrogate models capable of delivering rapid, accurate predictions for digital twin applications.

High-fidelity data

Field sensor measurements represent the high-fidelity data used in this study, capturing the actual dynamic response of the system. The monitoring campaign was conducted over a period of slightly less than two months, from January 18 to March 5, 2020-a time frame representative of relatively harsh winter marine conditions. As shown in Fig. 2d, a metocean buoy located approximately 400 meters from the fish farm was equipped with sensors to monitor environmental conditions, specifically recording incoming wave and current parameters. During this period, significant wave heights ranged from 0.3 to 1.6 m and the wave direction between 155 and 190 degrees (reflecting south-east to south-west directions). Current velocities ranged from 0.15 to 0.45 m/s, with directions between 60 and 126 degrees (north-east to south-east directions). Compared to long-term site conditions44, which are generally moderate with wave heights below 0.8 m for 84% of the time and current speeds under 0.3 m/s for 80% of the time, the short-term monitoring period (January 18–March 5, 2020) captured more energetic conditions. Wave heights reached up to 1.6 m and current speeds up to 0.45 m/s, representing some of the higher load scenarios observed at the site.

Instrumentation was deployed on cage number 11 at the SINTEF fish farm (Fig. 2d). Five load shackles (#1–5 in Fig. 2e) were installed on the upstream side, where the prevailing water current originated, between the collar and the mooring lines to measure the mooring loads. Additionally, three depth sensors were positioned at key locations within the net pen: at 7 m deep, at 15 m depth at the boundary of the cylindrical section, and at 31 m at the bottom of the net (#1–3 in Fig. 2e and f). These sensors measured vertical displacements and provided critical data on the net cage deformation.

Data collection was performed using a wireless sensor network, capturing measurements at 2-second intervals for high temporal resolution. Continuous data were obtained from the metocean buoy and the three depth sensors throughout the entire monitoring period. In contrast, load shackle data were available only for two isolated intervals: 12 h on January 23 and 24 h on February 22, illustrating the potential for intermittent data loss in field deployments. Further details on the sensor setup and measurement campaign can be found in Su et al.12.

Digital twin workflow

Figure 3a illustrates the digital twin workflow developed for the SINTEF ACE fish farm. A metocean buoy equipped with sensors provides real-time measurements of current speed and direction which serve as inputs to the NARGP multifidelity surrogate model. Initially, the low-fidelity GP models use the current data to predict the loads on the mooring lines and the overall deformation of the net cage under these conditions. To estimate the net cage displacement at specific locations, the user defines these locations (or nodes) on the net cage and obtains their vertical displacement. The low-fidelity predictions of mooring line force and node vertical displacement, along with the measured current speed and direction, are then fed into high-fidelity GP models. These models refine the low-fidelity predictions, providing outputs that closely match the real structural dynamics measured by load shackles and depth sensors. This workflow is completed in a few seconds due to the algorithmic simplicity and low computational cost of the NARGP method. The proposed digital twin enables real-time monitoring of net cage structural dynamics, supports autonomous operations, and informs decision-making without relying on real-time sensors from the net cage, as is currently the case12. This approach reduces the need for extensive sensor deployment, which is often expensive and unreliable due to issues like data loss.

Fig. 3.

Fig. 3

Multifidelity digital twin framework. (a) Workflow of the digital twin integrating multifidelity Gaussian Process (GP) models. Real-time measurements of current speed and direction from the metocean buoy are used as inputs to low-fidelity GP models that predict mooring line loads and net cage deformations. The low-fidelity predictions—represented by the GP posterior means—together with marine current measurements are subsequently passed to high-fidelity GP models, which refine these predictions to closely match load shackle and depth sensor observations. (b) Training of the low-fidelity GP models using data generated from FhSim simulations. One GP model maps marine current speed and direction to mooring line loads, while another predicts net cage deformations after dimensionality reduction via Principal Component Analysis (PCA). The PCA coefficients are predicted by the GP model and later reconstructed to obtain the net deformation field, enabling efficient surrogates that substitute full FhSim simulations. (c) Training of the high-fidelity GP models using multifidelity data. Limited field sensor measurements collected between January 18 and March 5, 2020—including current measurements from the metocean buoy, mooring loads from load shackles, and net displacements from depth sensors—are employed. The output of the low-fidelity model combined with the marine current measurements, serve as inputs to the high-fidelity models which yield refined predictions that accurately reproduce the measured mooring loads and vertical displacements at sensor locations.

Figure 3b shows the training stage of the low-fidelity GP surrogate models using data from FhSim simulations, described in the “Data Collection/Low-fidelity” section. Using the standard GP method41, we build surrogate models to map current speed and direction to the mooring line load. To create the surrogate model that maps ocean currents to net cage deformation, dimension reduction using PCA is required as a preliminary step, as detailed in the “Methods/Net cage displacement data dimension reduction using PCA” section. This is necessary due to the extensive data describing net cage deformation (x, y, z displacements for each of the 321 nodes). From the PCA dimension reduction step, we found that three PCA coefficients were sufficient and this allowed us to develop a standard GP model to map current characteristics to these PCA coefficients. Once the predicted coefficients are obtained, a reconstruction stage follows to determine the net cage deformation under the incoming measured current, as described in the “Methods/Machine learning functional relationships between current and PCA coefficients” section. Having the entire geometry of the deformed net cage allows us to extract the vertical displacement at the locations where the depth sensors are placed. The low-fidelity GP models developed at this stage can fully substitute FhSim simulations, with their accuracy well-documented by Katsidoniotaki et al.47. This ensures that if the load shackles and depth sensors change position or more sensors are added in the future, retraining the low-fidelity GP models will not be necessary.

Figure 3c outlines the steps for training the high-fidelity GP models following the NARGP method, as described in the “Methods/NARGP” section. The high-fidelity training data consist of field sensor measurements, including current measurements from the metocean buoy, mooring loads from load shackles, and net cage vertical displacements from depth sensors, as detailed in the “Data Collection/High-Fidelity” section. The current measurements are input into the low-fidelity GP models, and the predicted GP posterior mean, combined with the current measurements, serve as inputs for the high-fidelity models. The high-fidelity GP models are trained on a limited number of datasets obtained from the period January 18 to March 5, 2020, to predict quantities of interest that match the load shackle and depth sensor measurements. Specifically, while the depth sensor measurements cover the entire period, only a small number of datasets corresponding to a few minutes are used to train the model. For the load shackle measurements, available only for 36 h, datasets corresponding to a few seconds are used for training. This approach demonstrates how, with limited high-fidelity datasets, we can develop a multifidelity model that provides predictions representative of real observations.

Predicting net cage dynamics

Mooring line loads. Figure 4 showcases the proposed multifidelity framework’s proficiency in predicting mooring line loads under current conditions, as measured by the metocean buoy, and compares these predictions with field measurements obtained from load shackles. The figure presents predictions for load shackles #1 (top row), #2 (middle row), and #5 (bottom row), with similar patterns observed for other mooring lines. Validation was conducted using previously unseen data from the entire 36-hour load shackle measurement period, excluding the training datasets. The 2D histograms in Fig. 4 are scatter plots with a color scale depicting the distribution and density of prediction accuracy from the low-fidelity and multifidelity GP models. The color scale indicates the log count density of data points, with lighter colors representing areas with a higher concentration of data points. Proximity to the dashed diagonal line (Inline graphic) signifies prediction accuracy, while the red line represents the best-fit trend. The scatter plots for the low-fidelity GP model (Fig. 4a, d, g) reveal a broader spread and a significant deviation of the red best-fit line from the dashed diagonal line, indicating less accurate predictions. Conversely, the multifidelity GP model plots (Fig. 4b, e, h) exhibit a higher concentration of lighter colors near the dashed line, with the red best-fit line aligning more closely with the diagonal, indicating improved prediction accuracy. The color gradient helps identify trends and outliers; consistent, lighter colors along the dashed line suggest the model captures the overall trend well, whereas significant deviations indicate less accurate predictions. The Mean Absolute Error (MAE) quantifies prediction accuracy, calculated as the average of the absolute differences between the predicted and actual load measurements. The multifidelity model demonstrates a significant improvement in MAE values compared to the low-fidelity model. Figure 4 c, f, i provides a temporal view, displaying predictions over time based on the validation data from the 36-hour measurement period. The low-fidelity GP posterior mean deviates notably from the actual measurements, whereas the multifidelity GP posterior aligns closely with the real observations. The temporal plots reveal an oscillatory behavior in the load shackle measurements not fully captured by the multifidelity model, explaining some deviations from the dashed line in the middle column’s scatter plot. Nonetheless, the multifidelity model accurately follows the overall trend of the observations.

Fig. 4.

Fig. 4

Multifidelity framework predictions for mooring line loads. Scatter plots (a), (d), (g) compare the low-fidelity GP posterior mean predictions with load shackle measurements. The color scale represents the logarithmic density of data points, with lighter colors indicating higher concentrations. The dashed diagonal line denotes perfect prediction accuracy; the red line represents the best-fit trend. Low-fidelity GP predictions have broad spread and the red line presents noticeable deviation from the diagonal, indicating reduced accuracy of the low-fidelity GP model. Scatter plots (b), (e), (h) show the multifidelity GP model predictions against sensor measurements. The higher concentration of lighter points near the diagonal and the close alignment of the red line with it demonstrate the improved accuracy achieved through the multifidelity framework. Plots (c), (f), (i) display time-series predictions over a 36-hour period on test data. The low-fidelity GP posterior mean shows significant deviation from the measured loads, whereas the multifidelity GP predictions closely follow the observed trends, accurately capturing the temporal evolution despite minor deviations in oscillatory behavior. The Mean Absolute Error (MAE) values confirm a substantial improvement in predictive accuracy with the multifidelity model compared to the low-fidelity baseline.

Vertical displacement at specific points within the net cage. Figure 5 illustrates the predictive performance of the multifidelity framework in estimating net cage displacement at three distinct locations, as measured by depth sensors #1 (top row), #2 (middle row), and #3 (bottom row). The validation dataset spans from January 18 to March 5, 2020, excluding the training periods. Figure 5a, d, g presents scatter plots contrasting the predictions of the low-fidelity GP model with actual sensor measurements. The broad dispersion of data points and the notable deviation of the red best-fit line from the dashed diagonal line underscore the model’s limited accuracy. In contrast, Fig. 5b, e, h displays scatter plots for the multifidelity GP model. Here, data points cluster more closely around the dashed line, and the red best-fit line aligns more precisely with the diagonal, indicating enhanced predictive accuracy. Despite the inherent complexities and dynamic motion of the flexible net cage, the multifidelity model’s predictions, though still dispersed, show improved accuracy, as evidenced by the color scale: darker points represent outliers, while lighter points closer to the dashed line signify higher accuracy. The MAE values annotated on these plots quantitatively highlight the multifidelity model’s superiority over the low-fidelity model. Figure 5c, f, i provides a temporal comparison of predicted and actual net cage displacements over a specific period (January 18 to January 27, 2020), juxtaposing the low-fidelity GP and multifidelity GP posterior means against the actual sensor data. The low-fidelity GP model (blue line) exhibits substantial deviations from the actual measurements (green line), particularly for depth sensors #1 and #3. Conversely, the multifidelity GP model (red line) significantly mitigates these discrepancies, closely mirroring the actual measurements and capturing the underlying trend with greater fidelity.

Fig. 5.

Fig. 5

Multifidelity framework predictions for net cage displacement. Scatter plots (a), (d), (g) compare the low-fidelity GP model prediction with depth sensor measurements. The broad dispersion of data points and the deviation of the red best-fit line from the dashed diagonal line highlight the limited predictive accuracy of the low-fidelity model. The color scale represents the logarithmic density of data points, with lighter shades indicating higher concentrations. Scatter plots (b), (e), (h) display the multifidelity GP model predictions. Data points cluster more closely around the diagonal line, and the red best-fit line aligns with it, indicating a marked improvement in predictive accuracy. Despite the inherent complexity and dynamic motion of the flexible net cage, the multifidelity framework delivers significantly enhanced performance. Darker points represent outliers, while lighter regions correspond to higher accuracy. The MAE values annotated on these plots quantitatively highlight the multifidelity model’s superiority over the low-fidelity model. Plots (c), (f), (i) show the temporal comparison of predicted and actual net cage displacements over a specific period (January 18 to January 27, 2020). The low-fidelity GP model (blue line) exhibits substantial deviations from the actual measurements (green line), particularly for depth sensors #1 and #3. The multifidelity GP model (red line) significantly mitigates these discrepancies, closely mirroring the actual measurements and capturing the underlying trend with greater fidelity.

Flexible net cage deformation via surrogate modeling

Fish farm net cages are flexible structures that tend to follow rather than resist water motions, therefore they present significant deformations. These deformations can impact fish survival and the structural integrity of the cages, with any failure potentially leading to fish escapes. This underscores the importance of monitoring net cage deformations to maintain a comprehensive understanding of fish farm conditions. Due to their inherent flexibility, monitoring the entire deformation of net cages using available sensors is challenging. Depth sensors can be mounted strategically on the net structure thereby obtaining a 3D position of these points in the net, which in turn are used to extrapolate the full net structure using mathematical models9. A similar approach was implemented by Su et al.12 for the development of a digital twin for real-time monitoring of aquaculture net cage systems, where depth sensor data were assimilated into FhSim simulations to represent the actual net cage system. Input properties for the simulations were adapted to ensure the numerical simulation outputs fit the sensor-measured values. However, this approach depends on sensors transmitting position information, with sensors being expensive and prone to data transmission loss. Additionally, it requires substantial computational time to process the data and calculate the correct net cage deformation. These reasons make this approach less suitable for real-time monitoring and immediate assessment of the system’s state.

Our study previously introduced the multifidelity digital twin, which accurately reflects the real net cage response to the dynamic marine environment. As a result, this digital twin eliminates the reliance on real-time data measured by sensors placed in the cage. Additionally, as illustrated in Fig. 3b, we developed surrogate models, using the standard GP method with a prior PCA, that map the net cage deformation with the currents. The accuracy of these models, previously presented by Katsidoniotaki et al.47, suggests that they can replace FhSim simulations and significantly reduce computational costs in digital twin applications. However, surrogate models inherently introduce some degree of prediction error, depending on the method used to build them. This error can impact autonomous operations or decision-making where good accuracy is required. In this study, we compare the GP-PCA surrogate model with a surrogate model built using GCNs to predict net cage deformation topology under varying sea conditions. We chose the GCN since the net cage resembles graph-structured data. The cage is discretized into 321 nodes, which are connected in the manner shown in Fig. 2b. The GCN receives node features and the topology of their connections and can predict the displacement of each node under dynamic marine conditions. In the section “Methods/Graph convolutional networks for net cage deformation,” further details about the GCN model are provided. Figure 6 compares the net pen deformations predicted by the GCN model (in red) with those obtained from high-fidelity FhSim simulations (in black), under two representative sea conditions: mild (test case 801) and harsh (test case 301).

Fig. 6.

Fig. 6

Predicted net pen deformations using Graph Convolutional Networks (GCNs) are compared against numerical simulations from FhSim under two representative current excitation scenarios: harsh conditions (test case 301) and mild conditions (test case 801). Black lines indicate the numerical solution, while red lines represent the GCN predictions.

To evaluate the prediction error of each method, we consider the example of autonomous navigation of Unmanned Underwater Vehicles (UUVs), which require real-time information about the surrounding environment where they navigate. During autonomous operations, a UUV such as a Remotely Operated Vehicle (ROV) should maintain a distance of at least 1 meter from the net cage to ensure collision-free motion. Therefore, it is imperative to keep the prediction error regarding net cage deformation within acceptable limits for safe operations. The MAE metric was employed to quantify the accuracy of net cage deformation predictions across three spatial directions over a range of test indices. As depicted in Fig. 7, the majority of data points for both methods lie below the red line at MAE = 1 meter, indicating a satisfactory level of accuracy for most test indices. Notably, higher MAE values are observed in test cases involving higher current velocities, where the net cage exhibits larger deformations, highlighting the challenges in modeling more extreme conditions. As summarized in the final subplot in Fig. 7, the GP-PCA model demonstrates higher MAE values, suggesting lower prediction accuracy compared to the GCN model. The wider spread of the GP-PCA error bars in the x and z dimensions notably suggests greater variability in its predictions. This greater error in the GP-PCA model may be attributed to dual sources of error: the PCA dimension reduction, which entails some loss of information, and the inherent errors in the GP model itself.

Fig. 7.

Fig. 7

Machine-learned surrogate models for predicting net cage deformation. Evaluation of prediction accuracy for net pen deformations using the Mean Absolute Error (MAE) metric, comparing the predicted and simulated displacements of each node in the x, y, and z directions. Two surrogate modeling approaches are assessed: (i) a Gaussian Process (GP) surrogate model with prior dimensionality reduction via Principal Component Analysis (PCA), and (ii) a Graph Convolutional Network (GCN). The dashed horizontal line at Inline graphic indicates a critical threshold beyond which prediction errors may adversely affect downstream tasks such as unmanned underwater vehicle (UUV) control. The models are evaluated across 100 test scenarios. The final subplot summarizes the average MAE for each axis, with error bars denoting variability across the test set.

Discussion

In this study, we present a framework that leverages multifidelity surrogate modeling for integration into a digital twin designed to enable real-time monitoring of structural dynamics in aquaculture systems. Unlike traditional digital twins that rely on physics-based simulations–requiring substantial computational resources and thus being suboptimal for real-time use–our approach provides several key advantages. The proposed framework operates at a fraction of the computational cost during both training and deployment, enabling on-the-fly predictions and rapid updates. A major challenge in real-world digital twin implementations is the limited availability and access to real-world data required for accurate surrogate modeling. By seamlessly integrating low-fidelity numerical simulations with high-fidelity field sensor measurements, our framework bridges the gap between modeled and observed dynamics. This integration enhances the predictive accuracy and reliability, making the system particularly suitable for real-time monitoring where sensor data is scarce or costly to obtain.

The NARGP method, which forms the core of our framework, is part of a broader class of multifidelity information-fusion techniques. Its nonlinear autoregressive structure enables effective learning of complex, nonlinear correlations between data sources of varying fidelity. While NARGP has demonstrated strong performance in benchmark studies, its application to full-scale, real-world systems has been limited. Here, we demonstrate its practical effectiveness by rapidly and efficiently correcting low-fidelity model outputs to align with field observations, achieving high prediction accuracy at low computational cost.

The framework was successfully validated using data from the SINTEF ACE fish farm, combining low-fidelity numerical simulation results with high-fidelity sensor measurements. For specific locations along the net pen, discrepancies between simulated and measured displacements were observed—primarily due to the complex, nonlinear hydrodynamic interactions between the flexible net and the surrounding wave–current environment, which are difficult to fully capture numerically. The NARGP method effectively compensates for these limitations, substantially reducing prediction errors and yielding results that closely match field measurements. Our framework was developed and validated using high-fidelity data collected from existing sensors over a two-month period during the winter season, which is typically characterized by intensified wave activity, stronger currents, and the presence of biofouling effects. These conditions are especially relevant for assessing the model robustness under the nonlinear dynamics most critical for structural integrity and failure prediction. Although the dataset does not yet encompass the full range of seasonal variability, the modular design of the framework allows straightforward retraining and expansion as new high-fidelity data become available-whether from additional seasons, years, or locations. This flexibility enables continuous refinement and adaptation of the digital twin to evolving environmental conditions, including long-term changes such as increased storm frequency or biofouling progression. The primary objective of this study is to demonstrate and validate the methodology, highlighting the efficacy of the multifidelity approach in combining sparse, high-cost sensor data with abundant low-fidelity simulations. Rather than constructing a fully seasonal model, we establish a proof of concept under harsh environmental conditions where modeling accuracy is most demanding. The demonstrated performance under such challenging scenarios provides a solid foundation for future extensions toward broader temporal coverage and long-term operational deployment.

Adoption of digital twin technologies in aquaculture requires not only accurate modeling but also practical considerations, such as cost, ease of deployment, and operational simplicity. Current monitoring practices rely heavily on manual ROV inspections, diver operations, and vessel support. These approaches are labor-intensive, weather-dependent, and costly, requiring skilled pilots and coordination across teams. These approaches are largely reactive rather than predictive and become increasingly unsustainable as farms move to more exposed offshore sites. In contrast, our framework reduces these barriers by utilizing existing sensor infrastructure and minimizing the need for additional instrumentation or manual interventions. The core machine learning model operates passively in the background, requiring no specialized tuning during operation; users interact only with interpretable outputs, such as net deformation fields or anomaly alerts. This framework has been validated in an operational setting at a full-scale fish farm, where our digital twin was integrated with an autonomous ROV for real-time shape estimation of the flexible net pen, enabling safe, closed-loop path planning under dynamic marine conditions45. This advancement supports autonomous operations in fish farms, such as inspection of nets and mooring lines for damages, irregularities, biofouling conditions, as well as net cleaning46. The proposed digital twin is therefore not only technically robust but also aligned with industry workflows, offering significant reductions in costs and operational risk compared with traditional monitoring systems.

Deformation of aquaculture net pens due to wave-current interaction plays a critical role in fish welfare, structural integrity, and autonomous operation safety. Thus, developing efficient and accurate models for predicting these deformations is a core objective of this work. Here, we adopted a GPR-PCA approach to construct the surrogate model capable of predicting average net deformation across diverse marine conditions. As described in our previous work47, the GPR-PCA approach reconstructs net pen deformations with accuracy comparable to full-scale numerical simulations, while drastically reducing computational cost. Dimensionality reduction via PCA allows efficient handling of high-dimensional data describing net displacements, though it may smooth out localized deformation features in highly dynamic regions. To assess these limits, we further explored a GCN approach, which naturally operates on the graph-structured representation of the cage, capturing spatial relationships through localized message passing. This enables the model to capture fine-scale spatial features–particularly in highly dynamic regions such as the lower layers of the net–where precise shape estimation is critical. While the GCN achieves higher prediction accuracy, the GP-PCA surrogate was chosen as the operational model due to its direct compatibility with the NARGP framework, which is built upon Gaussian Process regression. The GCN model is included as a benchmark to illustrate the potential of graph-based learning for structural dynamics and to motivate future exploration of graph neural operators in this domain.

In conclusion, this study introduces a machine learning–enabled digital twin framework for marine aquaculture built on multifidelity data assimilation, advancing predictive modeling of complex marine structures. While the current focus is on net cage deformation and mooring line loads, the underlying framework is general and applicable to other dynamic variables of interest. The digital twin offers tangible benefits across the full system lifecycle. In the design phase, it enables virtual prototyping, model validation, and early-stage performance assessment, facilitating cost-effective planning and structural optimization under variable marine conditions. During system configuration and reconfiguration, it supports semi-physical simulations for mooring layout design, sensor placement, and operational optimization without requiring full physical trials. In the operational phase, the framework delivers real-time monitoring, anomaly detection, and supports autonomous ROV-based inspections and net cleaning, as demonstrated in a full-scale offshore deployment. Collectively, these advancements are essential for the industry’s expansion to meet rising food demands and secure sustainable food supply.

Methods

Nonlinear autoregressive GP regression (NARGP)

The nonlinear autoregressive GP regression denoted by NARGP32 is a class of multifidelity nonlinear information fusion algorithms that enables accurate inference of quantities of interest by synergistically combining realizations of low-fidelity models with a small set of high-fidelity observations, capable of learning complex nonlinear and space-dependent cross-correlations between models of variable fidelity. Our research adopts the NARGP method in response to the observed nonlinear dynamics between our low and high-fidelity datasets.

We have s levels of information sources (fidelities) producing outputs Inline graphic, at locations Inline graphic, with Inline graphic. In our application we have two fidelity levels, Inline graphic, and two input parameters, Inline graphic, i.e. current velocity and direction. We can organize the available datasets by increasing fidelity as Inline graphic, while the datasets have nested structure, i.e. Inline graphic. This assumption implies that the training inputs of the higher fidelity level need to be a subset of the training inputs of the lower fidelity level. In NARGP32 method, the nonlinear autoregressive scheme reads as:

graphic file with name d33e841.gif 1

where Inline graphic is the GP modeling the data at fidelity level t, the Inline graphic is the GP posterior from the previous inference level Inline graphic, and the function Inline graphic is modeled as a Gaussian prior, Inline graphic, with a covariance kernel that composes as

graphic file with name d33e870.gif 2

The structure of the kernel Inline graphic reveals the effect of the deep representation encoded in Eq. 1. The Inline graphic projects the lower fidelity posterior Inline graphic onto a Inline graphic-dimensional latent manifold, that jointly relates the input space and the outputs of the lower fidelity level to the output of the higher fidelity model, from which we can infer a smooth mapping that recovers the high-fidelity response Inline graphic. This allows to capture general nonlinear, non-functional and space-dependent cross-correlations between low- and high-fidelity data. In Eq. 2, Inline graphic, Inline graphic, and Inline graphic are valid covariance functions and Inline graphic denote their hyperparamters, which can be easily learnt from the data Inline graphic via the maximum likelihood estimation procedure followed by the standard GP41using the kernel Inline graphic. This approach requires the estimation of (Inline graphic) hyperparameters assuming that all kernels account for directional anisotropy in each dimension using automatic relevance determination (ARD) weights. The kernel functions are chosen to have the squared exponential form with ARD weights, i.e.

graphic file with name d33e936.gif 3

where Inline graphic is a variance parameters and Inline graphic are the ARD weights corresponding to fidelity level t. These weights allow for a continues ’blend’ of the contributions of each individual dimension in Inline graphic as well as the posterior predictions of the previous fidelity level Inline graphic, and they are learnt directly from the data when inferring Inline graphic.

The first level of the proposed recursive scheme corresponds to a standard GP regression problem trained on the lowest fidelity data Inline graphic, and therefore, the predictive posterior distribution is defined by standard’s Gaussian mean and covariance using the kernel function Inline graphic. Figure 3 provides a summary of the NARGP workflow as implemented in this study.

Net cage displacement data dimension reduction using PCA

PCA is a statistical technique used in data analysis for dimensionality reduction. It identifies the directions (or principal components) that maximize the variance in a dataset, thereby preserving essential information with fewer variables. In our study, we apply PCA to the dataset obtained from FhSim numerical simulations, which provide the Inline graphic, Inline graphic, Inline graphic displacements of the nodes into which the net cage is discretized, where Inline graphic. We have this data for each of the Inline graphic environmental conditions examined. Specifically, for each environmental condition, FhSim simulates the net cage interaction with waves and currents over a 30-min period, outputting the Inline graphic, Inline graphic, Inline graphic node displacements. The 30-min simulation time for all the Inline graphic environmental conditions corresponds to a total of Inline graphic time steps. All the available data from these simulations are integrated into the matrix Inline graphic.

graphic file with name d33e1028.gif

Using this data matrix, we perform eigenanalysis to calculate the eigenvectors, Inline graphic, and eigenvalues,Inline graphic, respectively.

graphic file with name d33e1040.gif 4

The eigenvalues and their corresponding eigenvectors are sorted in descending order of the eigenvalues. The eigenvector with the highest eigenvalue is the first principal component of the dataset and accounts for the highest variance, with each subsequent component capturing progressively less variance than its predecessor. We determine the number of principal components to retain based on the cumulative explained variance ratio, which translates into how many components k we should keep out of the c total components to better describe the data. This sum yields the total proportion of the dataset’s variance that is accounted for by the first k principal components.

graphic file with name d33e1055.gif 5

A common approach is to choose the smallest possible number of principal components, k, that explain a substantial portion of the variance. However, the choice of k is based on a predetermined threshold which reflects the desired level of variance explanation: Inline graphic. In our case, setting the threshold at 93% results in the retention of k = 3 principal components.

Next, we project the original data, Inline graphic, onto the selected principal components to transform them into the new feature subspace, by multiplying the original matrix by the first k selected eigenvectors, to get the Inline graphic matrix of the transformed data which can be referred as PCA coefficients.

graphic file with name d33e1086.gif 6

The resulting Inline graphic matrix consists of 3 rows, each corresponding to one of the principal components, and Inline graphic columns each one corresponds to a time step state of the net cage deformation. Since we are interested in estimating the average shape of the net cage deformation for each of the Inline graphic scenarios, we estimate the mean principle component for each scenario, resulting in a matrix Inline graphic, which can be referred as mean PCA coefficients. This matrix has Inline graphic rows and Inline graphic columns.

graphic file with name d33e1116.gif

To reconstruct the average net cage deformation for each of the total Inline graphic environmental conditions, we multiple the eigenvector Inline graphic with the principal components.

graphic file with name d33e1129.gif 7

The matrix that shows the average net cage displacement for each of the examined environmental conditions is:

graphic file with name d33e1134.gif

Machine learning functional relationships between current and PCA coefficients

To develop the low-fidelity component of the NARG methodology, which predicts net cage deformation that corresponds to FhSim solution, we create a standard GP model to learn the functional relationship between current speed and the net cage deformation. Specifically, the GP model maps the current characteristics to the PCA coefficients, Inline graphic, Inline graphic, Inline graphic. Then, we utilize the inner sum of Equation 7 (Inline graphic) to reconstruct the net cage deformation for an individual environmental condition. In other words, we obtain the predicted Inline graphic, Inline graphic, Inline graphic, where Inline graphic. Similar approaches–incorporating PCA for dimensionality reduction and the development of surrogate models using machine learning methods–have been widely applied across various disciplines due to their efficacy in feature extraction and data simplification37,4851. In our case, performing the dimension reduction step helps reduce the complexity of the data describing the net cage deformation, making it easier to use machine learning methods to create the mapping.

Graph convolutional networks for net cage deformation

Graph neural networks (GNNs) are mathematical models that can learn functions over graphs and are a learning approach for building predictive models on graph-structured data. Graphs differ from regular data in that they have a structure that neural networks must respect. Among the various types of GNNs, the graph convolutional networks (GCNs) have emerged as the most broadly applied model. GCNs are innovative due to their ability generalize the operation of convolutional neural networks to graphs, enabling the network to learn from the graph’s topology and node features to make predictions.

Representing net cage graph. The net cage of our application can be viewed as graph-structured data, discretized into 321 nodes.We build a GCN model for the net cage based on the work of Kipf and Welling42. The net cage can be viewed as a graph G consisting of Inline graphic nodes connected through the edges, where Inline graphic and Inline graphic represent nodes i and j. The edge features are represented by the adjacency matrix Inline graphic which encodes connections between the N nodes, represents the graph connectivity which is a square matrix where each element Inline graphic specifies the presence or absence of an edge from node i to node j in the graph. In other words, a non-zero element Inline graphic implies a connection from node i to node j, and a zero indicates no direct connection. In our net cage application, the adjacent matrix Inline graphic captures the node connection shown in Fig. 2b; the net cage consists of 10 layers, each with 32 nodes arranged in a circular pattern. Nodes within each layer are interconnected to form a circle, and each node is vertically connected to the node directly below it in the subsequent layer. This creates a consistent connection pattern between layers. Additionally, the final node (node 321) is connected to all nodes in the 10th layer, providing a central connection point. The Inline graphic is the node feature matrix with F being the number of the feature assigned to each node. In our case, the Inline graphic contains the features for each node which in our case are the Inline graphic, Inline graphic, Inline graphic coordinates of each node at rest and the sea conditions (i.e. current speed and direction). The graph is supposed to predict the matrix Inline graphic, where O is the number of output features per node. The matrix Inline graphic gives the x, y, z coordinates of each node under the interaction of the cage with the sea condition.

Message aggregation and update. The GCN layers are based on the formulation by Kipf and Welling (2017)42. At each layer, the graph convolution takes both the adjacency matrix A and the node features from the previous layer Inline graphic and outputs the node features for the next layer Inline graphic, with Inline graphic and Inline graphic are the node features dimensions for layers l and Inline graphic, while Inline graphic the input node feature matrix, following the formulation:

graphic file with name d33e1340.gif 8

where Inline graphic is the activation function, Inline graphic is the adjacency matrix with added self-connections represented by the identity matrix Inline graphic, Inline graphic is the diagonal degree matrix of Inline graphic, and Inline graphic is a trainable weight matrix for layer Inline graphic. In our application, the data passes through three graph convolution layers. Each layer updates the node features by aggregating information from its neighbor nodes according to the graph structure. The final output represents the predicted node features which are the predicted displacements (x, y, z) for each node. Having several layers enables to further refine the node features by aggregating information from the graph structure and learning mode complex patterns. The deformation of net-cage nodes is a highly non-linear process influenced, the ability of Swish activation function to introduce non-linearities is particularly beneficial, which is defined as:

graphic file with name d33e1384.gif 9

Model training. A notable point in our model is the implementation of a custom loss function that assigns different weights to the errors of each node by emphasizing regions of the net-cage that are more prone to displacement. At the node weights initialization stage nodes within layers 6 to 10, which are observed to undergo significant deformation, are assigned increased weights, with the highest weight allocated to the 321st node. The loss function computes the mean squared error (MSE) for each nodes prediction:

graphic file with name d33e1391.gif 10

given Inline graphic: Predicted displacements (matrix of shape Inline graphic), Inline graphic: True displacements (matrix of shape Inline graphic), Inline graphic: Node weights (vector of length N). For the training process we use backpropagation to compute gradients and gradient descent optimiser to update the parameters for each layer.

Acknowledgements

This work was supported by the Knut and Alice Wallenberg Foundation Postdoctoral Scholarship Program at MIT - KAW 2022.0334, the Office of Naval Research (Grant no N00014-21-1-2357), by RACE internal funding (SINTEF Ocean AS) and by the Research Council of Norway (RCN) project: CHANGE (no. 313737). The experimental data utilized in this paper for validation of the framework were obtained from RACE internal funding and FLEXAQUA (no. 284361) project funded from RCN.

Author contributions

All authors conceived the idea. E.Kat. and T.P.S developed the methodology for the multifidelity digital twin framework. E.Kat. analysed the data, developed the code and implemented the framework. B.S. conducted the FhSim simulations. B.S. and E.Kel. provided the field sensor measurements. E.Kat. and T.P.S. analysed the results. B.S. and E.Kel. suggested applications for the digital twin’s practical implementation. E.Kat. wrote the original manuscript, with all authors contributing to editing and reviewing the manuscript.

Data availability

Code will become available to a dedicated depository after the paper is accepted. Correspondence and requests for code materials should be addressed to Eirini Katsidoniotaki or Themistoklis P. Sapsis. The data that support the findings of this study are available from SINTEF Ocean AS but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of SINTEF Ocean AS.

Declarations

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Eirini Katsidoniotaki, Email: eirka289@mit.edu.

Themistoklis P. Sapsis, Email: sapsis@mit.edu

References

  • 1.Bergenas, J. The other global food crisis. Foreign Policy. Available online: https://foreignpolicy.com/2023/09/19/global-food-crisis-fishing-blue-foods-conflict-water-ocean-climate-resources/, accessed on [May 28, 2025] (2023).
  • 2.United Nations. The World’s Food Supply is Made Insecure by Climate Change. Available online: https://www.un.org/en/academic-impact/worlds-food-supply-made-insecure-climate-change/, accessed on [May 28, 2025].
  • 3.World Economic Forum. Aquaculture: why the world needs a new wave of food production. Available online: https://www.weforum.org/agenda/2022/01/aquaculture-agriculture-food-systems/, accessed on [May 28, 2025] (2022).
  • 4.Han, K., Leem, K., Choi, Y. R. & Chung, K. What drives a country’s fish consumption? Market growth phase and the causal relations among fish consumption, production and income growth. Fish. Res.254, 106435. 10.1016/j.fishres.2022.106435 (2022). [Google Scholar]
  • 5.World Wildlife Fund. Overfishing. Available online: https://www.worldwildlife.org/threats/overfishing/, accessed on [May 28, 2025].
  • 6.Gephart, J. A. et al. Environmental performance of blue foods. Nature597(7876), 360–365. 10.1038/s41586-021-03889-2 (2021). [DOI] [PubMed] [Google Scholar]
  • 7.Bjelland, H. V., Føre, M., Lader, P., Kristiansen, D., Holmen, I. M., Fredheim, A., ... & Schjølberg, I. Exposed aquaculture in Norway. In OCEANS 2015 - MTS/IEEE Washington (pp. 1–10). IEEE. 10.23919/OCEANS.2015.7404486 (2015).
  • 8.Ubina, N. A. et al. Digital twin-based intelligent fish farming with Artificial Intelligence Internet of Things (AIoT). Smart Agric. Technol.5, 100285. 10.1016/j.atech.2023.100285 (2023).
  • 9.Føre, M. et al. Digital Twins in intensive aquaculture-challenges, opportunities and future prospects. Comput. Electron. Agric.218, 108676. 10.1016/j.compag.2024.108676 (2024). [Google Scholar]
  • 10.Saad, A., Su, B. & Bjørnson, F. O. A web-based platform for efficient and robust simulation of aquaculture systems using integrated intelligent agents. Procedia Comput. Sci.225, 4560–4569. 10.1016/j.procs.2023.10.454 (2023). [Google Scholar]
  • 11.Huan, J., Li, H., Li, M. & Chen, B. Prediction of dissolved oxygen in aquaculture based on gradient boosting decision tree and long short-term memory network: A study of Chang Zhou fishery demonstration base, China. Comput. Electron. Agric.175, 105530. 10.1016/j.compag.2020.105530 (2020). [Google Scholar]
  • 12.Su, B. et al. Towards a holistic digital twin solution for real-time monitoring of aquaculture net cage systems. Marine Struct.91, 103469. 10.1016/j.marstruc.2023.103469 (2023). [Google Scholar]
  • 13.Føre, H. M. & Thorvaldsen, T. Causal analysis of escape of Atlantic salmon and rainbow trout from Norwegian fish farms during 2010–2018. Aquaculture532, 736002. 10.1016/j.aquaculture.2020.736002 (2021). [Google Scholar]
  • 14.Su, B. et al. An integrated approach for monitoring structural deformation of aquaculture net cages. Ocean Eng.219, 108424. 10.1016/j.oceaneng.2020.108424 (2021). [Google Scholar]
  • 15.Turnbull, J., Bell, A., Adams, C., Bron, J. & Huntingford, F. Stocking density and welfare of cage farmed Atlantic salmon: Application of a multivariate analysis. Aquaculture243(1–4), 121–132. 10.1016/j.aquaculture.2004.09.022 (2005). [Google Scholar]
  • 16.Amundsen, H. B., Xanthidis, M., Føre, M., Ohrem, S. J., & Kelasidi, E. Aquaculture field robotics: Applications, lessons learned and future prospects. arXiv preprint arXiv:2404.12995 (2024).
  • 17.Martin, T., Tsarau, A. & Bihs, H. A numerical framework for modelling the dynamics of open ocean aquaculture structures in viscous fluids. Appl. Ocean Res.106, 102410. 10.1016/j.apor.2020.102410 (2021). [Google Scholar]
  • 18.Su, B., Reite, K.-J., Føre, M., Aarsæther, K. G., Alver, M. O., Endresen, P. C., Kristiansen, D., Haugen, J., Caharija, W., & Tsarau, A. A multipurpose framework for modelling and simulation of marine aquaculture systems. In Proceedings of the International Conference on Offshore Mechanics and Arctic Engineering (Vol. 58837, pp. V006T05A002). American Society of Mechanical Engineers (2019).
  • 19.Reite, K.-J., Føre, M., Aarsæther, K. G., Jensen, J., Rundtop, P., Kyllingstad, L. T., Endresen, P. C., Kristiansen, D., Johansen, V., & Fredheim, A. FhSim–Time domain simulation of marine systems. In Proceedings of the International Conference on Offshore Mechanics and Arctic Engineering (Vol. 45509, pp. V08AT06A014). American Society of Mechanical Engineers (2014).
  • 20.Kristiansen, T. & Faltinsen, O. M. Modelling of current loads on aquaculture net cages. J. Fluids Struct.34, 218–235. 10.1016/j.jfluidstructs.2012.04.001 (2012). [Google Scholar]
  • 21.Huang, C.-C., Tang, H.-J. & Liu, J.-Y. Dynamical analysis of net cage structures for marine aquaculture: Numerical simulation and model testing. Aquacult. Eng.35(3), 258–270. 10.1016/j.aquaeng.2006.03.003 (2006). [Google Scholar]
  • 22.Liu, X., Han, X., Wang, H. & Liu, G. A modelling and updating approach of digital twin based on surrogate model to rapidly evaluate product performance. Int. J. Adv. Manuf. Technol.129(11), 5059–5074. 10.1007/s00170-023-12646-w (2023). [Google Scholar]
  • 23.Chakraborty, S., Adhikari, S. & Ganguli, R. The role of surrogate models in the development of digital twins of dynamic systems. Appl. Math. Model.90, 662–681. 10.1016/j.apm.2020.09.037 (2021). [Google Scholar]
  • 24.Bi, C.-W. et al. An efficient artificial neural network model to predict the structural failure of high-density polyethylene offshore net cages in typhoon waves. Ocean Eng.196, 106793. 10.1016/j.oceaneng.2019.106793 (2020). [Google Scholar]
  • 25.Zhao, Y.-P., Lian, L., Bi, C.-W. & Xu, Z. Digital twin for rapid damage detection of a fixed net panel in the sea. Comput. Electron. Agric.200, 107247. 10.1016/j.compag.2022.107247 (2022). [Google Scholar]
  • 26.Zhang, Z., Gui, F., Qu, X. & Feng, D. Netting damage detection for marine aquaculture facilities based on improved mask R-CNN. J. Mar. Sci. Eng.10(7), 996. 10.3390/jmse10070996 (2022). [Google Scholar]
  • 27.Zhao, Y.-P., Bi, C.-W., Sun, X.-X. & Dong, G.-H. A prediction on structural stress and deformation of fish cage in waves using machine-learning method. Aquacult. Eng.85, 15–21. 10.1016/j.aquaeng.2019.01.003 (2019). [Google Scholar]
  • 28.Popov, A. A., & Sandu, A. Multifidelity data assimilation for physical systems. In Data Assimilation for Atmospheric, Oceanic and Hydrologic Applications (Vol. IV, pp. 43–67). Springer. 10.1007/978-3-030-77722-7_2 (2022).
  • 29.Babaee, H., Bastidas, C., Defilippo, M., Chryssostomidis, C., & Karniadakis, G. E. A multifidelity framework and uncertainty quantification for sea surface temperature in the Massachusetts and Cape Cod Bays. Earth Space Sci., 7(2), e2019EA000954 (2020). 10.1029/2019EA000954.
  • 30.Conti, P. et al. Multi-fidelity reduced-order surrogate modelling. Proceedings R. Soc. A480(2283), 20230655. 10.1098/rspa.2023.0655 (2024). [Google Scholar]
  • 31.Kennedy, M. C. & O’Hagan, A. Predicting the output from a complex computer code when fast approximations are available. Biometrika87(1), 1–13. 10.1093/biomet/87.1.1 (2000). [Google Scholar]
  • 32.Perdikaris, P., Raissi, M., Damianou, A., Lawrence, N. D. & Karniadakis, G. E. Nonlinear information fusion algorithms for data-efficient multi-fidelity modelling. Proc. R. Soc. A: Math. Phys. Eng. Sci.473(2198), 20160751. 10.1098/rspa.2016.0751 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Cutajar, K., Pullin, M., Damianou, A., Lawrence, N., & González, J. Deep Gaussian processes for multi-fidelity modeling. arXiv preprint arXiv:1903.07320 (2019).
  • 34.Meng, X. & Karniadakis, G. E. A composite neural network that learns from multi-fidelity data: Application to function approximation and inverse PDE problems. J. Comput. Phys.401, 109020. 10.1016/j.jcp.2019.109020 (2020). [Google Scholar]
  • 35.Meng, X., Babaee, H. & Karniadakis, G. E. Multi-fidelity Bayesian neural networks: Algorithms and applications. J. Comput. Phys.438, 110361. 10.1016/j.jcp.2021.110361 (2021). [Google Scholar]
  • 36.Zhang, L., Wu, Y., Jiang, P., Choi, S.-K. & Zhou, Q. A multi-fidelity surrogate modeling approach for incorporating multiple non-hierarchical low-fidelity data. Adv. Eng. Inform.51, 101430. 10.1016/j.aei.2021.101430 (2022). [Google Scholar]
  • 37.Champenois, B. & Sapsis, T. Machine learning framework for the real-time reconstruction of regional 4D ocean temperature fields from historical reanalysis data and real-time satellite and buoy surface measurements. Physica D459, 134026. 10.1016/j.physd.2023.134026 (2024). [Google Scholar]
  • 38.Lai, X. et al. Digital twin-based structural health monitoring by combining measurement and computational data: An aircraft wing example. J. Manuf. Syst.69, 76–90. 10.1016/j.jmsy.2023.06.006 (2023). [Google Scholar]
  • 39.Desai, A. S., Navaneeth, N., Adhikari, S. & Chakraborty, S. Enhanced multi-fidelity modeling for digital twin and uncertainty quantification. Probab. Eng. Mech.74, 103525. 10.1016/j.probengmech.2023.103525 (2023). [Google Scholar]
  • 40.Chetan, M., Yao, S. & Griffith, D. T. Multi-fidelity digital twin structural model for a sub-scale downwind wind turbine rotor blade. Wind Energy24(12), 1368–1387. 10.1002/we.2636 (2021). [Google Scholar]
  • 41.Rasmussen, C. E., & Williams, C. K. I. Gaussian Processes for Machine Learning (Vol. 1). Springer (2006).
  • 42.Kipf, T. N., & Welling, M. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).
  • 43.Li, X., Xu, L., Guo, H. & Yang, L. Application of graph convolutional neural networks combined with single-model decision-making fusion neural networks in structural damage recognition. Sensors23(23), 9327. 10.3390/s23239327 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Føre, H. M., Thorvaldsen, T., Folkedal, O., Holmen, I., Lona, E., Grøtli, E. I., ... & Klebert, P. Safe, robust, and productive exposed aquaculture operations–Challenges and solutions. SINTEF Ocean AS (2024).
  • 45.Amundsen, H. B., Katsidoniotaki, E., Føre, M. & Kelasidi, E. Aquaculture robotics: Adaptive path planning through real-time estimation of the shape of flexible net pens (IEEE, IEEE Transactions on Field Robotics, 2025). 10.1109/TFR.2025.3560245. [Google Scholar]
  • 46.Kelasidi, E., & Svendsen, E. Robotics for sea-based fish farming. In Encyclopedia of Smart Agriculture Technologies (pp. 1–20). Springer. 10.1007/978-3-031-24861-0_202 (2023).
  • 47.Katsidoniotaki, E., Su, B., Kelasidi, E., & Sapsis, T. Integrating machine learning for real-time structural monitoring of net cages. In Proceedings of the 34th International Ocean and Polar Engineering Conference (ISOPE 2024) (2024).
  • 48.Rudy, S. H. & Sapsis, T. P. Prediction of intermittent fluctuations from surface pressure measurements on a turbulent airfoil. AIAA J.60(7), 4174–4190. 10.2514/1.J061163 (2022). [Google Scholar]
  • 49.Guth, S. & Sapsis, T. P. Wave episode based Gaussian process regression for extreme event statistics in ship dynamics: Between the Scylla of Karhunen-Loève convergence and the Charybdis of transient features. Ocean Eng.266, 112633. 10.1016/j.oceaneng.2022.112633 (2022). [Google Scholar]
  • 50.Katsidoniotaki, E., Guth, S., Göteman, M. & Sapsis, T. P. Reduced order modeling of wave energy systems via sequential Bayesian experimental design and machine learning. Appl. Ocean Res.155, 104439. 10.1016/j.apor.2025.104439 (2025). [Google Scholar]
  • 51.Guth, S., Katsidoniotaki, E. & Sapsis, T. P. Statistical modeling of fully nonlinear hydrodynamic loads on offshore wind turbine monopile foundations using wave episodes and targeted CFD simulations through active sampling. Wind Energy27(1), 75–100. 10.1002/we.2880 (2024). [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Code will become available to a dedicated depository after the paper is accepted. Correspondence and requests for code materials should be addressed to Eirini Katsidoniotaki or Themistoklis P. Sapsis. The data that support the findings of this study are available from SINTEF Ocean AS but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of SINTEF Ocean AS.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES