Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2026 Apr 8:2026.04.05.716563. [Version 1] doi: 10.64898/2026.04.05.716563

Composite Biofidelity: Addressing Metric Degeneracy in Biomechanical Model Validation and Machine Learning Loss Design

Amruta Koshe, Ehsan Sobhani-Tehrani, Kian Jalaleddini, Hamid Motallebzadeh
PMCID: PMC13081880  PMID: 41993423

Abstract

Spectral similarity is often judged with a single metric such as RMSE, yet this can be misleading: physically different errors can produce similar scores. This is a critical limitation for computational biomechanics, where spectral agreement underpins both model validation and machine-learning loss design. Here, we develop a multi-metric framework for objective spectral biofidelity and test whether it better captures meaningful disagreement across complex frequency-domain responses.

We evaluated 12 complementary similarity metrics, including CORA and ISO/TS 18571, using controlled spectral perturbations that mimic common real-world deviations such as resonance shifts, localized spikes, and broadband tilts. We then applied the framework to an SBI-tuned finite-element middle-ear model to assess convergence with training dataset size and robustness to measurement noise across repeated stochastic runs.

No single metric performed reliably across all distortion types. Shape-based metrics tracked resonance morphology but could miss vertical scaling, whereas MaxError remained important for narrowband anomalies that smoother metrics underweighted. CORA and ISO 18571 did not consistently outperform simpler metrics. Rank aggregation using Borda count provided a robust consensus across metrics, enabling objective identification of training-data saturation and noise thresholds beyond which similarity rankings became unstable.

These results show that spectral biofidelity cannot be reduced to a single norm. A multi-metric consensus provides a clearer and more physically meaningful basis for comparing experimental and simulated spectra, and offers a more defensible foundation for data-fidelity terms in physics-informed and simulation-based machine learning.

Full Text

The Full Text of this preprint is available as a PDF (1.3 MB). The Web version will be available soon.


Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES