Skip to main content
PLOS One logoLink to PLOS One
. 2024 Mar 28;19(3):e0301211. doi: 10.1371/journal.pone.0301211

Research on local sound field intensity control technique in metasurface based on deep neural networks

Huanlong Zhao 1, Qiang Lv 1,*, Zhen Huang 1, Wei Chen 1, Guoqiang Hao 1
Editor: Yuan Zhang2
PMCID: PMC10977772  PMID: 38547089

Abstract

The use of tunable metasurface technology to realize the underwater tracking function of submarines, which is one of the hotspots and difficulties in submarine design. The structure-to-sound-field metasurface design approach is a highly iterative process based on trial and error. The process is cumbersome and inefficient. Therefore, an inverse design method was proposed based on parallel deep neural networks. The method took the global and local target sound field feature information as input and the metasurface physical structure parameters as output. The deep neural network was trained using a kernel loss function based on a radial basis kernel function, which established an inverse mapping relationship between the desired sound field to the metasurface physical structure parameters. Finally, the sound field intensity modulation at a localized target range was achieved. The results indicated that within the regulated target range, this method achieved an average prediction error of less than 5 dB for 92.9% of the sample data.

1. Introduction

Acoustic super-surface materials, utilizing the technology of modifying material properties based on the generalized Snell’s law, possess exceptional acoustic control abilities at subwavelength scales. Therefore, they have emerged as research hotspots in areas such as acoustic lenses, noise reduction, and acoustic stealth [14]. Pentamode acoustic metasurface can further compensate for the narrow-band limitations of traditional metasurfaces [5,6]. So far, researchers have achieved some sound field modulation works through the use of various structural designs such as Helmholtz resonators, coiled channels, mazes, and cavities [713]. Also, the researchers have utilized combinations of different materials, such as a combination of water and silicone rubber or polyurethane composites, to achieve the goal of reflective sound field modulation [14,15], other researchers use bottom-up inversion optimization algorithms to design metasurfaces [1619]. While these phase mutation methods can achieve acoustic field modulation [2025], it is noteworthy that all of the aforementioned methods have been studied based on the forward model of the sound field and finite element simulation. This research process requires a substantial amount of time from the researchers.

Considering the limitations of conventional metasurface research methods, machine learning methods have been applied to the acoustic metasurface inverse design process, which enabled the optimization of structural parameters. Some results have been achieved [2628]. Among them, Zhao et al. used a convolutional neural network model to establish a mapping of the local acoustic field to the phase gradient of the metasurface to achieve regional control of the local acoustic field [27]. Li et al. proposed a tandem neural network approach to reverse-engineer the phase of a metasurface unit such that the energy loss of an acoustic wave in the return direction is greater than 10 dB [28]. Long, Chen et al. used genetic algorithms to respectively design metasurface structures for sound absorption [26,29]. Li, Lin, et al. have respectively used machine learning for encoding metasurfaces to enable the modulation of the sound field by arranging these logical units into specific sequences [30,31]. These studies have taken advantage of the benefits of machine learning methods for model construction, which can help weaken complex physical mechanisms and reduce the need for model accuracy. This shows that the addition of deep learning is relevant to metasurface modulation techniques.

This paper introduces a novel metasurface inverse design method leveraging parallel deep neural networks (PDNN). The method respectively extracts the key information of the acoustic field and the metasurface as the input and output of the PDNN network. With the help of the kernel loss function and the constraint performance provided by the constraint network, it establishes an inverse mapping relationship between the target acoustic field and the physical structure parameters of the super-surface. Model validation show that this method can realize the regulation of local sound field intensity. This may be a novel way to achieve stealth for submarine vehicles.

2. Physical model of the metasurface local sound field

Turing the process of realizing intensity modulation of the target acoustic field, a physical model of the metasurface local sound field was used to acquire the dataset. The pentamode metasurface was chosen for sound field simulation because of its advantages of impedance matching with the ambient medium and wide frequency [32]. For a pentamode metasurface based on the generalized Snell’s law, the material density distribution ρ(x) is the decisive parameter affecting the reflected acoustic field. When the sound wave is vertically incident on the pentamode metasurface with the acoustic velocity c0, its ideal density distribution ρ(x) satisfies Eq (1) [33]:

ρ(x)=(sin(θr)x/2h+C0)ρ0,0xL (1)

where L is the length of the metasurface, C0 is the integration constant, θr is the reflection angle, ρ0 is the density of incident medium, h is the normal thickness of metasurface, and x is the position. Artificial periodic structures cannot realize a continuous material density distribution on the theoretical metasurface. To approximate this continuous distribution, the theoretical metasurface can be discretized into n cells along the length (i = 1, 2, …, n). The density of each discrete cell is characterized by the density ρi at its center position [34,35].

In this paper, we started from the idea of parametric modeling without structural constraints on the metasurface structural units. Simplified parameters were used instead of metasurface structural units. With the method of unit combination, the phase mutation was adjusted at the same time to realize the local tuning of the acoustic field. The hypersurface has a normal thickness of 0.12m and a length of 1m. In the example of a sonar-detecting submarine shown in Fig 1(A), the incident acoustic wave can be viewed as a plane wave. When the incident acoustic wave contacts the surface of the submersible, the main reflected acoustic field is adjusted from the 90° direction to the other direction through the acoustic field modulation technique. The intensity of the acoustic field in the return direction is also changed. Therefore, we established the physical model of the sound field following the approach illustrated in Fig 1(B). A plane wave incident vertically underwater is used as the background field. The metasurface covered the backing plate surface and consisted of n metasurface structural units arranged along the x-positive direction. When a plane wave is incident on the metasurface in the y-reverse direction, the reflected waves generated by the n metasurface structural units interacting with each other make up the entire reflected acoustic field. The physical structural parameters of each structural unit could be different. In this paper, the physical structural parameters of each unit were obtained according to a gradient arrangement, which satisfied the requirements of the intensity characteristics of the desired sound field distribution, set n to 25, so the length of each hypersurface structural unit is 0.04m.

Fig 1.

Fig 1

a) Schematic of the sound field model. b) Schematic of the pentamode metasurface.

3. Inverse design method based on parallel deep neural networks

3.1. Extraction of sound field features

In order to predicted the intensity of the target sound field, this paper drew on the idea of multi-scale that was to extract the global and local feature information of the sound field. The goal is the modulation of the local acoustic field, but the entire reflected acoustic field is a joint action of all metasurface structural units. In particular, the coupling relationship between individual structural units will have a significant impact on the reflected acoustic field characteristics. This coupling relationship will greatly increase the complexity of the model [36]. Therefore, when using local sound field intensity values as parallel deep neural network inputs, it was necessary to include feature information of the global sound field to constrain this prediction process.

The prediction of target sound field strength required the selection of global and local feature information. According to Fig 2, it can be seen that the main change features of the reflected sound field are concentrated in the vicinity of the main reflection angle, the wave crest and trough, so the selection of global feature information can be extracted at the main change features. The local sound field intensity information was the value of the sound field intensity within the selected tuning target. After the global and local sound field feature information was extracted, it was used as an input to the parallel deep neural network.

Fig 2. Intensity distribution of the sound field.

Fig 2

3.2. Network model building

In this paper, a parallel deep neural network based on a fully connected architecture was used to predict the physical structural parameters of metasurface structural units. The model inputs were the extracted global and local feature information. The outputs were the density ρ and the gradient value g of the first structural unit. The density distribution of the entire metasurface structural unit could be determined from the density ρ of the first structural unit and the gradient value g [32]. As shown in Fig 3, the network model consists of two sub-networks, which are a constraint network with global sound field feature information as input and a prediction network with local sound field intensity information as input. The parameter α in the figure was the weight factor, which was used to determine the weight of the constraint network in the overall model and took values in the range of [0,1]. The specific neural network topology consisted of an input layer, a hidden layer, and an output layer.

Fig 3. Structure of the parallel network model.

Fig 3

In the loss function selection, the MSE function is usually adopted as the loss function. However, the MSE loss function cannot accurately assess the nonlinear characteristics of the error and is sensitive to outliers. This problem can be solved by modifying the loss function using the radial basis kernel function. The modified loss function LKernel-MSE-Single can be written as Eq (2) [37]:

LKernelMSESingle=1Nt=1N[1exp((yty^t)2/2σ2)] (2)

where N was the number of samples and σ was the parameter of the loss function itself, yt and ŷt respectively represented true and predicted values. We set φ = (y- ŷ)2 and λ = 2σ2, where y was the true value and ŷ was the predicted value, so φ was the squared error between the true value and the predicted value. According to Eq (2) and Fig 3, the final loss function LK-MSE expression was shown in Eq (3):

LKMSE=1Nt=1N{α[1exp(φt1/λ1)]+(1α)[1exp(φt2/λ2)]} (3)

where φt1 and φt2 respectively represented the squared errors of the true and predicted values of the constraint network, the prediction network. λ1 and λ2 were respectively the number of input features for the constraint and prediction networks. The modified loss function computed the gradient to the network parameters and completed the update to the network parameters.

4. Analysis of model validation results

4.1 Dataset preparation and setup parameters

The specific composition process of the dataset was shown in Fig 4, which consisted of two parts: labels and inputs. The labeling part consisted of the first metasurface unit density ρ and gradient value g. We randomly generated 30,000 sets of first block metasurface structural unit densities ρ and gradient values g. The metasurface density distributions could be calculated from the generated data, which were inputted into a physical model of the acoustic field to derive the corresponding acoustic field intensity distributions. The input parts were global sound field feature information and local sound field intensity information, which could be obtained by feature extraction of the sound field intensity distribution. In Fig 1(A), the reflected wave will be reflected along the echo direction (y reverse direction) when there is no metasurface, so the global sound field range could be set to 0˚~180˚. The energy of the reflected wave is mainly concentrated in the direction of the echo [38]. Therefore, we set the target regulation interval as 85˚~95˚. When extracting the sound field intensity values within the tuning target range, we sampled at 0.5° equal intervals with a dimension of 1×21. The sound field in the range of 0˚~180˚ was divided into 6 intervals at equal step. The rules for extracting global sound field features were as follows:

Fig 4. The composing process of the dataset.

Fig 4

Step 1, Within each interval, select: 1. The sound field intensity values and angles of the maximum crest and two adjacent points on each side are required, 2. The sound field intensity values and angles of the minimum trough trough and two adjacent points on each side are required.

Step 2: Within the global acoustic field, the act of selection: 1. The intensity value of the main reflection angle sound field, 2. The number of maximum peaks.

The above features were selected to characterize the global feature information of the sound field with a dimension of 1×122.

The number of neurons for the neural network model was set as shown in Table 1. The relevant parameters for training the network model were set as shown in Table 2.

Table 1. The establishment of neuronal population size.

Neuronal layer Predictive network Constrained network
Input layer 21 122
Hidden layer 100、300、600、1000、500、80 724、1000、543、200、100
Output layer 2

Table 2. Parameter configuration of predictive models.

Classification Parameter name Parameter Settings
Dataset partitioning Training set 24000
Testing set 6000
Network Configuration Learning rate 0.005
Activation function LeakyReLu
Optimizer Adam
Dropout 0.1
Training Process Epoch 800
Batch size 64

4.2 Network model training

In network model training, the constraint performance of the constraint network directly affects the final network prediction results. If the constraint network weight α was too large means that the network model prediction results were more biased towards the global sound field distribution. Thus, the target sound field intensity prediction was not accurate enough. If the constraint network weight α was small means that the link between the target sound field intensity and the global sound field was weakened, and the coupling relationship between multiple structural units couldn’t be learned during the model training process. The prediction results will also be inaccurate. Therefore, a key point in realizing local sound field intensity prediction was to determine the optimal weighting of the constraint network in the overall neural network by adjusting the weighting factors.

In this paper, the value of weight factor α was discussed. Under the unchanged conditions of the aforementioned parameter settings, model training and generalization ability verification were performed with different values of α (0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, and 0.9). The variation of the loss function is shown in Fig 5. When the model convergence at different values of α was stabilized, the last 20 batches after the convergence of the network model were taken as the horizontal coordinates. Fig 5 shows that the loss function value is decreasing as the value of α increases.

Fig 5. The loss function value corresponding to different values of α.

Fig 5

The loss function value can reflect the network model performance to some extent. Generalization ability is also one of the important indexes to evaluate the performance of network model. To verify the generalization ability of the network model, we randomly selected 1000 sets of acoustic data from the dataset that were not involved in the model training. These data were then input into the network model with different values of α. Predictions of the density of metamaterial unit cells were obtained. The density prediction values obtained were used to predict the sound field intensity through finite element analysis. The average error e between these predicted values and the corresponding local sound field intensity values in the sample data was calculated. The validity of the metasurface inverse design method was verified. The average error e is calculated as follows:

e=111iN(|PP^|),i=85,86,,95 (4)

In the above equation e is the mean error, P is the true value and P(ˆ) represents the predicted value. Fig 6 illustrates that the proportion P of cases where e is less than 5 dB varies with different values of α. The optimal generalization ability was achieved at 92.9% when α = 0.4. A comparison of Figs 5 and 6 shows that the generalization ability of the network models corresponding to different values of α do not increase as the loss function decreases. Fig 7(A)–7(I) show the real and predicted sound field distributions at different α when validation parameters are the same. Fig 7(D) shows the sound field distribution under the optimal parameter α = 0.4.

Fig 6. Proportion of modulation intervals with mean error values less than 5 dB for different values of α.

Fig 6

Fig 7.

Fig 7

(a)-(i) respectively showed the real and predicted sound field distributions at α = 0.1~0.9 for a certain set of parameters.

As shown in Fig 7(A)–7(C), when the α value is 0.1, 0.2, and 0.3, it represents weaker constraint capabilities. When the model is trained with more focus on local sound field characteristics. Information on the coupling relationship between multiple structural units is missing when constructing the mapping relationship between the sound field distribution to the structural parameters, which leads to inaccurate prediction of the local sound field. Fig 7(E)–7(I) illustrated the case of high constraint capacity. The network model learns more about global features. Constructing mapping relationships between sound field distributions to structural parameters focuses more on the entire sound field, which leads to neglecting local sound field feature information.

Based on the above model validation results, the model performance is best when the weighting factor α = 0.4. Fig 8 shows the loss function variation of the network model when α = 0.4. Fig 9 demonstrates the curves of the true and predicted values of a certain group of acoustic field intensities at α = 0.4. Fig 9 shows that the predicted values of the sound field intensity are in general agreement with the trend of the real values. In the local control range (i.e., within 85˚~95˚), the error between the predicted sound field intensity values and the true values is approximately 1 dB. The specific distribution curve can be seen in Fig 10. As shown in Fig 11, among the randomly selected 1000 sets of data not involved in training, 92.9% of the sample data have an average error e of less than 5 dB in the modulation interval. The model can establish the mapping relationship from the local acoustic field to the metasurface structural parameters, which realizes the accurate prediction of the metasurface structural parameters. The modulation of the target acoustic field intensity is realized and has a good generalization ability.

Fig 8. Loss function curve at α = 0.4.

Fig 8

Fig 9. Distribution curves of true and predicted sound field intensity.

Fig 9

Fig 10. Curves of real and predicted local sound field strengths.

Fig 10

Fig 11. The average error between the true value of the local sound field intensity and the predicted value.

Fig 11

4.3 Loss function comparison model validation

This study introduced a comparative model validation between the K-MSE loss function and other commonly used loss functions which were SmoothL1, Quantile, Huber, MAE, and MSE. Their specific formulas were given as Eqs (5)–(9). Comparative model validation could demonstrate that the K-MSE loss function can help to construct a mapping relationship between the physical structure parameters and the local sound field intensity. The training parameters of the network model were set according to Tables 1 and 2, while α = 0.4. Only the loss function changed throughout the training process.

LSmoothL1={1Ni=1N0.5(yiy^i)2,|yiy^i|<11Ni=1N|yiy^i|0.5,|yiy^i|1 (5)
LQuantile={1Ni=1Nq(yiy^i),yi>y^i1Ni=1N(1q)(y^iyi),yiy^i (6)
LHuber={1Ni=1N(yiy^i)2/2,|yiy^i|<δ1Ni=1Nδ(|yiy^i|δ/2),|yiy^i|δ (7)
LMAE=1Ni=1N|yiy^i| (8)
LMSE=1Ni=1N(yiy^i)2 (9)

where yt was the true value, ŷt was the model predicted value, and N was the number of samples. In Eq (6), q was the quantile, q = 0.8. In Eq (7), δ was the hyperparameter of LHuber, δ = 0.5.

The generalization ability of the network model under the six loss functions was also verified using a randomly sampled set of 1000 data sets that were not involved in training. The performance of the network model with different loss functions is shown in Table 3. The K-MSE loss function achieves P percentage of 92.9% when e is less than 5 dB in the target control range. It also has the smallest loss function value and the fastest convergence rate, which indicates its optimal optimization performance for the prediction network. The predicted sound field distribution for the same set of randomly selected parameters is shown in Fig 12. Fig 12(A)-12(E) demonstrates that the errors in the main reflection angles of the predicted and real sound fields are within 15°. The envelope trend of the sound field intensity distribution of the predicted sound field and the real sound field are similar, but the number of peaks and valleys and the difference of the local sound field intensity between the two are large. As shown in Fig 12(F), the distribution of the model predicted sound field under the K-MSE loss function can have the same distribution trend as that of the real sound field, and the error of the sound field intensity value in the modulation range is about 1 dB. The K-MSE loss function helps the network model to accurately measure the nonlinearity of the error. The mapping relationship between the physical structure parameters to the sound field intensity is helped to be constructed.

Table 3. Comparison of loss functions.

name SmoothL1 Quantile Huber MAE MSE K-MSE
value of the loss function 1.7586 e-05 0.0017 1.7766 e-05 0.0036 6.1846 e-05 2.9612 e-06
astringent batch 622 697 531 592 492 475
P 28.2% 38.2% 59.5% 65% 69.1% 92.9%

Fig 12.

Fig 12

(a)-(f) refer to the actual sound field and predicted sound field distributions under different loss functions for the same parameter.

5. Conclusions

This paper proposed a metasurface inverse design method based on parallel deep neural networks (PDNN). The method respectively established a prediction network with local target acoustic field intensity and metasurface physical structure parameters as input and output. The global acoustic field features were used as inputs to the constraint network. The weight of the constraint network in the whole PDNN network was adjusted by adjusting the weight factor. The loss function based on the radial basis kernel function was used to train the whole network model and construct the mapping relationship from the desired sound field to the metasurface physical structure parameters. The predicted metasurface structural unit parameters could be derived from the desired acoustic field. Ultimately, the modulation of local acoustic field intensity was realized. The model validation results demonstrate that the predicted sound field intensity curve obtained by this method closely follows the variation trend of the simulated sound field intensity curve. Additionally, it achieves a percentage of 92.9% for the sample data in which the average error between the predicted sound field intensity values and the true values falls within the specified control target range of less than 5 dB.

Data Availability

We have uploaded the minimal anonymized data set necessary to replicate our study findings to a stable, public repository. This is the link where one can access that information: https://github.com/bjhkbj/dataset.git.

Funding Statement

The National Natural Science Foundation of China (Grant No. 61873101), the PetroChina Innovation Foundation (Grant No. 2020 D-5007-0305), and the Marine Defense Technology Innovation Center Innovation Fund (Grant No. JJ-2020-719-03-02) supported this study.

References

Decision Letter 0

Yuan Zhang

29 Jan 2024

PONE-D-23-30565Research on Local Sound Field Intensity Control Technique in Metasurface Based on Deep Neural NetworksPLOS ONE

Dear Dr. Lv,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised by the referees. Please note detailed comments from reviewer 2 is in an attached document.

Please submit your revised manuscript by Mar 14 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Yuan Zhang, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Did you know that depositing data in a repository is associated with up to a 25% citation advantage (https://doi.org/10.1371/journal.pone.0230416)? If you’ve not already done so, consider depositing your raw data in a repository to ensure your work is read, appreciated and cited by the largest possible audience. You’ll also earn an Accessible Data icon on your published paper if you deposit your data in any participating repository (https://plos.org/open-science/open-data/#accessible-data).

3. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, all author-generated code must be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

4. Thank you for stating the following financial disclosure: "The National Natural Science Foundation of China (Grant No. 61873101), the PetroChina Innovation Foundation (Grant No. 2020 D-5007-0305), and the Marine Defense Technology Innovation Center Innovation Fund (Grant No. JJ-2020-719-03-02) supported this study."

Please state what role the funders took in the study.  If the funders had no role, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

If this statement is not correct you must amend it as needed.

Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf.

5. PLOS requires an ORCID iD for the corresponding author in Editorial Manager on papers submitted after December 6th, 2016. Please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. Please see the following video for instructions on linking an ORCID iD to your Editorial Manager account: https://www.youtube.com/watch?v=_xcclfuvtxQ

6. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This manuscript investigated the inverse design of acoustic metasurfaces based on a parallel deep neural network, where acoustic field features and the parameters of metasurfaces are the input and output, respectively. The work and the corresponding results are interesting and meaningful, yet the referee has several questions to be addressed by the authors.

1. In this manuscript, the metasurface is based on the profile of the mass density along the metasurface (or the constant gradient of the mass density more precisely). However, parametric modeling was conducted throughout the paper. How can this theoretical metasurface be physically realized in real applications? Do the authors have any proposal for this “theoretical metasurface” since it requires an increasing mass density while keeping the sound speed as a constant instead.

2. What is the physical dimension of the metasurface structural unit and the whole metasurface used in this manuscript? What is the thickness of the structural unit?

3. The authors should provide the necessary details of how the input acoustic fields are obtained. For example, what kind of physical model is used to get the acoustic field, what is the sound frequency, and etc. More importantly, what is the propagation distance for the sound intensity field since the absolute intensity level of the fields is used, but clearly the sound intensity level will be different at different propagation distance.

4. In Fig. 5, are the true values of the error for different alpha comparable? The error for the same alpha represents the degree of convergence of the model, but what is the physical meaning for the comparison between different alpha? The optimal choice of alpha=0.4 does not result in the minimum error.

5. How was the average error computed?

6. The language could be improved, and the symbols and punctuation mark should be kept consistent, for example C0 or c0 in Eq. (1), comma or semicolon in Lines 161 and 165, and etc.

Reviewer #2: This paper introduces a novel metasurface inverse design method leveraging parallel deep neural networks (PDNN). The method respectively extracts the key in formation of the acoustic field and the metasurface as the input and output of the PDNN network. With the help of the kernel loss function and the constraint performance provided by the constraint network, it establishes an inverse mapping relationship between the target acoustic field and the physical structure parameters of the metasurface. The method has certain design potential and may find applications in the field of underwater acoustic stealth. However, before accepting the publication of this paper, it is necessary to address the following questions.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: review.docx

pone.0301211.s001.docx (15.8KB, docx)
PLoS One. 2024 Mar 28;19(3):e0301211. doi: 10.1371/journal.pone.0301211.r002

Author response to Decision Letter 0


7 Mar 2024

Dear edit and dear reviewers,

Re: Manuscript ID: PONE-D-23-30565 and Title: Research on Local Sound Field Intensity Control Technique in Metasurface Based on Deep Neural Networks

Thank you for your letter and for the reviewers' comments concerning our manuscript entitled “Research on Local Sound Field Intensity Control Technique in Metasurface Based on Deep Neural Networks”(ID:PONE-D-23-30565). Those comments are all valuable and very helpfiul for revising and improving our paper, as well as the important guiding significance to our researches, We have studied comments carefully and have made correction which we hope meet with approval. Revised portion are marked in red in the paper. The main corrections in the paper and the responds to the reviewer's comments are as flowing:

Responds to the reviewers’s comments:

Reviewer #1: This manuscript investigated the inverse design of acoustic metasurfaces based on a parallel deep neural network, where acoustic field features and the parameters of metasurfaces are the input and output, respectively. The work and the corresponding results are interesting and meaningful, yet the referee has several questions to be addressed by the authors.

1. In this manuscript, the metasurface is based on the profile of the mass density along the metasurface (or the constant gradient of the mass density more precisely). However, parametric modeling was conducted throughout the paper. How can this theoretical metasurface be physically realized in real applications? Do the authors have any proposal for this “theoretical metasurface” since it requires an increasing mass density while keeping the sound speed as a constant instead.

Response 1: We really appreciate your professional comments on our articles. In this paper, due to the practical constraints, no actual physical implementation has been made, but the existing literature has used five-mode metasurfaces for practical applications in stealth and focusing, and for theoretical metasurfaces, it is impossible to guarantee that the sound velocity is constant, but we can use five-mode metasurfaces to avoid the influence of sound velocity.

In the theory of five-mode metasurfaces, when all sound waves are projected into the interior of the metasurface, the impedance matching of the metasurface and the medium needs to be satisfied when the perpendicular incident occurs

(1)

In Eq. (1), z is the intrinsic impedance of the metasurface, z0 is the intrinsic impedance of the medium, ρ and ρ0 are the densities of the metasurface and the incident medium, respectively, c and c0 represent the velocity of sound of the metasurface and the incident medium:

(2)

Eq. (2) illustrates that while controlling the speed of sound, the density needs to change in the opposite direction to achieve impedance matching. The density and speed of sound of natural materials generally change with the same trend, and in the case of a change in the speed of sound, there is no guarantee that the impedance will not change, and the speed of sound cannot be guaranteed to be constant. Therefore, to meet the conditions for the existence of full transmission, only artificial materials can achieve it. The ideal density distribution of the reflective five-mode metasurface is

(3)

In Eq. (3), θr is the reflection angle, L is the metasurface length, and H is the normal thickness of the metasurface.

2. What is the physical dimension of the metasurface structural unit and the whole metasurface used in this manuscript? What is the thickness of the structural unit?

Response 2: We appreciate your professional comments on our article. The focus of this article is on the prediction of the physical structure parameters of the metasurface, and there is no restriction on the internal structure of the metasurface structural units; the normal thickness of the metasurface is 0.12 m, the length is 1 m, and the length of each metasurface structural unit is 0.04 m. We have added the specific settings into the article at line 82,97 and 98. We thank you very much for your careful reading.

3. The authors should provide the necessary details of how the input acoustic fields are obtained. For example, what kind of physical model is used to get the acoustic field, what is the sound frequency, and etc. More importantly, what is the propagation distance for the sound intensity field since the absolute intensity level of the fields is used, but clearly the sound intensity level will be different at different propagation distance.

Response 3: We appreciate your professional comments on our article. The physical field of the water and the hypersurface is set as pressure acoustics, the acoustic frequency is the frequency of the incident wave, and the backing plate is set as solid mechanics; we establish the acoustic field model with the help of COMSOL simulation software, and the main work of this paper is to verify whether the deep learning modeling method can effectively realize the regulation of the local acoustic field with the help of the acoustic field model, and the results show that based on the deep learning modeling method, it can effectively realize the reverse design and thus achieve the regulation of local acoustic field of the hypersurface. The results show that based on the deep learning model method, the hypersurface inverse design can be effectively realized, and then realize the local acoustic field control. Thank you very much for your question.

4. In Fig. 5, are the true values of the error for different alpha comparable? The error for the same alpha represents the degree of convergence of the model, but what is the physical meaning for the comparison between different alpha? The optimal choice of alpha=0.4 does not result in the minimum error.

Response 4: Thank you very much for your question. The error truth values of different alpha are comparable, and only the value of alpha changes when validation with different alpha is performed. In Fig. 5, the same data set is used to validate the models built with different alpha values during validation, and the performance of the models corresponding to different alpha values can be seen to be superior or inferior based on the difference in the true value of the error; because of the coupling relationship between multiple hypersurface units, the local sound field will be affected by the whole sound field. In this paper, a parallel neural network is used, one model network extracts the global sound field feature information, and the other network extracts the local sound field feature information. In this paper, we use the alpha value can be used to adjust the constraint strength of the global sound field to the local sound field, and we introduce it into the kernel loss function and it plays a role in the model, we need to find out the optimum value of alpha to make the model performance optimal and maximize the accuracy of the prediction. We need to find the optimal alpha value to optimize the model performance and maximize the accurate prediction, we validate the role of alpha value on the model in the paper, and from the results, we can see that the appropriate alpha value has a greater impact on the accuracy of the deep learning model. Thank you for your careful reading.

5. How was the average error computed?

Response 5: Thank you for your question and I apologize for the confusion here. The formula for calculating the average error is as follows:

In the above equation e is the mean error, P is the true value and P(ˆ) represents the predicted value. We have added the formula for calculating the mean error to the article at line 205-208. Thank you for your careful reading.

6. The language could be improved, and the symbols and punctuation mark should be kept consistent, for example C0 or c0 in Eq. (1), comma or semicolon in Lines 161 and 165, and etc.

Response 6: Thank you for your question, and sorry for the trouble here. We've changed the parameters or punctuation to lines 71, 165, and 169. We are very sorry for our careless mistakes. Thanks for the reminder. We've rechecked and corrected the article to make punctuation consistent throughout the article. Thank you for your correction.

Reviewer #2: This paper introduces a novel metasurface inverse design method leveraging parallel deep neural networks (PDNN). The method respectively extracts the key in formation of the acoustic field and the metasurface as the input and output of the PDNN network. With the help of the kernel loss function and the constraint performance provided by the constraint network, it establishes an inverse mapping relationship between the target acoustic field and the physical structure parameters of the metasurface. The method has certain design potential and may find applications in the field of underwater acoustic stealth. However, before accepting the publication of this paper, it is necessary to address the following questions.

1. The state-of-the-art review in the introduction section is a bit poor. In particular, the description of metasurfaces and pentamode metamaterials is a bit too fast citing only a few papers for each mentioned field and does not give proper credit to previous literature contributions. The manipulation of scattering characteristics in the paper aligns with the concept of coding metasurfaces [Natl. Sci. Rev. 2022, 9, nwac030; Phys. Rev. Appl. 2022, 17, 034019; Small 2024, 2308349, DOI: 10.1002/smll.202308349]. The conceptual framework for the design of the pentamode metamaterial also requires richer literature support [J Acoust Soc Am 2009; 125: 839-849; J Sound Vib 2019; 443: 238–252; J Mech Phys Solids 2021; 152:104407].

Response 1: We sincerely thank you for your valuable comments. We have carefully reviewed the introductory section and added more references to metasurfaces and penta-mode metamaterials in the introduction section of the revised draft below:

1. Achromatic metasurfaces by dispersion customization for ultra-broadband acoustic beam engineering (doi: 10.1093/nsr/nwac030);

2. Broadband Coding Metasurfaces with 2-bit Manipulations (doi: https://doi.org/10.1103/PhysRevApplied.17.034019);

3. Enhanced Broadband Manipulation of Acoustic Vortex Beams Using 3-bit Coding Metasurfaces through Topological Optimization (doi: 10.1002/smll.202308349);

4. Acoustic metafluids (doi: https://doi.org/10.1121/1.3050288);

5. Convective correction of metafluid devices based on Taylor transformation (doi: https://doi.org/10.1016/j.jsv.2018.11.047);

6. Customized broadband pentamode metamaterials by topology optimization (doi: https://doi.org/10.1016/j.jmps.2021.104407)

7. Design and experimental verification of a water-like pentamode material (doi: 10.1063/1.4973924)

We have introduced the above literature in the article to complete the introduction section. Thank you for reading.

2. This paper is limited to the optimization design of equivalent parameters and does not delve into more complex pentamode metamaterial configuration designs. To enhance the completeness of the paper, it is suggested to provide potential structural instances based on the designed parameter distribution. Even referencing cases from the literature would be beneficial in demonstrating the effectiveness of the design and the rationality of the designable parameters.

Response 2: Thank you very much for this suggestion, we agree with your suggestion. The work of this paper is to use deep learning to predict the structure parameters of the metasurface, and then realize the local acoustic field control. Due to the fact that the actual design of the metasurface structure unit is long in modeling time and requires high accuracy, the current experimental conditions are difficult to meet, and the specific metasurface structure is not involved, in the follow-up work, we will apply the deep neural network to the study of different metasurface structures and a wider range of sound field control, and hope to truly realize the sound field control in the future. Thank you very much for your suggestions.

3. Since this paper does not involve the design of actual configurations and lacks experimental validation, the use of the term "experiment" in the main text is inappropriate.

Response 3: Thank you very much for your suggestion. Since there is no actual configuration involved, there is indeed an inappropriate problem with the use of "experimental" in the body, and we have modified the word "experimental" to "model validation" in the body of the article at 59,146,224,238,239,242 and 280. Thank you very much for pointing out the problem.

4. Does the scattering wave control strategy based on machine learning offer the advantage of faster and more convenient implementation? If so, does this attribute hold potential for applications in three-dimensional metasurface design and even curved metasurface design? Please provide further insights on this matter.

Response 4: Thank you for your questions. The scattered wave control strategy based on machine learning has the advantage of faster and more convenient implementation, when the machine learning model establishes the mapping relationship between input and output, the physical structure parameters of the metasurface can be obtained in a few seconds, which is much faster than the traditional way of establishing a simulation model. Machine learning has the potential to be applied in 3D metasurface design and even surface design, and it is a method of automatically learning the connection between the input data and the target data from examples from past experience. On the premise of having relevant datasets, the machine learning model can establish a high-dimensional mapping relationship between 3D/surface metasurface parameters and the sound field, and can quickly obtain the desired structural parameters or target sound field, which is helpful for high-dimensional, black-box, and time-consuming optimization problems, and reduces the time and labor cost of repeated experiments. Thank you very much for your questions.

5. This manuscript primarily focuses on the equivalent parameter design of underwater low-detectability metasurfaces. It is recommended to enhance its functionality, for instance, by comparing the reflection energy of this scattering metasurface with that of a fully reflective metasurface to evaluate the metasurface's acoustic target characteristics.

Response 5: We very much agree with you, and your suggestion provides a direction for our next research, which will further focus on the performance of metasurfaces in terms of energy. Detection and other fields play an important role, of course, not only in terms of energy, we also consider combining specific metasurface geometries to verify the performance of local control in actual scenarios, and also expect to achieve a wider range of sound field control, in the future work, we will enhance its function from including but not limited to these aspects. Thank you very much for your suggestion, which provides us with a good idea.

Attachment

Submitted filename: Response to Reviewers.docx

pone.0301211.s002.docx (33.1KB, docx)

Decision Letter 1

Yuan Zhang

12 Mar 2024

Research on Local Sound Field Intensity Control Technique in Metasurface Based on Deep Neural Networks

PONE-D-23-30565R1

Dear Dr. Lv,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at http://www.editorialmanager.com/pone/ and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Yuan Zhang, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

Reviewer #2: After a rigorous review, the authors of the paper have comprehensively addressed the concerns raised by the reviewer, and the overall writing of the paper is rigorous. Therefore, the reviewer has no further questions.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

Acceptance letter

Yuan Zhang

18 Mar 2024

PONE-D-23-30565R1

PLOS ONE

Dear Dr. Lv,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Professor Yuan Zhang

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Attachment

    Submitted filename: review.docx

    pone.0301211.s001.docx (15.8KB, docx)
    Attachment

    Submitted filename: Response to Reviewers.docx

    pone.0301211.s002.docx (33.1KB, docx)

    Data Availability Statement

    We have uploaded the minimal anonymized data set necessary to replicate our study findings to a stable, public repository. This is the link where one can access that information: https://github.com/bjhkbj/dataset.git.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES