Skip to main content
PLOS One logoLink to PLOS One
. 2021 Sep 20;16(9):e0257568. doi: 10.1371/journal.pone.0257568

Unifying information theory and machine learning in a model of electrode discrimination in cochlear implants

Xiao Gao 1,2,*, David Grayden 1, Mark McDonnell 3
Editor: Andreas Buechner4
PMCID: PMC8451994  PMID: 34543336

Abstract

Despite the development and success of cochlear implants over several decades, wide inter-subject variability in speech perception is reported. This suggests that cochlear implant user-dependent factors limit speech perception at the individual level. Clinical studies have demonstrated the importance of the number, placement, and insertion depths of electrodes on speech recognition abilities. However, these do not account for all inter-subject variability and to what extent these factors affect speech recognition abilities has not been studied. In this paper, an information theoretic method and machine learning technique are unified in a model to investigate the extent to which key factors limit cochlear implant electrode discrimination. The framework uses a neural network classifier to predict which electrode is stimulated for a given simulated activation pattern of the auditory nerve, and mutual information is then estimated between the actual stimulated electrode and predicted ones. We also investigate how and to what extent the choices of parameters affect the performance of the model. The advantages of this framework include i) electrode discrimination ability is quantified using information theory, ii) it provides a flexible framework that may be used to investigate the key factors that limit the performance of cochlear implant users, and iii) it provides insights for future modeling studies of other types of neural prostheses.

Introduction

People with hearing loss have greatly benefited from continuously developing cochlear implant technology. Contemporary cochlear implants restore functional hearing by directly stimulating different locations in the cochlea with electrodes [1]. The tonotopic organization of surviving nerve fibers allows cochlear implant electrodes to stimulate distinct neural populations thereby coding spectral information to electrode positions [2]. However, there is wide variability of speech perception performance between cochlear implant users [35]. The main factors that are known to limit speech recognition include i) age at implantation, age of onset of deafness, and survival rate of auditory nerve fibers [6], ii) electrode placement and insertion depth [7, 8], and iii) channel interactions between electrodes [3, 9].

Electrode discrimination has been shown to correlate with performance of speech tasks [10, 11]. It has been used as a one of the psychophysical measurements to assess pitch ranking ability [5, 1214], to investigate electrode stimulation strategies [13], to evaluate the effects of electrode placement and insertion [11, 15, 16], and to determine the number of spectral channels that can be distinguished by cochlear implant users [7]. In particular [11, 15], reported that electrode distance to the inner wall of the cochlea is a significant variable for the prediction of electrode discrimination ability, and improvements in speech recognition are expected with electrode placements closer to the inner wall [13, 17]. Showed that the Dual-Electrode strategy achieves better performance in tonotopic pitch ranking compared to single-electrode strategy. However, the results varied among cochlear implant users [12, 15, 16], so the extent to which the key factors affect the performance of cochlear implants users is still unknown.

Mathematical and computational models have been developed to study electrically evoked auditory nerve activity [1820], sound coding strategies [21], electrical current spread in the cochlea [22], electrode stimulation strategies [2325], and effects of electrode placements and insertion depth [26, 27]. The computational models are used to evaluate the effects of different electrode configurations and to study the pattern of neural excitation produced by a given electrode configuration. Psychophysical cochlear implant models which study the psychophysical responses to cochlear implant stimulation, on the other hand, have incorporated neural models to evaluate hearing performance [28]. Proposed an analytical model to predict loudness growth for different electrode configurations [29, 30]. Developed a computational model of pitch perception and discussed the contribution of place and temporal cues to pitch perception [31, 32]. Demonstrated how the spatial spread of the stimulation field impacts on speech recognition and predicted speech intelligibility. More recently [33], studied the effects of musical training on fundamental frequency discrimination.

However, to enable modelling studies of speech recognition variability at the individual level, a modelling framework is needed where the geometric information of the cochlea, the electric conduction characteristics and the neural elements are individually fitted. Our previous studies modeled the electrode-to-fiber interface and quantified the performance of electrode place discrimination from a theoretical point of view [3438]. This modeling framework was developed based on the place code theory of hearing [39], which assumes that sound frequency perception depends upon different auditory nerve fibers responding to different frequencies of sound. Accordingly, the interface between the array of electrodes and the auditory nerve fibers is conceptualized as a communication channel, where the channel input random variable is defined as a choice of electrode to stimulate, and the channel output random variable is a function of the nerve fibers that respond to the electrode stimulation. The mutual information between channel input random variables and channel output random variables is assumed to act as a proxy for electrode discrimination ability, under the assumption that increased mutual information leads to increased hearing performance of cochlear implants [40]. In later studies [41, 42], the performance of place discrimination was quantified by predicting the location of the stimulating electrode in response to an activation pattern of the fibers, and a commonly used measurement of electrode discrimination, four-interval forced-choice, was simulated by applying a discriminative classifier proposed in [43, 44]. However, the performance of cochlear implant electrode discrimination ability has only been investigated based on the statistical correlations [34, 35] or predicted using a neural network classifier [41, 42]. The methods have not been unified to address clinically relevant issues.

The goal of the current study is to provide a flexible structure that enables the investigation of the extent to which key factors limit the performance of cochlear implants model. Unifying the information theoretic method and the neural network classifier into a modeling framework enables us to quantitatively interrelate the performance of cochlear implants model to electrode discrimination ability. In particular, we aim to investigate the extent to which the key model parameters affect the modeled electrode discrimination ability. These parameters—electrode placements, insertion depth, auditory nerve fibre survival rate, width of current spread—represent the key factors that are thought to affect hearing performance with cochlear implants. Therefore, this study may provide insight into the following important and clinically relevant questions: i) how to quantitatively analyze the variability in hearing performance of cochlear implants at the individual level, ii) whether more electrodes can provide improved hearing performance, and iii) how to personalize cochlear implant configurations for an individual user.

Methods

Generation of nerve fiber activation pattern

Our previously developed model is adapted as the underlying method to generate fiber activation patterns [34, 42], which we summarize here:

  1. geometry of auditory nerve fiber and electrode locations and quantities;

  2. mechanisms of stochastic action potential generation in auditory nerve fibers;

  3. current spread from each electrode;

  4. dependence of loudness perception on overall auditory nerve activity;

We first describe the locations and distances between electrodes and auditory nerve fibers. The locations of the i–th fiber, fi, and j–th electrode, ej, are defined in three-dimensional space as [45]

fi=(Rf,icos(φf,i),Rf,isin(φf,i),hf,i), (1)
ej=(Re,jcos(φe,j),Re,jsin(φe,j),he,j), (2)

where i = 1, …, N, N is the number of fibers, j = 1, …, M, and M is the number of electrodes. In Eq (1), ϕf,i is the twirling angle of the i–th fiber and Rf,i and hf,i are the radius distance and the height of the i–th fiber, respectively. Similarly, in Eq (2), ϕe,j, Re,j, and he,j define the quantities for the j–th electrode. An example electrode-to-fiber geometry is shown in Fig 1.

Fig 1. The geometry of the electrode array and auditory nerve.

Fig 1

In this example, the total twirling angle of the cochlea is 5π radians, the twirling angle of the electrode array is 2π radians, and the distance between the electrodes and auditory nerve fibers is r = 2 mm. This figure is adapted from Figure 1 in [42].

The second component of the underlying model defines the event of a fiber producing an action potential under a memoryless assumption. Let Yi ∈ {0, 1} denote whether or not fiber i produces an action potential in response to current at fibre i, Cf,i. The probability that Yi = 1, given Cf,i is

Ps(Cf,i)0.5(1+erf(Cf,i-θi2σi)), (3)

where θi represents the current that produces an action potential with probability 0.5 and σi is a parameter that is used to measure the probability of an action potential [22, 34].

The third component defines the current level for the i–th fiber, Cf,i, as a function of the current level, Ce,j, at stimulating electrode, j, as

Ci,j=Ce,j10-0.05Adi,jl, (4)

where di,j is the distance between the i–th fiber and the j–th electrode, l is the total length of the cochlea, and A is the attenuation in electrode current [46]. The results in [46] confirm the attenuation is approximately 0.5 dB/mm for monopolar and 4 dB/mm for bipolar stimulation. In this paper, we follow [46] and use a single biphasic pulse stimulus and choose A between 0.5 dB/mm and 4 dB/mm to model the different widths of current spread. We use this simple yet effective model for action potential generation and current spread. More complex models may improve the accuracy of modeling electrode stimulation strategies [28].

The fourth component models the loudness by assigning a current level to each electrode such that stimulation of each electrode causes no more than a certain percentage of the auditory nerve fibers to generate action potentials.

A vector of the outcomes for a total of N nerve fibers is defined as the channel output in response to an electrode choice, ej,

Y=(Y1,Y2,,YN). (5)

Since Yi is the activation of the i–th fiber, Y is the total fiber activation pattern in response to an electrode choice.

Fig 2 shows three examples of activation patterns of auditory nerve fibers, Y. The electrode-cochlea geometry used is that in Fig 1, N = 1000, and A = 0.5 dB/mm. For a given stimulated electrode, most of the action potentials were generated in the fibers closest to the stimulated electrode. The activation patterns generated by the model were used to quantify the electrode discrimination ability.

Fig 2. The activation pattern of auditory nerve fibers, Y, for a stimulated electrode.

Fig 2

Three example cases are shown. The electrode-cochlea geometry is that shown in Fig 1, N = 1000, and A = 0.5 dB/mm.

Quantifying electrode discrimination ability

The method introduced by [44] is adapted to classify the fiber activation pattern. To this end, each example pattern, Y, is assigned to one of the M electrodes. The performance of electrode discrimination is then estimated by calculating the correct classification rates by simulating many repeated stimulations of each electrode using the model.

The classifier, used by [42], consists of a single hidden layer feedforward network (SLFN), where N is the number of units in the input layer (equal to the number of fibers), L is the number of units in the hidden layer, and M is the number of output layer units (equal to the number of electrodes). We also define two matrices that map the fiber activation samples (the input vectors) onto the indices of electrodes (the prediction vectors). The input weight matrix Win (of size L × N) maps the data from the N-dimensional input space to the L-dimensional hidden-layer feature space; the output weight matrix Wout (of size M × L) maps the L-dimensional hidden-layer activations to M-dimensional prediction vectors. The input and output weights are initially randomly chosen from uniform distributions on the interval [−1, 1].

In the training phase, we define a matrix, T, where each column represents an activation pattern, Y. We denote V, which is the same size as T, as the labels of each training sample. Each column in V has only one non-zero element set to 1 as the label of the corresponding training vector in T. We define Z as the prediction vectors

Z=WoutAtrain, (6)

where

Atrain=f(WinT). (7)

In Eq (7), WinT are the hidden layer activations and f(⋅) is the termwise (L × P1L × P1) activation function. The matrix Wout is found in the training phase that minimizes the mean square error between V and Z. In this paper, the absolute value function is used as activation function.

Similar to the training phase, we define T′ as the test vector, where each column represents a fiber activation pattern. The prediction vector in the test phase is defined as

Z=Woutf(WinT), (8)

where Wout was obtained in the training phase. The classification decision for testing vector i is made from Z′ by choosing the maximum element in a given column. This corresponds to making a decision of which particular electrode was stimulated in response to a given fiber spiking pattern.

We note that the structure of the network used in this paper was introduced by [47] as a variation of the Extreme Learning Machine algorithm. The weights from the input to the hidden layer neurons are randomly initialized and are fixed thereafter (i.e., they are not trained). Output weights Wout are obtained by minimizing the mean square error between V and Z. This is obtained using the closed form expression for such linear regression problems,

Wout=VAtrain(AtrainAtrain)-1. (9)

We chose this network structure because of its low training complexity.

Unifying information theory and machine learning in the framework for electrode discrimination

In [34], only mutual information was calculated to predict optimal number of electrodes, whereas in [42], only the correct classification rates are calculated to quantify electrode discrimination ability. To unify information theoretic methods and machine learning and investigate how mutual information and correct classification rates are related, in this paper, we model electrode discrimination using framework is adapted from [34, 42]. Correct classification rates were calculated between the stimulated and predicted electrodes and the mutual information between the inputs and outputs of the system were also estimated, as shown in Fig 3.

Fig 3. A full framework for estimating electrode discrimination ability.

Fig 3

The probability of the system input, PE, and output, PE, as well as the conditional probability, PE′|E, are estimated after the testing phase of the neural network classifier. Mutual information is then calculated using Eq (11).

We write a vector E ∈ {ej: j = 1, …, M} that defines the stimulated electrode and E{ej:j=1,,M} that defines the predicted electrode in response to an actual stimulated electrode. We then estimate the probability of the system input, PE, and output, PE, as well as the conditional probability, PE′|E. Thus, the mutual information between the system input and output is calculated as

I(E;E)=H(E)-H(E|E) (10)
=-j=1MPE(j)log2PE(j)+k=1MPE(k)×j=1MPE|E(j|k)log2(PE|E(j|k)). (11)

Parameter selection

Our framework provides a flexible structure that enables us to model key factors that are thought to limit the performance of cochlear implant model. The factors considered are listed in Table 1.

Table 1. Parameters in the modeling framework.

Parameters Description
N Number of surviving fibers.
A Attenuation in electrode current, which is used to model width of current spread.
C e The value of current at a stimulated electrode, which is used to model loudness perception.
α Electrode insertion depth, which is used to model different types of electrode array.
r Electrode-to-fiber distance, which together with α is used to model electrode placement.

The surviving auditory nerve fibers are chosen as nonuniformly located along the cochlea according to a model of the actual location distribution (for details, see [37]). The attenuation in current, A, is assumed to increase linearly with the distance from an electrode and the value can vary depending on the electrode stimulation strategy [48]. The interface between electrodes and auditory nerve fibers is modeled by electrode insertion depth, α, and electrode-to-fiber distance, r, which enables us to freely adjust the geometry between electrodes and auditory nerve fibers based on clinical studies.

Results

We quantify the performance of cochlear implant model by investigating the correct classification of electrode discrimination and estimating mutual information between the system inputs and outputs. The parameters and the training phase for the classifier follows the same principle used by [42]. In the test phase, for of each possible electrode location, the correct classification rate and mutual information are estimated from 400 tests.

Correct classification rate vs. mutual information

Fig 4(a) and 4(b) show the mutual information and correct classification rates, respectively, for α = {2π, 3π}. The rest of the parameters in the model are chosen as N = 3000, A = 0.5 dB/mm, and r = 2 mm. We emphasize that, in this paper, we aim to verify the method so the parameters are chosen to model a general case; the parameters can be chosen based on clinical studies to more accurately model cochlear implants of individual users. In Fig 4(a), for a given M, we see higher mutual information for α = 3π, which is consistent with findings in previous modeling studies [34, 35]. Accordingly, in Fig 4(b), we find higher correct classification rate for a longer electrode array (α = 3π). We observe a rapid increase of mutual information for very small number of electrode from 5 electrodes to around 11 electrodes; however, more than 11 electrodes only leads to a slight change in mutual information, which qualitatively agrees with the results of [42].

Fig 4. Model performance vs. the number of electrodes.

Fig 4

Two quantities are shown: (a) mutual information and (b) correct classification rate.

At the point where the peak of mutual information appears, we find that the correct electrode discrimination rate for both cases of α is around 90%. More than 11 electrodes leads to a sharp drop of the total correct classification rate. We note that the drop of correct classification rate is due to the distance between electrodes, which decreases with increasing numbers of electrodes in the array owing to the fixed length of the cochlea. It is worth emphasizing that improvement of correct classification rate for a longer electrode array is not only related to the spacing of adjacent electrodes. We have shown that, with the same spacing between adjacent electrodes, the longer electrode array leads to a better correct classification rate than a shorter electrode array [42].

In our model, correct classification rates quantify the probabilities that each electrode can be correctly discriminated, whereas mutual information is the measurement of the “information content” between a stimulated electrode and the response of auditory nerve fibers. Therefore, increased mutual information is usually associated with improved speech perception, and the maximum mutual information can be associated with the number of electrodes that leads to optimal hearing perception.

To further investigate the how mutual information is related to classification rate, we compare the correct classification rate at each electrode between an 11-electrode array and a 60-electrode array shown in Fig 5(a) and 5(b), respectively. For the two chosen electrode arrays, there is only a small difference in mutual information as seen in Fig 4(a). However, the electrode discrimination abilities are very different between the two arrays.

Fig 5. Electrode discrimination abilities for each electrode in an array.

Fig 5

(a) Correct classification rate for an 11-electrode array, (b) Correct classification rate for an 60-electrode array. (c) Confusion matrix for the 11-electrode array for twirling angle α = 2π. (d) Confusion matrix for the 60-electrode array for twirling angle α = 2π. Color bars indicate percentages for each stimulated-chosen electrode pair. We number the electrodes from the basal end to the apical end in sequence.

Fig 5(c) and 5(d) show electrode discrimination confusion matrices for the 11-electrode and 60-electrode arrays for twirling angle α = 2π, respectively. For each stimulated electrode, we observe interference between the neighboring 2 electrodes for the 11-electrode array and between the neighboring 4 to 6 electrodes in the 60-electrode array. This indicates that around 12 electrode locations (in which case, the distance between adjacent electrodes is approximately 2.5 mm) can be distinguished. This concords with the result that mutual information calculation. The mutual information for the 11-electrode and 60-electrode arrays are approximately the same, which indicates that the level of perception of 11-electrode and 60-electrode arrays are similar. More importantly, only a marginal increase of mutual information is observed for electrode arrays with more than 12 electrodes for the given parameters in the model, which indicates that the hearing perception does not significantly improve for more than 12 electrodes. This finding is consistent with clinical studies [49, 50].

Parameter sensitivity

We now model how the key parameters affect the performance of the model. The electrode placements (modeled by r), the surviving rate of auditory nerve fibers (N), the level of electrode current (Ce, which is modeled as the percentage of surviving auditory nerve fibers N that generate action potentials for a given stimulated electrode), and the electrode stimulation strategies (modeled by the attenuation of the current, A) are investigated in this parameter sensitivity test.

We choose α = 2π, N = 3000, r = 2 mm, and 10% of the surviving auditory nerve fibers generate action potentials for a given stimulated electrode as a general case for a parameter sensitivity test. In each test, we vary one parameter at a time. We use the percentage of surviving auditory nerve fibers, N, that generate action potentials for a given stimulated electrode to represent the changes of electrode current, Ce.

Fig 6 shows mutual information and correct classification rates for ranges of parameter choices. Together with the results in Fig 4, we investigate the sensitivity of the model to the parameters listed in Table 1. Our framework can clearly reflect impact upon performance with the changes of key parameters. When the change of a chosen parameter leads to a decrease of correct classification rate, a decrease of mutual information is also found. Empirical evidence [11] show that a closer electrode-inner wall distance has a narrower excitation region per electrode and, therefore, leads to better discrimination between two adjacent stimulations. This concords with our results in the first column in Fig 6, which shows that reducing r (reducing the distances of electrodes to nerve fibers) improves performance.

Fig 6. Model performance with ranges of parameter choices.

Fig 6

The top and bottom rows respectively show mutual information and the percentage of correct electrode discrimination vs. the number of electrodes. The columns from left to right show the sensitivity to r, N, Ce, and A, respectively.

We notice that only small changes are shown when varying electrode-to-fiber distance, r, in the model. This is because electrode discrimination is not the ideal test for showing how the electrode-to-fiber distance affects hearing performance. Previous studies have suggested that up to 120 spectral channels can be distinguished via simultaneous virtual channel stimulation [51, 52]. However, speech recognition ability is not significantly improved even if more than 22 individual spectral channels can be distinguished. To better investigate how electrode-to-fiber distance affects hearing performance with cochlear implants, speech recognition should be modelled in the future.

The third column in Fig 6 shows that better performance of the model is achieved by choosing the current level such that 0.2N fibers generate action potentials for a given stimulated electrode; i.e., the current level is between 53.8 to 55 dB relative to 1 μA for our given parameter set (shown in Fig 7) for an example 25-electrode array.

Fig 7. Current levels, Ce, for which 20% of auditory nerve fibers generates action potentials.

Fig 7

A 25-electrode array is used in this example.

The result provides an indication of the “optimal” current level for each electrode. If the current level is too low, there are insufficient fiber activations to provide information about which electrode was stimulated; if current level is too high, there is increased interference between adjacent electrodes.

The fourth column of Fig 6 shows that increasing A (reducing current spread) leads to increase of both mutual information and electrode discrimination ability compared to other parameters. This indicates that better performance of cochlear implant model may be achieved by narrowing the width of current spread such as using current focusing techniques. Our results qualitatively agree with clinical studies; e.g., a current focusing stimulation strategy improves speech recognition in cochlear implant users [24].

Discussion and conclusion

By combining the information theoretic methods and a machine learning, our model provides a flexible framework that can interrelate cochlear implant electrode discrimination with information transfer at the interface between the electrode array and auditory nerve fibers. Our model also enables us to investigate the extent to which key factors affect the performance of cochlear implant model at the individual level. Our results have shown good qualitative consistency with clinical studies.

The choice of classifier

A SLFN is chosen as a classifier for electrode discrimination rather than use a simpler classification method as proposed by [35]. The neural network has reduced the limitations of our previous work [35, 42], where the mutual information was based on statistical correlations and relied on an observable output distribution [34, 35]. Choosing a neural network as a classifier provides flexibility to the modeling framework so that it can be adapted to model other psychophysical hearing measurements.

The classifier in this model consists of one hidden layer since electrode discrimination is a simple task. Through trial and error, the optimal classification was found to occur with 1500 hidden layer units and the maximum mutual information occurred with 11 or 12 electrodes. Over-fitting occurred when more than 1500 hidden layer units were used [42]. The activation function in this classifier is the absolute value function; other nonlinear activation functions may also be used effectively [42, 47].

The training and testing data are the activation patterns generated for every possible electrode location, from which 80% of the simulation data was used for training and 20% was used for testing. Although only small variations of the activation patterns exist in the simulation data due to the limited numbers of electrodes, the performance of the classifier was not affected. This is because the neural network classifier only specifically assigns an activation pattern generated by a given electrodes to one of the possible electrode locations rather than act like general classifier to distinguish between activation patterns.

We emphasize that the architecture of the neural network needs to be adjusted according to the complexity of the psychophysical measurements that we aim to simulate.

Model parameters

Our improved framework successfully demonstrates the impacts of changes to key parameters, and so can provide a theoretic estimation of the performance of cochlear implant model. Thus, the model may be helpful for investigating how and to what extent key factors affect the performance of cochlear implant model for individual users. The key parameters in the model represent the key factors that have been found to influence the performance of cochlear implant model in clinical studies [3, 6, 14]: i) electrode placement can be modeled in the electrode-to-fiber geometry, ii) onset of deafness of cochlear implants users affects the number of surviving auditory nerve fibers and can be modeled by changing the number and distribution of fibers in the model, and iii) loudness judgements can be modeled by adjusting stimulation current, Ce.

The framework enables the investigation of individual components without affecting other components in the framework. In future work, there are a number of ways that the model may be modified to investigate other effects:

  1. To test parameter sensitivity, the distance between electrodes and auditory nerve, r, was chosen as a uniform distribution. The electrode-cochlea geometry can be updated to reflect actual electrode-to-fiber distances based on CT scans or other measurements [20, 53].

  2. To better understand the variability of speech recognition, it is important to model speech recognition tests along with electrode discrimination.

  3. The model in this study was built based on “place coding” [39] to study spectral information transfer from electrodes to auditory nerve fibers. A key component to include in the future will be temporal coding, which may improve the accuracy of the model and is necessary to model speech and pitch perception [30, 54].

  4. The auditory nerve activity patterns evoked by electrical stimulation and current spread are currently described using a simplified mathematical model. More sophisticated models of current flow in neural tissue [5558] can be used for a more accurate model predictions.

In this study, we have, for the first time, unified our information theoretic method with a machine learning technique to model electrode discrimination ability, and have overcome the limitations of our previous studies [35, 42]. The flexible framework can be updated to be a subject-specific model of hearing with cochlear implants. This may help with personalized cochlear implant design and provide insight to modeling studies of other types of neural prostheses.

Supporting information

S1 Data

(7Z)

Acknowledgments

We thank Dr M. M. A. Yajadda for the valuable discussion and comments on the paper.

Data Availability

All relevant data are within the manuscript and its Supporting information files.

Funding Statement

XG receives a McKenzie Fellowship from the University of Melbourne (https://sites.research.unimelb.edu.au/research-funding/researcher-development-schemes/mckenzie-fellowship). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Grayden DB, Clark GM, Copper H. Implant design and development. Cochlear implants: A practical guide Second Edition, Wiley, New York. 2006; p. 1–20. [Google Scholar]
  • 2.McDermott HJ, McKay CM, Vandali AE. A new portable sound processor for the University of Melbourne/Nucleus Limited multielectrode cochlear implant. Journal of the Acoustical Society of America. 1992;91:3367–3371. doi: 10.1121/1.402826 [DOI] [PubMed] [Google Scholar]
  • 3.Shannon RV, Fu QJ, Galvin JJI. The number of spectral channels required for speech recognition depends on the difficulty of the listening situation. Acta oto-laryngologica Supplementum. 2004;552:50–54. [DOI] [PubMed] [Google Scholar]
  • 4.Sucher CM, McDermott HJ. Pitch ranking of complex tones by normally hearing subjects and cochlear implant users. Hearing Research. 2007;230:80–87. doi: 10.1016/j.heares.2007.05.002 [DOI] [PubMed] [Google Scholar]
  • 5.Schatzer R, Vermeire K, Visser D, Krenmayr A, Kals M, Voormolen M, et al. Electric-acoustic pitch comparisons in singlesided- deaf cochlear implant users: frequency-place functions and rate pitch. Hearing Research. 2014;309:26–35. doi: 10.1016/j.heares.2013.11.003 [DOI] [PubMed] [Google Scholar]
  • 6.Blamey P, Arndt P, Bergeron F, Bredberg G, Brimacombe J, Facer G, et al. Factors affecting auditory performance of postlinguistically deaf adults using cochlear implants. Audiology and Neurotology. 1996;1:293–306. doi: 10.1159/000259212 [DOI] [PubMed] [Google Scholar]
  • 7.Landsberger DM, Mertens G, Punte AK, Heyning PVD. Perceptual changes in place of stimulation with long cochlear implant electrode arrays. Journal of the Acoustical Society of America, JASA Express Letters. 2014;135:75–82. doi: 10.1121/1.4862875 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.O’Connell BP, Cakir A, Hunter JB, Francis DO, Noble JH, Labadie RF, et al. Electrode location and angular insertion depth are predictors of audiologic outcomes in cochlear implantation. Otology & neurotology. 2016;37:1016. doi: 10.1097/MAO.0000000000001125 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Clark GM. The multi-channel cochlear implant: Multi-disciplinary development of electrical stimulation of the cochlea and the resulting clinical benefit. Hearing research. 2015;322:4–13. doi: 10.1016/j.heares.2014.08.002 [DOI] [PubMed] [Google Scholar]
  • 10.Henry BA, McKay CM, McDermott HJ, Clark GH. The relationship between speech perception and electrode discrimination in cochlear implantees. The Journal of the Acoustical Society of America. 2000;108:1269–1280. doi: 10.1121/1.1287711 [DOI] [PubMed] [Google Scholar]
  • 11.de Miguel ÁR, Argudo AA, Barreiro SAB, González JCF, Macías AR. Imaging evaluation of electrode placement and effect on electrode discrimination on different cochlear implant electrode arrays. European Archives of Oto-Rhino-Laryngology. 2018; p. 1–10. [DOI] [PubMed] [Google Scholar]
  • 12.Collins LM, Zwolan TA, O’Neill JC, Wakefield GH. Comparison of electrode discrimination, pitch ranking, and pitch scaling data in postlingually deafened adult cochlear implant subjects. Journal of the Acoustical Society of America. 1997;101:440–455. doi: 10.1121/1.417989 [DOI] [PubMed] [Google Scholar]
  • 13.Goehring JL, Neff DL, Baudhuin JL, Hughes ML. Pitch ranking, electrode discrimination, and physiological spread-of-excitation using Cochlear’s dual-electrode mode. The Journal of the Acoustical Society of America. 2014;136:715–727. doi: 10.1121/1.4884881 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Kenway B, Tam YC, Vanat Z, Harris F, Gray R, Birchall J, et al. Pitch discrimination: an independent factor in cochlear implant performance outcomes. Otology & Neurotology. 2015;36:1472. doi: 10.1097/MAO.0000000000000845 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Finley CC, Skinner MW. Role of electrode placement as a contributor to variability in cochlear implant outcomes. Otology & neurotology. 2008;29:920. doi: 10.1097/MAO.0b013e318184f492 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Hochmair I, Hochmair E, Nopp P, Waller M, Jolly C. Deep electrode insertion and sound coding in cochlear implants. Hearing research. 2015;322:14–23. doi: 10.1016/j.heares.2014.10.006 [DOI] [PubMed] [Google Scholar]
  • 17.Busby PA, Battmer RD, Pesch J. Electrophysiological spread of excitation and pitch perception for dual and single electrodes using the nucleus freedom cochlear implant. Ear Hearing—Ear and Hearing. 2008;29:853–864. doi: 10.1097/AUD.0b013e318181a878 [DOI] [PubMed] [Google Scholar]
  • 18.Black RC, Clark GM. Differential electrical excitation of the auditory nerve. Journal of Acoustic Society America. 1980;67:868–874. doi: 10.1121/1.383966 [DOI] [PubMed] [Google Scholar]
  • 19.Bruce IC, White MW, Irlicht LS, Leary SJO, Dynes S, Javel E, et al. A stochastic model of the electrically stimulated auditory nerve: Single-Pulse response. IEEE Transactions on Biomedical Engineering. 1999b;46:617–629. doi: 10.1109/10.764939 [DOI] [PubMed] [Google Scholar]
  • 20.Bai S, Encke J, Obando-Leitón M, Weiß R, Schäfer F, Eberharter J, et al. Electrical stimulation in the human cochlea: a computational study based on high-resolution micro-CT scans. Frontiers in Neuroscience. 2019;13:1312. doi: 10.3389/fnins.2019.01312 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Nogueira W, Rode T, Büchner A. Spectral contrast enhancement improves speech intelligibility in noise for cochlear implants. Journal of the Acoustical Society of America. 2016;139:728–739. doi: 10.1121/1.4939896 [DOI] [PubMed] [Google Scholar]
  • 22.Cohen LT. Practical model description of peripheral neural excitation in cochlear implant recipients: 4. Model developments at low pulse rates: General model and application to individuals. Hearing Research. 2009;248:15–30. doi: 10.1016/j.heares.2008.11.007 [DOI] [PubMed] [Google Scholar]
  • 23.Bierer JA, Faulkner KF. Identifying cochlear implant channels with poor electrode-neuron interface: Partial tripolar, single-channel thresholds and psychophysical tuning curves. Ear and Hearing. 2010;31:247–258. doi: 10.1097/AUD.0b013e3181c7daf4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Srinivasan AG, Padilla M, Shannon RV, Landsberger DM. Improving speech perception in noise with current focusing in cochlear implant users. Hearing Research. 2013;299:29–36. doi: 10.1016/j.heares.2013.02.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Kalkman RK, Briaire JJ, Frijns JHM. Current focussing in cochlear implants: An analysis of neural recruitment in a computational model. Hearing Research. 2015;322:89–98. doi: 10.1016/j.heares.2014.12.004 [DOI] [PubMed] [Google Scholar]
  • 26.Goldwyn JH, Bierer SM, Bierer JA. Modeling the electrodeeneuron interface of cochlear implants: Effects of neural survival, electrode placement, and the partial tripolar configuration. Hearing Research. 2010;268:93–104. doi: 10.1016/j.heares.2010.05.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Snel-Bongers J, Briaire JJ, van der Veen EH, Kalkman RK, Frijns JH. Threshold levels of electrode stimulation in cochlear implants. Journal of the Association for Research in Otolaryngology. 2013;14:781–790. doi: 10.1007/s10162-013-0395-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Litvak LM, Spahr AJ, Emadi G. Loudness growth observed under partially tripolar stimulation: Model and data from cochlear implant listeners. Journal of the Acoustical Society of America. 2007;122:967–981. doi: 10.1121/1.2749414 [DOI] [PubMed] [Google Scholar]
  • 29.Saeedi NE, Blamey PJ, Burkitt AN, Grayden DB. Application of a pitch perception model to investigate the effect of stimulation field spread on the pitch ranking abilities of cochlear implant recipients. Hearing Research. 2014;316:129–137. doi: 10.1016/j.heares.2014.08.006 [DOI] [PubMed] [Google Scholar]
  • 30.Saeedi NE, Blamey PJ, Burkitt AN, Grayden DB. An integrated model of pitch perception incorporating place and temporal pitch codes with application to cochlear implant research. Hearing Research. 2017;344:135–147. doi: 10.1016/j.heares.2016.11.005 [DOI] [PubMed] [Google Scholar]
  • 31.Fredelake S, Hohmann V. Factors affecting predicted speech intelligibility with cochlear implants in an auditory model for electrical stimulation. Hearing Research. 2012;287:76–90. doi: 10.1016/j.heares.2012.03.005 [DOI] [PubMed] [Google Scholar]
  • 32.Jürgens T, Hohmann V, Büchner A, Nogueira W. The effects of electrical field spatial spread and some cognitive factors on speech-in-noise performance of individual cochlear implant users—A computer model study. PLoS ONE. 2018;13:e0193842. doi: 10.1371/journal.pone.0193842 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Bianchi F, Carney LH, Dau T, Santurette S. Effects of Musical Training and Hearing Loss on Fundamental Frequency Discrimination and Temporal Fine Structure Processing: Psychophysics and Modeling. Journal of the Association for Research in Otolaryngology. 2019;277:263–277. doi: 10.1007/s10162-018-00710-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.McDonnell MD, Burkitt AN, Grayden DB, Meffin H, Grant AJ. A channel model for inferring the optimal number of electrodes for future cochlear implants. IEEE Transactions on Information Theory. 2010;56:928–940. doi: 10.1109/TIT.2009.2037054 [DOI] [Google Scholar]
  • 35.Gao X, Grayden DB, McDonnell MD. Stochastic information transfer from cochlear implant electrodes to auditory nerve fibers. Physical Review E. 2014;90:022722. doi: 10.1103/PhysRevE.90.022722 [DOI] [PubMed] [Google Scholar]
  • 36.Gao X, Grayden DB, McDonnell MD. Information theoretic optimization of cochlear implant electrode usage probabilities. In: 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society; 2013. p. 5974–5977. [DOI] [PubMed]
  • 37.Gao X, Grayden DB, McDonnell MD. Using convex optimization to compute channel capacity in a channel model of cochlear implant stimulation. In: IEEE International Symposium on Information Theory; 2014. p. 2919–2923.
  • 38.Gao X, Grayden DB, McDonnell MD. Inferring the dynamic range of electrode current by using an information theoretic model of cochlear implant stimulation. In: IEEE Information Theory Workshop; 2014. p. 347–351.
  • 39.Mather G. Foundations of Perception. Psychology Press; 2006. [Google Scholar]
  • 40.Borst A, Theunissen FE. Information theory and neural coding. Nature Neuroscience. 1999;2:947–957. doi: 10.1038/14731 [DOI] [PubMed] [Google Scholar]
  • 41.Gao X, Grayden DB, McDonnell MD. Modeling electrode place discrimination in cochlear implants: Analysis of the influence of electrode array insertion depth. In: 7th International IEEE EMBS Neural Engineering Conference; 2015. p. 691–694.
  • 42.Gao X, Grayden DB, McDonnell MD. Modeling electrode place discrimination in cochlear implant stimulation. IEEE Transactions on Biomedical Engineering. 2017;64:2219–2229. doi: 10.1109/TBME.2016.2634461 [DOI] [PubMed] [Google Scholar]
  • 43.Huang GB, Zhu QY, Siew CK. Extreme learning machine: Theory and applications. Neurocomputing. 2006;70:489–501. doi: 10.1016/j.neucom.2005.12.126 [DOI] [Google Scholar]
  • 44.Huang GB, Zhou H, Ding X, Zhang R. Extreme learning machine for regression and multiclass classification. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics. 2012;42:513–529. doi: 10.1109/TSMCB.2011.2168604 [DOI] [PubMed] [Google Scholar]
  • 45.Moroz AS, McDonnell MD, Burkitt AN, Grayden DB, Meffin H. Information theoretic inference of the optimal number of electrodes for future cochlear implants using a spiral cochlea model. In: 34th Annual International Conference of the IEEE Engineering in Medicine and Biology Society; 2012. p. 2965–2968. [DOI] [PubMed]
  • 46.Bruce IC, White MW, Irlicht LS, Leary SJO, Clark GM. The effects of stochastic neural activity in a model predicting intensity perception with cochlear implants: Low-Rate stimulation. IEEE Transactions on Biomedical Engineering. 1999c;46:1393–1403. doi: 10.1109/10.804567 [DOI] [PubMed] [Google Scholar]
  • 47.McDonnell MD, Tissera MD, Vladusich T, van Schaik A, Tapson J. Fast, simple and accurate handwritten digit classification by training shallow neural network classifiers with the ‘extreme learning machine’ algorithm. PLoS ONE. 2015;10:e0134254. doi: 10.1371/journal.pone.0134254 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Bruce IC, Irlicht LS, White MW, Leary SJO, Dynes S, Javel E, et al. A stochastic model of the electrically stimulated auditory nerve: Pulse-Train response. IEEE Transactions on Biomedical Engineering. 1999a;46:630–637. doi: 10.1109/10.764939 [DOI] [PubMed] [Google Scholar]
  • 49.McKay CM, Remine MD, McDermott HJ. Loudness summation for pulsatile electrical stimulation of the cochlea: Effects of rate, electrode separation, level, and mode of stimulation. Journal of the Acoustical Society of America. 2001;110:1514–1524. doi: 10.1121/1.1394222 [DOI] [PubMed] [Google Scholar]
  • 50.Wilson BS, Dorman MF. Cochlear implants: A remarkable past and a brilliant future. Hearing Research. 2008;242:3–21. doi: 10.1016/j.heares.2008.06.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Donaldson GS, Kreft HA. Place-pitch discrimination of single- versus dual-electrode stimuli by cochlear implant users (L). Journal of the Acoustical Society of America. 2005;118:623–626. doi: 10.1121/1.1937362 [DOI] [PubMed] [Google Scholar]
  • 52.Landsberger DM, Padilla M, Srinivasan AG. Reducing current spread using current focusing in cochlear implant users. Hearing Research. 2012;284:16–24. doi: 10.1016/j.heares.2011.12.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Long CJ, Holden TA, McClelland GH, Parkinson WS, Shelton C, Kelsall DC, et al. Examining the electro-neural interface of cochlear implant users using psychophysics, CT scans, and speech understanding. Journal of the Association for Research in Otolaryngology. 2014;15:293–304. doi: 10.1007/s10162-013-0437-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Boulet J, White M, Bruce IC. Temporal considerations for stimulating spiral ganglion neurons with cochlear implants. Journal of the Association for Research in Otolaryngology. 2016;17:1–17. doi: 10.1007/s10162-015-0545-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Meffin H, Tahayori B, Grayden DB, Burkitt AN. Modeling extracellular electrical stimulation I: Derivation and interpretation of neurite equations. Journal of Neural Engineering. 2012;9:065005. doi: 10.1088/1741-2560/9/6/065005 [DOI] [PubMed] [Google Scholar]
  • 56.Tahayori B, Meffin H, Dokos S, Burkitt AN, Grayden DB. Modeling extracellular electrical stimulation II: Computational validation and numerical results. Journal of Neural Engineering. 2012;9:065006. doi: 10.1088/1741-2560/9/6/065006 [DOI] [PubMed] [Google Scholar]
  • 57.Meffin H, Tahayori B, Sergeev E, Mareels IMY, Grayden DB, Burkitt AN. Modeling extracellular electrical stimulation III: Derivation and interpretation of neural tissue equations. Journal of Neural Engineering. 2014;11:065004. doi: 10.1088/1741-2560/11/6/065004 [DOI] [PubMed] [Google Scholar]
  • 58.Tahayori B, Meffin H, Sergeev E, Mareels IMY, Burkitt AN, Grayden DB. Modeling extracellular electrical stimulation IV: Effect of the cellular composition of neural tissue on its spatio-temporal filtering properties. Journal of Neural Engineering. 2014;11:065005. doi: 10.1088/1741-2560/11/6/065005 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Andreas Buechner

25 Mar 2021

PONE-D-21-01888

Unifying information theory and machine learning in a model of electrode discrimination in cochlear implants

PLOS ONE

Dear Dr. Gao,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by May 09 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Andreas Buechner, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found athttps://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

Additional Editor Comments (if provided):

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: I Don't Know

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors present a method to model the effect of several of the most important parameters on CI electrode discrimination. In this method they combine an information theory approach that computes the mutual information between input and output with a classifier that predicts the stimulated electrode from a given nerve fiber activation pattern. The method is described clearly and the manuscript is written in an intelligible fashion.

Even though both, the classifier and the information theory approach are described in their own sections, the manuscript would benefit from a more detailed description of the combination of the two methods.

In the results section, the authors could elaborate more on the discrepancies between the results. Why are there substantial differeces between the mutual information and the correct electrode classification rate for the same set of parameters?

Minor remarks:

line 33: the word order seems wrong

line 169: missing word or incorrect grammar

line 192: reference to the wrong figure (should be 4(a))

line 300: "model" instead of "mode" and "have" instead of "has"

Fig. 7: The x axis label does not match the figure description.

Reviewer #2: The manuscript „Unifying information theory and machine learning in a model of electrode discrimination in cochlear implants“ presents an original research article. The model unifying a phenomenological model of auditory nerve activity with a machine learning approach and an information theory approach is very original.

1) Lack of validation of the model. I would suggest that the authors make an effort comparing in more detail the predictions of the model with more detailed data in the literature. For instance one could compare published data on electrode discrimination with different electrode distances and compare it to the predictions of the model. But this is just an example, the authors should make an effort in looking for the exact data that can validate each of the simulations (Figures) of the model. This is a journal paper that extends previous works from the authors, so now it is an opportunity to demonstrate that this model/framework can at least reproduce qualitatively electrode discrimination tasks. For example, if you refer to virtual channel discrimination you should provide data about virtual channel discrimination and demonstrate that your model obtains the same scores as the data.

2) Some mathematical formulation isnot clear. When describing the neural network you say that you train the Wout coefficients, but what happens to the Win coefficients? Are these not trained? If not, why not?

3) Across the manuscript you use the term “performance of cochlear implants”. I would suggest to substitute these terms by “performance of cochlear implant users”.

4) You did a nice sensitivity analysis of your model for some parameters. However, what is the effect of choosing different values for parameters of your network? Size of the hidden layer? Type of activation functions, etc?

Detailed comments

Abstract

“limit performance of cochlear implants” --> “Limit performance of cochlear implant users”

“It provide insights it provide insights” --> “it provides insights”

Introduction

Third line: What do you mean here with extracellular electrodes? I agree the electrodes are extracellular but this definition seems a bit out of cochlear implant context.

Last line of the first paragraph: Here you give three references at the very end [3,6,7]. Please provide detailed reference to each of your statements i); ii) and iii). Please check if results in the literature are statistically significant for each of the factors.

First line second paragraph: “performance of cochlear implant” � “performance of cochlear implant users”. Please correct this terminology across the whole manuscript where it is used multiple times.

Second line of the second paragraph: Here you state “… electrode dissemination has been used as a primary psychophysical measurement to assess the performance cochlear implants” I disagree with this statement. As far as I know there are no studies showing a significant correlation between electrode discrimination and speech understanding performance. Otherwise please provide these references. I agree though that many researchers investigated electrode discrimination performance as you mention in the following lines, and that this measure may impact speech understanding performance. But it is not a primary measurement, for sure not used in clinical environments.

Page 2 Line 4: Rewrite as “A focus of recent studies is to investigate how …” Actually the two parts of this sentence seem to be a bit disconnected : First part is about “insertion depth” second part is about “virtual channels”. Divide the sentence into two and expand what you are trying to convey here.

Page 2 second paragraph Line 1: “Mathematical and computational model” � “Mathematical and computational models”

Page 2 second paragraph Line 2: Here you cite [15] and [16] but there are many models out there. I would suggest to extend to recent models by Kalkman et al. (2015), Nogueira et al. (2016) and Bai et al. (2019).

Page 2 second paragraph Line 6: What is a Psychophysical cochlear implant model? Please define.

Page 2 second paragraph Line 12: “[25] demonstrated …” Here you can refer to more recent models by Jürgens et al 2018 PlosOne.

Page 3 Caption of Figure 1: Specify the units after 5pi

Page 3 Equation 3: How accurate is this simple model of auditory nerve spiking? For example how accurate can you reproduce auditory nerve activity in comparison to data or to other models such as Joshi or Litvak.

Page 4 second last line of the last paragraph “predication” should be “prediction”?

Page 5 Line 1: “…we defined a vector, T, where each …” instead of a vector, isn’t T a matrix?

Page 5 second line after Equation 7. I would suggest to remove “neuron” and just use the term “activation function” to make sure that the rather does not conus the activation function with Equation 3.

Page 5 first line after equation 8: Here you state that Wout is obtained by training. What happens with Win?

Which activation function did you use in the output layer of your network. I think it is not specified.

Results

Page 6 first line: Here again use “cochlear implant users”

Page 6 last line first paragraph: “400 tests” How many for each electrode?

Page 6: “we find higher correct classification rate for a longer electrode array (alpha = 3pi). Is this improvement caused by the electrode array being longer or just by the fact that the electrode spacing is longer? In other words, you should show that a short array with increased electrode spacing results in worse performance.

Page 7: Caption of Figure 5: Define electrode index.

Page 8: First line second paragraph: “We notice that only small changes are shown when varying electrode-to fiber.. .. this is because electrode discrimination is not as sensitive as speech recognition to changes in r” This statement seems a bit strange. You should provide references and give more detail why this happens. If r is increased electric spread will increase? Why are you not observing a worsening in electrode discrimination? Maybe you can check data comparing electrode discrimination with modiolar vs lateral wall electrodes? Or electrode discrimination with positioner vs no-positioner. Your model needs more data to be validated in general, at least qualitatively.

Page 8 second paragraph last line: “previous studies have suggested 120 spectral channels..” I don’t understand the connection between this sentence and the previous ones. In you model you did not model virtual channels. Please model virtual channels or be more specific with the data that you use to validate your model.

Page 8: Last line of the second last paragraph: Over-training or Over-fitting?

Page 9: Last paragraph: I should be “to model electrode … “ In this section again use “performance of cochlear implant users“

I repeat: In general in the results section I miss more comparison to studies in humans comparing your model predictions with real data, at least qualitatively.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Sep 20;16(9):e0257568. doi: 10.1371/journal.pone.0257568.r002

Author response to Decision Letter 0


13 Jun 2021

Dear Editors,

We thank the referee for their overall positive feedback to our manuscript (PONE-D-21-01888). The discussion has been very useful as it has identified ways in which we needed to improve the descriptions in the manuscript. We describe how we have changed the paper to address the reviewers’ comments in the Response to Reviewers.

We believe the paper now has addressed the reviewers’ comments and has improved to better fit the journal requirements.

Attachment

Submitted filename: RebuttalLetter_PONE-D-21-01888_XiaoGAO.pdf

Decision Letter 1

Andreas Buechner

7 Sep 2021

Unifying information theory and machine learning in a model of electrode discrimination in cochlear implants

PONE-D-21-01888R1

Dear Dr. Gao,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Andreas Buechner, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors have appropriately responded to all comments on the previous draft of their paper and have adressed all the issues raised.

Reviewer #2: the manuscript has much improved during the first review and should now be accepted - no further comments or questions from my side.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Waldo Nogueira

Acceptance letter

Andreas Buechner

10 Sep 2021

PONE-D-21-01888R1

Unifying information theory and machine learning in a model of electrode discrimination in cochlear implants

Dear Dr. Gao:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Professor Andreas Buechner

Academic Editor

PLOS ONE


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES