Skip to main content
Entropy logoLink to Entropy
. 2023 Jun 29;25(7):994. doi: 10.3390/e25070994

Performance Baseline of Phase Transfer Entropy Methods for Detecting Animal Brain Area Interactions

Jun-Yao Zhu 1,2, Meng-Meng Li 1,2, Zhi-Heng Zhang 1,2, Gang Liu 1,2,*, Hong Wan 1,2,*
Editor: Alessandro Giuliani
PMCID: PMC10378602  PMID: 37509941

Abstract

Objective: Phase transfer entropy (TEθ) methods perform well in animal sensory–spatial associative learning. However, their advantages and disadvantages remain unclear, constraining their usage. Method: This paper proposes the performance baseline of the TEθ methods. Specifically, four TEθ methods are applied to the simulated signals generated by a neural mass model and the actual neural data from ferrets with known interaction properties to investigate the accuracy, stability, and computational complexity of the TEθ methods in identifying the directional coupling. Then, the most suitable method is selected based on the performance baseline and used on the local field potential recorded from pigeons to detect the interaction between the hippocampus (Hp) and nidopallium caudolaterale (NCL) in visual–spatial associative learning. Results: (1) This paper obtains a performance baseline table that contains the most suitable method for different scenarios. (2) The TEθ method identifies an information flow preferentially from Hp to NCL of pigeons at the θ band (4–12 Hz) in visual–spatial associative learning. Significance: These outcomes provide a reference for the TEθ methods in detecting the interactions between brain areas.

Keywords: phase transfer entropy, interaction, visual–spatial associative learning, hippocampus, nidopallium caudolaterale, pigeon

1. Introduction

Sensory–spatial associative learning is defined as the ability to integrate sensory cues (visual, olfactory, and auditory cues) and spatial locations together in memory [1,2,3]. Accumulating evidence shows that a distributed brain network supports animals in realizing this complex cognitive process [1,4,5], especially for the interaction between the hippocampus (Hp) and prefrontal cortex (PFC) [6,7], playing a key role in sensory–spatial associative learning. Avian Hp is homologous with mammalian Hp and the pigeons with Hp damage are impaired in forming an association between visual cues and spatial location [8,9], which indicates that the formation of associative memory critically depends on the integrity of Hp. As an analogue of mammalian PFC, pigeon nidopallium caudolaterale (NCL) integrates mnemonic information and task rules in order to direct behavior appropriately [10,11]. However, it is not yet understood whether there is information transfer between Hp and NCL of pigeons in sensory–spatial associative learning. Revealing the coupling between Hp and NCL not only helps to understand the mechanisms of brain areas, but also promotes the development of brain-like intelligence [12].

The interaction between Hp and NCL can be identified by phase transfer entropy (TEθ), which relates to calculating transfer entropy (TE) on the neural signals recorded from these two areas. TEθ retains the advantages of TE because it is model-free and does not require any prior knowledge on the input of the system or the target connectivity networks [13,14,15]. Meanwhile, owing to the phase relationships linked to neuronal synchronization and information flow within the interconnected brain regions [16,17,18], compared with TE on the time series that combined amplitude and phase, TEθ is more suitable for investigating coupling between brain regions.

In TEθ, the most critical assignment is to estimate the TEθ metric value. According to its standard definition, TEθ is formulated as Shannon entropy, which quantifies the difference between p(θi+1y|θix,θiy) and p(θi+1y|θix) [15,19]. Hlaváčková-Schindler focused on diverse approaches to Shannon entropy [20]. Among them, a method is proposed by Kraskov, Stögbauer, and Grassberger (KSG) [21], in which the TE metric value can be estimated by counting the samples in specific strips, rather than finding nearest neighbors in each low-dimensional space [22], so that the deviations caused by the different spatial scales in low-dimensional spaces are significantly reduced. Another widely used approach to estimate TEθ is the binning method [19]. By partitioning the state spaces of the phase time series into several bins and counting the number of points in each bin, various probability density functions (PDFs) and Shannon entropy can be calculated. Besides Shannon entropy, other techniques have also been proposed to estimate the TEθ metric value; for instance, the symbolic technique based on permutation entropy and the kernel method with the concept of Renyi’s α entropy. Staniek suggested to estimate TEθ by adopting a technique of symbolization [23]. Instead of calculating PDFs directly in traditional TEθ, a symbolic processing is carried out on the state space of phase time series [24]. With the relative frequencies of the symbols, the PDFs of the sequence of permutation index are estimated and the symbolic TEθ metric value is obtained. Panche proposed a kernel TEθ estimator that expresses TEθ as a linear combination of Renyi’s entropy [25], which is approximated by a function defined on positive definite and infinitely divisible kernel matrices. This method sidesteps the requirement of obtaining the PDFs from the phase time series [26]. However, these methods mentioned above have their own assumptions, advantages, and limitations [25,26,27], and the criteria for applying these methods are different.

Owing to performance baseline differences in TEθ methods, these methods may yield different results even for the same datasets, let alone the neural signals recorded during associative learning, where the short duration of a trial results in the small sample size and the interaction lag is unknown. Therefore, benchmarking TEθ methods on various types of data to determine their performance baseline is an urgent problem to be solved. Though the ability of the TEθ with KSG (TEKSGθ), symbolic (TEsymθ), and Renyi’s entropy (TEkαθ) estimators to detect the directional interaction has been discussed in [25], there is no uniform standard to evaluate the performance of these methods, and some key factors that may have a great effect on the estimators are not analyzed, especially for the sample size and interaction lag. The properties of the TEθ methods for suitable application are not yet well understood.

Therefore, this paper first explores the performance baseline of TEKSGθ, TEsymθ, TEkαθ, and the binning method (TEbinθ) on the simulated signals generated by a neural mass model and the actual neural data from ferrets with known interaction structures. To satisfy the validity for a non-stationary time series, we adopt the ensemble method in which multiple independent repetition trials are taken as a set to estimate TEθ, instead of estimating from an individual trial [28,29]. On the simulated data, the accuracy of the TEθ methods in identifying the directional interaction and the ability of these methods to estimate the interaction lag is analyzed. The effect of sample size and interaction lag on these methods as well as the robustness of the TEθ estimators to noise and linear mixing is also emphasized. Finally, the computational complexity of the four methods is compared. On the actual neural data, the TEθ methods are applied to public datasets to assess the performance of TEθ estimators on the actual neural signals.

Then, based on the properties, the most suitable TEθ method is applied to the local field potential signals (LFPs) recorded from pigeons to detect the interaction between Hp and NCL in visual–spatial associative learning. The contributions of this paper are as follows:

  • (1)

    By benchmarking the TEθ methods on the simulated signals and the actual neural data, we explore the performance baseline of the TEθ methods and provide a reference for the use of the TEθ methods on the actual neural signals.

  • (2)

    The most suitable TEθ method is applied to neural signals recorded from pigeons and identifies the interaction between Hp and NCL of pigeons in visual–spatial associative learning.

The remainder of this paper is organized as follows: Section 2 reviews the theoretical foundations of the TEθ methods, the neural mass model, evaluation criteria, and the actual neural data recording as well as analysis; Section 3 presents our results on the simulated signal pairs, actual neural data from ferrets, and the LFPs from pigeons; Section 4 shows the discussion; and, finally, Section 5 contains our conclusion.

2. Materials and Methods

2.1. Phase Transfer Entropy (TEθ)

For two random variables X and Y, the TE from X to Y can be defined as [30]:

TE(XY)=yt,yt1dy,xtudxp(yt,yt1dy,xtudx)logp(yt|yt1dy,xtudx)p(yt|yt1dy) (1)

where xtudx, yt1dyD×d are the time-embedded series of X and Y, respectively, and D=T(τ(d1))u, in which d,τ  N are the embedding dimension and embedding delay, respectively. T indicates the length of X and u  N represents the interaction lag between X and Y. p(·) is the PDF. Details of TE are described in [13,14,31].

In TEθ, the time series xt and yt are replaced by instantaneous phase time series θtx and θty, which are extracted from xt and yt with Hilbert or Morlet wavelet transform at frequency f. Thus, TEθ is

TEθ(XY, f)=θty,θt1y,dy,θtux,dxp(θty,θt1y,dy,θtux,dx)logp(θty|θt1y,dy,θtux,dx)p(θty|θt1y,dy) (2)

where θtux,dx and θt1y,dy are the time-embedded versions of θtx and θty.

Then, TEθ is extended to the ensemble method, in which the independent repetition trials of an experimental condition are taken as an ensemble of realizations, and various PDFs are estimated from the ensemble members. Therefore, TEθ can be written as follows [28]:

TEθ(XY,f)=θty(r),θt1y,dy(r),θtux,dx(r)p(θty(r),θt1y,dy(r),θtux,dx(r))logp(θty(r)|θt1y,dy(r),θtux,dx(r))p(θty(r)|θt1y,dy(r)) (3)

where r is the number of independent repetition trials.

2.2. TEθ Methods

2.2.1. Binning Estimator (TEbinθ)

Formula (3) can be written as the form of Shannon entropy:

TEθ(XY, f)=H(θtux,dx(r),θt1y,dy(r))H(θty(r),θtux,dx(r),θt1y,dy(r))+H(θty(r),θt1y,dy(r))H(θt1y,dy(r))  (4)
H(θtux,dx(r),θt1y,dy(r))=θtux,dx(r),θt1y,dy(r)p(θtux,dx(r),θt1y,dy(r))log(p(θtux,dx(r),θt1y,dy(r)))  (5)
H(θty(r),θtux,dx(r),θt1y,dy(r))=θty(r),θtux,dx(r),θt1y,dy(r)p(θty(r),θtux,dx(r),θt1y,dy(r))log(p(θty(r),θtux,dx(r),θt1y,dy(r)))  (6)
H(θty(r),θt1y,dy(r))=θty(r),θt1y,dy(r)p(θty(r),θt1y,dy(r))log(θty(r),θt1y,dy(r)) (7)
H(θt1y,dy(r))=θt1y,dy(r)p(θt1y,dy(r))log(θt1y,dy(r)) (8)

where H(·) is Shannon entropy.

The histogram-based method is used to estimate various PDFs in TEbinθ. First, the state spaces of all trials ((θtux,dx(r),θt1y,dy(r)), (θty(r),θtux,dx(r),θt1y,dy(r)), (θty(r),θt1y,dy(r)), and (θt1y,dy(r))) can be divided into several bins. Then, the number of points in each bin is counted. Finally, the probability value for each bin is computed by dividing the number of points in that bin by the total number of data points.

The bin width is the only parameter should be determined in the binning method. According to Scott’s choice [32], bin width can be defined as follows: h=3.5σ/N13, where N is the number of samples for θty and σ is the standard deviation for a directional variable as defined by Fisher. In TEbinθ, the embedding dimension (dx, dy) is 1.

2.2.2. KSG Method (TEKSGθ)

In the KSG estimator, the TEθ metric value can be estimated by counting the number of samples in the strip of low-dimensional spaces. The strip is defined by the kth nearest neighbors in high-dimensional space projecting to the low-dimensional spaces. TEKSGθ can be written as follows:

TEKSGθ(XY, f)=ψ(k)+ψ(nθt1y,dy(r)+1)ψ(nθty(r)θt1y,dy(r)+1)ψ(nθt1y,dy(r)θtux,dx(r)+1) (9)

where ψ is the Digamma function, ψ(x)=Γ(x)1dΓ(x)dx. · means average. nθt1y,dy(r), nθty(r)θt1y,dy(r), and nθt1y,dy(r)θtux,dx(r) are the number of samples falling into the strip of low-dimensional space θt1y,dy(r), θty(r)θt1y,dy(r), and θt1y,dy(r)θtux,dx(r), respectively. k is generally 4 [21].

In the KSG method, the Rawdgitz criterion is used to calculate the embedding dimension and embedding delay [33].

2.2.3. Symbolic Estimator (TEsymθ)

The symbolic version of TEθ is based on permutation entropy [34]. A coarse-graining (partitioning) of state space method is used, in which the phase spaces of θtx and θty are arranged in an ascending order and uniquely mapped onto one of the possible permutations. Then, the symbols are defined and, with the relative frequency of symbols, the joint and conditional probabilities of the sequence of permutation indices can be calculated. The symbolic TEθ method is applied to the ensemble repetitions:

TEsymθ(XY,f)=θ^iy(r),θ^i1y(r),θ^iux(r)p(θ^iy(r),θ^i1y(r),θ^iux(r))logp(θ^iy(r)|θ^i1y(r),θ^iux(r))p(θ^iy(r)|θ^i1y(r)) (10)

where θ^ix and θ^iy are the results of θtx and θtx symbolization.

In the process of phase space reconstruction, the C–C method is applied to calculate the embedding dimension and embedding delay [35].

2.2.4. Renyi’s α–Entropy Estimator (TEkαθ)

For two-phase time series θtx and θty, TEθ from X to Y can be expressed as the kernel-based formulation of Renyi’s α–order entropy:

TEκαθ(XY,f)=Hα(Κθt1y,dy(r),Κθtux,dx(r))Hα(Κθty(r),Κθt1y,dy(r),Κθtux,dx(r))Hα(Κθty(r),Κθt1y,dy(r))+Hα(Κθt1y,dy(r)) (11)

where Hα(A,B)=Hα(A°Btr(A°B))=11αlog(tr((A°Btr(A°B))α)) is Renyi’s α-order entropy. tr(·) stands for matrix trace. Κθty(r), Κθt1y,dy(r), and Κθtux,dx(r) are the Gram matrixes for θty(r), θt1y,dy(r), and  θtux,dx(r), respectively, and hold elements kij=κ(ai,aj), κ(ai,aj)=exp(aiaj22σ2). For Κθty(r), ai, aj are the values of θy at times i and j. In the case of matrix Κθt1y,dy(r), the vectors ai, ajd contain the space state reconstruction θy,dy of θy at times i and j, likewise for Κθtux,dx(r). · indicates Euclidean distance and σ is defined as the kernel bandwidth, which is calculated by the median of aiaj.

The Rawdgitz criterion is used to calculate the embedding dimension and embedding delay, and α = 3 is chosen for Renyi’s α entropy by experience.

For all estimators, TEθ is a biased estimation, and it may be non-zero even in the absence of an interaction between X and Y. To reduce the bias, we define differential TEθ (dTEθ) for all TEθ types:

dTEθ(XY,f)=TEθ(XY,f)TEθ(YX,f) (12)

dTEθ(XY,f) > 0 indicates information flows preferentially from X to Y and dTEθ(XY,f) < 0 indicates the reverse direction. In the case of no preferential direction of interaction, dTEθ(XY,f) = 0.

To test the statistical significance of the dTEθ value, the source variable X is shuffled to generate the surrogate data, and the dTEθ values for 200 sets of surrogate data are calculated to construct the null hypothesis distribution. The null hypothesis of the raw data can be rejected or retained by comparing the dTEθ value of the raw data to the null hypothesis distribution at the 1% level of significance.

The interaction lag δ between X and Y is a significant parameter. Here, the scanning method is used to estimate δ. When TEθ(XY,f,u) is maximal [30], u is equal to δ (Equation (13)).

δ=argmaxu(TEθ(XY,f,u)) (13)

The Matlab codes of the TEθ methods are provided in the Supplementary Materials.

2.3. Neural Mass Model (NMM)

Simulated data play an essential role in evaluating the competing methods against a “ground truth”. An NMM is used to generate the simulated signal pairs with known interaction properties [36]. The NMM simulates the connectivity between multiple regions of interest (ROIs) through long-range excitatory connections. In the NMM, the average spike density of pyramidal neurons of the presynaptic area (ZX) affects the postsynaptic area by a weight factor ω and a time delay δ (Equation (14)):

vY(t)=ωXYZX(tδ)+nY(t) (14)

where the superscripts X and Y are represented by the source and target region, respectively. n(t) is a Gaussian white noise.

Signal pairs generated by the NMM are nonlinear and have significant β (about 20–30 Hz) activity (Figure 1). By changing the interaction delay and weight factor ωXY, ωYX, the simulated signal pairs with directional interaction can be obtained. In the following analysis, the signal pairs are first filtered in a 15 to 35 Hz pass band using a finite impulse response filter (FIR) with order 15, and then Hilbert transform is used to extract the phase time series from the filtered signals.

Figure 1.

Figure 1

The simulated signal pairs generated by a neural mass model (NMM). (a) We use the NMM to produce 1 s time series of Signal 1 and Signal 2 and the instantaneous phases from 15 to 30 Hz are extracted by Hilbert transform; (b) power averaged over 100 simulations for each weight factor is plotted as a function of frequency.

To test the accuracy of the TEθ methods in detecting the directional interaction for different levels of coupling strength, ωYX is set to 0 and ωXY from 0 to 70 to simulate unidirectional coupling. The interaction lag δ is set to 20 ms. Then, the stability of these methods in the presence of noise and linear mixing is investigated. We use the method in [19] to add noise and linear mixing. The signal-to-noise ratio (SNR) is set to 30, 20, 10, 5, 0, and −5 dB, respectively, and the mixing strength varies from 0.1 to 0.5. For each case, 1000 simulated signal pairs are generated with a duration of 2 s. To investigate the impact of sample size on the performance of the competing estimators, 25, 50, 75, 100, 125, 150, 175, 200, 250, and 300 signal pairs are selected to simulate varied sample size. Finally, the interaction lag is changed from 10 to 50 ms in five steps to assess the effect of interaction lag on the TEθ methods.

2.4. Evaluation Criteria

The dTEθ value is applied to measure the coupling strength, and the false positive rate (FPR) and sensitivity are calculated to assess the accuracy of the TEθ estimators in identifying the direction. By comparing the dTEθ value of the raw data to the null hypothesis distribution, which is constructed by 200 sets of surrogate data at the significance p (0.01) level, FPR for ωXY = 0 is obtained. The sensitivity (proportion of true positive for ωXY0) is calculated as a function of ωXY and the coupling detection threshold (CDT) for a sensitivity of 0.8 is estimated by linear interpolation. The CDT value represents the smallest coupling value for which the estimators detect 80% of the directional interaction. Therefore, a low CDT value indicates that a significant interaction is detected even for weak coupling, while a high CDT value means that the coupling could be detected only for solid coupling. δ accuracy is used to indicate the accuracy of these methods for interaction lag.

2.5. The Actual Neural Signals from Ferrets

The LFPs in the PFC and primary visual cortex (V1) of a female ferret in an awake state are recorded by a single metal electrode that is accurately inserted into putative layer IV. Details of the experimental process are described in [37] and the raw data are available from http://dx.doi.org/10.5061/dryad.kk40s (accessed on 1 November 2022).

Several sessions are carried out and the LFPs from each session are separated into epochs with a length of 4.8 s. Before detecting the interactions between PFC and V1, data processing is performed to remove the epochs containing motion artifact, and the epochs with large power in the delta (0.5 to 4 Hz) band (higher than 30% of the total power in 0.5 to 50 Hz) are also rejected. Then, the LFPs are filtered in 0–20, 20–40, 40–60, 60–80, and 80–100 Hz using a two-way, zero-phase-lag FIR filter with the order defined as 3 r, where r is the ratio of the sampling rate to the low-frequency cutoff of the filter, rounded down [38]. The phase time series of each frequency band can be obtained by Hilbert transform. Finally, the TEθ methods are applied to the phase time series for identifying the interaction between PFC and V1.

To explore the performance baseline of these methods on the actual neural signals with a small and large sample size, 2 and 30 epochs are drawn randomly from each session as a set to estimate TEθ, respectively, and search the interaction lag in the range of 1 to 20 ms. This procedure is repeated 10 times and results in 10 dTEθ values for each session. We first test the dTEθ values for their significance (p < 0.05) within individual sessions against 200 surrogate datasets at each frequency band, and then use a binomial test to establish the statistical significance over recordings.

2.6. The LFP Recorded from Pigeons and Analysis

The LFPs are recorded from Hp and NCL of six pigeons while they perform a visual–spatial associative learning task in a Y maze (Figure 2a). Figure 2c shows the schematic of the spatial associative learning task. A detailed description of subjects, surgical implantation of electrodes, data acquisition, and behavioral tasks are provided in the Supplementary Materials.

Figure 2.

Figure 2

Behavioral task and experiment procedures. (a) Schematic diagram of the Y maze. Pigeons learn to start from the home area, then decide on the left arm or right arm according to the light color. A red light corresponds to the right arm and green light to left arm. S1, S2, and S3 are the infrared sensors. (b) A pigeon learning the associative task in the maze. (c) Diagram of the visual–spatial learning task. The red dashed line is the epochs to be analyzed. Because the animals enter into a critical decision-making place when the door is opened, 0.5 s after the end of the delay is considered to be a decision-making period. The inter trial interval (ITI) is 2 s, the visual cue period is 3 s, the delay period is 0.5 s, the decision-making period is 0.5 s, and the tuning period is 1 s. (d) Examples of simultaneous local field potential signals (LFPs) recorded from the hippocampus (Hp) and nidopallium caudolaterale (NCL).

Data processing is performed before analysis. First, combined with video, only the trials in which pigeons are ready to enter the next trial in the inter-trial interval (ITI) are reserved. The trials containing strong motion artifact are also removed. Then, the adaptive common average reference is used for all channels of the remaining trials to remove the spatially correlated noise [39]. θ (4–12 Hz), β (12–30 Hz), slow-γ (30–45 Hz), and fast-γ (55–80 Hz) bands of LFPs are extracted by the zero-phase-lag FIR filter with the order defined as 3 r, where r is the ratio of the sampling rate (2000 Hz) to the low-frequency cutoff of the filter, rounded down. The phase time series for each frequency band is extracted by Hilbert transform. Finally, the phase time series from ITI (the last 2 s) to the turning period (a total of 7 s) is divided into 14 non-overlapping bins, and the coupling between Hp and NCL is calculated for each bin.

For each pigeon, only one session (about 30 trials) with a correct rate of 75–80% is applied to detect the interactions in brain areas (Table 1). All trials in a session are assumed to have the equivalent brain activity. Thus, LFPs (recorded with two 16-channel microelectrode arrays from Hp and NCL) for all of the correct trials in a session are pooled together. For each frequency band and each bin, 200 signal pairs are drawn randomly to estimate TEθ and the interaction lag is searched in the range of 1 to 30 ms. This process is repeated 30 times. The significance of dTEθ value is tested against those of 200 surrogate datasets (p < 0.05) for each estimation, and then the binomial test is used to establish the statistical significance over all estimations.

Table 1.

Detailed information on pigeons, trials, correct rate, and available channels.

Pigeon Number Total Trials Correct Trial Correct Rate Available Channels (Hp/NCL)
p21 33 26 78.8% 16/16
p22 35 28 80% 16/16
p23 42 32 76.2% 16/16
p24 37 29 78.4% 15/16
p25 46 35 76.1% 15/16
p26 30 23 76.7% 15/16

3. Results

3.1. The Performance Baseline Based on the Simulated Data

3.1.1. Accuracy of the TEθ Methods for the Directional Interaction Detection

To investigate the accuracy of the competing methods for the directional interaction recognition, the simulated signal pairs with ωXY = 0, 10, 20, 30, 40, 50, 60, and 70; ωYX = 0; δ = 20 ms; and trial length = 2 s are used. For each ωXY value, 1000 simulated signal pairs are pooled together, and 100 pairs are drawn randomly as a set to estimate TEθ and search for the interaction lag δ in the range of 10 to 70 ms. The dTEθ distribution, FPR, sensitivity, and δ accuracy of the TEθ methods are computed by 200 sets of signal pairs. This procedure is repeated 20 times to result in the mean and variance of the FPR, CDT values, and δ accuracy.

As shown in Figure 3, the dTEθ values are clustered around 0 for ωXY = 0 and increase monotonically with ωXY from 0 to 70 for all methods. The dTEθ values for TEKSGθ in the range of 0 to 0.05 are lower than those of other methods. The FPR values of the TEθ methods have no significant difference and are at a low level (below 0.01), but there is a dramatic difference in the sensitivity of these estimators. The CDT values of TEbinθ and TEsymθ are around 17.5, which are significantly lower than those of other methods. The second is TEkαθ, followed by TEKSGθ. The CDT values of TEKSGθ are higher than 50, indicating that the validity of TEKSGθ in the directional coupling detection is lower than other methods. Finally, the δ accuracy of these methods is also analyzed. For TEbinθ, TEKSGθ, and TEkαθ, the δ accuracy is increased in ωXY. TEbinθ and TEkαθ have the same performance in detecting the interaction lag, followed by TEKSGθ. The δ values estimated by the TEsymθ method are clustered around 30 ms rather than 20 ms, as we set. Thus, TEsymθ cannot estimate the interaction lag correctly.

Figure 3.

Figure 3

Accuracy of the phase transfer entropy (TEθ) methods in identifying and quantifying the directional interaction. (a) The differential TEθ (dTEθ) values of these methods increase monotonically with weight factor ωXY from 0 to 70. (b) False positive rate (FPR) values of the competing methods have no significant difference and are lower than 0.01. (c) The sensitivity of the estimators is plotted against ωXY, and that of TEbinθ is stronger than that of other methods. (d) The coupling detection threshold (CDT) values of TEKSGθ are significantly larger than those of other methods. (e) The interaction lag (δ) accuracy for these estimators is improved with an increase in ωXY, and TEsymθ cannot identify the interaction lag correctly. All results (except for the interaction lag accuracy) in Figure 3, Figure 4, Figure 5, Figure 6, Figure 7, Figure 8, Figure 9 and Figure 10 are presented as mean ± s.d. (shading or error bar) for multiple estimates.

To sum up, TEbinθ, TEsymθ, and TEkαθ can accurately detect the directional interaction with a low FPR and high sensitivity, but TEsymθ cannot identify the interaction lag correctly. Though TEKSGθ behaves well for ωXY = 0, it has poor reliability in the directional identification for ωXY > 0.

3.1.2. Stability of the TEθ Methods to the Directional Interaction Estimation

  • (1)

    Robustness of the TEθ methods to the directional interaction identification in the presence of noise and linear mixing.

Noise is inevitably introduced into the process of neural signals’ acquisition and, owing to volume conduction between adjacent brain regions, there may be linear mixing in the neural signals. To investigate whether the competing TEθ methods can reliably detect the strength and the direction of the interaction in the presence of noise and linear mixing, the simulated signal pairs (ωXY = 0, 10, 20, 30, 40, 50, 60, and 70; ωYX = 0; δ = 20 ms; trial length = 2 s) with varied SNR and linear mixing values (m) are applied for TEθ estimation.

Figure 4 shows the behavior of estimators with SNR = 30, 20, 10, 5, 0, and −5 dB, with m = 0. Low power noise (SNR = 30, 20 dB) has a little effect on the dTEθ values of the competing methods, and the dTEθ values decrease moderately for SNR = 10, 5 dB. However, for strong noise (SNR = 0, −5 dB), they reduce greatly. The FPR values of the four estimators are not affected by noise. They all remain at a low level regardless of the noise power. However, the sensitivity of these estimators is seriously influenced by noise. Especially for TEKSGθ and TEkαθ, the direction cannot be detected even for a strong interaction when SNR is −5 dB. Although TEbinθ, TEsymθ, and TEkαθ behave well for an SNR higher than 5 dB, their CDT values increase sharply when the SNR is lower than 5 dB. The δ accuracy for TEbinθ, TEKSGθ, and TEkαθ shows the same trend as that of sensitivity, which increases with ωXY from 0 to 70 and reduces with an increase in noise. The δ accuracy of TEbinθ as well as TEkαθ is significantly higher than that of other methods, followed by TEKSGθ, and TEsymθ has the worst performance in identifying the interaction lag.

Figure 4.

Figure 4

Robustness of the TEθ methods on noise. (ad) Noise reduces the dTEθ values of estimators, especially for the strong noise. (e) The FPR values of these methods remain at a low level regardless of the noise power. (f) The CDT values of the four methods are increased with the SNR changing from 30 to −5 dB. (g) The δ accuracy of these estimators shows the same trend as that of the sensitivity and reduces with an increase in noise power.

Figure 5 shows the effect of linear mixing (m = 0.1, 0.2, 0.3, and 0.5; SNR = 20 dB) on the TEθ methods. The dTEθ values of the estimators increase with ωXY from 0 to 70 even in the presence of linear mixing, and they are not affected by a low mixing strength (m = 0.1, 0.2, and 0.3), but are reduced for a strong mixing strength (m = 0.5). The FPR and the sensitivity of the competing methods with varied linear mixing strength are calculated. The FPR values of TEkαθ remain at a low level regardless of the mixing strength. Those of TEbinθ and TEKSGθ are below 0.01 for m = 0.1 and 0.2 and are higher than 0.01 when m is 0.3 and 0.5. The FPR values of TEsymθ increase monotonically with mixing strength from 0.1 to 0.5 and reach 0.38 for m = 0.5. However, the sensitivity of TEbinθ, TEKSGθ, and TEsymθ shows the opposite trend and reduces with an increase in m. The CDT values of TEbinθ and TEsymθ are below 20 for m = 0.1 and 0.2 and increase to 30 when m is 0.5. Those of TEKSGθ are lower than 50 for m = 0.1, 0.2, and 0.3, but TEKSGθ cannot detect the directional interaction when m is 0.5. Linear mixing has little effect on TEkαθ and the CDT values remain at a low level regardless of m. However, the δ accuracy of TEbinθ and TEkαθ decreases with m from 0.1 to 0.5. That of TEKSGθ presents unexpected results and improves with an increase in m. TEsymθ cannot detect the interaction lag irrespective of the effect of linear mixing (m).

Figure 5.

Figure 5

Stability of the TEθ methods in detecting the directional interaction in the presence of linear mixing. (ad) Strong linear mixing (m = 0.5) reduces the dTEθ metric values of the estimators, except for TEkαθ. (e) The FPR values of TEbinθ, TEKSGθ, and TEsymθ are affected by strong linear mixing, but those of TEkαθ remain at a low level. (f) The CDT values of the TEθ methods increase with m from 0.1 to 0.5, but those of TEkαθ are stable regardless of m. (g) Linear mixing affects the δ accuracy of the estimators.

Above all, low, realistic noise (SNR = 30, 20, 10, and 5 dB) and mixing (m = 0.1 and 0.2) have little effect on the TEθ methods, but the validity of the estimators is greatly reduced for strong noise and mixing. Compared with other methods, TEbinθ with a low false positive rate and high sensitivity performs the best for strong noise, while the robustness of TEkαθ to linear mixing is better than that of other methods. The FPR of TEsymθ is seriously influenced by linear mixing. The performance of TEKSGθ is the most unstable and the directional interaction cannot be detected for strong noise as well as linear mixing.

  • (2)

    The impact of sample size on the performance of the TEθ methods.

An accurate TEθ estimation requires enough samples. We, therefore, investigate the effect of sample size on the performance of the TEθ methods by drawing varied trials as a subset to estimate TEθ (trial number = 25, 50, 75, 100, 125, 150, 175, 200, 250, and 300; trial length = 2 s). It should be emphasized that the Gram matrices’ operation in TEkαθ requires a large amount of internal memory, so only small samples (trial number = 25, 50, 75, 100, and 125) are used in TEkαθ.

As expected, the mean values of dTEθ are not affected by sample size and scale up monotonically with ωXY from 0 to 70, but the variances reduce with an increase in sample size, which indicates that a large sample size improves the stability of the TEθ methods on the coupling strength quantification (Figure 6). The FPR values of TEbinθ, TEKSGθ, and TEkαθ remain at a low level irrespective of the sample size. However, those of TEsymθ increase with the sample size from 5 × 103 to 60 × 103 and are higher than 0.05 when the sample size is 60 × 103. A large sample capacity has a positive effect on the sensitivity of all methods and the CDT values of TEbinθ, TEsymθ, and TEkαθ below 15 for sample size = 60 × 103, indicating that these methods can detect the directional interaction even for weak coupling strength under the condition of enormous sample size. The δ accuracy of TEbinθ, TEKSGθ, and TEkαθ is also improved by the sample size, but even for large samples, TEsymθ is still unable to correctly identify the interaction lag.

Figure 6.

Figure 6

The sample size improves the effectiveness of the TEθ methods. (ad) The mean values of dTEθ for all estimators remain stable, but the variances reduce with an increase in sample size. (e) The FPR values of TEbinθ, TEKSGθ, and TEkαθ fluctuate around 0.01 for varied sample size, while those of TEsymθ are increased from 0.01 to 0.05. (f) The sample size has a positive effect on the CDT values of methods. (g) The δ accuracy of these methods improves with an increase in sample size, except for that of TEsymθ.

  • (3)

    The effect of interaction lag on the TEθ methods.

The interaction lag between brain areas is unknown. To analyze whether the validity of these estimators is affected by interaction lag, the simulated signal pairs (ωXY = 0, 10, 20, 30, 40, 50, 60, and 70; ωYX = 0; SNR = 20 dB; m = 0.1; trial length = 2 s) with varied interaction lag (δ = 10, 20, 30, 40, and 50 ms) are generated by the NMM. For each estimation, we draw 100 pairs as a set from 1000 signal pairs and scan the interaction lag in the range of 10 to 70 ms.

As shown in Figure 7, the dTEθ values of these methods display different changing characteristics. Those of TEbinθ and TEKSGθ scale up with an increase in interaction lag. While for TEsymθ and TEkαθ, the dTEθ values are at the maximum when the interaction lag is in the middle (δ = 30 ms) and decrease in both directions (δ = 10, 20, 40, and 50 ms). The FPR values of these methods are not influenced by interaction lag, and they are below 0.01. The sensitivity of TEKSGθ is improved for δ from 10 to 50 ms, and the CDT values decrease from higher than 70 to 25, indicating that TEKSGθ may be more suitable for the neural signals with large interaction lag. The sensitivity of TEsymθ is the best when δ is 30 ms and reduces on both sides. For TEbinθ and TEkαθ, the CDT values for δ = 10 ms are higher than those of other δ values. Finally, the δ accuracy of the estimators is calculated. The interaction lag has a great effect on the δ accuracy of TEbinθ and TEKSGθ. TEsymθ cannot detect the interaction lag, irrespective of the analysis lag.

Figure 7.

Figure 7

Robustness of the TEθ methods on interaction lag. (ad) The dTEθ values of these estimators show different changing characteristics for varied interaction lag. (e) The FPR values of the TEθ methods are below 0.01 for any δ value. (f) The CDT values of the TEθ methods are affected by interaction lag. (g) The δ  accuracy of these estimators has complex characteristics for varied interaction lag.

3.1.3. The Computational Complexity of the TEθ Methods

TEθ is a biased estimation and the dTEθ values are not zero even in the absence of interaction, so it is necessary to construct the null hypothesis distribution to test the significance of the dTEθ metric value. Generally, the surrogate signal pairs are shuffled 200 times to construct a null hypothesis distribution. So, the TEθ methods should be time-saving. Here, the time consumption of these TEθ estimators with varied sample sizes is compared. One complete directional interaction detection includes estimating the dTEθ value as well as scanning the interaction lag in the range of 10 to 70 ms and constructing the null hypothesis distribution with 200 surrogate signal pairs.

The results are shown in Figure 8. The time consumption of the methods is scaled up with an increase in sample size. TEbinθ consumes the least time. It takes only 20.1118 s for sample size = 5 × 103 and increases to 124.3577 when the sample capability is 60 × 103. TEsymθ also has an acceptable time consumption (167.1969 s for sample size = 5 × 103 and 524.7998 s for 60 × 103). The third is TEKSGθ (194.9923 s and 5.2436 × 103 s for sample capability = 5 × 103 and 60 × 103, respectively). TEkαθ has the highest time cost. It takes 4.2735 × 104 s for sample size = 5 × 103, which is much greater than TEbinθ. Thus, TEkαθ may not be suitable for the signal pairs with a large sample size.

Figure 8.

Figure 8

The computational complexity of the TEθ methods. They are calculated for a complete estimation, which includes scanning the interaction lag in the range of 10 to 70 ms, estimating the dTEθ value, and constructing the null hypothesis distribution.

3.2. The Performance Baseline of the TEθ Methods Based on the Actual Neural Signals

To explore the properties of the TEθ estimators on the actual neural signals, we apply the methods to the LFPs, which are recorded from PFC and V1 of a ferret in an awake state with a large and small sample size. Owing to the large time consumption of TEKSGθ and TEkαθ, only the small sample size is used on TEKSGθ and TEkαθ.

For the large sample size (30 epochs as a set to estimate TEθ), TEbinθ as well as TEsymθ detect bidirectional coupling between PFC and V1 for the ferret in an awake state, and the information flows preferentially from V1 to PFC (known as feed-forward FF) in the low-frequency band (0–20 Hz), with p < 0.001 for TEbinθ and p < 0.01 for TEsymθ, respectively, while the direction is reversed (from PFC to V1, called as feedback FB) in the γ band (40–60 Hz) with p < 0.001 (Figure 9). These results are in line with previous studies finding that there are bidirectional interactions between PFC and V1 in ferret brains in an awake state [40]. The δ values of TEbinθ for FF are clustered around 5 and 20 ms, and those of FB fluctuate around 1 ms (almost consistent with the results in [40]). However, the δ values of TEsymθ are distributed in the range of 0 to 20 ms, more dispersed than those of TEbinθ. No method can identify the coupling between PFC and V1 with a small sample size (2 epochs as a set to estimate TEθ), except for TEsymθ. An information flow predominantly from PFC to V1 at 60–80 Hz is detected by TEsymθ. However, owing to its disappearance with an increase in sample size, it may be a false positive. The outcomes on the actual neural data are in line with those on the simulated signal pairs. Sample size has a positive effect on the applicability criteria of TEθ methods, and TEbinθ performs better than other methods. TEsymθ may produce a high false positive rate, and the large time consumption of TEkαθ hinders its application in neural signals.

Figure 9.

Figure 9

Performance baseline of the TEθ methods on the actual neural data with known interaction properties. (a,b) The dTEθ values of the TEθ estimators are calculated for varied frequency bands with a large (a) and small sample size (b), respectively. ‘*’ represents a significant interaction between PFC and V1. ‘*’ above curves indicates the information flow preferentially from PFC to V1, and ‘*’ below curves indicates the interaction from V1 to PFC. ‘***’ p < 0.001, ‘**’ p < 0.01, ‘*’ p < 0.05. (c) The δ values of TEbinθ with a large sample size are clustered around 5 and 20 ms for PFC → V1 and around 1 ms for V1 → PFC. (d) The δ values of TEsymθ with a large sample size are more dispersed than those of TEbinθ.

Based on the above analysis, the performance baseline of these methods is proposed in Table 2.

Table 2.

Performance baseline table for the TEθ methods. ‘Inline graphic’ means the FPR is below 0.01 and CDT values are lower than 25. ‘Inline graphic’ represents the FPR in the range of 0.01 to 0.05 or CDT values from 25 to 40. ‘Inline graphic’ indicates the FPR is higher than 0.05 or CDT values are larger than 40.

TEbinθ TEKSGθ TEsymθ TEkαθ
Accuracy on the directional interaction graphic file with name entropy-25-00994-i004.jpg graphic file with name entropy-25-00994-i005.jpg graphic file with name entropy-25-00994-i006.jpg graphic file with name entropy-25-00994-i007.jpg
Accuracy on the interaction lag graphic file with name entropy-25-00994-i008.jpg Inline graphic graphic file with name entropy-25-00994-i010.jpg graphic file with name entropy-25-00994-i011.jpg
Robustness to noise SNR ≥ 5 dB graphic file with name entropy-25-00994-i012.jpg graphic file with name entropy-25-00994-i013.jpg graphic file with name entropy-25-00994-i014.jpg graphic file with name entropy-25-00994-i015.jpg
SNR < 5 dB Inline graphic graphic file with name entropy-25-00994-i017.jpg graphic file with name entropy-25-00994-i018.jpg graphic file with name entropy-25-00994-i019.jpg
Robustness to linear mixing m ≤ 0.3 graphic file with name entropy-25-00994-i020.jpg graphic file with name entropy-25-00994-i021.jpg graphic file with name entropy-25-00994-i022.jpg graphic file with name entropy-25-00994-i023.jpg
m > 0.3 Inline graphic graphic file with name entropy-25-00994-i025.jpg graphic file with name entropy-25-00994-i026.jpg graphic file with name entropy-25-00994-i027.jpg
Stability to the sample size Sample size ≤ 10 × 103 Inline graphic graphic file with name entropy-25-00994-i029.jpg Inline graphic Inline graphic
Sample size > 10 × 103 graphic file with name entropy-25-00994-i032.jpg Inline graphic Inline graphic graphic file with name entropy-25-00994-i035.jpg
Stability to the interaction lag Short interaction lag graphic file with name entropy-25-00994-i036.jpg graphic file with name entropy-25-00994-i037.jpg Inline graphic graphic file with name entropy-25-00994-i039.jpg
Long interaction lag graphic file with name entropy-25-00994-i040.jpg graphic file with name entropy-25-00994-i041.jpg Inline graphic graphic file with name entropy-25-00994-i043.jpg
Computational complexity graphic file with name entropy-25-00994-i044.jpg graphic file with name entropy-25-00994-i045.jpg graphic file with name entropy-25-00994-i046.jpg graphic file with name entropy-25-00994-i047.jpg

Suitable for the following: (1) TEbinθ—strong noise, large sample size, any interaction lag; (2) TEKSGθ—small sample size, long interaction lag; (3) TEsymθ—weak linear mixing, large sample size; (4) TEkαθ—strong linear mixing, small sample size, any interaction lag.

3.3. Implementing a Suitable TEθ Estimation Method on the Interaction between Hp and NCL of Pigeons in Visual–Spatial Associative Learning

TEbinθ is selected by the performance baseline table to explore the coupling between Hp and NCL of pigeons in spatial associative learning. The TEbinθ method is applied on the LFPs recorded from Hp and NCL of six pigeons (p21 to p26) while they perform a visual–spatial associative learning task. The LFPs from ITI to the turning period (a total of 7 s) are divided into 14 non-overlapping bins, then the dTEθ value is calculated, and the null hypothesis distribution is constructed for each bin.

The results are shown in Figure 10. For all pigeons, the dTEθ values in the period where the door is opened and animals enter the critical decision-making place are actually larger than those in other periods for the θ band (4–12 Hz). By comparing the dTEθ value with the null hypothesis distribution for each bin, we find there exists coupling between Hp and NCL in the decision-making period at the θ band, and the information flow is predominately from Hp to NCL. The scan method is used to detect the interaction lag between Hp and NCL, finding that different pigeons have varied interaction lag values. For pigeons p21 to p26, the interaction lag values are clustered around 27, 23, 15, 24, 11, and 29 ms, respectively (Figure 10g). The coupling between Hp and NCL during associative learning may indicate that the spatially related associative information formed in Hp is transmitted to NCL for decision-making.

Figure 10.

Figure 10

The interaction between Hp and NCL of pigeons in visual–spatial associative learning. (af) The dTEθ values from Hp to NCL for six pigeons (p21, p22, p23, p24, p25, and p26) are calculated by TEbinθ. The information flow is predominantly from Hp to NCL at the θ band in the decision-making period. The solid line is the mean value of dTEθ and the shaded area represents the standard deviation of dTEθ, ‘**’ p < 0.01, ‘*’ p < 0.05. ①, ②, ③, ④, and ⑤ indicate the ITI period (2 s), the visual cue (3 s), the delay period (0.5 s), the decision-making period (0.5 s), and the turning period (1 s), respectively. Time 0 means the end of the delay period and the point at which the animal enters the critical decision-making place. (g) The interaction lag between Hp and NCL of pigeons in visual–spatial associative learning.

4. Discussion

This paper investigates the performance baseline of four commonly used TEθ methods (TEbinθ, TEKSGθ, TEsymθ, and TEkαθ) on the simulated signals and actual neural data. The results are shown in Table 2 and, for a large sample size, TEbinθ with less time consumption, a low false positive rate, and high sensitivity performs better than other methods. TEkαθ is robust to linear mixing. It is a good choice for small samples. TEsymθ is extremely susceptible to linear mixing and cannot accurately detect the interaction lag. The performance of TEKSGθ is unstable and easily affected by interaction lag. Then, the TEbinθ method is applied to the LFPs recorded from Hp and NCL of pigeons to detect the coupling between these two brain regions while pigeons perform a visual–spatial associative learning task. TEbinθ identifies an information flow from Hp to NCL at the θ band when pigeons enter the critical decision-making place.

Recently, TEθ has become a research focus in identifying animal brain area interactions, and various TEθ methods have been proposed to estimate the TEθ metric value. However, these methods with distinct underlying mathematical assumptions or measures of dependency show various characteristics. The characterization of TEθ methods is unclear, which limits their use in the actual neural signals. This paper explores the performance baseline of TEθ methods and provide a reference for researchers when using these methods, promoting the usage of TEθ methods in neural signals.

The accuracy of TEθ estimators in identifying and quantifying the directional interaction is first analyzed in an ideal state (no noise and linear mixing). The estimators show almost the same results, except for TEKSGθ, in which the validity is significantly lower than that of other methods for weak coupling. Transfer entropy with the KSG estimator (TEKSG) is a widely used method for signal pairs combining amplitude and phase [31]. Though some researchers note it may be not applicable for phase time series because of the periodicity of the phase spectrum [20], the KSG estimator has been successfully used to estimate mutual information on phase time series [41]. In this study, TEKSGθ behaves well for a large interaction lag (δ = 50 ms). Therefore, TEKSGθ may be more suitable for analyzing the coupling with a large interaction lag.

The neural signals are corrupted by measurement noise and biological noise in their acquisition and transmission [39]. Though some measures have been taken to reduce the noise [42], we cannot take them out completely from neural signals owing to some noise being sufficiently complex. The presence of noise can mask the features of the neural signals and affect the analysis of the coupling between them. So, the TEθ methods should be robust to noise. In this study, the performance of TEθ estimators reduce with an increased in noise power, especially for TEKSGθ and TEkαθ, which cannot detect the coupling with a high noise power. This may be due to the discretization in TEbinθ and TEsymθ, making them less sensitive to noise. For low, realistic noise, TEbinθ, TEsymθ, and TEkαθ can accurately detect the directional interaction. Because of volume conduction in the brain, there may be linear mixing in adjacent brain areas. Linear mixing leads to a decrease in dTEθ values for ωXY > 0, which could result in low sensitivity, and inflates the dTEθ values, increasing the risk of false positives. These may result from the reduction of useful information in X to predict Y for ωXY > 0 and self-prediction information in Y added to X for ωXY = 0. The false positive rate of TEsymθ is increased with an increase in the mixing strength, which is significantly higher than that of the other methods. This may be related to the symbolization in TEsymθ, which makes TEsymθ more sensitive to the change in signal asymmetry. In TEsymθ, increasing the samples also improves the false positive rate, even for weak linear mixing. Therefore, we should pay more attention to the erroneous judgement when applying TEsymθ to detect the interaction between adjacent brain areas. TEkαθ provides substantial robustness to linear mixing. This phenomenon is probably due to Renyi’s entropy, instead of Shannon entropy used in TEkαθ, and a functional defined on positive definite and infinitely divisible kernel matrices is applied to approximate Renyi’s entropy, which can capture the similarity relations among signal pairs and detect slow changing features in data [43].

The sample size is also an important factor affecting the performance of the methods. Accumulating evidence shows that accurate recognition of directional interaction relies on a sufficient sample size [44]. The results in this study show that increasing the sample size effectively improves the validity of the estimators. However, expanding the sample capability also leads to a great time cost. As shown in Figure 8, the time consumption of the estimators scales up with an increase in sample size. Because of the trace operator on Gram matrix (G) to power α (tr(Gα)) in TEkαθ, it poses a great challenge in terms of both storage and computing when using TEkαθ in practice [45]. In TEbinθ, a sample binning method rather than more complex technology is used to reconstruct the time series state-space, so, compared with other methods, the time cost of TEbinθ is within an acceptable range.

The performance baseline of the competing methods is investigated on the actual neural signals, which are recorded from PFC and V1 for a ferret in an awake state. TEbinθ and TEsymθ with a large sample size identify the bidirectional interactions between PFC and V1 and, for TEbinθ, the interaction lag values are clustered around 5 and 20 ms for FF and around 1 ms for FB, consistent with the results in [40]. We also detect FF in the low frequency band (0–20 Hz) and FB in the high frequency band (40–60 Hz). Unexpectedly, these results are different from those in humans and monkeys. Studies have shown that a low frequency mediates FB, while a high frequency mediates FF, when humans and monkeys perform cognitive tasks such as memory encoding and recall [46,47,48]. This is consistent with the functional interpretation under the hierarchical predictive coding framework. According to predictive coding, the main function of FB is to provide predictions of input signals by integrating memory and expectations, which mainly operate on slow time scales. Meanwhile, FB operates on faster time scales, because FB has to respond to fast sensory inputs [49]. However, in the ferret experiment, although the animals receive the visual stimuli, they do not need to respond to the visual stimuli. Therefore, the inter-regional interactions of ferrets may be different from those of animals with task requirements. Another reason for the difference in results between ferrets and other animals may be that only data from one ferret are used in this manuscript, which is not statistically significant. In subsequent studies, we hope to use more data to reveal the neural mechanisms behind FF and FB of ferrets.

Accumulating evidence shows that the θ frequency mediates the spatial information processing and communication between Hp and PFC [7,50,51,52,53]. Studies investigating the interaction between brain areas of rodents suggest that the θ band mediates the spatially related information from Hp to PFC in odor–spatial associative tasks [54]. This pattern of the θ band modulation spatial information has also been reported in Hp and NCL of pigeons, supporting the formation of a stable route for homing [55]. In our report, an information flow preferentially from Hp to NCL at the θ band is detected by TEbinθ, indicating that, like in mammals, the spatially related information is transferred from Hp to NCL for decision-making, and avian Hp-NCL may have the same information processing and interaction mode as those of mammalian Hp-PFC in sensory–spatial associative learning.

A limitation of this paper is that TEθ is not suitable for neural activity with the learning process [28]. However, pigeons take a long time to learn the visual–spatial associative task (about 30 days). So, we assume that the neural activity in a session is stable. We look forward to further research on the application of TEθ in slow learning processes. Just one session with a 75% to 80% accuracy rate is applied to detect the interaction between Hp and NCL of pigeons in visual–spatial associative learning. In the future, we will investigate the dynamics of Hp and NCL interactions in associative learning.

5. Conclusions

In the study, we investigate the performance of four commonly used TEθ methods (TEbinθ, TEKSGθ, TEsymθ, and TEkαθ) on the simulated signal pairs and the actual neural data with known interaction properties. The results show a performance baseline table that contains the most suitable method for different scenarios. Then, TEbinθ is applied for the local field potential recorded from pigeons and detects an information flow predominantly from Hp to NCL at the θ band during the decision-making period in visual–spatial associative learning. These outcomes highlight the importance of choosing the appropriate method for detecting the interactions between brain regions and provide a reference for researchers when using these methods.

Acknowledgments

We are grateful to Kristin K. Sellers and his group members for generously sharing their unique local field potential signals from ferrets.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/e25070994/s1. The Matlab codes of the TEθ methods and pigeon experimental procedure.

Author Contributions

Conceptualization, J.-Y.Z., G.L. and H.W.; methodology, J.-Y.Z.; software, J.-Y.Z.; validation, J.-Y.Z., M.-M.L. and G.L.; formal analysis, J.-Y.Z. and G.L.; investigation, J.-Y.Z. and Z.-H.Z.; resources, J.-Y.Z.; data curation, J.-Y.Z. and Z.-H.Z.; writing—original draft preparation, J.-Y.Z.; writing—review and editing, J.-Y.Z., G.L. and H.W.; visualization, J.-Y.Z., M.-M.L. and G.L.; supervision, G.L. and H.W.; project administration, G.L. and H.W.; funding acquisition, G.L. and H.W. All authors have read and agreed to the published version of the manuscript.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Life Science Ethical Review Committee of Zhengzhou University (No SYXK 2019-0002).

Data Availability Statement

The local field potential signals recorded from pigeons are available upon request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

Funding Statement

This research was funded by the National Natural Science Foundation of China, grant number 61673353.

Footnotes

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

References

  • 1.Suzuki W.A. Integrating associative learning signals across the brain. Hippocampus. 2010;17:842–850. doi: 10.1002/hipo.20321. [DOI] [PubMed] [Google Scholar]
  • 2.Iwasaki S., Sasaki T., Ikegaya Y. Hippocampal beta oscillations predict mouse object-location associative memory performance. Hippocampus. 2021;31:503–511. doi: 10.1002/hipo.23311. [DOI] [PubMed] [Google Scholar]
  • 3.Trimper J.B., Galloway C.R., Jones A.C., Mandi K., Manns J.R. Gamma Oscillations in Rat Hippocampal Subregions Dentate Gyrus, CA3, CA1, and Subiculum Underlie Associative Memory Encoding. Cell Rep. 2017;21:2419. doi: 10.1016/j.celrep.2017.10.123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Browning P.G., Easton A., Buckley M.J., Gaffan D. The role of prefrontal cortex in object-in-place learning in monkeys. Eur. J. Neurosci. 2005;22:3281–3291. doi: 10.1111/j.1460-9568.2005.04477.x. [DOI] [PubMed] [Google Scholar]
  • 5.Igarashi K.M., Lu L., Colgin L.L., Moser M.B., Moser E.I. Coordination of entorhinal-hippocampal ensemble activity during associative learning. Nature. 2014;510:143–147. doi: 10.1038/nature13162. [DOI] [PubMed] [Google Scholar]
  • 6.Eichenbaum H. Prefrontal–hippocampal interactions in episodic memory. Nat. Rev. Neurosci. 2017;18:547–558. doi: 10.1038/nrn.2017.74. [DOI] [PubMed] [Google Scholar]
  • 7.Kim J., Delcasso S., Lee I. Neural correlates of object-in-place learning in hippocampus and prefrontal cortex. J. Neurosci. 2011;31:16991–17006. doi: 10.1523/JNEUROSCI.2859-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Allen T.A., Fortin N.J. The evolution of episodic memory. Proc. Natl. Acad. Sci. USA. 2013;110((Suppl. 2)):10379–10386. doi: 10.1073/pnas.1301199110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Good M. The effects of hippocampal-area parahippocampalis lesions on discrimination learning in the pigeon. Behav. Brain Res. 1987;26:171–184. doi: 10.1016/0166-4328(87)90165-3. [DOI] [PubMed] [Google Scholar]
  • 10.Veit L., Pidpruzhnykova G., Nieder A. Associative learning rapidly establishes neuronal representations of upcoming behavioral choices in crows. Proc. Natl. Acad. Sci. USA. 2015;112:15208–15213. doi: 10.1073/pnas.1509760112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Dykes M., Klarer A., Porter B., Rose J., Colombo M. Neurons in the Pigeon Nidopallium Caudolaterale Display Value-Related Activity. Sci. Rep. 2018;8:5377. doi: 10.1038/s41598-018-23694-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Sendhoff B., Körner E., Sporns O., Ritter H.J., Doya K. Creating Brain-Like Intelligence: From Basic. Principles to Complex. Intelligent Systems. Springer; Berlin/Heidelberg, Germany: 2009. Creating Brain-Like Intelligence. [Google Scholar]
  • 13.Vicente R., Wibral M., Lindner M., Pipa G. Transfer entropy—A model-free measure of effective connectivity for the neurosciences. J. Comput. Neurosci. 2011;30:45–67. doi: 10.1007/s10827-010-0262-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Kaiser A., Schreiber T. Information transfer in continuous processes. Phys. D Nonlinear Phenom. 2002;166:43–62. doi: 10.1016/S0167-2789(02)00432-3. [DOI] [Google Scholar]
  • 15.Schreiber T. Measuring Information Transfer. Phys. Rev. Lett. 2000;85:461–464. doi: 10.1103/PhysRevLett.85.461. [DOI] [PubMed] [Google Scholar]
  • 16.Janson N.B., Balanov A.G., Anishchenko V.S., Mcclintock P. Phase relationships between two or more interacting processes from one-dimensional time series. I. Basic theory. Phys. Rev. E. 2002;65:36211. doi: 10.1103/PhysRevE.65.036211. [DOI] [PubMed] [Google Scholar]
  • 17.Vinck M.A., Oostenveld R., Wingerden M.V., Battaglia F.P., Pennartz C.M.A. An improved index of phase-synchronization for electrophysiological data in the presence of volume-conduction, noise and sample-size bias. NeuroImage. 2011;55:1548–1565. doi: 10.1016/j.neuroimage.2011.01.055. [DOI] [PubMed] [Google Scholar]
  • 18.Ad A., Vmab C. Causal dynamics and information flow in parietaltemporal-hippocampal circuits during mental arithmetic revealed by high-temporal resolution human intracranial EEG. Cortex. 2022;147:24–40. doi: 10.1016/j.cortex.2021.11.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Lobier M., Siebenhühner F., Palva S., Palva J.M. Phase transfer entropy: A novel phase-based measure for directed connectivity in networks coupled by oscillatory interactions. Neuroimage. 2014;85:853–872. doi: 10.1016/j.neuroimage.2013.08.056. [DOI] [PubMed] [Google Scholar]
  • 20.Hlaváčková-Schindler K., Palu M., Vejmelka M., Bhattacharya J. Causality detection based on information-theoretic approaches in time series analysis. Phys. Rep. 2007;441:1–46. doi: 10.1016/j.physrep.2006.12.004. [DOI] [Google Scholar]
  • 21.Kraskov A., Stögbauer H., Grassberger P. Estimating mutual information. Phys. Rev. E. 2004;64:066138. doi: 10.1103/PhysRevE.69.066138. [DOI] [PubMed] [Google Scholar]
  • 22.Kozachenko L.F., Leonenko N.N. A statistical estimate for the entropy of a random vector. Probl. Inf. Transm. 1987;23:9–16. [Google Scholar]
  • 23.Staniek M., Lehnertz K. Symbolic transfer entropy: Inferring directionality in biosignals. Biomed. Tech. 2009;54:323–328. doi: 10.1515/BMT.2009.040. [DOI] [PubMed] [Google Scholar]
  • 24.Zhang N., Lin A., Shang P. Symbolic Phase Transfer Entropy Method and its Application. Commun. Nonlinear Sci. Numer. Simul. 2017;51:78–88. doi: 10.1016/j.cnsns.2017.03.011. [DOI] [Google Scholar]
  • 25.De La Pava Panche I., Álvarez-Meza A., Herrera Gómez P.M., Cárdenas-Peña D., Ríos Patiño J.I., Orozco-Gutiérrez Á. Kernel-Based Phase Transfer Entropy with Enhanced Feature Relevance Analysis for Brain Computer Interfaces. Appl. Sci. 2021;11:6689. doi: 10.3390/app11156689. [DOI] [Google Scholar]
  • 26.Panche I., Alvarez-Meza A.M., Orozco-Gutierrez A. A Data-Driven Measure of Effective Connectivity Based on Renyi’s α-Entropy. Front. Neurosci. 2019;13:1277. doi: 10.3389/fnins.2019.01277. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Wang H.E., Benar C.G., Quilichini P.P., Friston K.J., Jirsa V.K., Bernard C. A systematic framework for functional connectivity measures. Front. Neurosci. 2014;8:405. doi: 10.3389/fnins.2014.00405. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Wollstadt P., Martinez-Zarzuela M., Vicente R., Diaz-Pernas F.J., Wibral M. Efficient transfer entropy analysis of non-stationary neural time series. PLoS ONE. 2014;9:e102833. doi: 10.1371/journal.pone.0102833. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Gómez-Herrero G., Wu W., Rutanen K., Soriano M., Pipa G., Vicente R. Assessing coupling dynamics from an ensemble of time series. Entropy. 2015;17:1958–1970. doi: 10.3390/e17041958. [DOI] [Google Scholar]
  • 30.Wibral M., Pampu N., Priesemann V., Siebenhuhner F., Seiwert H., Lindner M., Lizier J.T., Vicente R. Measuring information-transfer delays. PLoS ONE. 2013;8:e55809. doi: 10.1371/journal.pone.0055809. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Lindner M., Vicente R., Priesemann V., Wibral M. TRENTOOL: A Matlab open source toolbox to analyse information flow in time series data with transfer entropy. BMC Neurosci. 2011;12:119. doi: 10.1186/1471-2202-12-119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Scott D.W. Multivariate Density Estimation. Ann. Math. Stat. 1997;34:191–198. doi: 10.1214/aoms/1177704254. [DOI] [Google Scholar]
  • 33.Ragwitz M., Kantz H. Markov models from data by simple nonlinear time series predictors in delay embedding spaces. Phys. Rev. E Stat. Nonlinear Soft Matter Phys. 2002;65:056201. doi: 10.1103/PhysRevE.65.056201. [DOI] [PubMed] [Google Scholar]
  • 34.Bandt C., Pompe B. Permutation Entropy: A Natural Complexity Measure for Time Series. Phys. Rev. Lett. 2002;88:174102. doi: 10.1103/PhysRevLett.88.174102. [DOI] [PubMed] [Google Scholar]
  • 35.Kim H.S., Eykholtb R., Salasc J.D. Nonlinear Dynamics, Delay Times, and Embedding Windows. Phys. D Nonlinear Phenom. 1999;127:48–60. doi: 10.1016/S0167-2789(98)00240-1. [DOI] [Google Scholar]
  • 36.Ursino M., Ricci G., Magosso E. Transfer Entropy as a Measure of Brain Connectivity: A Critical Analysis With the Help of Neural Mass Models. Front. Comput. Neurosci. 2020;14:45. doi: 10.3389/fncom.2020.00045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Sellers K.K., Bennett D.V., Hutt A., Williams J.H., Frohlich F. Awake vs. anesthetized: Layer-specific sensory processing in visual cortex and functional connectivity between cortical areas. J. Neurophysiol. 2015;113:3798–3815. doi: 10.1152/jn.00923.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Voytek B., D’Esposito M., Crone N., Knight R.T. A method for event-related phase/amplitude coupling. Neuroimage. 2013;64:416–424. doi: 10.1016/j.neuroimage.2012.09.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Liu X., Hong W., Shan L., Yan C., Li S. Adaptive common average reference for in vivo multichannel local field potentials. Biomed. Eng. Lett. 2017;7:7–15. doi: 10.1007/s13534-016-0004-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Patricia W., Sellers K.K., Lucas R., Viola P., Axel H., Flavio F., Michael W., Hilgetag C.C. Breakdown of local information processing may underlie isoflurane anesthesia effects. PLoS Comput. Biol. 2017;13:e1005511. doi: 10.1371/journal.pcbi.1005511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Andreas W., Marc D.L., Markus L., Satoru H. Time-Delayed Mutual Information of the Phase as a Measure of Functional Connectivity. PLoS ONE. 2012;7:e44633. doi: 10.1371/journal.pone.0044633. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.De Cheveigné A., Nelken I. Filters: When, Why, and How (Not) to Use Them. Neuron. 2019;102:280–293. doi: 10.1016/j.neuron.2019.02.039. [DOI] [PubMed] [Google Scholar]
  • 43.Sanchez Giraldo L.G., Rao M., Principe J.C. Measures of Entropy from Data Using Infinitely Divisible Kernels. IEEE Trans. Inf. Theory. 2014;61:535–548. doi: 10.1109/TIT.2014.2370058. [DOI] [Google Scholar]
  • 44.Zhu J., Chen M., Lu J., Zhao K., Cui E., Zhang Z., Wan H. A Fast and Efficient Ensemble Transfer Entropy and Applications in Neural Signals. Entropy. 2022;24:1118. doi: 10.3390/e24081118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Gong T., Dong Y., Yu S. Computationally Efficient Approximations for Matrix-based Renyi’s Entropy. IEEE Trans. Signal. Process. 2022;70:6170–6182. doi: 10.1109/TSP.2022.3233724. [DOI] [Google Scholar]
  • 46.Das A., Menon V. Replicable patterns of causal information flow between hippocampus and prefrontal cortex during spatial navigation and spatial-verbal memory formation. Cereb. Cortex. 2022;32:5343–5361. doi: 10.1093/cercor/bhac018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Das A., Menon V. Asymmetric Frequency-Specific Feedforward and Feedback Information Flow between Hippocampus and Prefrontal Cortex during Verbal Memory Encoding and Recall. J. Neurosci. 2021;41:8427–8440. doi: 10.1523/JNEUROSCI.0802-21.2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Bastos A.M., Vezoli J., Bosman C.A., Schoffelen J.M., Oostenveld R., Dowdall J.R., De Weerd P., Kennedy H., Fries P. Visual areas exert feedforward and feedback influences through distinct frequency channels. Neuron. 2015;85:390–401. doi: 10.1016/j.neuron.2014.12.018. [DOI] [PubMed] [Google Scholar]
  • 49.Cohen D., van Swinderen B., Tsuchiya N. Isoflurane Impairs Low-Frequency Feedback but Leaves High-Frequency Feedforward Connectivity Intact in the Fly Brain. eNeuro. 2018;5:e0329-17. doi: 10.1523/ENEURO.0329-17.2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Jones M.W., Wilson M.A. Theta rhythms coordinate hippocampal-prefrontal interactions in a spatial memory task. PLoS Biol. 2005;3:e402. doi: 10.1371/journal.pbio.0030402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Shin J.D., Jadhav S.P. Multiple modes of hippocampal-prefrontal interactions in memory-guided behavior. Curr. Opin. Neurobiol. 2016;40:161–169. doi: 10.1016/j.conb.2016.07.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Zielinski M.C., Shin J.D., Jadhav S.P. Coherent Coding of Spatial Position Mediated by Theta Oscillations in the Hippocampus and Prefrontal Cortex. J. Neurosci. 2019;39:4550–4565. doi: 10.1523/JNEUROSCI.0106-19.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Place R., Farovik A., Brockmann M., Eichenbaum H. Bidirectional prefrontal-hippocampal interactions support context-guided memory. Nat. Neurosci. 2016;19:992–994. doi: 10.1038/nn.4327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Symanski C.A., Bladon J.H., Kullberg E.T., Miller P., Jadhav S.P. Rhythmic coordination and ensemble dynamics in the hippocampal-prefrontal network during odor-place associative memory and decision making. Elife. 2022;11:e79545. doi: 10.7554/eLife.79545. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Li M.M., Fan J.T., Cheng S.G., Yang L.F., Yang L., Wang L.F., Shang Z.G., Wan H. Enhanced Hippocampus-Nidopallium Caudolaterale Connectivity during Route Formation in Goal-Directed Spatial Learning of Pigeons. Animals. 2021;11:2003. doi: 10.3390/ani11072003. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The local field potential signals recorded from pigeons are available upon request from the corresponding author.


Articles from Entropy are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES