Skip to main content
The Journal of the Acoustical Society of America logoLink to The Journal of the Acoustical Society of America
. 2013 Jan;133(1):283–300. doi: 10.1121/1.4770245

Joint entropy of continuously differentiable ultrasonic waveforms

M S Hughes 1,a), J E McCarthy 2, J N Marsh 3, S A Wickline 3
PMCID: PMC3548839  PMID: 23297902

Abstract

This study is based on an extension of the concept of joint entropy of two random variables to continuous functions, such as backscattered ultrasound. For two continuous random variables, X and Y, the joint probability density p(x,y) is ordinarily a continuous function of x and y that takes on values in a two dimensional region of the real plane. However, in the case where X=f(t) and Y=g(t) are both continuously differentiable functions, X and Y are concentrated exclusively on a curve, γ(t)=(f(t),g(t)), in the x,y plane. This concentration can only be represented using a mathematically “singular” object such as a (Schwartz) distribution. Its use for imaging requires a coarse-graining operation, which is described in this study. Subsequently, removal of the coarse-graining parameter is accomplished using the ergodic theorem. The resulting expression for joint entropy is applied to several data sets, showing the utility of the concept for both materials characterization and detection of targeted liquid nanoparticle ultrasonic contrast agents. In all cases, the sensitivity of these techniques matches or exceeds, sometimes by a factor of two, that demonstrated in previous studies that employed signal energy or alternate entropic quantities.

INTRODUCTION

Previous studies have demonstrated the utility of several different entropies for detection of subresolution backscattering structures in both materials characterization and medical ultrasonics in situations where other approaches fail. These include detection of unresolvable near-surface defects,1, 2, 3, 4, 5 diffuse accumulation of subresolution-sized nanoparticle-based ultrasound contrast agents,6, 7, 8, 9, 10 characterization of smooth muscle pathologies,11, 12, 13 and monitoring of ultrasonically induced temperature rise in tissue.14

In the first of these studies, entropy images were applied to detect defects in advanced aerospace materials.1 In particular, detection and imaging of “near-surface” defects in graphite-epoxy composite plates was accomplished using a 5 MHz center frequency transducer.1, 2, 3, 4, 5 In this application, it is not possible to separate the front wall echo from the defect reflection. This presents a significant challenge to conventional approaches for defect detection and imaging. In fact, the case presented in the following text is nearly impossible to discern using the traditional approach. However, the joint entropy image permits visualization of the defect, its boundary, and certain substructures contained in the defect that were not visible to conventional detection methods or even the previously published entropic images.

The bulk of our investigations have focused on medical imaging. These studies have demonstrated the utility of several different entropies for detection of subresolution structures of backscattering tissue: e.g., targeted accumulation of weakly scattering perfluorocarbon nanoparticles in tissues in vivo, and pathological changes in smooth muscle structure arising from muscular dystrophy. These results have been obtained in a variety of animal disease models and in patients as shown in Table TABLE I.. Several features common to these studies deserve special emphasis at the outset:

  • (a)

    Studies have been executed using several different imaging systems (Philips HDI 5000, Philips IE33, Vevo 660, Ardent Spark) with transducer arrays covering a frequency range from 2–34 MHz.

  • (b)

    In all cases but one,11 the entropic analysis demonstrated greater sensitivity than did signal energy to subresolution features of scattering architecture. In the exceptional case,11 sensitivities were essentially equivalent.

  • (c)

    The most recent version of entropy published8, 15 is amenable to real-time implementation.

  • (d)

    All results pertaining to detection of targeted nanoparticles were obtained without use of hand-drawn regions of interest. In fact, the entire analysis chain from raw data to image construction to image-based sensitivity assessment was automated and thus objective.

  • (e)

    Image subtraction has not been required to detect changes in scattering architecture.

TABLE I.

Summary of studies based on different entropic signal receivers defined for waveform y=f(t) having density function wf(y): entropy, Hf, Renyi entropy, If(r), and its asymptotic limiting form, If,. The joint entropy Hf,g is frequently much more sensitive than any of these.

“Entropic” receiver Heuristic meaning Scattering structure Experimental system
Hf=fminfmaxwf(y)log[wf(y)]dy “Information” contained Materials characterization, Graphite/epoxy laminates (Ref. 5)
Hf=01log[f(t)]dt “Information” contained in backscattered RF f(t). Targeted nanoparticles, smooth muscle cells MDA 435 (Ref. 16), B16
VX2,MDx mice (Ref. 11)
Precancer models (Ref. 7)
Humans (Ref. 12)
If(r)=1/1rlogfminfmaxwf(y)rdy “Weighted-information” contained in backscattered RF: f(t) (note: Hf=If(1)). Targeted nanoparticles, smooth muscle cells MDA 435, B16 Precancer models (Ref. 17)
If,=log[{tk|f(tk)=0}1/|f(tk)|] Limiting form of If(r) as r2. More sensitive than Hf, If(r). Targeted nanoparticles, MDA 435 (Ref. 8), B16 (Ref. 10)
smooth muscle cells Humans
  Precancer models (Ref. 15)

All of these points suggest the clinical utility of this approach, which might proceed along lines similar to that employed in Doppler imaging systems, where the conventional B-mode image is color-coded according to the blood cell velocity to present a combined B-mode/velocity image; similarly, a B-mode/entropy image could be made as well.9

All of these investigations required acquisition of thousands of radio frequency (RF) signals, f(t). For each of these captured waveforms, viewed as a sampled version of a differentiable function, the probability density function of values was computed using the digitized values of f(t). Distributions of values (or densities of distributions of values) are typically used for analysis of random variables, and all differentiable functions are, in fact, legitimate random variables. However, in the general case, it is not possible to compute the probability density function of a random variable solely from its values. For example, to determine the probability density function for noise in an amplifier, the noise source must be sampled and the distribution of sampled values subsequently fit to an assumed functional form, e.g., Gaussian or Poisson. In our case, the knowledge that the random variable f(t) is differentiable permits direct computation of its probability density function without the need to assume and fit a model.2, 5, 7 While this can be a mathematically and computationally demanding task, previous applications of this approach to analysis of acoustic signals have repeatedly demonstrated its utility. The present study extends these results by developing joint entropy, which we will denote by Hf,g, to analyze the same type of data. This approach requires computation of the joint probability density of two (differentiable) functions: y=f(t), representing a segment of backscattered radio frequency ultrasound, and a reference signal, g(t). After carefully deriving a suitable expression for Hf,g, we show that it provides further improvements in sensitivity over those obtained in previous studies that were based on other entropies.

APPROACH

Our goal is to further improve the ability to detect weak scatterers in strongly scattering environments, as in the case where liquid nanoparticles are targeted to cancer molecular epitopes or in material defects in advanced aerospace materials. Although prior entropy measures have been proposed, we expect the current approach to demonstrate significant enhancement in detection in real-time applications that could be ported to existing imaging systems.

Background

This study is based on an extension of the concept of joint entropy of two random variables to continuous functions, such as backscattered RF, and a reference signal, such as a reflection from a weak reflector. For two continuous random variables, X, Y with joint probability density function p(x,y), the joint entropy is

H(X,Y)=p(x,y)logp(x,y)dxdy. (1)

Ordinarily p(x,y) is a continuous function of x and y that takes non-zero values in a two dimensional region of the real plane. However, in the case where X=f(t) and Y=g(t) are both continuously differentiable functions, the random variables X and Y are concentrated exclusively on a curve, γ(t)=(f(t),g(t)), in the x,y plane as shown in Fig. 1. Thus the densities of their values cannot be represented using conventional functions as was done in previous studies of only one differentiable random variable.1, 2, 3, 4, 5, 7 Instead, this concentration can only be represented using a mathematically “singular” object like a series of Dirac Delta functions placed on γ(t) with infinitesimal spaces in between them. Technically, such an object is not a function; in precise terms, p(x,y) must be represented as a Schwartz distribution.16 Its use for imaging requires that the Schwartz distribution be coarse-grained, i.e., integrated, on a fine grid of small cells as shown in Fig. 2. The outcome of this course-graining, as shown in Appendix B, is the conversion of the probability density (which is a Schwartz distribution) into a bona fide probability distribution function such that the probability of X and Y being in a particular cell, which is the generalization of p(x,y) that we seek, is given by the time that γ(t) “spends” in the cell [Eq. B28 in the following text]. This is what we would expect intuitively if we imagined a test particle traversing the curve, γ(t), with velocity v(t)=f(t)2+g(t)2, and wanted to know the probability of finding it in a particular cell. Having this result, it is possible to extract a well-defined finite component from the coarse-grained version of Eq. 1, which we will denote by Hf,g, in terms of f(t) and g(t).

Figure 1.

Figure 1

(Color online) Left panel: A plot of the curve γ(t) used to visualize the joint cumulative density function, W(x,y), of values of the functions f(t) and g(t). The probability density is non-zero only on the curve γ(t) because the random variables X=f(t), Y=g(t)are concentrated exclusively on γ(t). On the right is an graphical calculation of W(x,y) using the graphs of f(t) and g(t). The exact definition of W(x,y) is given in Eq.B1.

Figure 2.

Figure 2

Uniform partition of the xy-plane into ε×ε cells.

As with the entropies previously investigated, Hf,g is applied in a moving window analysis. However, its computation requires two input waveforms. The first, f(t), is exactly the same as in previous investigations. However, choice of the second reference waveform, g(t), constitutes an additional degree of freedom that may be exploited to increase sensitivity of the entropic analysis. Discovery of strategies for identification of the optimum reference waveform is one of the eventual goals of our research. In this study, we present the results of preliminary investigations that indicate a reflection of the imaging system's interrogating pulse from a weak reflector, such as a water-agarose interface, constitutes a good initial guess for the optimum. To summarize our findings:

  • (a)

    Hf,g analysis may be applied in the same situations as previous entropies, Hf, I(r), If,. In fact, all of the data for this study have been analyzed previously with the use of at least one of these entropies. In this study, we repeat several of these analyses using Hf,g images.

  • (b)

    Hf,g analysis is amenable to real-time implementation, as was If, analysis.

  • (c)

    In all cases studied so far, Hf,g analysis has proven to be more sensitive than the best previous results obtained by other entropies. For example, preliminary application of this new entropy to the MDA 435 data shown in Figs. 7 and 8 shows a roughly twofold increase in peak confidence ratios.

Figure 7.

Figure 7

Confidence, c, panels for ΔHf,g (left column) and ΔIf, (right column) for all groups used in our study. (A) MDA 435-implanted mice injected with αvβ3-targeted nanoparticles (N=5), (B) MDA 435-implanted mice injected with non-targeted nanoparticles (N=5), (C) MDA 435-implanted mice injected with saline (N=5), (D)–(F), same injections into N=5 tumor-free mice. The smoothing spline parameter S=1.0 in all cases. A color-to-confidence calibration bar is shown to the right of the confidence panels.

Figure 8.

Figure 8

Comparison ΔHf,g (left) and ΔIf, (right) for data acquired in a pre-cancer model (Ref. 7) like that shown in Fig. 7. While analysis at all thresholds between 0% and 100% was undertaken, we wish to focus attention on the results obtained between 75% and 100% (results from lower analysis thresholds have been “dimmed” by a translucent overlay). In both cases, a broad region of confidence values above 2 is shown. While the extent of this region is larger in the ΔIf, confidence panel, the maximum height of the ΔHf,g confidence panel is more than twice as high (16.0 vs 6.8). A color-to-confidence calibration bar is shown to the right of the confidence panels.

Conventions and calculation

To fix terminology, suppose that we are given two functions, f(t) and g(t), assumed to map [0,1] into (1,1) (i.e., |fmax|, |fmin|, |gmax|, |gmin| are all strictly less than one). We are interested in computing the joint entropy,

H(f,g)=11dx11dyw(x,y)log[w(x,y)], (2)

where w(x,y) is their joint density, defined on the domain Ω=(1,1)×(1,1). It turns out that for the types of functions f(t) and g(t) that we will consider (intuitively thought of as “well-behaved”), w(x,y) is not a real-valued function but a (Schwartz) distribution so that Eq. 2 is not well-defined.

To get around this difficulty, we employ a “coarse-grained” approximation to Eq. 2 obtained, by dividing Ω into ε×ε cells, where ε=1/N as shown in Fig. 2, and integrating w(x,y) over each cell. We denote the ε×ε cells intersecting γ(t) at initial time ti by Ci and index them by i=1,,M (M will be discussed later). For subsequent discussion, we define the distance traversed along γ(t) when it enters the ith cell by si. Figure 9 illustrates our terminology. The final step in the coarse-graining process is replacement of H(f,g) by the quantity Hf,g defined by

Hf,g=limϵ0[HΔ(f,g)ε]=limϵ0[j,kpΔ(j,k)log[pΔ(j,k)]ε], (3)

where pΔ(j,k)=0 unless γ(t) passes through the (j,k)th cell, in which case (as mentioned in the preceding text and as we shall show in the following text), pΔ(j,k) is equal to the time γ(t) spends in the (j,k)th cell.

Figure 9.

Figure 9

A diagram of the computation of time spent by γ(t) in an ε×ε cell in the xy plane.

Our ultimate goal is to obtain an expression for Hf,g in terms of f(t) and g(t) because these quantities are experimentally accessible. As is carefully shown in Appendixes AC, the following relation is true:

Hf,g1201dtmin[|f(t)|,|g(t)|]max[|f(t)|,|g(t)|]01dtlog[|max[|f(t)|,|g(t)|]|]. (4)

It is a remarkable fact that derivation of Eq. 4, which uses the ergodic theorem, also requires the fact that the set of points for which f(t)/g(t) is rational comprises a set of measure zero (in the set of real numbers). Thus the relation is essentially derived in a noncomputable setting. Nevertheless, the end result, Eq. 4, may be evaluated numerically. The full details are provided in the appendixes.

RESULTS

Noise simulation

The effect of noise on the input waveform f(t) is shown in Fig. 3. To produce this plot, Gaussian distributed noise of different amplitude was added to f(t)=e10(t0.4)2sin(4πt) and g(t)=e10(t0.4)2sin(2πt) (simulations using uniformly distributed noise have also been performed yielding essentially similar results). The range of noise levels spans 20 to 120 dB. At each noise level, 1000 waveforms f(t) and reference waveforms g(t) were created, Hf,g computed, and the average Hf,g and standard error computed. Each Hf,g is shown in the plot, with its standard error bar, which in all but one case is too small to see. We notice that for signal-to-noise ratios above 80 dB, it is possible to obtain three-digit agreement with the noise-free answer [which was obtained using a separate mathematica implementation of Eq. 4 that was also used to double check the correctness of our C programming language implementation of the same equation]. However, for signal-to-noise ratios between 40 and 60 dB, which is the relevant range for our experimental data, agreement to at most one or two digits is achievable. This limitation shows that the numerical precision of our composite integrator is more than enough for analysis of experimental data, which always contain noise.

Figure 3.

Figure 3

(Color online) Effect noise on the computation of Hf,g using Eq. 4. Each data point is the average of Hf,g computed from N=1000 different waveforms g(t)=e10(t0.4)2sin(2πt) to which Gaussian noise has been added. The signal-to-noise ratio simulated ranges from 20 to 120 dB. The reference trace f(t)=e10(t0.4)2sin(4πt) did not have noise added. Standard error bars are plotted with each point, most are too small to see on the scale of the plot.

Materials characterization: Defect detection

The first application of entropy imaging was to materials characterization;17 in particular, to detection and sizing of near-surface defects in graphite/epoxy aerospace structures.1, 2, 3, 4, 5 Figure 4 shows some previously published (top row)5 and newer results (middle and bottom row) for imaging of a near-surface “resin-rich” defect in a thin (4 mm thick) graphite/epoxy composite. The bottom left corner shows a schematic of the specimen, which was fabricated by removing a 3.81 cm diameter circular section from the second ply prior to fabrication. The specimen was then autoclaved following standard manufacturing practice. This forced excess epoxy into the circular void and created the resin-rich region. During the fabrication procedure, additional resin penetrated the inter-laminar region bordering the circular void. The result is an oval shaped defect centered on the original circular cutout.

Figure 4.

Figure 4

Five different types of images made using RF backscattered from a graphite/epoxy plate with a near-surface defect. Top row contains previously published results (Ref. 5). These were obtained using the full 512-point digitized waveform and without optimal smoothing splines. The next two rows show the results obtained using optimal smoothing splines on a 128-point subset of the full digitized waveform. A schematic of the experimental specimen is shown in the lower left corner of the figure. The Hf,g image shows the clearest delineation of defect boundaries.

This specimen was scanned on a 101 × 101 point grid using a 5 MHz broadband transducer with the backscattered RF acquired at 8-bit digital resolution and a 100 MHz sampling rate. The width of the interrogating pulse was therefore far too wide to permit gating of front-wall reflection from that of the defect, which lay within one ply (0.25 mm) of the surface. While the top row results5 were originally published as part of a larger study of six different types of near-surface defects, results in the resin-rich case were less conclusive than hoped for because, as the top row shows, the best understood images failed to detect the boundaries of the defect region. While other thermodynamic receivers used in that study5 did delineate the defect boundaries, their physical meaning was not readily apparent.

The middle and bottom rows of Fig. 4 show that this situation has been rectified by the newer signal processing algorithms described in the preceding text. One reason for the improvement is the use of optimal smoothing splines, which are used for global noise suppression.18 Additionally, only a gated portion of the backscattered RF (128 points long for this figure) was used. Gating enables exclusion of baseline regions that contain no useful data. While log[Ef] and If, images show the defect with relatively good contrast, the boundary between defect and surrounding plate “fades-out” at the “3 o'clock” position. This can be significant, for instance, if the images are to be used as input for automatic feature extraction software. On the other hand, at all points on its perimeter, the Hf,g image has greater contrast between defect boundary and surrounding “defect-free” plate (matching the clarity of the less understood images shown in earlier work (e.g., those labeled Z, CV, Eth, in Fig. 6 of that work5). We also observe a circular outline within the defect region. This is the diameter of the circular region removed from the second ply to produce the resin-rich region during the curing process used to fabricate the graphite/epoxy laminate. During that process, resin accumulated not only in the circular region but was also pushed out around its boundaries to create the oblong appearance of the defect. We note that the outline of the original circle is less evident in log[Ef] and If, images and is not visible in any of the images from the previous study.5 The reference g(t) was a reflection from a stainless steel reflector placed at the 4-in. focal plane of the transducer. Because the experimental system used for data acquisition has been verified to be stable to at least the −40 decibel level, this reference was acquired once, prior to scanning of the entire collection of graphite/epoxy specimens. We observe that this is equivalent to the use of a reference acquired from the weak reflector that we use in biological studies. The operational criterion for choice of reflector is that it roughly matches the reflection coefficient of the scatterers in the specimen trace f(t).

Figure 6.

Figure 6

Use of ΔHf,g vs time plots to prepare confidence, c, panels. Step 1: An analysis threshold is set, and Hf,g images are analyzed to produce plots of ΔHf,g; the time plot for the 96% analysis threshold is shown. Step 2: The confidence as a function of time is computed according to Eq. 6 and then converted to a color bar shown parallel to the x axis of the plot. Step 3: These color bars are stacked vertically left-to-right, in increasing order, to prepare a confidence panel showing the confidence obtained at all time points for all analysis thresholds between 0% and100%. A color-to-confidence calibration bar is shown to the right of the confidence panel.

Medical imaging: Targeted tumor imaging

Previously, we reported on application of entropic quantifies Hf and If, for detection of subtle changes in RF backscatter induced by accumulation of targeted nanoscale contrast enhancers in tumor neovasculature6, 8 or pre-cancerous tissue.7 We have used the same raw RF from these studies to prepare Hf,g images, which were then analyzed for evidence of nanoparticle accumulation. As the materials and methods for these studies are described in previous publications,8 we provide only a brief summary of data acquisition and analysis in each case and then present a comparison of the Hf,g and If, analyses.

MDA 435 tumors implanted in athymic nude mice

Human MDA 435 cancer cells were implanted in the inguinal fat pads of 15 athymic nude mice between 19 and 20 days prior to acquisition of data. Five of these animals were injected with αvβ3-targeted nanoparticles, five were injected with nontargeted nanoparticles, and five were injected with saline at a whole body dose of 1 ml/kg. In addition, 15 athymic nude mice not implanted with tumors were imaged in the same region following the same imaging protocol: five were injected with αvβ3-targeted nanoparticles, five were injected with nontargeted nanoparticles, and five were injected with saline.

RF data were acquired with a research ultrasound system (Vevo 660, Visualsonics, Toronto, Canada) at 0 through 60 min in 5-min intervals after injection. The tumor was imaged with a 35-MHz center frequency single element “wobbler” probe, and the digitized RF data corresponding to single frames were stored on a hard disk for later off-line analysis. The frames consisted of 384 lines of 2048 12-bit words acquired at a sampling rate of 200 MHz using a Gage 12400 digitizer card (connected to the analog-out and sync ports of the Vevo) in a controller PC. Each frame corresponds spatially to a region 1.5 cm wide and 0.8 cm deep. Further description of materials and methods may be found in previous publications.8 The reference trace g(t) was obtained by digitizing a reflection of the imaging system's interrogating pulse from a water-agarose interface using the same acquisition parameters. As for the materials study, the long-term stability of the system electronics has been verified better than the 40 dB level. Consequently, one reference trace was acquired prior to beginning the study and used for all subsequent Hf,g analysis.

A moving window analysis was performed on each waveform by moving a rectangular window (128 points long, 0.64 μs) in 0.08 μs steps (16 points), resulting in 121 window positions within the output data set. A smoothing spline was fit to each window. Issues surrounding signal-to-noise (SNR) ratio, SNR estimation, and robust noise suppression for these data have been discussed in a previous publication.8 The fitting routine also returned an array of first and second derivatives at the locations of any critical points in the window. The arrays were used to compute Hf,g (or If,). This produced an image for each time point in the experiment (i.e., 0, 5,…, 60 min).

The segment of RF within the boxcar, or moving window, is processed to produce a pixel value for an Hf,g (or If,) image. For each mouse used in this study, this is done using RF data acquired at 0, 5,…, 60 min post-injection to produce an image at each time point. For this study, in which the same portion of the anatomy was imaged at successive intervals, our goal was to identify and quantify the accumulation of targeted nanoparticles, which occurs preferentially at targeting sites. This suggests that segmenting the image into “targeted” and “non-targeted” regions will be required as part of the analysis.

One of the chief goals of our research has been to develop objective segmentation algorithms that do not require user selection (e.g., hand drawn regions of interest). Figure 5, displays the steps of such an algorithm graphically. For each mouse used in this study, a histogram of the image pixel values appearing over the entire time course (i.e., 0, 5, 10,…, 60 min) was constructed and normalized to obtain the probability density function (PDF) of these values and then integrated to obtain the cumulative distribution function (CDF). This is shown in the top panel of the figure. Next, pixel values corresponding to “analysis-thresholds” at the lower 2%, 4%, …, 98% of the CDF were then used to segment the images at each time point into two regions corresponding to targeted and nontargeted tissue. The figure shows the segmentation for an example analysis-threshold of 44%. The blue lines, shown in the second panel of the figure, indicate the boundary between targeted (inside the blue boundaries) and non-targeted (outside the blue boundaries) regions. Subsequently, the mean value of pixels in the targeted region were computed as a function of time post-injection. This is indicated in the bottom panel of the figure, in which the mean value of Hf,g found at each time, denoted in the figure by Hf,gi, i = 1,…, 12, is shown immediately below the thresholded image. Subsequently, the change in these mean values,

ΔHf,g=Hf,giHf,g0, (5)

is computed (ΔHf,g0=0). The analysis diagrammed in Fig. 5 was performed for all animals in all groups and the results averaged by group. We will drop the subscript i in the remaining discussion and refer only to the ΔHf,g.

Figure 5.

Figure 5

Analysis steps for Hf,g images from a single mouse. Step 1: The cumulative distribution function (CDF) of all 12 images acquired from the animal are computed. Step 2: The CDF is used to segment the image into “targeted” (inside the blue boundaries) and “non-targeted” (outside) segments according to pixel value being above or below the threshold level (44% in this case). Step 3: the mean value, Hf,gi, of Hf,g is computed for each targeted region. Step 4: These are used to calculate ΔHf,gi=Hf,giHf,g0 for each time point in the study. This procedure is repeated for all mice in each group.

ΔHf,g, corresponding to a 98% analysis-threshold is plotted in Fig. 6, in panel number one. For the purposes of detection, it is actually the ratio of the mean value to the standard error that is significant. We have plotted an example of these ratios (for the MDA 435 implanted group injected with αvβ3-targeted nanoparticles) in panel two of Fig. 6. To shorten subsequent discussion of our results, we will define this ratio (confidence) as19

c=meanstandarderror. (6)

As this panel shows, for an analysis-threshold of 98%, the mean value of ΔHf,g is at all times at least four standard deviations or more from zero and peaks at 16 standard deviations from the mean Hf,g at zero time. We have mapped each confidence value to a color scale shown along the x axis of the plot. Panel three of the figure shows the aggregate of all color maps resulting from analysis thresholds between 0% and 98%. These aggregated color maps, or confidence panels, may be used to quickly identify analysis thresholds at which accumulation of targeted nanoparticles is successfully detected by either Hf,g or If, imaging and to quantify the sensitivity of each image type.

An inter-receiver comparison is shown in Fig. 7, which is a five-dimensional presentation having three dimensions within each confidence panel (i.e., post-injection time in the vertical direction, analysis-threshold in the horizontal direction, and confidence c in the out-of-plane or color direction), another dimension for animal group (the vertical direction within the array), and a fifth dimension being the masking level for the absolute value of confidence ratio c (the horizontal direction within the array). In the left column are ΔHf,g confidence panels for all six groups used in our study as indicated in the figure caption. In the right-hand column are the corresponding ΔIf, confidence panels made using the same RF data and smoothing spline processing parameters.

In spite of the high dimensionality of the display, rapid comparison of Hf,g or If, processing is possible, as well as inter-group comparison. For instance, in both columns of the top row, we observe “extensive” (meaning contiguous, wider than three columns over at least half the height of the confidence panel) regions, corresponding to confidence values greater than four in magnitude. The remaining confidence panels (for the control groups), with one exception, have much lower confidence value magnitudes in the same range of analysis-thresholds. Comparison of the confidence panels in the left column (ΔHf,g) shows that the top confidence panel, corresponding to the MDA 435-implanted group injected with αvβ3-targeted nanoparticles, generally exhibits confidence ratios with the greatest magnitude. There is an anomalous region of high confidence ratio for the saline group C appearing at 30 min post-injection. Because no nanoparticles were injected into this group, it might be classified as a “false-positive” for the technique. Further research, aimed at understanding and eliminating this effect, e.g., by using different reference traces g(t), is underway. However, we remark that this region is contained in a range of analysis thresholds that does not overlap the range of analysis thresholds for which the confidence ratio of the targeted MDA435-implanted group (row A) is large. Consequently, it possible to define a strategy for separating the groups that eliminates false positives. Moreover, for the top row, the ΔHf,g images produce confidence values that are more than twice those obtained using ΔIf, images. Consequently, the criteria: “|c|4,” with analysis threshold between 60% and 98% for Hf,g analysis (44%–62% for If, analysis), becomes a means of distinguishing the targeted/tumor-implanted group from all others. We observe also that these criteria exhibit the presence of targeted nanoparticles accumulating within five minutes after injection. While the figure shows that we may distinguish the targeted tumor-implanted group from the control groups, the issue of single animal “diagnosis” remains open due to the small cohort of animals studied. The current study is a demonstration of feasibility and suggests the utility of conducting a larger experiment having a proper receiver operator curve analysis that would permit quantification of true positives, true negatives, false positives, and false negatives.

Transgenic K14-HPV16 mouse tumor model

The model used is the transgenic K14-HPV16 mouse that contains human papilloma virus-16 oncoproteins driven by a keratin promoter so that lesions develop in the skin, particularly in the ear. Typically the ears exhibit squamous metaplasia, a pre-cancerous condition, associated with abundant neovasculature that expresses the αvβ3 integrin. Eight transgenic mice20, 21 were treated with 1.0 mg/kg i.v. of either αvβ3-targeted nanoparticles (n = 4) or nontargeted nanoparticles (n = 4) and imaged dynamically for 1 h using a research ultrasound imager (Vevo 660 40 MHz probe) modified to store digitized RF waveforms acquired at 0, 15, 30, and 60 min. time points. Further details may be found in reference 8. All RF data were processed in the same manner as the MDA 435 data.

The results are shown in Fig. 8, which displays the three confidence panels obtained using Hf,g images on the left and If, images on the right. Broad, contiguous, regions of large confidence ratio magnitude only occur in the right side of the panels, corresponding to analysis thresholds of 78% to 98%. Consequently, we have emphasized these portions by enclosing them with dotted lines while placing a semi-transparent layer over the complementary region (which has been displayed in this manner for completeness). Focusing attention on the 78% to 98% confidence thresholds, we see that Hf,g analysis exhibits twice the sensitivity of If, analysis for the targeted group. Moreover, Hf,g analysis, at these analysis thresholds, exhibits greater separation between the targeted group and the saline control group than does If, analysis. While separation between the targeted group and the non-targeted control is greater for If, analysis, the panels show that Hf,g analysis still yields several standard deviations of separation between the two groups.

DISCUSSION AND CONCLUSIONS

In all cases, the sensitivity afforded by analysis of Hf,g images exceeds that obtained in our best previous analyses of the same data by roughly a factor of 2. The main difference between the previous approach and the current one is employment of a reference trace. The results presented in this study employed a reference, water-path only reflection from a weak reflector, which is the natural choice in many conventional ultrasonic analyses that deconvolve the transfer function of the experimental apparatus from the raw experimental data. In our case, the rationale for application of this reference is based on the relation between entropy, H(A), joint entropy, H(A,B), and conditional entropy of B given A, H(B|A) for discrete random variables A, and B, which is

H(B,A)=H(A,B)=H(A)+H(B|A). (7)

If A is taken as a reference variable and we compute H(B,A) where B is another random variable, then H(B,A)H(B,A)=H(B|A)H(B|A). Because analysis of entropy images described in the preceding text is based on differences between joint entropies, the last equation shows that they can be reinterpreted as differences between conditional entropies, i.e., heuristically at least, we are looking at the differences in information contained in backscattered RF (e.g., either B, or B) given that the tissue was interrogated with a certain reference pulse (A).

The reference trace g(t) is also a “knob” that we can “turn” to adjust the sensitivity of the image to specific attributes in the reference waveform. In our heuristic interpretation, we are examining the extent of differences between the information in the reference trace and the backscatter. In other words, we are estimating how much the information content of the interrogating ultrasonic pulse has been changed by the scattering events occurring in the tissue.

In spite of the fact that our choice of reference appears natural, it is conceivable that in other circumstances, a different choice of reference waveform might be more compelling. Thus the choice of reference is to some extent arbitrary. Our response to this fact has been to post-process the resulting Hf,g images using all analysis thresholds and to determine if there exists a rational set of criterion for differentiating the targeted from control groups. The results of the present study, as well as previous studies,8, 15 advance one strategy for doing so. The investigation of other strategies for choice of an optimum reference given specific detection requirements is the subject of ongoing research.

ACKNOWLEDGMENTS

This study was funded by NIH EB002168, HL042950, and CO-27031 and NSF DMS 0966845. The research was carried out at the Washington University Department of Mathematics and the School of Medicine.

APPENDIX A: CALCULATION OVERVIEW

In this subsection, we derive from Eq. 2 an expression for Hf,g in terms of the experimentally accessible quantities f(t) and g(t). To do this, we claim the following assertions, which will be established in subsequent sections:

  • (a)

    Claim i:pΔ(j,k)=Δti=ti+1ti if γ(t) enters the (j,k)th cell at time ti and leaves it at ti+1. Otherwise, pΔ(j,k)=0. This is Eq. B28 in the following text. Consequently, evaluation of Eq. 3 comes down to being able to compute the limiting behavior of εΔtilog[Δti] as ε0. We will see presently that what we really need to compute is the limiting behavior of εΔsilog[Δsi].

This leads us to our second assertion:

  • (b)
    Claim ii: there exists an expression for the limiting behavior of εΔsilog[Δsi] in terms of f(t) and g(t), assuming that g(t)/f(t) is irrational. This is essentially Eq. A9 in the following text. Subsequently, we derive the expression for Hf,g in terms of the experimentally accessible quantities f(t) and g(t). The next two subsections will justify claims i and ii. At this point, we observe from Fig. 9 that the distance traversed, Δsi, in any member of Ci is given by
    Δsi=Δtivi, (A1)
    where vi=v(ti), i=,M. Using Eq. A1, Eq. 3 becomes the sum
    i=1MΔtilog[Δti]=1MΔtilog[Δsivi]=i=1MΔtilog[vi]+i=1MΔtilog[Δsi]. (A2)
    The first term in the sum has the limiting behavior
    i=1MΔtilog[vi]01dtlog[v(t)], (A3)
    as ε0. It remains to evaluate the sum
    i=1MΔtilog[Δsi]. (A4)
    However, we do not know the coordinates of the entry (or exit) points of γ(t) into the ith cell, and thus it appears that we lack the necessary information to calculate either Δti or Δsi. Nevertheless, if we assume that the portion of the curve γ(t) within each ε×ε cell is a straight line segment that starts at the point (f(ti),g(ti)) and has slope
    αig(ti)/f(ti) (A5)
    so that it leaves the ε×ε cell near the point (f(ti+1),g(ti+1)), we may then continue the limiting process by sub-partitioning each cell further into εs×εs cells as shown in Fig. 10. Given this subdivision, each term in the sum, Eq. A4, must now be replaced by the sum
    n=1MSiΔti,nlog[Δsi,n], (A6)
    where the n indexes the εs×εs cells crossed by γ(t) as it traverses the larger ε×ε cell. Thus the sum appearing in Eq. A4 is replaced by a double sum
    i=1Mn=1MSiΔti,nlog[Δsi,n]. (A7)
    We further assume that the velocity, vi, is constant over the entire ε×ε square as we further subdivide it. Thus the inner sum in Eq. A6 becomes
    n=1MSΔti,nlog[Δsi,n]=n=1MS1viΔsi,nlog[Δsi,n]=1vin=1MSΔsi,nlog[Δsi,n]Si. (A8)

    While it remains true that we still do not know where γ(t) will enter each of these smaller cells, we have nevertheless made progress because if we imagine stacking all of the εs×εs cells crossed by γ(t) on top of each other, as shown in Fig. 11, we will observe that, in almost every case (we will make this statement precise presently), the entry/exit points will be uniformly distributed around the perimeter of the smaller εs×εs cell. Given this picture, it is relatively easy to see that Δsi may be obtained by summing the transit lengths of γ(t) over the smaller “stacked” εs×εs cell and that in the limit where these crossings become infinite (i.e., εs0), the sum may be obtained as the integral of the transit lengths, starting along the boundary of the stacked cell.

Figure 10.

Figure 10

(Color online) Subdivision of an ε×ε cell into finer εs×εs cells required for evaluation of Eq. A6. We assume that ε is small enough that in the larger ε×ε cell γ(t) is a straight line and v(t) is constant. Legends indicate lengths of lettered segments.

Figure 11.

Figure 11

(Color online) Mapping of subgrids onto the “stacked” εs×εs square. The shaded εs×εs sub-squares of the original ε×ε cell are all stacked on top of each other. The corresponding points where γ(t) crosses boundaries of the sub-squares are lettered by capital or small letters. Legends indicates their status as iterates of the mapping Ty(yi) or Tx(xi) [Eqs. C4, C6]. For irrational αi=g(ti)/f(ti), the ergodic theorem guarantees that as the number of sub-squares is increased, the density of labeled points on vertical and horizontal sides becomes uniform.

The almost every case in the preceding text occurs when the ratio g(t)/f(t) is irrational. The reader might anticipate this fact by considering first the case where the ratio is rational (and the lengths of the sides of the εs×εs cell is rational, which is the case with our conventions), then it is easy to see that a small set of entry/exit points will be used over and over again, i.e., the trajectory cell crossings will exhibit periodic behavior. It is also relatively easy to see that the number of these entry/exit points grows as the decimal expansion of the g(t)/f(t) grows in length. Consequently, we might anticipate that for irrational g(t)/f(t), the number of these points is infinite. The ergodic theorem tells us that this is, in fact, true and that they are also uniformly distributed around the perimeter of the εs×εs cell. Thus the entry point into the original (larger) ε×ε cell becomes irrelevant because as the εs×εs cells are made smaller, all entry values occur with equal probability. Actually, while application of the ergodic theorem eliminates that we know where γ(t) enters a cell, it also introduces a cell dependent scaling factor into our expression for Δsi. Consequently, we have traded one missing piece of information for another. However, these scaling factors group together in the course of the calculation can be eliminated from the final expression for Si [see Eq. C18]. Moreover, if f(t), g(t) are continuously differentiable over their range, then the set of points at which f(t)/g(t) is rational forms a set of measure zero, which may be ignored in any integral, so that it is only the cases where g(t)/f(t) is irrational that matter. We provide the precise details in a subsequent section, the end result is:

Si=1vin=1MSΔsi,nlog[Δsi,n]=121viΔsimin[|f(ti)|,|g(ti)|]max[|f(ti)|,|g(ti)|]+1viΔsilog[1+(min[|f(ti)|,|g(ti)|]max[|f(ti)|,|g(ti)|])2]+1viΔsilog[εs]=12Δtimin[|f(ti)|,|g(ti)|]max[|f(ti)|,|g(ti)|]+Δtilog[1+(min[|f(ti)|,|g(ti)|]max[|f(ti)|,|g(ti)|])2]+Δtilog[εs]. (A9)

Summing these Si over i, we obtain

i=1M(12Δtimin[|f(ti)|,|g(ti)|]max[|f(ti)|,|g(ti)|])+i=1MΔtilog[1+(min[|f(ti)|,|g(ti)|]max[|f(ti)|,|g(ti)|])2]+i=1MΔtilog[εs]. (A10)

The last term sums to log[εs] because the time required to traverse the curve is one. The remaining terms become integrals in the limit where ε0. Collecting all terms

HΔ(f,g)01dtlog[v(t)]1201dtmin[|f(t)|,|g(t)|]max[|f(t)|,|g(t)|]+01dtlog[1+(min[|f(ti)|,|g(ti)|]max[|f(ti)|,|g(ti)|])2]+log[εs], (A11)

as εs0. Because the form of the singularity in this limit log[εs] is independent of f(t) and g(t), it makes sense to define

Hf,g01dtlog[v(t)]1201dtmin[|f(t)|,|g(t)|]max[|f(t)|,|g(t)|]+01dtlog[1+(min[|f(ti)|,|g(ti)|]max[|f(ti)|,|g(ti)|])2]. (A12)

We observe that this expression is completely symmetric in f(t) and g(t) as expected.

Moreover, because

1+(min[|f(t)|,|g(t)|]max[|f(t)|,|g(t)|])2

is equal to

=(max[|f(t)|,|g(t)|]max[|f(t)|,|g(t)|])2+(min[|f(t)|,|g(t)|]max[|f(t)|,|g(t)|])2=min[|f(t)|,|g(t)|]2+max[|f(t)|,|g(t)|]2max[|f(t)|,|g(t)|]2=f(t)2+g(t)2max[|f(t)|,|g(t)|]2=v(t)max[|f(t)|,|g(t)|]2, (A13)

Eq. A11 becomes

Hf,g1201dtmin[|f(t)|,|g(t)|]max[|f(t)|,|g(t)|]01dtlog[|max[|f(t)|,|g(t)|]|]. (A14)

Before proceeding to the claims, we point out that in this study, we consider only the case where self-intersections of γ(t) are isolated as shown in Figs. 1, 12, and 14. Inspection of our experimental data shows that this assumption is valid in all but a negligible number of cases, which have been “masked” as described in the following text and excluded from analysis. In the case of complete overlap, where f(t)=g(t), the joint entropy Hf,g reduces effectively to Hf entropy published previously.6, 7 An intermediate case, of total overlap for part of γ(t) and isolated self-intersection on other portions can be handled by breaking up the integrals we will obtain into separate portions.

Figure 12.

Figure 12

Examples of two time-domain functions, f(t)=x, g(t)=y (left side) and the path they define, γ(t), in the x,y plane (right side). The action of the Schwartz distribution w(x,y) in a two dimensional integral is to collapse it into an equivalent one dimensional integral over time.

Figure 14.

Figure 14

(Color online) Computation of p(j,k) (left). Computation of the marginal probability p(k) (right).

APPENDIX B: JUSTIFICATION OF CLAIM i

We begin by establishing the joint density w(x,y) as the (Schwartz) distribution obtained from the cumulative joint density function, W(x,y), for x=f(t) and y=g(t), While W(x,y) is a conventional function of x and y, w(x,y) is not, so that a course-graining operation on w(x,y) using test functions must be defined. This leads to the main result of this section, Eq. B28. Although this permits computation of the course-grained joint-entropy, we must also verify consistency of this approach with that used to obtain previous entropies, e.g., Hf, If(r), and If,, which were based on the density function, wf(y) for a single function y=f(t).6, 7, 8, 12, 22 This is established by showing that the marginal density functions obtained from w(x,y) are just the single function densities employed previously. The same results are established for the course-grained marginal probabilities.

Calculating W(x,y)/xy using Schwartz distributions

Typically, w(x,y) is defined as the second partial derivative (with respect to x and y) of the joint cumulative density function W(x,y). However, for the types of random variables, we consider (assumed to be infinitely differentiable) the joint density requires further discussion since, unlike wf(x) and wg(y),7, 12, 22 it is not a regular function (and is defined in the following text as a distribution in the sense of L. Schwartz). So we must start with the joint cumulative density function, W(x,y), for f(t)=x, g(t)=y, an example of which is shown in Fig. 1. This is a regular function defined by

W(x,y)=|{t:f(t)xandg(t)y}|, (B1)

where |{·}| denotes the measure (“length”) of the set. The right side of Fig. 1 has an example calculation of W(0.5,0.75) directly from Eq. B1.

On the left side of the figure, an alternative (and eventually more useful) way of looking at W(x,y) is shown. The trajectory defined by

γ(t)=(f(t),g(t)), (B2)

parameterized by time running from 01, is shown. Also shown is a red box having lower left corner at the point (x,y)=(-1,-1) and its upper right corner at (x,y)=(0.5,0.75). The portions of the trajectory within this rectangular region are colored red, and the entry and exit times of the red portions are labeled to show their correspondence with times in the right-hand side of the figure. The sum of these times is W(0.5,0.75), so that W(x,y) measures the time that the trajectory spends in the red box.

The figure also suggests that the derivative of W(x,y) will exhibit jump discontinuities for certain values of x and y as the number of segments of γ(t) contained in the red rectangle changes discontinuously, for instance, as the height of the red rectangle is adjusted so that its top moves from below to just above the point labeled t0. This result may also be anticipated by formally differentiating the integral representation of W(x,y) given in Eq. B4. Consequently, careful definition of w(x,y) will require the use of Schwartz distributions.

If H(x) is the Heaviside function

H(x)={0x<0,1x0, (B3)

then

W(x,y)=01H(xf(t))H(yg(t))dt. (B4)

We note that the joint density w(x,y)=(2/xy)W(x,y) appears only in expressions like

I=11dx11dyϕ(x,y)w(x,y). (B5)

While w(x,y) is not a regular “point” function, we recognize that it can be defined as a Schwartz distribution. Thus we assume that we may freely interchange orders of integration and that if ϕ(x,y)Cc(Ω) [recall Ω=(1,1)×(1,1)], we obtain

I=11dx11dyϕ(x,y)w(x,y)=11dx11dyϕ(x,y)2W(x,y)xy=11dx11dy2ϕ(x,y)xyW(x,y), (B6)
I=-11dx-11dy2f(x,y)xy01dtH(x-f(t))×H(y-g(t)) (B7)
=01dt11dx11dy2ϕ(x,y)xyH(xf(t))×H(yg(t)) (B8)
=01dt11dxH(xf(t))
×[ϕ(x,y)xH(yg(t))|y=1y=111dyϕ(x,y)xyH(yg(t))], (B9)
I=01dt11dxH(xf(t))×11dyϕ(x,y)xyH(yg(t)), (B10)

where the boundary terms vanished because ϕ(x,y)Cc(Ω). Using

11dyF(x,y)yH(yz)=F(x,z), (B11)

with

F(x,y)=ϕ(x,y)x, (B12)

Eq. B10 becomes

I=-01dt-11dxH(x-f(t))ϕ(x,g(t))x. (B13)

Integrating by parts again leads to

I=01dt[ϕ(x,g(t))H(xf(t))|x=1x=111dxxH(xf(t))ϕ(x,g(t))] (B14)
=01dt11dxxH(xf(t))ϕ(x,g(t)), (B15)

where once again the boundary terms vanish because ϕ(x,y)Cc(Ω). Using Eqs. B11, B12, Eq. B15 becomes

I=01dtϕ(f(t),g(t)). (B16)

Combining this result with Eq. B5, we see that, as a Schwartz distribution, w(x,y) obeys the relation

11dx11dyϕ(x,y)w(x,y)=01dtϕ(f(t),g(t)). (B17)

This generalizes the expression for single (differentiable) random variable density function

01ϕ(y)wf(y)dy=01ϕ(f(t))dt, (B18)

which we have employed in the definition of Hf as described in previous studies.6, 7, 8, 12, 22

Where w(x,y) “lives” in terms of f(t) and g(t)

Equation B16 can be visualized using the picture shown on the right-hand panel of Fig. 12. The fact that w(x,y) is represented by a Schwartz distribution means that the “probability” is effectively concentrated on the curve γ(t). Thus the current situation differs significantly from typical applications of jointly distributed random variables, which are usually distributed continuously over two-dimensional regions of the real plane.

Marginal distributions

Consistency of the current approach with previous studies requires that the marginal density functions for w(x,y) obtained using Eq. B17 should be identical with the single (differential) random variable density functions of Eq. B18. To check this, we let ϕ(x)Cc(Ω) and compute

I=fminfmaxdxgmingmaxdyϕ(x)w(x,y)=01dtϕ(f(t))=fminfmaxdxϕ(x)wf(x). (B19)

But also

I=fminfmaxdxgmingmaxdyϕ(x)w(x,y)=fminfmaxdxϕ(x)gmingmaxdyw(x,y). (B20)

Equations B19, B20 imply

wf(x)=gmingmaxdyw(x,y), (B21)

and similarly

wg(y)=fminfmaxdxw(x,y), (B22)

as required.

Choices of ϕ(x,y) for coarse-graining

The integral equations, Eqs. B17, B18, defining w(x,y) and wf(y) are all we need to compute the coarse-grained joint probabilities, pΔ(j,k), and the associated coarse-grained marginal probabilities, pΔ(j) and pΔ(k), which we will need as intermediate quantities to obtain Hf,g.

This is accomplished by choosing an appropriate function ϕ(x,y) to use in Eq. B17 [or for the marginal probabilities: choosing an appropriate ϕ(y) to use in Eq. B18]. Let 0<ε1 and let 0<s1 be a parameter used to control the sharpness of the edges of test functions ϕ(x,y), which will be used to effect coarse-graining of the Schwartz distribution w(x,y) and the density functions wf(x), and wg(y). We partition Ω into small square cells, ε×ε in size, over which we will compute integrals of product of test functions ϕ(x,y) and w(x,y), wf(x), or wg(y). The test function will be chosen so that as s0, the coarse-graining test functions approach unit height square waves that sample w(x,y), wf(x), or wg(y) over a small nonzero region. The aim is to simulate the operation of a digitizer that is sampling a one or two-dimensional function. We begin by defining an infinitely differentiable smooth function that “turns on” at x=jε as x increases,

ϕL,j(x)={e-1/(s(x-jε))2,0,x>jε;xjε, (B23)

and a corresponding smooth function that turns off at x=(j+1)ε as x increases

ϕR,j(x)={0,e1/(s(x(j+1)ε))2,x(j+1)ε,x<(j+1)ε. (B24)

Their product defines an infinitely differentiable step function,

ϕj(x)=ϕL,j(x)ϕR,j(x), (B25)

which approaches a unit height square wave, turning on at x=jε and turning off at x=(j+1)ε, as s0. This function may be used to “sample” a function of one variable, e.g., f(t)=x. Similarly ϕk(y) is defined as an infinitely smooth precursor to a square wave that turns on at y=kε and turns off at y=(k+1)ε. Their product may be used to define the infinitely differentiable (in x and y) function

ϕj,k(x,y)=ϕj(x)ϕk(y), (B26)

which may be used to sample functions (or Schwartz distributions) of two variables at x=jε, y=. Example ϕj(x), ϕk(y), ϕj,k(x,y) [which are all Cc(Ω)] are shown in Fig. 13.

Figure 13.

Figure 13

(Color online) Examples of course graining functions, ϕj(x) (top left), ϕk(y) (top right), and ϕj,k(x,y) (bottom).

We will also allow the two indices, k associated with the f(t)=x variable and j associated with the g(t)=y variable to run from to .

Course-grained probabilities pΔ(x,y), pΔ(x), and pΔ(y)fromw(x,y), wf(x), and wg(y)

Now we have all we need to coarse-grain the densities w(x,y), wf(x) and wg(y). Suppose that γ(t) passes only once through the j,kth cell, which we will denote by Cj,k. To coarse-grain w(x,y) over Cj,k, we compute,

pΔ(j,k)=limsfminfmaxdxgmingmaxdyw(x,y)ϕj,k(x,y)=lims01dtϕj,k(f(t),g(t))=limstitfdtϕj,k(f(t),g(t))=limstitfdt=tfti, (B27)

where ϕj,k(x,y) is defined by Eq. B26. For cells that do not intersect γ(t), we define pΔ(j,k)=0 for completeness. Defining s(ti) to be the length along γ(t) between t=0 and t=ti, Eq. B27 may be rewritten as

pΔ(j,k)=tfti=Δt=s(tf)s(ti)v(ti), (B28)

where v(ti)=f(ti)2+g(ti)2 is the velocity along the curve at time ti. If more than one passage through occurs Cj,k, then all traversal times ti such that (f(ti),g(ti))Cj,k must be added, and we obtain

pΔ(j,k)={tiCi,j}s(tf)-s(ti)v(ti). (B29)

Discrete marginal probabilities: From the standard definition

The discrete marginal probabilities are defined by

pΔ(k)=jpΔ(j,k). (B30)

For the course-grained marginal probabilities, we also set pΔ(k)=0 if γ(t) does not intersect any cell with index j (Cj,kγ(t)=0,k). Similarly we define,

pΔ(j)=kpΔ(j,k). (B31)

Discrete marginal probabilities: From Eqs. B21, B22, and B25

Alternatively, we may start with Eq. B21 and define

pΔ(j)=fminfmaxdxw(x,y)ϕj(x), (B32)

where ϕj(x) is given by Eq. B25. Similarly, using Eq. B22,

pΔ(k)=gmingmaxdyw(x,y)ϕk(y), (B33)

where ϕk(y) is also given by Eq. B25 and pΔ(j), pΔ(j) are zero in the same cases described after Eqs. B30, B31. Focusing on Eq. B29, we see from Fig. 14, that the integral will reduce, in the limit where s, to the sum of times spent in the cells lying between jε and (j+1)ε, so that we again obtain

pΔ(j)=kpΔ(j,k) (B34)

and

pΔ(k)=jpΔ(j,k). (B35)

We note based on previous work that as ε0 these are asymptotic to

pΔ(j)εwf(1+jε)=εwf(x) (B36)

and

pΔ(k)εwg(-1+)=εwg(y). (B37)

These are just the coarse-grained versions of Eqs. B21, B22 and demonstrate the consistency of computing marginals and then coarse-graining with coarse-graining and then computing marginals.

APPENDIX C: JUSTIFICATION OF CLAIM ii

While Eq. B28 gives the probabilities needed to compute joint entropies in terms of time, Δti, we must express it in terms of experimentally accessible quantities, which means quantities derived from the backscattered RF, f(t), and the reference signal g(t). The first step in this process is Eq. A1, which relates Δti, to Δsi using the velocity in the cell vi=f(t)2+g(t)2, However, as discussed in the preceding text, we are still faced by an apparently unsolvable problem because we do not know where γ(t) enters the ε×ε cell. Moreover, we also must eliminate the coarse-graining parameter, ε from the final expressions for joint entropy, which is accomplished by taking ε0. In this section, we provide the details of our approach beginning with the “stacking” process used above which may be made rigorous using the ergodic theorem23 as we now describe. Thus we suppose that we are given a straight line segment, , of slope α=g(ti)/f(ti), and length Δsi, contained in a cell of dimensions ε×ε, such as that shown in Fig. 10. We wish to subdivide the cell into εs×εs sub-cells as also shown in the figure, and compute the limiting form of

Sin=1MSΔsi,nlog[Δsi,n], (C1)

where

Δsi=n=1MSΔsi,n, (C2)

and the index n keeps track of the εs×εs cells crossed as γ(t) traverses the larger ε×ε cell. We begin, as shown in Fig. 11, by imagining that we translate all the subsegments of , to the εs×εs square, as shown in the left hand side of the figure (this is equivalent to the stacking procedure described in the preceding text). The y coordinates of those line segments that start out on the y axis are shown the figure. These are given by

Ty(r)(yi)=yi+rαεsmodεs, (C3)

where Ty(r)(yi) is the rth iteration of the mapping

Ty(y)=y+αmodεs. (C4)

Similarly the x coordinates of those line segments that start out on the x axis are given by

Tx(r)(xi)=xi+r1αεsmodεs, (C5)

where Tx(r)(xi) is the rth iteration of the mapping

Tx(x)=x+1αmodεs. (C6)

Thus the physical “stacking” procedure is implemented mathematically by iterating the maps Tx(x) and Ty(y). Because g(ti)/f(ti) is irrational, we know that both of these mappings are ergodic.23 Hence we have the following relations between sums and integrals involving continuous functions ϕ(x), ψ(y) and iterates of Tx(x) and Ty(y),

r=0Msx-1ϕ(Tx(r)(xi))Msx01dxϕ(x), (C7)

as Msx and,

r=0Msy1ψ(Ty(r)(yi))Msy01dyψ(y), (C8)

as Msy. While it is possible to use Eqs. C7, C8 to obtain Eq. A9, we will not do so. Instead, it is simpler to refer back to Fig. 11 and observe that Msx is the number of times that the line segment y=αx crosses the vertical lines x=mεs, where m is a non-negative integer, and Msy is the number of times that the line segment y=αx crosses the horizontal lines x=qεs, where q is a non-negative integer. Furthermore, we observe that the ergodic theorem tells us, via Eqs. C7, C8, that as s0, the crossing points become uniformly distributed along the edges of the εs×εs square in Fig. 11. This means that the random variable y shown in Fig. 15 is uniformly distributed. Thus the variable θ, which is a linear function of y, is also uniformly distributed over its range (the interval [0,1]). We will use this implication of the ergodic theorem to obtain Eq. A9.

Figure 15.

Figure 15

The black circle is a point where γ(t) crosses the vertical line in the εs×εs sub-grid shown in Fig. (11). Because α is irrational, the values of y are uniformly distributed along the vertical side of the εs×εs square by the ergodic theorem. This also implies that the variable θ is uniformly distributed over its range, the interval [0,1], because it is a linear function of y, as shown in the figure.

Relation between Msx and Msy

Let sh be the horizontal distance traversed and sv=|α|sh be the vertical distance traversed (so Δsi=1+α2sh). Then

Msx{sh/εs,sh/εs}, (C9)

where z is the largest integer z, and z is the smallest integer z. Similarly

Msy{sv/εs,sv/εs}. (C10)

As sv=|α|sh, we get Msy|α|Msx as εs0.

Calculation of Δsi

We have

Δsi=sh1+α2=sv1+1α2 (C11)

and

sh=εsMsx (C12)

with an error of at most εs by Eq. C9. Therefore

ΔsiMsxεs1+α2=Msxε, (C13)

where we have defined the symbol ε=εs1+α2 to simplify calculations in the following text.

Calculation of Si

We are now ready to calculate

Sin=1MSΔsi,nlog[Δsi,n].

We break the calculation into two cases: |α|<1 and |α|>1.

Case |α|<1: We note that MS=Msx+Msy is the number of times that the trajectory, γ(t), crosses either a vertical or a horizontal line of the εs×εs sub-grid. (Because α is irrational, there is at most one occasion where it crosses both simultaneously; we can ignore this event as εs0.)

If there are two successive vertical crossings, then

Δsi,nlog[Δsi,n]=εs1+α2log[εs1+α2]. (C14)

As |α|<1, if this does not happen, then there is exactly one horizontal crossing in between vertical crossings at n and n+2, and

Δsi,nlog[Δsi,n]+Δsi,n+1log[Δsi,n+1]=θεs1+α2log[θεs1+α2]+(1θ)εs1+α2log[(1θ)εs1+α2] (C15)

for some 0<θ<1. By the ergodic theorem, θ will tend to a uniformly distributed random variable on [0,1] as shown in Fig. 15. So letting ε=εs1+α2, the contribution from Eq. C15 is, on average,

01[θεlog[θε]+(1-θ)εlog[(1-θ)ε]], (C16)

or

εlogεε2. (C17)

Of the Msx vertical crossings, Msy=|α|Msx of them will have a horizontal crossing “sandwiched” in between and contribute the quantity shown in Eq. C17 on average; the remainder will contribute εlog[ε]. Thus we obtain

Si=|α|Msx[εlogεε2]+(1|α|)Msx[εlogε]=Msxε{|α|[logε12]+(1||α|)logε}=Δsi{|α|2+logε}=Δsi{|α|2+log1+α2+logεs}. (C18)

Case |α|>1: If |α|>1, we repeat the calculation with x and y “switched.” The calculation proceeds exactly as before although α must be replaced by 1/α.

Calculation summary: These results may be concisely written as

Si=Δsi{[12|α|+log[1+α2]]+log[εs]|α|1,(121|α|+log[1+1α2])+log[εs]|α|1 (C19)

or, recalling that α=g(t)/f(t),

Si=12Δsimin[|f(tk)|,|g(tk)|]max[|f(tk)|,|g(tk)|]+Δsilog[1+(min[|f(tk)|,|g(tk)|]max[|f(tk)|,|g(tk)|])2]+Δsilog[εs], (C20)

which is Eq. A9. This completes the derivation of Hf,g.

References

  1. Hughes M., “Analysis of ultrasonic waveforms using Shannon entropy,” Proc.-IEEE Ultrason. Symp. 2, 1205–1209 (1992). 10.1109/ULTSYM.1992.275884 [DOI] [Google Scholar]
  2. Hughes M., “Analysis of digitized waveforms using Shannon entropy,” J. Acoust. Soc. Am. 93, 892–906 (1993). 10.1121/1.405451 [DOI] [Google Scholar]
  3. Hughes M., “NDE imaging of flaws using rapid computation of Shannon entropy,” Proc.-IEEE Ultrason. Symp. 2, 697–700 (1993). [Google Scholar]
  4. Hughes M., “Analysis of digitized waveforms using Shannon entropy. II. High-speed algorithms based on Green's functions,” J. Acoust. Soc. Am. 95, 2582–2588 (1994). 10.1121/1.409828 [DOI] [Google Scholar]
  5. Hughes M. S., Marsh J. N., Hall C. S., Savy D., Scott M. J., Allen J. S., Lacy E. K., Carradine C., Lanza G. M., and Wickline S. A., “Characterization of digital waveforms using thermodynamic analogs: Applications to detection of materials defects,” IEEE Trans. Ultrason. Ferroelectr. Freq. Control 52, 1555 – 1564 (2005). 10.1109/TUFFC.2005.1516028 [DOI] [PubMed] [Google Scholar]
  6. Hughes M., Marsh J., Woodson A., Lacey E., Carradine C., Lanza G. M., and Wickline S. A., “Characterization of digital waveforms using thermodynamic analogs: Detection of contrast targeted tissue in MDA 435 tumors implanted in athymic nude mice,” Proc.-IEEE Ultrason. Symp. 1, 373–376 (2005). [DOI] [PubMed] [Google Scholar]
  7. Hughes M. S., McCarthy J. E., Marsh J. N., Arbeit J. M., Neumann R. G., Fuhrhop R. W., Wallace K. D., Znidersic D. R., Maurizi B. N., Baldwin S. L., Lanza G. M., and Wickline S. A., “Properties of an entropy-based signal receiver with an application to ultrasonic molecular imaging,” J. Acoust. Soc. Am. 121, 3542–3557 (2007). 10.1121/1.2722050 [DOI] [PubMed] [Google Scholar]
  8. Hughes M., Marsh G. M., Lanza, S. A. Wickline, J. McCarthy, B. Wickerhauser, V. annd Maurizi, and, W. K., “Improved signal processing to detect cancer by ultrasonic molecular imaging of targeted nanoparticles,” J. Acoust. Soc. Am. 129, 3756–3767 (2011). 10.1121/1.3578459 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Marsh J. N., McCarthy J. E., Wickerhauser M., Arbeit J. M., Fuhrhop R. W., Wallace K. D., Lanza G. M., Wickline S. A., and Hughes M. S., “Application of real-time calculation of a limiting form of the Renyi entropy for molecular imaging of tumors,” IEEE Trans. Ultrason. Ferroelectr. Freq. Control 57, 1890–1895 (2010). 10.1109/TUFFC.2010.1630 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Marsh J. N., Wallace K. D., Lanza G. M., Wickline S. A., Hughes J. E., and McCarthy M. S., “Application of a limiting form of the Renyi entropy for molecular imaging of tumors using a clinically relevant protocol,” Proc.-IEEE Ultrason. Symp. 1, 53–56 (2010). [Google Scholar]
  11. Wallace K. D., Marsh J., Baldwin S. L., Connolly A. M., Richard K., Lanza G. M., Wickline S. A., and Hughes M. S., “Sensitive ultrasonic delineation of steroid treatment in living dystrophic mice with energy-based and entropy-based radio frequency signal processing,” IEEE Trans. Ultrason. Ferroelectr. Freq. Control 54, 2291–2299 (2007). 10.1109/TUFFC.2007.533 [DOI] [PubMed] [Google Scholar]
  12. Hughes M., Marsh J., Wallace K., Donahue T., Connolly A., Lanza G. M., and Wickline S. A., “Sensitive ultrasonic detection of dystrophic skeletal muscle in patients with Duchenne's muscular dystrophy using an entropy-based signal receiver,” Ultrasound Med. Biol. 33, 1236–1243 (2007). 10.1016/j.ultrasmedbio.2007.02.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Hughes M., Marsh J., Agyem K., McCarthy J., Maurizi B., Wickerhauser M., Lanza W. K. D. G., and Wickline S., “Use of smoothing splines for analysis of backscattered ultrasonic waveforms: Application to monitoring of steroid treatment of dystrophic mice,” IEEE Trans. Ultrason. Ferroelectr. Freq. Control 58, 2361–2369 (2011). 10.1109/TUFFC.2011.2093 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Seip R., Ebbini E., and O'Donnell M., “Non-invasive detection of thermal effects due to highly focused ultrasonic fields,” Proc.-IEEE Ultrason. Symp. 2, 1229–1232 (1993). [Google Scholar]
  15. Hughes M. S., McCarthy J. E., Wickerhauser M., Marsh J. N., Arbeit J. M., Fuhrhop R. W., Wallace K. D., Thomas T., Smith J., Agyem K., Lanza G. M., and Wickline S. A., “Real-time calculation of a limiting form of the Renyi entropy applied to detection of subtle changes in scattering architecture,” J. Acoust. Soc. Am. 126, 2350–2358 (2009). 10.1121/1.3224714 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Schwartz L., Mathematics for the Physical Sciences (Dover, New York, 2008), pp. 1–368. [Google Scholar]
  17. Hughes M., “A comparison of Shannon entropy versus signal energy for acoustic detection of artificially induced defects in Plexiglas,” J. Acoust. Soc. Am. 91, 2272–2275 (1992). 10.1121/1.403662 [DOI] [Google Scholar]
  18. Reinsch C. H., “Smoothing by spline functions,” Num. Math. 10, 177–183 (1967). 10.1007/BF02162161 [DOI] [Google Scholar]
  19. Sackett D. L., “Why randomized controlled trials fail but needn't. II. Failure to employ physiological statistics, or the only formula a clinician-trialist is ever likely to need (or understand!),” Can. Med. Assoc. J. 165, 1226–1237 (2001). [PMC free article] [PubMed] [Google Scholar]
  20. Arbeit J. M., Riley R. R., Huey B., Porter C., Kelloff G., Lubet R., Ward J. M., and Pinkel D., “DFMO chemoprevention of epidermal carcinogenesis in k14-hpv16 transgenic mice,” Cancer Res. 59, 3610–3620 (1999). [PubMed] [Google Scholar]
  21. Arbeit J. M., Mnger K., Howley P. M., and Hanahan D., “Progressive squamous epithelial neoplasia in k14-human papillomavirus type 16 transgenic mice,” J. Virol. 68, 4358–4368 (1994). [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hughes M. S., McCarthy J. E., Marsh J. N., Arbeit J. M., Neumann R. G., Fuhrhop R. W., Wallace K. D., Thomas T., Smith J., Agyem K., Znidersic D. R., Maurizi B. N., Baldwin S. L., Lanza G. M., and Wickline S. A., “Application of Renyi entropy for ultrasonic molecular imaging,” J. Acoust. Soc. Am. 125, 3141–3145 (2009). 10.1121/1.3097489 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Katok A. and Hasselblatt B., Introduction to the Modern Theory of Dynamical Systems (Cambridge University Press, Cambridge, UK, 1995), pp. 1–804. [Google Scholar]

Articles from The Journal of the Acoustical Society of America are provided here courtesy of Acoustical Society of America

RESOURCES