Evaluating the Gilbert–Varshamov Bound for Constrained Systems

Keshav Goyal; Han Mao Kiah

doi:10.3390/e26040346

. 2024 Apr 19;26(4):346. doi: 10.3390/e26040346

Evaluating the Gilbert–Varshamov Bound for Constrained Systems ^†

Keshav Goyal ¹, Han Mao Kiah ^1,^*

Editor: T Aaron Gulliver¹

PMCID: PMC11049528 PMID: 38667900

Abstract

We revisit the well-known Gilbert–Varshamov (GV) bound for constrained systems. In 1991, Kolesnik and Krachkovsky showed that the GV bound can be determined via the solution of an optimization problem. Later, in 1992, Marcus and Roth modified the optimization problem and improved the GV bound in many instances. In this work, we provide explicit numerical procedures to solve these two optimization problems and, hence, compute the bounds. We then show that the procedures can be further simplified when we plot the respective curves. In the case where the graph presentation comprises a single state, we provide explicit formulas for both bounds.

Keywords: Gilbert–Varshamov bound, constrained codes, asymptotic rates, sliding window constrained codes

1. Introduction

From early applications in magnetic recording systems to recent applications in DNA-based data storage [1,2,3,4] and energy-harvesting [5,6,7,8,9,10], constrained codes have played a central role in enhancing reliability in many data storage and communications systems (see also [11] for an overview). Specifically, for most data storage systems, certain substrings are more prone to errors than others. Thus, by forbidding the appearance of such strings, that is, by imposing constraints on the codewords, the user is able to reduce the likelihood of error. We refer to the collection of words that satisfy the constraints as the constrained space $S$ .

To further reduce the error probability, one can impose certain distance constraints on the codebook. In this work, we focus on the Hamming metric and consider the maximum size of a codebook whose words belong to the constrained space $S$ and whose pairwise distance is at least of a certain value d. Specifically, we study one of the most well-known and fundamental lower bounds of this quantity—the Gilbert–Varshamov (GV) bound.

To determine the GV bound, one requires two quantities: the size of the constrained space, $| S |$ , and, also, the ball volume, that is, the number of words with a distance of at most $d - 1$ from a “center” word. In the case where the space is unconstrained, i.e., $S = {0, 1}^{n}$ , the ball volume does not depend on the center. Then, the GV bound is simply $| S | / V$ , where V is the ball volume of a center. However, for most constrained systems, the ball volume varies with the center. Nevertheless, Kolesnik and Krachkovsky showed that the GV lower bound can be generalized to $| S | / 4 \bar{V}$ , where $\bar{V}$ is the average ball volume [12]. This was further improved by Gu and Fuja to $| S | / \bar{V}$ in [13] (see pp. 242–243 in [11] for additional details). In the same paper [12], they showed the asymptotic rate of average ball volume can be computed via an optimization problem. Later, Marcus and Roth modified the optimization problem by including an additional constraint and variable [14], and the resulting bound, which we refer to as GV-MR bound, improves the usual GV bound. Furthermore, in most cases, the improvement is strictly positive.

However, about three decades later, very few works have evaluated these bounds for specific constrained systems. To the best of our knowledge, in all works that numerically computed the GV bound and/or GV-MR bound, the constrained systems of interest have, at most, eight states [15]. In [15], the authors wrote that “evaluation of the bound required considerable computation”, referring to the GV-MR bound.

In this paper, we revisit the optimization problems defined by Kolesnik and Krachkovsky [12] and Marcus and Roth [14] and develop a suite of explicit numerical procedures that solve these problems. In particular, to demonstrate the feasibility of our methods, we evaluated and plotted the GV and GV-MR bounds for a constrained system involving 120 states in Figure 1b.

Lower bounds for optimal asymptotic code rates $R (δ; S)$ for the class of sliding-window constrained codes.

We provide a high-level description of our approach. For both optimization problems, we first characterized the optimal solutions as roots of certain equations. Then, using the celebrated Newton–Raphson iterative procedure, we proceeded to find the roots of these equations. However, as the latter equations involved the largest eigenvalues of certain matrices, each Newton–Raphson iteration required the (partial) derivatives of these eigenvalues (in some variables). To resolve this, we made modifications to another celebrated iterative procedure—the power iteration method—and the resulting procedures computed the GV and GV-MR bounds efficiently for a specific relative distance $δ$ . Interestingly, if we plot the bounds for $0 \leq δ \leq 1$ , the numerical procedure can be further simplified. Specifically, by exploiting certain properties of the optimal solutions, we provided procedures that use less Newton–Raphson iterations.

Parts of this paper were presented in the IEEE International Symposium on Information Theory (ISIT 2022) [16]. In the next section, we provide the formal definitions and state the optimization problems that compute the GV bound.

2. Preliminaries

Let $Σ = {0, 1}$ be the binary alphabet and let $Σ^{n}$ denote the set of all words of length n over $Σ$ . A labeled graph $G = (V, E, L)$ is a finite directed graph with states $V$ , edges $E \subseteq V \times V$ , and an edge labeling $L : E \to Σ^{s}$ for some $s \geq 1$ . Here, we use $v_{i} \overset{σ}{\to} v_{j}$ to mean that there is an edge from $v_{i}$ to $v_{j}$ with label $σ$ . The labeled graph $G$ is deterministic if, for each state, the outgoing edges have distinct labels.

A constrained system $S$ is, then, the set of all words obtained by reading the labels of paths in a labeled graph $G$ . We say that $G$ is a graph presentation of $S$ . We further denote the set of all n-length words $S$ by $S_{n}$ . Alternatively, $S_{n}$ is the set of all words obtained by reading the labels of $(n / s)$ -length paths in $G$ . Then, the capacity of $S$ , denoted by $Cap (S)$ , is given by $Cap (S) ≜ {lim sup}_{n \to \infty} log | S_{n} | / n$ . It is well-known that $Cap (S)$ corresponds to the largest eigenvalue of the adjacency matrix $A_{G}$ (see, for example, [11]). Here, $A_{G}$ is a $(| V | \times | V |)$ -matrix whose rows and columns are indexed by $V$ . For each entry $(u, v) \in V \times V$ , we set the corresponding entry to be one if $(u, v)$ is an edge, and zero otherwise.

Every constrained system can be presented by a deterministic graph $G$ . Furthermore, any deterministic graph can be transformed into a primitive deterministic graph $H$ such that the capacity of $G$ is same as the capacity of the constrained system presented by some irreducible component (maximal irreducible subgraph) of $H$ (see, for example, Marcus et al. [11]). It should be noted that a graph $G$ is primitive if there exists a positive integer ℓ such that ${(A_{G})}^{ℓ}$ is strictly positive. Therefore, we henceforth assume that our graphs are deterministic and primitive. When $| V | = 1$ , we call this a single-state graph presentation and study these graphs in Section 5.

For $x, y \in S$ , $d_{H} (x, y)$ is the Hamming distance between x and y. We fix $1 \leq d \leq n$ , and a fundamental problem in coding theory is finding the largest subset $C$ of $S_{n}$ such that $d_{H} (x, y) \geq d$ for all distinct $x, y \in C$ . Let $A (n, d; S)$ denote the size of largest subset $C$ .

In terms of asymptotic rates, we fix $0 \leq δ \leq 1$ , and our task is to find the highest attainable rate, denoted by $R (δ)$ , which is given by $R (δ; S) ≜ {lim sup}_{n \to \infty} log A (n, ⌊δ n⌋; S) / n$ .

2.1. Review of Gilbert–Varshamov Bound

To define the GV bound, we need to determine the total ball size. Specifically, for $x \in S_{n}$ and $0 \leq r \leq n$ , we define $V (x, r; S) ≜ | {y \in S_{n} : d_{H} (x, y) \leq r} |$ . We further define $T (n, d; S) = \sum_{x \in S_{n}} V (x, d - 1; S)$ . Then, the GV bound, as given by Gu and Fuja [13,17], states that there exists an $(n, d; S)$ code of size at least $| S_{n} |^{2} / T (n, d; S)$ .

In terms of asymptotic rates, there exists a family of $(n, ⌊δ n⌋; S)$ codes such that their rates approach

R_{GV} (δ) = 2 Cap (S) - \tilde{T} (δ),

(1)

where $\tilde{T} (δ) ≜ {lim sup}_{n \to \infty} log T (n, ⌊δ n⌋; S) / n$ .

In this paper, our main task is to determine $R_{GV} (δ)$ efficiently. We observe that since $Cap (S) = \tilde{T} (0)$ , it suffices to find efficient ways of determining $\tilde{T} (δ)$ . It turns out that $\tilde{T} (δ)$ can be found via the solution of a convex optimization problem. Specifically, given a labeled graph $G = (V, E, L)$ , we define its product graph $G' = (V', E', L')$ as follows:

$V' ≜ V \times V$ .
For $(v_{i}, v_{j}), (v_{k}, v_{ℓ}) \in V'$ , and $(σ_{1}, σ_{2}) \in Σ^{s} \times Σ^{s}$ , we draw an edge $(v_{i}, v_{j}) \overset{(σ_{1}, σ_{2})}{\to} (v_{k}, v_{ℓ})$ if and only if both $v_{i} \overset{σ_{1}}{\to} v_{k}$ and $v_{j} \overset{σ_{2}}{\to} v_{ℓ}$ belong to $E$ .
Then, we label the edges in $E'$ with the function $L' : E' \to Z_{\geq 0}$ , where $L' ((v_{i}, v_{j}) \overset{(σ_{1}, σ_{2})}{\to} (v_{k}, v_{ℓ})) = d_{H} (σ_{1}, σ_{2}) / s$ .

A stationary Markov chain P on a graph $G = (V, E, L)$ is a probability distribution function $P : E \to [0, 1]$ such that $\sum_{e \in E} P (e) = 1$ and, for any state $u \in G$ , the sum of the probabilities of the outgoing edges equals the sum of the probabilities of the incoming edges. We denote by $M (G)$ the set of all stationary Markov chains on $G$ . For a state $u \in V$ , let $E_{u}$ denote the set of outgoing edges from u in $G$ . The state vector $π^{T} = {(π_{u})}_{u \in V}$ of a stationary Markov chain P on $G$ is defined by $π_{u} = \sum_{e \in E_{u}} P (e)$ . The entropy rate of a stationary Markov chain is defined by

\begin{matrix} H (P) = - \sum_{u \in V} \sum_{e \in E_{u}} π_{u} P (e) log (P (e)) \end{matrix}

Furthermore, $\tilde{T} (δ)$ can be obtained by solving the following optimization problem [12,14]:

\tilde{T} (δ) = sup \{H (P) : P \in M (G \times G), \sum_{e \in E'} P (e) D (e) \leq δ\} .

(2)

To this end, we consider the dual problem of (2). Specifically, we define a $(| V |^{2} \times | V |^{2})$ -distance matrix $T_{G \times G} (y)$ whose rows and columns are indexed by $V'$ . For each entry indexed by $e \in V^{'} \times V^{'}$ , we set the entry to be zero if $e \notin E^{'}$ and we set it to be $y^{D (e)}$ if $e \in E^{'}$ . Then, the dual problem can be stated in terms of the dominant eigenvalue of the matrix $T_{G \times G} (y)$ .

By applying the reduction techniques from [14], we can reduce the problem size by a factor of two. Formally, in the case of $s = 1$ , we define a $(\binom{| V | + 1}{2}) \times (\binom{| V | + 1}{2})$ -reduced distance matrix $B_{G \times G} (y)$ whose rows and columns are indexed by $V^{(2)} ≜ {(v_{i}, v_{j}) : 1 \leq i \leq j \leq | V |}$ using the following procedure.

Two states $s_{1} = (v_{i}, v_{j})$ and $s_{2} = (v_{k}, v_{ℓ})$ in $G \times G$ are said to be equivalent if $v_{i} = v_{ℓ}$ and $v_{j} = v_{k}$ . The matrix $B_{G \times G} (y)$ is then obtained by merging all pairs of equivalent states $s_{1}$ and $s_{2}$ . That is, we add the column indexed by $v_{2}$ to the column indexed by $v_{1}$ and then remove the row and column which are indexed by $v_{2}$ . It should be noted that it may be possible to reduce the size of this matrix $B_{G \times G} (y)$ further. However, for the ease of exposition, we did not consider this case in this work.

Following this procedure, we observe that the entries in the matrix $B_{G \times G} (y)$ can be described by the rules in Table 1. Moreover, the dominant eigenvalue of $B_{G \times G} (y)$ is the same as that of $T_{G \times G} (y)$ . Then, by strong duality, computing (2) is equivalent to solving the following dual problem [18,19] (see also, [20]):

\tilde{T} (δ) = inf \{- δ log y + log Λ (B_{G \times G} (y)) : 0 \leq y \leq 1\} .

(3)

Here, we use $Λ (M)$ to denote the dominant eigenvalue of matrix M. To simplify further, we write $Λ (y; B) ≜ Λ (B_{G \times G} (y))$ .

Table 1.

We set the $((v_{i}, v_{j}), (v_{k}, v_{ℓ}))$ entry of the matrix $B_{G \times G} (y)$ according to subgraph induced by the states $v_{i}$ , $v_{j}$ , $v_{k}$ Gilbert–Varshamov $v_{ℓ}$ . Here, $\bar{σ}$ denotes the complement of $σ$ .

$B_{G \times G} (y)$ at Entry $((v_{i}, v_{j}), (v_{k}, v_{ℓ}))$	Subgraph Induced by the States ${v_{i}, v_{j}, v_{k}, v_{ℓ}}$
0
1
y
$2 y$

Open in a new tab

Since the objective function in (3) is convex, it follows from standard calculus that any local minimum solution $y^{*}$ in the interval $[0, 1]$ is also a global minimum solution. Furthermore, $y^{*}$ is a zero of the first derivative of the objective function. If we consider the numerator of this derivative, then $y^{*}$ is a root of the function

F (y) ≜ y Λ^{'} (y; B) - δ Λ (y; B) .

(4)

In Corollary 1, we showed that there is only one $y^{*}$ such that $F (y^{*}) = 0$ and $F^{'} (y)$ is strictly positive for all values of y. Therefore, to evaluate the GV bound for a fixed $δ$ , it suffices to determine $y^{*}$ .

Later, Marcus and Roth [14] improved the GV bound (1) by considering certain subsets of the constrained space $S$ . This entails the inclusion of an additional constraint defined in the optimization problem (2), and, correspondingly, an additional variable in the dual problem (3). Specifically, they considered certain subsets $S (p) \subseteq S$ where each symbol in the words of $S (p)$ appears with a certain frequency dependent on the parameter p. We describe this in more detail in Section 4.

2.2. Our Contributions

(A)
In Section 3, we develop the numerical procedures to compute $\tilde{T} (δ)$ for a fixed $δ$ and, hence, determine the GV bound (1). Our procedure modifies the well-known power iteration method to compute the derivatives of $Λ (y; B)$ . After that, using these derivatives, we apply the classical Newton–Raphson method to determine the root of (4). In the same section, we also study procedures to plot the GV curve, that is, the set ${(δ, R_{GV} (δ)) : 0 \leq δ \leq 1}$ . Here, we demonstrate that the GV curve can be plotted without any Newton–Raphson iterations.
(B)
In Section 4, we then develop similar power iteration methods and numerical procedures to compute the GV-MR bound. Similar to the GV curve, we also provide a plotting procedure that uses significantly less Newton–Raphson iterations.
(C)
In Section 5, we provide explicit formulas for the computation of the GV bound and GV-MR bound for graph presentations that have exactly one state but multiple parallel edges.
(D)
In Section 6, we validate our methods by computing the GV and the GV-MR bounds for some specific constrained systems. For comparison purposes, we also plot a simple lower bound that is obtained by using an upper estimate of the ball size. From the plots in Figure 1, Figure 2 and Figure 3, it is also clear that the GV and GV-MR bounds are significantly better. We also observe that the GV bound and GV-MR bound for subblock energy-constrained codes (SECCs) obtained through our procedures improve the GV-type bound given by Tandon et al. (Proposition 12 in [21]).

Lower bounds for optimal asymptotic code rates $R (δ; S)$ for the class of runlength limited codes.

Lower bounds for optimal asymptotic code rates $R (δ; S)$ where $S$ is the class of $(3, 2)$ -SECCs (subblock energy-constrained codes).

3. Evaluating the Gilbert–Varshamov Bound

In this section, we first describe a numerical procedure that solves (3) and, hence, determine $R_{GV} (δ)$ for fixed values of $δ$ . Then, we show that the procedure can be simplified when we compute the GV curve, that is, the set of points ${(δ, R_{GV} (δ)) : δ \in [0, 1]}$ . Here, we eschew notation and use $[a, b]$ to denote the interval ${x : a \leq x \leq b}$ , if $a < b$ , and the interval ${x : b \leq x \leq a}$ otherwise.

Below, we provide formal description of our procedure to obtain the GV bound for a fixed relative distance $δ$ .

Procedure 1 (GV bound for fixed relative distance).

Input: Adjacency matrix $A_{G}$ , reduced distance matrix $B_{G \times G} (y)$ , and relative minimum distance $δ$

Output: GV bound, that is, $R_{GV} (δ)$ as defined in (1)

(1)
Apply the Newton–Raphson method to obtain $y^{*}$ such that $F (y^{*})$ is approximately zero.
- Fix the tolerance value $ϵ$ .
- Set $t = 0$ and pick an initial guess $0 \leq y_{t} \leq 1$ .
- While $| y_{t} - y_{t - 1} | > ϵ$ ,
  - –
    Compute the next guess $y_{t + 1}$ as follows:
    $y_{t + 1} = y_{t} - \frac{F (y_{t})}{F^{'} (y_{t})} = y_{t} - \frac{y_{t} Λ^{'} (y_{t}; B) - δ Λ (y_{t}; B))}{(1 - δ) Λ^{'} (y_{t}; B) + y_{t} Λ^{''} (y_{t}; B)} .$
  - –
    In this step, apply the power iteration method to compute $Λ (y_{t}; B)$ , $Λ' (y_{t}; B)$ , and $Λ'' (y_{t}; B)$ .
  - –
    Increment t by one.
- Set $y^{*} \leftarrow y_{t}$ .
(2)
Determine $R_{GV} (δ)$ using $y^{*}$ . Specifically, compute $\tilde{T} (δ) ≜ - δ log y^{*} + log Λ (y^{*}; B)$ , $Cap (S) ≜ log Λ (A_{G})$ , and $R_{GV} (δ) ≜ 2 Cap (S) - \tilde{T} (δ)$ .

Throughout Section 3 and Section 4, we illustrate our numerical procedures via a running example using the class of sliding window-constrained codes (SWCCs). Formally, we fix a window length L and window weightw, and say that a binary word satisfies the $(L, w)$ -sliding window weight constraint if the number of ones in every consecutive L bits is at least w. We refer to the collection of words that meet this constraint as an $(L, w)$ -SWCC constrained system. The class of SWCCs was introduced by Tandon et al. for the application of simultaneous energy and information transfer [7,10]. Later, Immink and Cai [8,9] studied encoders for this constrained system and provided a simple graph presentation that uses only $(\binom{L}{w})$ states.

In the next example, we illustrate how the numerical procedure can be used to compute the GV bound for the value when $δ = 0.1$ .

Example 1.

Let $L = 3$ and $w = 2$ , and we consider a $(3, 2)$ -SWCC constrained system. From [8], we have the following graph presentation with states x11, 101, and 110

Then, the corresponding adjacency and reduced distance matrices are as follows:

$A_{G} = [\begin{matrix} 1 & 1 & 0 \\ 0 & 0 & 1 \\ 1 & 0 & 0 \end{matrix}], B_{G \times G} (y) = [\begin{matrix} 1 & 2 y & 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 & y & 0 \\ 1 & y & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 \\ 0 & 0 & 1 & 0 & 0 & 0 \\ 1 & 0 & 0 & 0 & 0 & 0 \end{matrix}] .$

To determine the GV bound at $δ = 0.1$ , we first approximate the optimal point $y^{*}$ for which $- δ log y + log Λ (y; B)$ is minimized.

We apply the Newton–Raphson method to find a zero of the function $F (y)$ . Now, with the initial guess $y_{0} = 0.3$ , we apply the power iteration method to determine

$Λ (0.3; B) = 1.659, Λ' (0.3; B) = 0.694, Λ'' (0.3; B) = 0.183 .$

Then, we compute that $y_{1} \approx 0.238$ . Repeating the computations, we have that $y_{2} \approx 0238$ . Since $| y_{2} - y_{1} |$ is less than the tolerance value $10^{- 5}$ , we set $y^{*} = 0.238$ . Hence, we have that $\tilde{T} (0.1) = 0.9$ . Applying the power iteration method to either $A_{G}$ or $B_{G \times G} (0)$ , we compute the capacity of the $(3, 2)$ -SWCC constrained system to be $Cap (S) = 0.551$ . Then, the GV bound is given by $R_{GV} (0.1) = 2 (0.551) - 0.9 = 0.202$ .

We discuss the convergence issues arising from Procedure 1. We observe that there are two different iterative processes in Step 1, namely, (a) the power iteration method to compute the values $Λ (y_{t}; B)$ , $Λ' (y_{t}; B)$ , and $Λ'' (y_{t}; B)$ , and (b) the Newton–Raphson method that determines the zero of $F (y)$ .

(a)
We recall that $Λ (y; B)$ is the largest eigenvalue of the reduced distance matrix $B_{G \times G} (y)$ . If we apply naive methods to compute this dominant eigenvalue, the computational complexity increases very rapidly with the matrix size. Specifically, if $G$ has M states, then the reduced distance matrix has dimensions $Θ (M^{2}) \times Θ (M^{2})$ and finding its characteristic equation takes $O (M^{6})$ time. Even then, determining the exact roots of characteristic equations with at least five degrees is generally impossible. Therefore, we turn to the numerical procedures like the ubiquitous power iteration method [22]. However, the standard power iteration method is only able to compute the dominant eigenvalue $Λ (y; B)$ . Nevertheless, we can modify the power iteration method to compute $Λ (y; B)$ and its higher order derivatives. In Appendix A, we demonstrate that under certain mild assumptions, the modified power iteration method always converges. Moreover, using the sparsity of the reduced distance matrix, we have that each iteration can be completed in $O (M^{2})$ time.
(b)
Next, we discuss whether we can guarantee that $y_{t}$ converges to $y^{*}$ as t approaches infinity. Even though the Newton–Raphson method converges in all our numerical experiments, we are unable to demonstrate that it always converges for $F (y)$ . Nevertheless, we can circumvent this issue if we are interested in plotting the GV curve. Specifically, if our objective is to determine the curve ${(δ, R_{GV} (δ)) : δ \in [0, 1]}$ , it turns out that we do not need to implement the Newton–Raphson iterations and we discuss this next.

We fix some constrained system $S$ . Let us define its corresponding GV curve to be the set of points $GV (S) ≜ {(δ, R_{GV} (δ)) : δ \in [0, 1]}$ . Here, we demonstrate that the GV curve can be plotted without any Newton–Raphson iterations.

To this end, we observe that when $F (y^{*}) = 0$ , we have that $δ = y^{*} Λ' (y^{*}; B) / Λ (y^{*}; B)$ . Hence, we eschew notation and define the function

δ (y) ≜ y Λ' (y; B) / Λ (y; B) .

(5)

We further define $δ_{\max} = δ (1) = Λ' (1; B) / Λ (1; B)$ . In this section, we prove the following theorem.

Theorem 1.

Let $G$ be the graph presentation for the constrained system $S$ . If we define the function

$ρ_{GV} (y) ≜ 2 Cap (S) + δ (y) log y - log Λ (y; B),$ (6)

then the corresponding GV curve is given by

$GV (S) = \{(δ (y), ρ_{GV} (y)) : y \in [0, 1]\} \cup \{(δ, 0) : δ \geq δ_{\max}\} .$ (7)

Before we prove Theorem 1, we discuss its implications. It should be noted that to compute $δ (y)$ and $ρ (y)$ , it suffices to determine $Λ (y; B)$ and $Λ' (y; B)$ using the modified power iteration methods described in Appendix A. In other words, no Newton–Raphson iterations are required. We also have additional computational savings, as we do not need to apply the power iteration method to compute the second derivative $Λ^{''} (y; B)$ .

Example 2.

We continue our example and plot the GV curve for the $(3, 2)$ -SWCC constrained system in Figure 1a. Before plotting, we observe that when $y = 0$ , we have $(δ (0), ρ (0)) = (0, 0.551) = (0, Cap (S))$ , as expected. When $y = 1$ , we have $δ (1) = δ_{\max} = 0.313$ . Indeed, both $ρ (1)$ and $R_{GV} (δ_{\max})$ are equal to zero and we have that $R_{GV} (δ) = 0$ for $δ \geq δ_{\max}$ .

Next, we compute a set of 100 points on the GV curve. If we apply Procedure 1 to compute $R_{GV} (δ)$ for 100 values of δ in the interval $[0, δ_{\max}]$ , we require 275 Newton–Raphson iterations and 6900 power iterations to find these points. In contrast, applying Theorem 1, we compute $(δ (y), ρ (y))$ for 100 values of y in the interval $[0, 1]$ . This does not require any Newton–Raphson iterations and involves only 2530 power iterations.

To prove Theorem 1, we demonstrate the following lemmas. Our first lemma is immediate from the definitions of $R_{GV}$ , $δ$ , and $ρ$ in (1), (5), and (6), respectively.

Lemma 1.

$R_{GV} (δ (y)) = ρ (y)$ for all $y \in [0, 1]$ .

The next lemma studies the behaviour of both $δ$ and $ρ$ as functions in y.

Lemma 2.

In terms of y, the functions $δ (y)$ and $ρ (y)$ are monotone increasing and decreasing, respectively. Furthermore, we have that $(δ (0), ρ (0)) = (0, Cap (S))$ , $(δ (1), ρ (1)) = (δ_{\max}, 0)$ and $R_{GV} (δ) = 0$ for $δ \geq δ_{\max}$ .

Proof.

To simplify notation, we write $Λ (y; B)$ , $Λ^{'} (y; B)$ , and $Λ^{''} (y; B)$ as $Λ$ , $Λ^{'}$ , and $Λ^{''}$ , respectively.

First, we show that $δ^{'} (y)$ is positive for $0 \leq y < 1$ . Differentiating the expression in (5), we have that $δ^{'} (y) > 0$ is equivalent to

$Λ (Λ^{'} + y Λ^{''}) - y {(Λ^{'})}^{2} > 0 .$ (8)

We recall that (3) is a convex minimization problem. Hence, the second order derivative of the objective function is always positive. In other words,

$\frac{δ}{y^{2}} + \frac{Λ'' Λ - {(Λ^{'})}^{2}}{Λ^{2}} > 0 .$

Substituting $δ$ with $y Λ^{'} / Λ$ and multiplying by $y Λ^{2}$ , we obtain (8), as desired.

Next, we show that $ρ$ is monotone decreasing. We recall that $ρ (y) = R_{GV} (δ (y)) = Cap (S) - \tilde{T} (δ)$ . Since $\tilde{T} (δ)$ yields the asymptotic rate of the total ball size, we have that as y increases, $δ (y)$ increases and so, $\tilde{T} (δ)$ increases. Therefore, $ρ (y)$ decreases, as desired.

Next, we show that $ρ (1) = 0$ . When $y = 1$ , we have from (6) that $ρ (1) = 2 Cap (S) - log Λ (1; B)$ . Now, we recall that $B_{G \times G} (y)$ shares the same dominant eigenvalue as the matrix $T_{G \times G} (y)$ [12]. Furthermore, it can be verified that when $y = 1$ , $T_{G \times G} (1)$ is tensor product of $A_{G}$ and $A_{G}$ . That is, $T_{G \times G} (1) = A_{G} \otimes A_{G}$ . It then follows from standard linear algebra that $Λ (1; B) = Λ (1; T) = Λ {(A_{G})}^{2}$ . Thus, $log Λ (1; B) = 2 Cap (S)$ and $ρ (1) = 0$ . In this instance, we also have that $\tilde{T} (δ_{\max}) = 2 Cap (S)$ .

Finally, for $δ \geq δ_{\max}$ , we have that $\tilde{T} (δ_{\max}) = 2 Cap (S)$ and thus, $R_{GV} (δ) = 0$ , as required. □

Theorem 1 is then immediate from Lemmas 1 and 2.

We have the following corollary that immediately follows from Lemma 2. This corollary then implies that $y^{*}$ yields the global minimum for the optimization problem.

Corollary 1.

When $0 \leq δ \leq δ_{m a x} = \frac{Λ^{'} (1, B)}{Λ (1, B)}$ , $F (y) ≜ y Λ^{'} (y; B) - δ Λ (y; B)$ has a unique zero in $[0, 1]$ . Furthermore, $F^{'} (y)$ is strictly positive for all $y \in [0, 1]$ .

4. Evaluating Marcus and Roth’s Improvement of the Gilbert–Varshamov Bound

In [14], Marcus and Roth improved the GV lower bound for most constrained systems by considering subsets $S (p)$ of $S$ where p is some parameter. Here, we focus on the case $s = 1$ and set p to be the normalized frequency of edges whose labels correspond to one. Specifically, we set $S (p) ≜ {x \in S : wt (x) = ⌊p | x |⌋}$ .

Next, let $S_{n} (p)$ be the set of all words/paths of length n in $S (p)$ and we define $S (p) ≜ {lim sup}_{n \to \infty} \frac{1}{n} log | S_{n} (p) |$ .

Similar to before, we define $\tilde{T} (p, δ) = {lim sup}_{n \to \infty} \frac{1}{n} log T (⌊δ n⌋, n; S_{n} (p))$ . Since $S_{n} (p)$ is a subset of $S_{n}$ , it follows from the usual GV argument that there exists a family of $(n, ⌊ δ n ⌋; S)$ codes whose rates approach $2 S (p) - \tilde{T} (p, δ)$ for all $0 \leq p \leq 1$ . Therefore, we have the following lower bound on asymptotic achievable code rates:

R_{MR} (δ) = sup {2 S (p) - \tilde{T} (p, δ) : 0 \leq p \leq 1} .

(9)

Now, a key result from [14] is that both $S (p)$ and $\tilde{T} (p, δ)$ can be obtained via two different convex optimization problems. For succinctness, we state the dual formulations of these optimization problems.

First, $S (p)$ can be obtained from the following problem:

S (p) = inf \{- p log z + log Λ (C_{G} (z)) : z \geq 0\} .

(10)

Here, $C_{G} (z)$ is the following $(| V | \times | V |)$ matrix $C_{G} (z)$ whose rows and columns are indexed by $V$ . For each entry indexed by e, we set ${(C_{G} (z))}_{e}$ to be zero if $e \notin E$ , and $z^{L (e)}$ if $e \in E$ .

As before, we simplify notation by writing $Λ (z; C) ≜ Λ (C_{G} (z))$ . Again, following the convexity of (10), we are interested in finding the zero of the following function:

G_{1} (z) ≜ z Λ^{'} (z; C) - p Λ (z; C) .

(11)

Next, $\tilde{T} (p, δ)$ can be obtained via the following optimization:

\tilde{T} (p, δ) = inf \{- 2 p log x - δ log y + log Λ (D_{G \times G} (x, y)) : x \geq 0, 0 \leq y \leq 1\} .

(12)

Here, $D_{G \times G} (x, y)$ is a $(\binom{| V | + 1}{2}) \times (\binom{| V | + 1}{2})$ -reduced distance matrix indexed by $V^{(2)}$ . To define the entry of matrix $D_{G \times G} (x, y)$ indexed by $((v_{i}, v_{j}), (v_{k}, v_{ℓ}))$ , we look at the vertices $v_{i}$ , $v_{j}$ , $v_{k}$ , and $v_{ℓ}$ and follow the rules given in Table 2.

Table 2.

We set the $((v_{i}, v_{j}), (v_{k}, v_{ℓ}))$ entry of the matrix $D_{G \times G} (x, y)$ according to the subgraph induced by the states $v_{i}$ , $v_{j}$ , $v_{k}$ , and $v_{ℓ}$ .

$D_{G \times G} (x, y)$ at Entry $((v_{i}, v_{j}), (v_{k}, v_{ℓ}))$	Subgraph Induced by the States ${v_{i}, v_{j}, v_{k}, v_{ℓ}}$
0
1
$x^{2}$
$x y$
$2 x y$

Open in a new tab

Again, we write $Λ (x, y; D) ≜ Λ (D_{G \times G} (x, y))$ . Furthermore, following the convexity of (12), we have that if the optimal solution is obtained at x and y, then

\begin{matrix} G_{2} (x, y) & ≜ x Λ_{x} (x, y; D) - 2 p Λ (x, y; D) = 0 . \end{matrix}

(13)

\begin{matrix} G_{3} (x, y) & ≜ y Λ_{y} (x, y; D) - δ Λ (x, y; D) = 0 . \end{matrix}

(14)

To this end, we consider the function $Δ (x) = Λ_{y} (x, 1; D) / Λ (x, 1; D)$ for $x > 0$ and set $δ_{max} = sup {Δ (x) : x > 0}$ . As with the previous section, we develop a numerical procedure to solve the optimization problem (9). To this end, we have the following critical observation.

Theorem 2.

For a given $δ < δ_{\max}$ , consider the optimization problem

$\begin{matrix} sup {- 2 p log z + 2 log Λ (z; C) + 2 p log x + δ log y - log Λ (x, y; D) : \\ G_{1} (z) = G_{2} (x, y) = G_{3} (x, y) = 0} . \end{matrix}$

If $(p^{*}, x^{*}, y^{*}, z^{*})$ is an optimal solution, then $x^{*} = z^{*}$ . Furthermore, if $0 \leq p^{*} \leq 1$ , then $x^{*}, z^{*} \geq 0$ and $0 \leq y^{*} \leq 1$ .

Proof.

Let $λ_{1}, λ_{2},$ and $λ_{3}$ be real-valued variables and we define $L (p, x, y, z, λ_{1}, λ_{2}, λ_{3}) ≜ G (p, x, y, z) + λ_{1} G_{1} (z) + λ_{2} G_{2} (x, y) + λ_{3} G_{3} (x, y)$ . Using the Lagrangian multiplier theorem, we have that $\partial L / \partial p = \partial L / \partial x = \partial L / \partial y = \partial L / \partial z = 0$ for any optimal solution. Solving these equations with the constraints $G_{1} (z) = G_{2} (x, y) = G_{3} (x, y) = 0$ , we have that $λ_{1} = λ_{2} = λ_{3} = 0$ and $x = z$ for any optimal solution.

Now, when $p^{*} \in [0, 1]$ , using $G_{1} (z) = 0$ , let us define $z (p) ≜ z Λ^{'} (z; C) / Λ (z; C)$ . Then, proceeding as with the proof of Lemma 2, we see that $z (p)$ is monotone increasing with $z (0) = 0$ . Therefore, $z^{*} = z (p^{*})$ is zero.

Similarly, given $p^{*}$ and $x^{*}$ , we use $G_{3} (x^{*}, y) = 0$ to define $δ (y) = y Λ_{y} (x^{*}, y; D) / Λ (x^{*}, y; D)$ . Again, we can proceed as with the proof of Lemma 2 to show that $δ (y)$ is monotone increasing. Furthermore, since $δ (y^{*}) < δ_{\max} = δ (1)$ , we have that $y^{*} \in [0, 1]$ . □

Therefore, to determine $R_{MR} (δ)$ for any fixed $δ$ , it suffices to find x, y, z, and p such that $G_{1} (z) = G_{2} (x, y) = G_{3} (x, y) = 0$ and $x = z$ .

Now, the optimization in Theorem 2 does not constrain the values of p. Furthermore, for certain constrained systems, there are instances where p falls outside the interval $[0, 1]$ . In this case, instead of solving the optimization problem (9), we set p to be either zero or one, and we solve the corresponding optimization problems (10) and (12). Specifically, if we have $p^{*} < 0$ , then we set $p^{*} = 0$ and $x^{*} = 0$ , or if $p^{*} > 1$ , then we set $p^{*} = 1$ and $x^{*} = \infty$ . Hence, the resulting rates that we obtain are a lower bound for the GV-MR bound.

Procedure 2 ( $R_{MR} (δ)$ for fixed $δ \leq δ_{\max}$ ).

Input: Matrices $C_{G} (x)$ , $D_{G} (x, y)$

Output: $R_{MR} (δ)$ or $R_{LB} (δ)$ , where $R_{MR} (δ) \geq R_{LB} (δ)$ .

(1)
Apply the Newton–Raphson method to obtain $p^{*}, x^{*},$ and $y^{*}$ such that $G_{1} (x^{*})$ , $G_{2} (x^{*}, y^{*})$ , and $G_{3} (x^{*}, y^{*})$ are approximately zero. Specifically, do the following:
- Fix a tolerance value $ϵ$
- Set $t = 0$ and pick an initial guess $p_{t} \geq 0$ , $x_{t} \geq 0$ , $0 \leq y_{t} \leq 1$ .
- While $| p_{t} - p_{t - 1} | + | x_{t} - x_{t - 1} | + | y_{t} - y_{t - 1} | > ϵ$ ,
  - –
    Compute the next guess $p_{t + 1}, x_{t + 1}, y_{t + 1}$ :
    $[\begin{matrix} p_{t + 1} \\ x_{t + 1} \\ y_{t + 1} \end{matrix}] = [\begin{matrix} p_{t} \\ x_{t} \\ y_{t} \end{matrix}] - {[\begin{matrix} \frac{\partial G_{1}}{\partial p} & \frac{\partial G_{1}}{\partial x} & \frac{\partial G_{1}}{\partial y} \\ \frac{\partial G_{2}}{\partial p} & \frac{\partial G_{2}}{\partial x} & \frac{\partial G_{2}}{\partial y} \\ \frac{\partial G_{3}}{\partial p} & \frac{\partial G_{3}}{\partial x} & \frac{\partial G_{3}}{\partial y} \end{matrix}]}^{- 1} [\begin{matrix} G_{1} (x_{t}) \\ G_{2} (x_{t}, y_{t}) \\ G_{3} (x_{t}, y_{t}) \end{matrix}] .$
  - –
    Here, apply the power iteration method to compute $Λ (x_{t}; C)$ , $Λ^{'} (x_{t}; C)$ , $Λ^{''} (x_{t}; C)$ , $Λ (x_{t}, y_{t}; D)$ , $Λ_{x} (x_{t}, y_{t}; D)$ , $Λ_{y} (x_{t}, y_{t}; D)$ , $Λ_{x x} (x_{t}, y_{t}; D)$ , $Λ_{y y} (x_{t}, y_{t}; D)$ , and $Λ_{x y} (x_{t}, y_{t}; D)$ .
  - –
    Increment t by one.
- Set $p^{*} \leftarrow p_{t}$ , $x^{*} \leftarrow x_{t}$ , $y^{*} \leftarrow y_{t}$ .
(2A)
If $0 \leq p^{*} \leq 1$ , set $R_{MR} (δ) \leftarrow 2 log Λ (x^{*}; C) + δ log y^{*} - log Λ (x^{*}, y^{*}; D)$ .
(2B)
Otherwise,
- If $p^{*} < 0$ , set $p^{*} \leftarrow 0$ , $x^{*} \leftarrow 0$ , and $y^{*} \leftarrow solution of G_{3} (0, y) = 0$ .
- If $p^{*} > 1$ , set $p^{*} \leftarrow 1$ , $x^{*} \leftarrow \infty$ , and $y^{*} \leftarrow solution of G_{3} (\infty, y) = 0$ .
Finally, set $R_{LB} (δ) \leftarrow 2 log Λ (x^{*}; C) + δ log y^{*} - log Λ (x^{*}, y^{*}; D)$ .

Remark 1.

Let $p^{*}$ be the value computed at Step 1. When $p^{*}$ falls outside the interval $[0, 1]$ , we set $p^{*} \in {0, 1}$ , and we argued earlier that the value returned $R_{LB} (δ)$ (at Step 2B) is, at most, $R_{MR} (δ)$ . Nevertheless, we conjecture that $R_{LB} (δ) = R_{MR} (δ)$ .

As before, we develop a plotting procedure that minimizes the use of Newton–Raphson iterations.

We note that we have three scenarios for $Δ (x)$ . If $Δ (x)$ is monotone decreasing, then $δ_{max} = {lim}_{x \to 0} Δ (x)$ and we set $x^{#} = 0$ . If $Δ (x)$ is monotone increasing, then $δ_{max} = {lim}_{x \to \infty} Δ (x)$ and we set $x^{#} = \infty$ . Otherwise, $Δ (x)$ is maximized for some positive value and we set $x^{#}$ to be this value. Next, to obtain the GV-MR curve (see Remark 2); we iterate over $x \in [1, x^{#}]$ . It should be noted that if $y (x^{#}) < 1$ or, equivalently, $δ (x^{#}) < δ_{\max}$ , we obtain a lower bound on the GV-MR curve by iterating over $y \in [y (x^{#}), 1]$ . Similar to Theorem 1, we define

ρ_{MR} (x) ≜ 2 log Λ (x; C) + δ (x) log y (x) - log Λ (x, y (x); D),

(15)

and

ρ_{LB} (y) ≜ 2 log Λ (x^{#}; C) + δ (y) log y - log Λ (x^{#}, y; D) .

(16)

Finally, we state the following analogue of Theorem 1.

Theorem 3.

We define $δ_{max}$ , $x^{#}$ as before. For $x \in [1, x^{#}]$ , we set

$\begin{matrix} p (x) & \leftarrow x Λ^{'} (x; C) / Λ (x; C), \\ y (x) & \leftarrow s o l u t i o n o of G_{2} (x, y) = 0, \\ δ (x) & \leftarrow y (x) Λ_{y} (x, y (x); D) / Λ (x, y (x); D), \end{matrix}$

If $y (x^{#}) < 1$ , then for $y \in [y (x^{#}), 1]$ , we set

$\begin{matrix} δ (y) & \leftarrow y Λ_{y} (x^{#}, y; D) / Λ (x^{#}, y; D), \end{matrix}$

then, the corresponding GV-MR curve is given by

$\{(δ (x), ρ_{MR} (x)) : x \in [1, x^{#}]\} \cup {(δ (y), ρ_{LB} (y)) : y \in [y (x^{#}), 1]} \cup \{(δ, 0) : δ \geq δ_{\max}\} .$ (17)

where $ρ_{MR}$ and $ρ_{LB}$ are defined in (15) and (16), respectively.

Example 3.

We continue our example and evaluate the GV-MR bound for the $(3, 2)$ -SWCC constrained system. In this case, the matrices of interest are

$C_{G} (z) = [\begin{matrix} z & 1 & 0 \\ 0 & 0 & z \\ z & 0 & 0 \end{matrix}] a n d D_{G \times G} (x, y) = [\begin{matrix} x^{2} & 2 x y & 0 & 1 & 0 & 0 \\ 0 & 0 & x^{2} & 0 & x y & 0 \\ x^{2} & x y & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & x^{2} \\ 0 & 0 & x^{2} & 0 & 0 & 0 \\ x^{2} & 0 & 0 & 0 & 0 & 0 \end{matrix}] .$

Here, we observe that $Δ (x)$ is a monotone decreasing function and so, we set $x^{#} = 0.01$ and $δ_{max} = {lim}_{x \to 0} Δ (x) \approx 0.426$ . If we apply Procedure 2 to compute $R_{MR} (δ)$ for 100 points in $[0, δ_{max}]$ , we require 437 Newton–Raphson iterations and 85,500 power iterations. In contrast, we use Theorem 3 to compute $(δ (x), ρ_{MR} (x))$ for 100 values of x in the interval $[1, x^{#}]$ . This requires 323 Newton–Raphson iterations and involves 22,296 power iterations. The resulting GV-MR curve is given in Figure 1a.

Remark 2.

Strictly speaking, the GV-MR curve described by (17) may not be equal to the curve defined by the optimization problem (15). Nevertheless, the curve provides a lower bound for the optimal asymptotic code rates and we conjecture that the GV-MR curve described by (17) is a lower bound for the curve defined by the optimization problem (15).

5. Single-State Graph Presentation

In this section, we focus on graph presentations that have exactly one state. Here, we allow these single-state graph presentations to contain the parallel edges and their labels to be binary strings of length possibly greater than one. Now, for these constrained systems, the procedures to evaluate the GV bound and its MR improvements can be greatly simplified. This is because the matrices $B_{G \times G} (y)$ , $C_{G} (z)$ , and $D_{G \times G} (x, y)$ are all of dimensions one by one. Therefore, determining their respective dominant eigenvalues is straightforward and does not require the power iteration method. The results in this section follow directly from previous sections and our objective is to provide explicit formulas whenever possible.

Formally, let $S$ be the constrained system with graph presentation $G = (V, E, L)$ such that $| V | = 1$ and $L : E \to Σ^{s}$ with $s \geq 1$ (existing methods that determine the GV bound for constrained systems with $| V | \geq 1$ assume that the edge-labels have single letters, i.e., $s = 1$ . In other words, previous methods developed in [12,14] do not apply).

We further define $α_{t} ≜ # {(x, y) \in L {(E)}^{2} : d_{H} (x, y) = t}$ for $0 \leq t \leq s$ . Then. the corresponding adjacency and reduced distance matrices are as follows:

A_{G} = [\begin{matrix} | E | \end{matrix}] and B_{G \times G} (y) = [\begin{matrix} \sum_{t \geq 0} α_{t} y^{t} \end{matrix}] .

Then, we compute the capacity using its definition as $Cap (S) = (log | E |) / s$ .

To compute $\tilde{T} (δ)$ , we consider the following extension of the optimization problem (3) for the case $s \geq 1$ :

\begin{matrix} \tilde{T} (δ) & = \frac{1}{s} inf \{- δ s log y + log λ (y; B) : 0 \leq y \leq 1\} \\ = \frac{1}{s} inf \{- δ s log y + log (\sum_{t \geq 0} α_{t} y^{t}) : 0 \leq y \leq 1\} . \end{matrix}

(18)

As before, following the convexity of the objective function in (18), we have that the optimal y is the zero (in the interval $[0, 1]$ ) of the function

F (y) ≜ \sum_{t \geq 0} (t - δ s) α_{t} y^{t} .

(19)

So, for fixed values of $δ$ , we can use the Newton–Raphson procedure to compute the root y of (19), and, hence, evaluate $R_{GV} (δ)$ . It should be noted that the power iteration method is not required in this case.

On the other hand, to plot the GV curve, we have the following corollary of Theorem 1.

Corollary 2.

Let $G$ be the single-state graph presentation for a constrained system $S$ . Then, the corresponding GV curve is given by

$GV (S) ≜ \{(δ, R_{GV} (δ)) : δ \in [0, 1]\} = \{(δ (y), ρ (y)) : y \in [0, 1]\} \cup \{(δ, 0) : δ \geq δ_{\max}\},$ (20)

where

$\begin{matrix} δ_{\max} & = \frac{\sum_{t \geq 0} t α_{t}}{{s | E |}^{2}}, \\ δ (y) & = \frac{\sum_{t \geq 0} t α_{t} y^{t}}{s (\sum_{t \geq 0} α_{t} y^{t})}, \\ ρ (y) & = \frac{1}{s} (log \frac{{| E |}^{2}}{\sum_{t \geq 0} α_{t} y^{t}} - \frac{\sum_{t \geq 0} t α_{t} y^{t}}{\sum_{t \geq 0} α_{t} y^{t}} log y) . \end{matrix}$

We illustrate this evaluation procedure via an example of the class of subblock energy-constrained codes (SECCs). Formally, we fix a subblock length L and energy constraintw. A binary word x of length $m L$ is said to satisfy the $(L, w)$ -subblock energy constraint if we partition x into m subblocks of length L, then the number of ones in every subblock is at least w. We refer to the collection of words that meet this constraint as an $(L, w)$ -SECC constrained system. The class of SECCs was introduced by Tandon et al. for the application of simultaneous energy and information transfer [7]. Later, in [21], a GV-type bound was introduced (see Proposition 12 in [21] and also, (28)) and we make comparisons with the GV bound (20) in the following example.

Example 4.

Let $L = 3$ and $w = 2$ and we consider a $(3, 2)$ -SECC constrained system. It is straightforward to observe that the graph presentation is as follows with the single state x. Here, $s = L = 3$ .

Then, the corresponding adjacency and reduced distance matrices are as follows:

$A_{G} = [\begin{matrix} 4 \end{matrix}], B_{G \times G} (y) = [\begin{matrix} 4 + 6 y + 6 y^{2} \end{matrix}] .$

First, we determine the GV bound at $δ = 1 / 3$ . We observe that $F (y) = - 4 + 6 y^{2}$ and, so, the optimal point y for (18) is $\sqrt{2 / 3}$ (the unique solution to $F (y)$ in the interval $[0, 1]$ ). Hence, we have that $\tilde{T} (1 / 3) \approx 1.327$ . On the other hand, the capacity of a $(3, 2)$ -SECC constrained system is $Cap (S) = 2 / 3$ . Therefore, the GV bound is given by $R_{GV} (1 / 3) = 0.006$ .

In contrast, the GV-type lower bound given by Proposition 12 in [21] is zero for $δ > 0.174$ . Hence, the evaluation of the GV bound yields a significantly better lower bound. In fact, we can show that $R_{GV} (δ) > 0$ for all $δ \leq δ_{\max} = 3 / 8$ .

To plot the GV curve, using the fact that $δ_{\max} = 3 / 8$ , we have that

$GV (S) = \{(\frac{y + 2 y^{2}}{2 + 3 y + 3 y^{2}}, \frac{1}{3} log \frac{8}{2 + 3 y + 3 y^{2}} + \frac{3 y + 6 y^{2}}{2 + 3 y + 3 y^{2}} log y) : y \in [0, 1]\} \cup \{(δ, 0) : δ \geq \frac{3}{8}\} .$

We plot the curve in Section 6.

From this example, we see that our methods yield better lower bounds in terms of asymptotic coding rates for a specific pair of $(L, w)$ . It is open to determine how much improvement can be achieved for general pairs of L and w.

Next, we evaluate the GV-MR bound. To this end, we consider some proper subset $P \subset E$ and define

\begin{matrix} α_{t} & ≜ # {(x, y) \in L {(E)}^{2} : d_{H} (x, y) = t, x, y \in P}, \\ β_{t} & ≜ # {(x, y) \in L (E) : d_{H} (x, y) = t, (x \in P, y \notin P) or (x \notin P, y \in P)}, \\ γ_{t} & ≜ # {(x, y) \in L (E) : d_{H} (x, y) = t, x, y \notin P} . \end{matrix}

Then, we consider the following matrices:

C_{G} (z) = [\begin{matrix} | E | - | P | + | P | z \end{matrix}] and D_{G \times G} (x, y) = [\begin{matrix} \sum_{t \geq 0} (α_{t} x^{2} + β_{t} x + γ_{t}) y^{t} \end{matrix}] .

Setting p to be the normalized frequency of edges in $P$ , we obtain $S (p)$ by solving the optimization problem (10).

Specifically, we have that

S (p) = \frac{1}{s} (H (p) + p + log | P | + (1 - p) log (| E | - | P |)),

(21)

and this value is achieved when

z = \frac{p (| E | - | P |)}{(1 - p) | P |} .

(22)

To compute $\tilde{T} (p, δ)$ , we consider the following extension of the optimization problem (12) for the case $s \geq 1$ .

\begin{matrix} \tilde{T} (p, δ) & = \frac{1}{s} inf \{- 2 p log x - δ s log y + log λ (y; D) : 0 \leq y \leq 1\} \\ = \frac{1}{s} inf \{- 2 p log x - δ s log y + log (\sum_{t \geq 0} (α_{t} x^{2} + β_{t} x + γ_{t}) y^{t}) : 0 \leq y \leq 1\} . \end{matrix}

(23)

As before, following the convexity of the objective function in (23), we have that the optimal x and y are the zeroes (in the interval $[0, 1]$ ) of the functions

\begin{matrix} G_{2} (x, y) ≜ & 2 (1 - p) (\sum_{t \geq 0} α_{t} y^{t}) x^{2} + (1 - 2 p) (\sum_{t \geq 0} β_{t} y^{t}) x - 2 p (\sum_{t \geq 0} γ_{t} y^{t}) \\ G_{3} (x, y) ≜ & \sum_{t \geq 0} (t - δ s) (α_{t} x^{2} + β_{t} x + γ_{t}) y^{t} \end{matrix}

(24)

So, for fixed values of p and $δ$ , we can use the Newton–Raphson procedure to compute the roots x and y of (24), and, hence, evaluate $R_{GV} (p, δ)$ . It should be noted that the power iteration method is not required in this case. We find $x^{#}$ as defined in Section 4 and set

ρ_{MR} (x) ≜ 2 log (| E | - | P | + | P | x) + δ (x) log y (x) - log \sum_{t \geq 0} (α_{t} x^{2} + β_{t} x + γ_{t}) y {(x)}^{t} .

(25)

Furthermore, if $y (x^{#}) < 1$ , we set

ρ_{LB} (y) ≜ 2 log (| E | - | P | + | P | x^{#}) + δ (y) log y - log \sum_{t \geq 0} (α_{t} {(x^{#})}^{2} + β_{t} x^{#} + γ_{t}) y^{t} .

(26)

Next, to plot the GV-MR curve, we have the following corollary of Theorem 3.

Corollary 3.

Let $G$ be the single-state graph presentation for a constrained system $S$ . For $x \in [1, x^{#}]$ , we set

$\begin{matrix} p (x) & = \frac{| P | x}{(| E | - | P |) + | P | x)}, \\ δ (x) & = \frac{\sum_{t \geq 1} t (α_{t} x^{2} + β_{t} x + γ_{t}) y {(x)}^{t}}{s \sum_{t \geq 0} (α_{t} x^{2} + β_{t} x + γ_{t}) y {(x)}^{t}}, \end{matrix}$

where $y (x)$ is the smallest root of the equation

$\begin{matrix} 2 (| E | - | P |) (\sum_{t \geq 0} α_{t} y^{t}) x + (| E | - | P | - | P | x) (\sum_{t \geq 0} β_{t} y^{t}) - 2 | P | (\sum_{t \geq 0} γ_{t} y^{t}) & = 0 . \end{matrix}$

If $y (x^{#}) < 1$ , then for $y \in [y (x^{#}), 1]$ , we set

$\begin{matrix} δ (y) & = \frac{\sum_{t \geq 1} t (α_{t} {(x^{#})}^{2} + β_{t} x^{#} + γ_{t}) y^{t}}{s \sum_{t \geq 0} (α_{t} {(x^{#})}^{2} + β_{t} x^{#} + γ_{t}) y^{t}}, \end{matrix}$

Then, the corresponding GV-MR curve is given by

$\{(δ (x), ρ_{MR} (x)) : x \in [1, x^{#}]\} \cup {(δ (y), ρ_{LB} (y)) : y \in [y (x^{#}), 1]} \cup \{(δ, 0) : δ \geq δ_{\max}\} .$ (27)

where $ρ_{MR}$ and $ρ_{LB}$ are defined in (25) and (26), respectively.

Example 5.

We continue our example and evaluate the GV-MR bound for the $(3, 2)$ -SECC constrained system. We have the following single-state graph presentation:

Then, the matrices of interest are:

$C_{G} = [\begin{matrix} 1 + 3 z \end{matrix}], D_{G \times G} (x, y) = [\begin{matrix} (3 + 6 y^{2}) x^{2} + 6 x y + 1 \end{matrix}] .$

Since $C_{G}$ and $D_{G \times G} (x, y)$ are both singleton matrices, we have $Λ (z; C) = 1 + 3 z$ and $Λ (x, y; D) = (3 + 6 y^{2}) x^{2} + 6 x y + 1$ . Then, $G_{1} (z) = - p (1 + 3 z) + 3 z$ , $G_{2} (x, y) = 3 (1 + 2 y^{2}) x^{2} (1 - p) + 3 x y (1 - 2 p) - p$ and $G_{3} (x, y) = 4 x^{2} y^{2} - 3 δ (1 + 2 y^{2}) x^{2} + 2 x y (1 - 3 δ) - δ$ . Now, we apply Theorem 2 and express $p, y,$ and δ in terms of x where $x \in [1, x^{#}]$ where $x^{#} \to \infty$ .

$\begin{matrix} p & = \frac{3 x}{(1 + 3 x)} \\ y & = \frac{x - 1}{2 x} \\ δ & = \frac{2 x (x - 1)}{(9 x^{2} - 1)} \end{matrix}$

Now, we observe that we have $y (x^{#}) = 1 / 2$ . Since we can still increase y to 1, we apply the GV bound with $p = 1$ and $x = z = x^{#}$ once we reach the boundary that is $p = 1$ . Hence, at the boundary, we solve the following problem:

$\begin{matrix} S (1) & = 2 log 3 \\ \tilde{T} (1, δ) & = inf \{- 2 log x - 3 δ log y + log (3 (1 + 2 y^{2}) x^{2} + 6 x y + 1) : 1 / 2 \leq y \leq 1; x = x^{#} \to \infty\} \\ = inf \{- 3 δ log y + log 3 + log (1 + 2 y^{2}) : 1 / 2 \leq y \leq 1\} \\ R_{MR} (δ) & = S (1) - \tilde{T} (1, δ) . \end{matrix}$

By setting $F (y) = - 3 δ (1 + 2 y^{2}) + 4 y^{2} = 0$ , we get $δ = 4 y^{2} / 3 (1 + 2 y^{2})$ where $y \in [1 / 2, 1]$ and we plot the respective curve.

6. Numerical Plots

In this section, we apply our numerical procedures to compute the GV and the GV-MR bounds for some specific constrained systems. In particular, we consider the $(L, w)$ -SWCC constrained systems defined in Section 3, the ubiquitous $(d, k)$ -runlength limited systems (see, for example, p. 3 in [11]) and the $(L, w)$ -subblock energy constrained codes recently introduced in [7]. In addition to the GV and GV-MR curves, we also plot a simple lower bound. For each $δ \in [0, 1 / 2]$ , any ball size is at most $2 H (δ n)$ . So, for any constrained system $S$ , we have that $\tilde{T} (δ) \leq Cap (S) + H (δ)$ . Therefore, we have that

R (δ; S) \leq Cap (S) - H (δ) .

(28)

From the plots in Figure 1, Figure 2 and Figure 3, it is also clear that the computations of (7) and (17) yield a significantly better lower bound.

6.1. $(L, w)$ -Sliding Window Constrained Codes

We fix L and w. We recall from Section 3 that a binary word satisfies the $(L, w)$ -sliding window weight constraint if the number of ones in every consecutive L bits is at least w and the $(L, w)$ -SWCC constrained system refers to the collection of words that meet this constraint. From [8,9], we have a simple graph presentation that uses only $(\binom{L}{w})$ states. To validate our methods, we choose $(L, w) \in {(3, 2), (10, 7)}$ and the corresponding graph presentations have 3 and 120 states, respectively. Applying the plotting procedures described in Theorems 1 and 3, we obtain Figure 1.

6.2. $(d, k)$ -Runlength Limited Codes

Next, we revisit the ubiquitous runlength constraint. We fix d and k. We say that a binary word satisfies the $(d, k)$ -RLL constraint if each run of zeroes in the word has a length of at least d and at most k. Here, we allow the first and last runs of zeroes to have a length of less than d. We refer to the collection of words that meet this constraint as a $(d, k)$ -RLL constrained system. It is well known that a $(d, k)$ -RLL constrained system has the graph presentation with $k + 1$ states (see, for example, [11]). Here, we choose $(d, k) \in {(1, 3), (3, 7)}$ to validate our methods and apply Theorems 1 and 3 to obtain Figure 2. For $(d, k) = (3, 7)$ , we corroborate our results with those derived in [15]. Specifically, Winick and Yang determined the GV bound (1) for the $(3, 7)$ -RLL constraint and remarked that the “evaluation of the (GV-MR) bound required considerable computation” for “a small improvement”. In Table 3, we verify this statement.

Table 3.

Comparison of the GV-MR bound with lower bound [15] for $(3, 7)$ -RLL constrained systems.

$δ$	GV-MR Bound (15)	GV Bound [15] (see Equation (1))
0	0.406	0.406
0.05	0.255	0.225
0.1	0.163	0.163
0.15	0.095	0.094
0.2	0.048	0.044
0.25	0.018	0.012

Open in a new tab

6.3. $(L, w)$ -Subblock Energy-Constrained Codes

We fix L and w. We recall from Section 5 that a binary word satisfies the $(L, w)$ -subblock energy constraint if each subblock of length L has a weight of at least w and the $(L, w)$ -SECC constrained system refers to the collection of words that meet this constraint. Then, the corresponding graph presentation has a single state $x$ with $\sum_{i = 0}^{w} (\binom{L}{i})$ edges, where each edge is labeled by a word of length L and weight at least w. We apply the methods in Section 5 to determine the GV and GV-MR bounds.

For the GV bound, we provide the explicit formula for $α_{t}$ and proceed as in Example 4.

α_{t} = (\binom{L}{t}) (| E | - \sum_{j = 1}^{t} \sum_{k = 0}^{⌈ \frac{j}{2} ⌉ - 1} (\binom{L - t}{w - j + k}) (\binom{t}{k}))

(29)

Similarly, for GV-MR bound, we provide the explicit formula for $α_{t}$ , $β_{t}$ , and $γ_{t}$ and proceed as in Example 5.

\begin{matrix} α_{t} & = (\binom{L}{w}) (\binom{L - w}{i / 2}) (\binom{w}{i / 2}) if t is even, otherwise, α_{t} = 0 . \end{matrix}

(30)

\begin{matrix} β_{t} & = 2 (\binom{L}{w}) \sum_{j = 1}^{⌊ \frac{t}{2} ⌋} (\binom{L - w}{t - j}) (\binom{w}{j}) - 2 α_{t} \end{matrix}

(31)

\begin{matrix} γ_{t} & = (\binom{L}{t}) (| E | - \sum_{j = 1}^{t} \sum_{k = 0}^{⌈ \frac{j}{2} ⌉ - 1} (\binom{L - t}{w - j + k}) (\binom{t}{k})) - α_{t} - β_{t} \end{matrix}

(32)

In Figure 3, we plot the GV bound and GV-MR bounds. We remark that the simple lower bound (28) corresponds to Proposition 12 in [21].

Acknowledgments

The authors would also like to thank the assistant editor for her skillful handling and the anonymous reviewers for their valuable suggestions.

Appendix A. Power Iteration Method for Derivatives of Dominant Eigenvalues

Throughout this appendix, we assume that A is a diagonalizable matrix with dominant eigenvalue $λ_{1}$ and whose corresponding eigenspace has dimension one. Let $e_{1}$ be the unit eigenvector whose entries are positive in this space. Then, the power iteration method is a well-known numerical procedure that finds the dominant eigenvalue $λ_{1}$ and the corresponding eigenvector $e_{1}$ efficiently.

Now, in the preceding sections, the entries in the matrix A are given functions in either one or two variables and, thus, the dominant eigenvalue $λ_{1}$ is a function in the same variables. Moreover, the numerical procedures in these sections require us to compute the higher order (partial) derivatives of this dominant eigenvalue function $λ_{1}$ . To the best of our knowledge, we are unaware of any algorithms or numerical procedures that estimate the values of these derivatives. Hence, in this appendix, we modify the power iteration method to compute these estimates.

Formally, let A be an irreducible nonnegative diagonalizable square matrix with dominant eigenvalue $λ_{1}$ and corresponding unit eigenvector $e_{1}$ . Since A is diagonalizable, A has n eigenvectors $e_{1}, e_{2}, \dots, e_{n}$ that form an orthonormal basis for $R^{n}$ . Let $λ_{1}, λ_{2}, \dots, λ_{n}$ be the corresponding eigenvalues and, so, we have that

A e_{i} = λ_{i} e_{i} for all i = 1, 2, . . ., n .

(A1)

Since A is irreducible, the dominant eigenspace has dimension one and, also, the dominant eigenvalue is real and positive. Therefore, we can assume that $λ_{1} > | λ_{2} | \geq \dots \geq | λ_{n} |$ .

We first assume that the entries of A are functions in the variable z. Hence, $λ_{i}$ and the entries of $e_{i}$ are functions in z too. Then Power Iteration I then evaluates both $λ_{1}$ and $λ_{1}^{'}$ for some fixed value of z, while Power Iteration II additionally evaluates the second order derivative $λ_{1}^{''}$ .

The case where the entries of A are functions in two variables x and y is discussed at the end of the appendix. Here, Power Iteration III evaluates higher order partial derivatives of $λ_{1}$ for certain fixed values of x and y. For ease of exposition, we provide detailed proofs for the correctness of Power Iteration I and the proofs can be extended for Power Iteration II and Power Iteration III.

We continue our discussion where the entries of A are univariate functions in z. We differentiate each entry of A with respect to z to obtain the matrix $A^{'}$ . Furthermore, for all $1 \leq i \leq n$ , we differentiate each entry of eigenvectors $e_{i}$ and the eigenvalue $λ_{i}$ to obtain $e_{i}^{'}$ and $λ_{i}^{'}$ , respectively. Specifically, it follows from (A1) that

A^{'} e_{i} + A e_{i}^{'} = λ_{i}^{'} e_{i} + λ_{i} e_{i}^{'} for all i = 1, 2, \dots, n .

(A2)

Then, the following procedure computes both $λ_{1}$ and $λ_{1}^{'}$ .

Power Iteration I.

Input: Irreducible nonnegative diagonalizable matrix A

Output: Estimates of $λ_{1}$ and $λ_{1}^{'}$

(1)
Initialize $q^{(0)}$ such that all its entries are strictly positive.
- Fix a tolerance value $ϵ$ .
- While $| q^{(k)} - q^{(k - 1)} | > ϵ$ ,
  - –
    Set
    $\begin{matrix} λ^{(k)} & = ∥ A q^{(k - 1)} ∥, \\ q^{(k)} & = \frac{A q^{(k - 1)}}{λ^{(k)}}, \\ μ^{(k)} & = ∥ A^{'} q^{(k - 1)} + A r^{(k - 1)} - λ^{(k)} r^{(k - 1)} ∥, \\ r^{(k)} & = \frac{A r^{(k - 1)} + A^{'} q^{(k - 1)} - μ^{(k)} q^{(k - 1)}}{λ^{(k)}} . \end{matrix}$
  - –
    Increment k by one.
- (2)
  Set $λ_{1} \leftarrow λ^{(k)}$ and $λ_{1}^{'} \leftarrow μ^{(k)}$ .

Theorem A1.

If A is an irreducible nonnegative diagonalizable matrix and $q^{(0)}$ has positive components with unit norm, then, as $k \to \infty$ , we have

$λ^{(k)} \to λ_{1}, q^{(k)} \to e_{1}, μ^{(k)} \to λ_{1}^{'} .$

Here, $q^{(k)} \to e_{1}$ means that $∥q^{(k)} - e_{1}∥ \to 0$ as $k \to \infty$ .

Before we present the proof of Theorem A1, we remark that the usual power iteration method computes only $λ^{(k)}$ and $q^{(k)}$ . Then, it is well-known (see, for example, [22]) that $λ^{(k)}$ and $q^{(k)}$ tend to $λ_{1}$ and $e_{1}$ , respectively.

Now, since $e_{i}$ spans $R^{n}$ , we can write $q^{(0)} = \sum_{i = 1}^{n} α_{i} e_{i}$ for any initial vector $q^{(0)}$ . The next technical lemma provides closed formulas for $λ^{(k)}$ , $q^{(k)}$ , $μ^{(k)}$ , and $r^{(k)}$ in terms of $λ_{i}$ , $e_{i}$ and $α_{i}$ .

Lemma A1.

Let $q^{(0)} = \sum_{i = 1}^{n} α_{i} e_{i}$ . Then,

$\begin{matrix} q^{(k)} & = \frac{\sum_{i = 1}^{n} α_{i} λ_{i}^{k} e_{i}}{∥ \sum_{i = 1}^{n} α_{i} λ_{i}^{k} e_{i} ∥}, \end{matrix}$ (A3)

$\begin{matrix} λ^{(k)} & = \frac{∥ \sum_{i = 1}^{n} α_{i} λ_{i}^{k} e_{i} ∥}{∥ \sum_{i = 1}^{n} α_{i} λ_{i}^{k - 1} e_{i} ∥}, \end{matrix}$ (A4)

$\begin{matrix} r^{(k)} & = \frac{\sum_{i = 1}^{n} (α_{i} e_{i}^{'} + α_{i}^{'} e_{i}) λ_{i}^{k} + (k λ_{i}^{'} - \sum_{j = 1}^{k} μ^{(j)}) α_{i} λ_{i}^{k - 1} e_{i}}{∥ \sum_{i = 1}^{n} α_{i} λ_{i}^{k} e_{i} ∥}, \end{matrix}$ (A5)

$\begin{matrix} μ^{(k)} & = \frac{∥ \sum_{i = 1}^{n} (α_{i} e_{i}^{'} + α_{i}^{'} e_{i}) λ_{i}^{k - 1} (λ_{i} - λ^{(k)}) + α_{i} λ_{i}^{k - 1} λ_{i}^{'} e_{i} + ((k - 1) λ_{i}^{'} - \sum_{j = 1}^{k - 1} μ^{(j)}) α_{i} λ_{i}^{k - 2} (λ_{i} - λ^{(k)}) e_{i} ∥}{∥ \sum_{i = 1}^{n} α_{i} λ_{i}^{k - 1} e_{i} ∥} . \end{matrix}$ (A6)

Proof.

Since $q^{(k)}$ is defined recursively as $q^{(k)} = \frac{A q^{(k - 1)}}{λ^{(k)}} = \frac{A q^{(k - 1)}}{∥ A q^{(k - 1)} ∥}$ , we have that

$q^{(k)} = \frac{A^{k} q^{(0)}}{∥ A^{k} q^{(0)} ∥} .$

Then, it follows from Equation (A1) that

$A^{k} q^{(0)} = A^{k} \sum_{i = 1}^{n} α_{i} e_{i} = \sum_{i = 1}^{n} α_{i} (A^{k} e_{i}) = \sum_{i = 1}^{n} α_{i} λ_{i}^{k} e_{i},$ (A7)

and, so, we obtain (A3). Similarly, from (A1), we have that

$λ^{(k)} = ∥ A q^{(k - 1)} ∥ = \frac{∥ A^{k} q^{(0)} ∥}{∥ A^{k - 1} q^{(0)} ∥} = \frac{∥ \sum_{i = 1}^{n} α_{i} λ_{i}^{k} e_{i} ∥}{∥ \sum_{i = 1}^{n} α_{i} λ_{i}^{k - 1} e_{i} ∥},$

as required for (A4).

Next, we note that $r^{(0)} = \sum_{i = 1}^{n} α_{i} e_{i}^{'} + \sum_{i = 1}^{n} α_{i}^{'} e_{i}$ . Then, using the recursive definition of $r^{(k)}$ , we have

$r^{(k)} = \frac{A^{k} r^{(0)} + \sum_{j = 0}^{k - 1} A^{j} A^{'} A^{k - j - 1} q^{(0)} - (\sum_{j = 1}^{k} μ^{(j)}) A^{k - 1} q^{(0)}}{∥ A^{k} q^{(0)} ∥} .$ (A8)

Then, from (A1), we have

$A^{k} r^{(0)} = A^{k} (\sum_{i = 1}^{n} α_{i} e_{i}^{'} + \sum_{i = 1}^{n} α_{i}^{'} e_{i}) = \sum_{i = 1}^{n} α_{i} (A^{k} e_{i}^{'}) + \sum_{i = 1}^{n} α_{i}^{'} λ_{i}^{k} e_{i} .$ (A9)

and, from (A2),

$A^{'} \sum_{i = 1}^{n} α_{i} λ_{i}^{k - j - 1} e_{i} = \sum_{i = 1}^{n} α_{i} λ_{i}^{k - j - 1} (A^{'} e_{i}) = \sum_{i = 1}^{n} α_{i} λ_{i}^{k - j - 1} (λ_{i}^{'} e_{i} + λ_{i} e_{i}^{'} - A e_{i}^{'}) .$

Therefore, using (A1) again,

$\begin{matrix} \sum_{j = 0}^{k - 1} A^{j} A^{'} \sum_{i = 1}^{n} α_{i} λ_{i}^{k - j - 1} e_{i} & = \sum_{j = 0}^{k - 1} A^{j} \sum_{i = 1}^{n} α_{i} λ_{i}^{k - j - 1} (λ_{i}^{'} e_{i} + λ_{i} e_{i}^{'} - A e_{i}^{'}) \\ = k \sum_{i = 1}^{n} α_{i} λ_{i}^{k - 1} λ_{i}^{'} e_{i} + \sum_{i = 1}^{n} α_{i} λ_{i}^{k} e_{i}^{'} - \sum_{i = 1}^{n} α_{i} (A^{k} e_{i}^{'}) . \end{matrix}$

Therefore, we obtain (A5).

Finally, we recall that $μ^{(k)}$ is defined as

$μ^{(k)} = ∥ A^{'} q^{(k - 1)} + A r^{(k - 1)} - λ^{(k)} r^{(k - 1)} ∥ .$

Then, by replacing $r^{(k - 1)}$ and $q^{(k - 1)}$ from (A5) and (A3), respectively, and then using Equation (A2), we obtain (A6). □

Finally, we are ready to demonstrate the correctness of Power Iteration I.

Proof of Theorem A1.

Since A is an irreducible nonnegative diagonalizable matrix, $λ_{1}$ is real positive and there exists $0 < ϵ < 1$ such that $\frac{| λ_{i} |}{λ_{1}} < ϵ for all i = 2, 3, \dots, n$ (see, for example, [11]). For purposes of brevity, we write

$Φ_{k} = \sum_{i = 1}^{n} α_{i} λ_{i}^{k} e_{i}$ (A10)

and, so, we can rewrite (A3) as

$q^{(k)} = \frac{Φ_{k}}{∥ Φ_{k} ∥} = \frac{λ_{1}^{k}}{∥ Φ_{k} ∥} \frac{Φ_{k}}{λ_{1}^{k}} = \frac{λ_{1}^{k}}{∥ Φ_{k} ∥} (α_{1} e_{1} + \sum_{i = 2}^{n} α_{i} \frac{λ_{i}^{k}}{λ_{1}^{k}} e_{i}) .$

Now, since $λ_{i}^{k} / λ_{1}^{k} \leq ϵ^{k}$ for all $i = 2, \dots, n$ , we have that

$∥\frac{Φ_{k}}{λ_{1}^{k}} - α_{1} e_{1}∥ \leq C_{1} ϵ^{k} for some constant C_{1} .$ (A11)

Then, using the triangle inequality, we have that as $k \to \infty$ , $|\frac{∥ Φ_{k} ∥}{λ_{1}^{k}} - α_{1}| \to 0$ and, thus, $\frac{λ_{1}^{k}}{∥ Φ_{k} ∥} \to \frac{1}{α_{1}}$ . Therefore, $∥ q^{(k)} - e_{1} ∥ \to 0$ as required.

It should be noted that since $\frac{λ_{1}^{k}}{∥ Φ_{k} ∥}$ tends to a finite limit, we have that $\frac{λ_{1}^{k}}{∥ Φ_{k} ∥}$ is bounded above by some constant. In other words, we have that

$\frac{λ_{1}^{k}}{∥ Φ_{k} ∥} \leq C_{2} for some constant C_{2} .$ (A12)

Next, we show the following inequality:

$| λ^{(k)} - λ_{1} | \leq C_{3} ϵ^{k - 1} for some constant C_{3} .$ (A13)

Using (A4), we have that

$\frac{∥ Φ_{k} - λ_{1} Φ_{k - 1} ∥}{∥ Φ_{k - 1} ∥} = \frac{λ_{1}^{k - 1}}{∥ Φ_{k - 1} ∥} \frac{\sum_{i = 1}^{n} α_{i} λ_{i}^{k} e_{i} - α_{i} λ_{1} λ_{i}^{k - 1} e_{i}}{λ_{1}^{k - 1}} = (\frac{λ_{1}^{k - 1}}{∥ Φ_{k - 1} ∥}) \cdot λ_{1} \cdot \sum_{i = 2}^{n} α_{i} (\frac{λ_{i}^{k}}{λ_{1}^{k}} - \frac{λ_{i}^{k - 1}}{λ_{1}^{k - 1}}) e_{i} .$

Now, observe that $(\frac{λ_{i}^{k}}{λ_{1}^{k}} - \frac{λ_{i}^{k - 1}}{λ_{1}^{k - 1}}) \leq 2 ϵ^{k - 1}$ for $i = 2, \dots, n$ . Since $\frac{λ_{1}^{k - 1}}{∥ Φ_{k - 1} ∥} \leq C_{2}$ , we have (A13) after applying the triangle inequality.

Again, to reduce clutter, we introduce the following abbreviations:

$\begin{matrix} D_{k} & = \sum_{i = 1}^{n} (α_{i} e_{i}^{'} + α_{i}^{'} e_{i}) λ_{i}^{k - 1} (λ_{i} - λ^{(k)}), \\ E_{k} & = \sum_{i = 1}^{n} α_{i} λ_{i}^{k - 1} λ_{i}^{'} e_{i}, \\ F_{k} & = \sum_{i = 1}^{n} ((k - 1) λ_{i}^{'} - \sum_{j = 1}^{k - 1} μ^{(j)}) α_{i} λ_{i}^{k - 2} (λ_{i} - λ^{(k)}) e_{i} . \end{matrix}$

Thus, we can rewrite (A6) as

$μ^{(k)} = \frac{∥ D_{k} + E_{k} + F_{k} ∥}{∥ Φ_{k - 1} ∥} \leq λ_{1}^{'} + \frac{∥ D_{k} ∥}{∥ Φ_{k - 1} ∥} + \frac{∥ E_{k} - λ_{1}^{'} Φ_{k - 1} ∥}{∥ Φ_{k - 1} ∥} + \frac{∥ F_{k} ∥}{∥ Φ_{k - 1} ∥} .$

Next, we bound each of the summands on the right-hand side. Specifically, we show the following inequalities:

$\begin{matrix} \frac{∥ D_{k} ∥}{∥ Φ_{k - 1} ∥} + \frac{∥ E_{k} - λ_{1}^{'} Φ_{k - 1} ∥}{∥ Φ_{k - 1} ∥} & \leq C_{4} ϵ^{k - 1} for some constant C_{4}, \end{matrix}$ (A14)

$\begin{matrix} \frac{∥ F_{k} ∥}{∥ Φ_{k - 1} ∥} & \leq C_{5} (k - 1) ϵ^{k - 1} + C_{5} (\sum_{j = 1}^{k - 1} μ^{(k)}) ϵ^{k - 1} for some constant C_{5} . \end{matrix}$ (A15)

To demonstrate (A14), we consider

$\frac{∥ D_{k} ∥}{λ_{1}^{k - 1}} = ∥\sum_{i = 1}^{n} (α_{i} e_{i}^{'} + α_{i}^{'} e_{i}) \frac{λ_{i}^{k - 1}}{λ_{1}^{k - 1}} (λ_{i} - λ^{(k)})∥ \leq ∥ α_{1} e_{1}^{'} + α_{1}^{'} e_{1} ∥ | λ_{1} - λ^{(k)} | + ϵ^{k - 1} \sum_{i = 2}^{n} ∥ α_{i} e_{i}^{'} + α_{i}^{'} e_{i} ∥ | λ_{i} - λ^{(k)} | .$

We use (A13) to bound the first summand by some constant multiple of $ϵ^{k - 1}$ . On the other hand, we have $| λ_{i} - λ^{(k)} | \leq | λ_{i} - λ_{1} | + | λ_{1} - λ^{(k)} | \leq max {| λ_{i} - λ_{1} | : 2 \leq i \leq n} + C_{3} ϵ^{k - 1}$ for $2 \leq i \leq n$ . In other words, the second summand is also bounded by some constant multiple of $ϵ^{k - 1}$ . Next, we consider

$\frac{∥ E_{k} - λ_{1}^{'} Φ_{k - 1} ∥}{λ_{1}^{k - 1}} = ∥\sum_{i = 1}^{n} α_{i} \frac{λ_{i}^{k - 1}}{λ_{1}^{k - 1}} (λ_{i}^{'} - λ_{1}^{'}) e_{i}∥ \leq ϵ^{k - 1} \sum_{i = 2}^{n} | α_{i} (λ_{i}^{'} - λ_{1}^{'}) | .$

and, so, $\frac{∥ E_{k} - λ_{1}^{'} Φ_{k - 1} ∥}{λ_{1}^{k - 1}}$ is also bounded by a multiple of $ϵ^{k - 1}$ . Therefore, since $\frac{λ_{1}^{k - 1}}{∥ Φ_{k - 1} ∥} \leq C_{2}$ , we have (A14). Using similar methods, we can establish (A15).

Next, we apply (A14) and then recursively apply (A15) until the right-hand side is free of $μ^{(i)}$ s. Then, it follows that

$μ^{(k)} \leq λ_{1}^{'} + C_{4} ϵ^{k - 1} + C_{5} (k - 1) ϵ^{k - 1} + \prod_{j = 2}^{k - 1} (1 + C_{5} ϵ^{k - j}) + C_{5} ϵ^{k - 1} \sum_{i = 1}^{k - 1} (λ_{1}^{'} + C_{4} ϵ^{k - i - 1} C_{5} (k - i - 1) ϵ^{k - i - 1}) \prod_{j = 2}^{i} (1 + C_{5} ϵ^{k - j})) .$ (A16)

Furthermore, since $i \leq k - 1$ , $\prod_{j = 2}^{i} (1 + C_{5} ϵ^{k - j}) \leq \prod_{j = 2}^{k - 1} (1 + C_{5} ϵ^{k - j})$ , we can rewrite (A16) as

$μ^{(k)} \leq λ_{1}^{'} + C_{4} ϵ^{k - 1} + C_{5} (k - 1) ϵ^{k - 1} + \prod_{j = 2}^{k - 1} (1 + C_{5} ϵ^{k - j}) (1 + C_{5} ϵ^{k - 1} \sum_{i = 1}^{k - 1} (λ_{1}^{'} + C_{4} ϵ^{k - i - 1} C_{5} (k - i - 1) ϵ^{k - i - 1})) .$ (A17)

Next, it follows from standard calculus that $\prod_{j = 2}^{k - 1} (1 + C_{5} ϵ^{k - j}) < e^{\frac{C_{5}}{1 - ϵ}}$ . Furthermore, since $ϵ < 1$ , we have $\sum_{i = 0}^{k - 2} ϵ^{j} < \frac{1}{1 - ϵ}$ and $\sum_{i = 0}^{k - 2} j ϵ^{j} < \frac{1}{{(1 - ϵ)}^{2}}$ . Putting everything together, we have

$μ^{(k)} \leq λ_{1}^{'} + C_{4} ϵ^{k - 1} + C_{5} (k - 1) ϵ^{k - 1} + C_{5} ϵ^{k - 1} e^{\frac{C_{5}}{1 - ϵ}} (1 + (k - 1) λ_{1}^{'} + \frac{C_{4}}{1 - ϵ} + \frac{C_{5}}{{(1 - ϵ)}^{2}}) .$ (A18)

As $k \to \infty$ , since $ϵ < 1$ , we have $ϵ^{k} \to 0$ and $k ϵ^{k} \to 0$ . Therefore, ${lim}_{k \to \infty} μ^{(k)} \leq λ_{1}^{'}$ . Using similar methods, we have that ${lim}_{k \to \infty} μ^{(k)} \geq λ_{1}^{'}$ and, so, ${lim}_{k \to \infty} μ^{(k)} = λ_{1}^{'}$ , as required. □

Next, we modify Power Iteration I so as to compute the higher order derivatives. We omit a detailed proof as it is similar to the proof of Theorem A1.

Power Iteration II.

Input: Irreducible nonnegative diagonalizable matrix A

Output: Estimates of $λ_{1}$ , $λ_{1}^{'}$ , and $λ_{1}^{''}$

(1)

Initialize $q^{(0)}$ such that all its entries are strictly positive.

Fix a tolerance value $ϵ$ .

While $| q_{(k)} - q_{(k - 1)} | > ϵ$ ,

–

Set

\begin{matrix} λ^{(k)} & = ∥ A q^{(k - 1)} ∥, \\ q^{(k)} & = \frac{A q^{(k - 1)}}{λ^{(k)}}, \\ μ^{(k)} & = ∥ A^{'} q^{(k - 1)} + A r^{(k - 1)} - λ^{(k)} r^{(k - 1)} ∥, \\ r^{(k)} & = \frac{A r^{(k - 1)} + A^{'} q^{(k - 1)} - μ^{(k)} q^{(k - 1)}}{λ^{(k)}}, \\ ν^{(k)} & = ∥ A^{''} q^{(k - 1)} + 2 A^{'} r^{(k - 1)} + A s^{(k - 1)} - λ^{(k)} s^{(k - 1)} - 2 μ^{(k)} r^{(k - 1)} ∥, \\ s^{(k)} & = \frac{A^{''} q^{(k - 1)} + 2 A^{'} r^{(k - 1)} + A s^{(k - 1)} - 2 μ^{(k)} r^{(k - 1)} - ν^{(k)} q^{(k - 1)}}{λ^{(k)}} . \end{matrix}

–
Increment k by one.

(2)
Set $λ_{1} \leftarrow λ^{(k)}$ , $λ_{1}^{'} \leftarrow μ^{(k)}$ and $λ_{1}^{''} \leftarrow ν^{(k)}$ .

Theorem A2.

If A is an irreducible nonnegative diagonalizable matrix and $q^{(0)}$ has positive components with unit norm, then, as $k \to \infty$ , we have

$λ^{(k)} \to λ_{1}, q^{(k)} \to e_{1}, μ^{(k)} \to λ_{1}^{'}, ν^{(k)} \to λ_{1}^{''} .$

Finally, we end this appendix with a power iteration method that computes the partial derivatives when the elements of the given matrix are bivariate functions.

Power Iteration III.

Input: Irreducible nonnegative diagonalizable matrix A

Output: Estimates of $λ_{1}$ , ${(λ_{1})}_{x}$ , ${(λ_{1})}_{y}$ , ${(λ_{1})}_{x x}$ , ${(λ_{1})}_{y y}$ , and ${(λ_{1})}_{x y}$

(1)

Initialize $q^{(0)}$ such that all its entries are strictly positive.

Fix a tolerance value $ϵ$ .

While $| q^{(k)} - q^{(k - 1)} | > ϵ$ ,

–

Set

\begin{matrix} λ^{(k)} & = ∥ A q^{(k - 1)} ∥, \\ q^{(k)} & = \frac{A q^{(k - 1)}}{λ^{(k)}}, \\ λ_{x}^{(k)} & = ∥ A_{x} q^{(k - 1)} + A q_{x}^{(k - 1)} - λ q_{x}^{(k - 1)} ∥, \\ q_{x}^{(k)} & = \frac{A_{x} q^{(k - 1)} + A q_{x}^{(k - 1)} - λ_{x}^{(k - 1)} q^{(k - 1)}}{λ^{(k)}}, \\ λ_{y}^{(k)} & = ∥ A_{y} q^{(k - 1)} + A q_{y}^{(k - 1)} - λ q_{y}^{(k - 1)} ∥, \\ q_{y}^{(k)} & = \frac{A_{y} q^{(k - 1)} + A q_{y}^{(k - 1)} - λ_{y}^{(k - 1)} q^{(k - 1)}}{λ^{(k)}}, \\ λ_{x x}^{(k)} & = ∥ A_{x x} q^{(k - 1)} + 2 A_{x} q_{x}^{(k - 1)} + A q_{x x}^{(k - 1)} - λ^{(k - 1)} q_{x x}^{(k - 1)} - 2 λ_{x}^{(k - 1)} q_{x}^{(k - 1)} ∥, \\ q_{x x}^{(k)} & = \frac{A_{x x} q^{(k - 1)} + 2 A_{x} q_{x}^{(k - 1)} + A q_{x x}^{(k - 1)} - 2 λ_{x}^{(k - 1)} q_{x}^{(k - 1)} - λ_{x x}^{(k - 1)} q^{(k - 1)}}{λ^{(k)}} \\ λ_{y y}^{(k)} & = ∥ A_{y y} q^{(k - 1)} + 2 A_{y} q_{y}^{(k - 1)} + A q_{y y}^{(k - 1)} - λ^{(k - 1)} q_{y y}^{(k - 1)} - 2 λ_{y}^{(k - 1)} q_{y}^{(k - 1)} ∥, \\ q_{y y}^{(k)} & = \frac{A_{y y} q^{(k - 1)} + 2 A_{y} q_{y}^{(k - 1)} + A q_{y y}^{(k - 1)} - 2 λ_{y}^{(k - 1)} q_{y}^{(k - 1)} - λ_{y y}^{(k - 1)} q^{(k - 1)}}{λ^{(k)}} \\ λ_{x y}^{(k)} & = ∥ A_{x y} q^{(k - 1)} + A_{x} q_{y}^{(k - 1)} + A_{y} q_{x}^{(k - 1)} + A q_{x y}^{(k - 1)} - λ^{(k - 1)} q_{x y}^{(k - 1)} - λ_{x}^{(k - 1)} q_{y}^{(k - 1)} - λ_{y}^{(k - 1)} q_{x}^{(k - 1)} ∥, \\ q_{x y}^{(k)} & = \frac{A_{x y} q^{(k - 1)} + A_{x} q_{y}^{(k - 1)} + A_{y} q_{x}^{(k - 1)} + A q_{x y}^{(k - 1)} - λ_{x y}^{(k - 1)} q^{(k - 1)} - λ_{x}^{(k - 1)} q_{y}^{(k - 1)} - λ_{y}^{(k - 1)} q_{x}^{(k - 1)}}{λ^{(k)}} . \end{matrix}

–
Increment k by one.

Set $λ^{(k)} \leftarrow λ_{1}$ , $λ_{x}^{(k)} \leftarrow {(λ_{1})}_{x}$ , $λ_{y}^{(k)} \leftarrow {(λ_{1})}_{y}$ , $λ_{x x}^{(k)} \leftarrow {(λ_{1})}_{x x}$ , $λ_{y y}^{(k)} \leftarrow {(λ_{1})}_{y y}$ , $λ_{x y}^{(k)} \leftarrow {(λ_{1})}_{x y}$ .

Theorem A3.

If A is an irreducible nonnegative diagonalizable matrix and $q^{(0)}$ has positive components with unit norm, then, as $k \to \infty$ , we have $λ_{x x}^{(k)} \to {(λ_{1})}_{x x}$ , $λ_{y y}^{(k)} \to {(λ_{1})}_{y y}$ , $λ_{x y}^{(k)} \to {(λ_{1})}_{x y}$ .

Author Contributions

Conceptualization, K.G. and H.M.K.; software, K.G.; writing—original draft preparation, K.G.; writing—review and editing, K.G. and H.M.K. All authors have read and agreed to the published version of the manuscript.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Funding Statement

The work of Han Mao Kiah was supported by the Ministry of Education, Singapore, under its MOE AcRF Tier 2 Award under Grant MOE-T2EP20121-0007 and MOE AcRF Tier 1 Award under Grant RG19/23.

Footnotes

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

References

1.Yazdi S.M.H.T., Kiah H.M., Garcia-Ruiz E., Ma J., Zhao H., Milenkovic O. DNA-Based Storage: Trends and Methods. IEEE Trans. Mol. Biol. Multi-Scale Commun. 2015;1:230–248. doi: 10.1109/TMBMC.2016.2537305. [DOI] [Google Scholar]
2.Immink K.A.S., Cai K. Efficient balanced and maximum homopolymer-run restricted block codes for DNA-based data storage. IEEE Commun. Lett. 2019;23:1676–1679. doi: 10.1109/LCOMM.2019.2930970. [DOI] [Google Scholar]
3.Nguyen T.T., Cai K., Immink K.A.S., Kiah H.M. Capacity-Approaching Constrained Codes with Error Correction for DNA-Based Data Storage. IEEE Trans. Inf. Theory. 2021;67:5602–5613. doi: 10.1109/TIT.2021.3066430. [DOI] [Google Scholar]
4.Kovačević M., Vukobratović D. Asymptotic Behavior and Typicality Properties of Runlength-Limited Sequences. IEEE Trans. Inf. Theory. 2022;68:1638–1650. doi: 10.1109/TIT.2021.3134871. [DOI] [Google Scholar]
5.Popovski P., Fouladgar A.M., Simeone O. Interactive joint transfer of energy and information. IEEE Trans. Commun. 2013;61:2086–2097. doi: 10.1109/TCOMM.2013.031213.120723. [DOI] [Google Scholar]
6.Fouladgar A.M., Simeone O., Erkip E. Constrained codes for joint energy and information transfer. IEEE Trans. Commun. 2014;62:2121–2131. doi: 10.1109/TCOMM.2014.2317480. [DOI] [Google Scholar]
7.Tandon A., Motani M., Varshney L.R. Subblock-constrained codes for real-time simultaneously energy and information transfer. IEEE Trans. Inf. Theory. 2016;62:4212–4227. doi: 10.1109/TIT.2016.2559504. [DOI] [Google Scholar]
8.Immink K.A.S., Cai K. Block Codes for Energy-Harvesting Sliding- Window Constrained Channels. IEEE Commun. Lett. 2020;24:2383–2386. doi: 10.1109/LCOMM.2020.3012301. [DOI] [Google Scholar]
9.Immink K.A.S., Cai K. Properties and Constructions of Energy-Harvesting Sliding-Window Constrained Codes. IEEE Commun. Lett. 2020;24:1890–1893. doi: 10.1109/LCOMM.2020.2993467. [DOI] [Google Scholar]
10.Wu T.Y., Tandon A., Varshney L.R., Motani M. Skip-sliding window codes. IEEE Trans. Commun. 2021;69:2824–2836. doi: 10.1109/TCOMM.2021.3058965. [DOI] [Google Scholar]
11.Marcus B.H., Roth R.M., Siegel P.H. An Introduction to Coding for Constrained Systems. 2001. [(accessed on 1 October 2020)]. Lecture Notes. Available online: https://ronny.cswp.cs.technion.ac.il/wp-content/uploads/sites/54/2016/05/chapters1-9.pdf.
12.Kolesnik V.D., Krachkovsky V.Y. Generating functions and lower bounds on rates for limiting error-correcting codes. IEEE Trans. Inf. Theory. 1991;37:778–788. doi: 10.1109/18.79947. [DOI] [Google Scholar]
13.Gu J., Fuja T. A generalized Gilbert-Varshamov bound derived via analysis of a code-search algorithm. IEEE Trans. Inf. Theory. 1993;39:1089–1093. doi: 10.1109/18.256522. [DOI] [Google Scholar]
14.Marcus B.H., Roth R.M. Improved Gilbert-Varshamov bound for constrained systems. IEEE Trans. Inf. Theory. 1992;38:1213–1221. doi: 10.1109/18.144702. [DOI] [Google Scholar]
15.Winick K.A., Yang S.H. Upper bounds on the size of error-correcting runlength-limited codes. Eur. Trans. Telecommun. 1996;37:273–283. doi: 10.1002/ett.4460070309. [DOI] [Google Scholar]
16.Goyal K., Kiah H.M. Evaluating the Gilbert-Varshamov Bound for Constrained Systems; Proceedings of the 2022 IEEE International Symposium on Information Theory (ISIT); Espoo, Finland. 26 Jun–1 July 2022; pp. 1348–1353. [Google Scholar]
17.Tolhuizen L.M.G.M. The generalized Gilbert-Varshamov bound is implied by Turan’s theorem. IEEE Trans. Inf. Theory. 1997;43:1605–1606. doi: 10.1109/18.623158. [DOI] [Google Scholar]
18.Luenberger D.G. Introduction to Linear and Nonlinear Programming. Addison-Wesley; Reading, MA, USA: 1973. [Google Scholar]
19.Rockafellar T. Convex Analysis. Princeton University; Pressrinceton, NJ, USA: 1970. [Google Scholar]
20.Kashyap N., Roth R.M., Siegel P.H. The Capacity of Count-Constrained ICI-Free Systems; Proceedings of the 2019 IEEE International Symposium on Information Theory (ISIT); Paris, France. 7–12 July 2019; pp. 1592–1596. [Google Scholar]
21.Tandon A., Kiah H.M., Motani M. Bounds on the size and asymptotic rate of subblock-constrained codes. IEEE Trans. Inf. Theory. 2018;64:6604–6619. doi: 10.1109/TIT.2018.2864137. [DOI] [Google Scholar]
22.Stewart G.W. Introduction to Matrix Computations. Academic Press; New York, NY, USA: 1973. Computer Science and Applied Mathematics. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

[B1-entropy-26-00346] 1.Yazdi S.M.H.T., Kiah H.M., Garcia-Ruiz E., Ma J., Zhao H., Milenkovic O. DNA-Based Storage: Trends and Methods. IEEE Trans. Mol. Biol. Multi-Scale Commun. 2015;1:230–248. doi: 10.1109/TMBMC.2016.2537305. [DOI] [Google Scholar]

[B2-entropy-26-00346] 2.Immink K.A.S., Cai K. Efficient balanced and maximum homopolymer-run restricted block codes for DNA-based data storage. IEEE Commun. Lett. 2019;23:1676–1679. doi: 10.1109/LCOMM.2019.2930970. [DOI] [Google Scholar]

[B3-entropy-26-00346] 3.Nguyen T.T., Cai K., Immink K.A.S., Kiah H.M. Capacity-Approaching Constrained Codes with Error Correction for DNA-Based Data Storage. IEEE Trans. Inf. Theory. 2021;67:5602–5613. doi: 10.1109/TIT.2021.3066430. [DOI] [Google Scholar]

[B4-entropy-26-00346] 4.Kovačević M., Vukobratović D. Asymptotic Behavior and Typicality Properties of Runlength-Limited Sequences. IEEE Trans. Inf. Theory. 2022;68:1638–1650. doi: 10.1109/TIT.2021.3134871. [DOI] [Google Scholar]

[B5-entropy-26-00346] 5.Popovski P., Fouladgar A.M., Simeone O. Interactive joint transfer of energy and information. IEEE Trans. Commun. 2013;61:2086–2097. doi: 10.1109/TCOMM.2013.031213.120723. [DOI] [Google Scholar]

[B6-entropy-26-00346] 6.Fouladgar A.M., Simeone O., Erkip E. Constrained codes for joint energy and information transfer. IEEE Trans. Commun. 2014;62:2121–2131. doi: 10.1109/TCOMM.2014.2317480. [DOI] [Google Scholar]

[B7-entropy-26-00346] 7.Tandon A., Motani M., Varshney L.R. Subblock-constrained codes for real-time simultaneously energy and information transfer. IEEE Trans. Inf. Theory. 2016;62:4212–4227. doi: 10.1109/TIT.2016.2559504. [DOI] [Google Scholar]

[B8-entropy-26-00346] 8.Immink K.A.S., Cai K. Block Codes for Energy-Harvesting Sliding- Window Constrained Channels. IEEE Commun. Lett. 2020;24:2383–2386. doi: 10.1109/LCOMM.2020.3012301. [DOI] [Google Scholar]

[B9-entropy-26-00346] 9.Immink K.A.S., Cai K. Properties and Constructions of Energy-Harvesting Sliding-Window Constrained Codes. IEEE Commun. Lett. 2020;24:1890–1893. doi: 10.1109/LCOMM.2020.2993467. [DOI] [Google Scholar]

[B10-entropy-26-00346] 10.Wu T.Y., Tandon A., Varshney L.R., Motani M. Skip-sliding window codes. IEEE Trans. Commun. 2021;69:2824–2836. doi: 10.1109/TCOMM.2021.3058965. [DOI] [Google Scholar]

[B11-entropy-26-00346] 11.Marcus B.H., Roth R.M., Siegel P.H. An Introduction to Coding for Constrained Systems. 2001. [(accessed on 1 October 2020)]. Lecture Notes. Available online: https://ronny.cswp.cs.technion.ac.il/wp-content/uploads/sites/54/2016/05/chapters1-9.pdf.

[B12-entropy-26-00346] 12.Kolesnik V.D., Krachkovsky V.Y. Generating functions and lower bounds on rates for limiting error-correcting codes. IEEE Trans. Inf. Theory. 1991;37:778–788. doi: 10.1109/18.79947. [DOI] [Google Scholar]

[B13-entropy-26-00346] 13.Gu J., Fuja T. A generalized Gilbert-Varshamov bound derived via analysis of a code-search algorithm. IEEE Trans. Inf. Theory. 1993;39:1089–1093. doi: 10.1109/18.256522. [DOI] [Google Scholar]

[B14-entropy-26-00346] 14.Marcus B.H., Roth R.M. Improved Gilbert-Varshamov bound for constrained systems. IEEE Trans. Inf. Theory. 1992;38:1213–1221. doi: 10.1109/18.144702. [DOI] [Google Scholar]

[B15-entropy-26-00346] 15.Winick K.A., Yang S.H. Upper bounds on the size of error-correcting runlength-limited codes. Eur. Trans. Telecommun. 1996;37:273–283. doi: 10.1002/ett.4460070309. [DOI] [Google Scholar]

[B16-entropy-26-00346] 16.Goyal K., Kiah H.M. Evaluating the Gilbert-Varshamov Bound for Constrained Systems; Proceedings of the 2022 IEEE International Symposium on Information Theory (ISIT); Espoo, Finland. 26 Jun–1 July 2022; pp. 1348–1353. [Google Scholar]

[B17-entropy-26-00346] 17.Tolhuizen L.M.G.M. The generalized Gilbert-Varshamov bound is implied by Turan’s theorem. IEEE Trans. Inf. Theory. 1997;43:1605–1606. doi: 10.1109/18.623158. [DOI] [Google Scholar]

[B18-entropy-26-00346] 18.Luenberger D.G. Introduction to Linear and Nonlinear Programming. Addison-Wesley; Reading, MA, USA: 1973. [Google Scholar]

[B19-entropy-26-00346] 19.Rockafellar T. Convex Analysis. Princeton University; Pressrinceton, NJ, USA: 1970. [Google Scholar]

[B20-entropy-26-00346] 20.Kashyap N., Roth R.M., Siegel P.H. The Capacity of Count-Constrained ICI-Free Systems; Proceedings of the 2019 IEEE International Symposium on Information Theory (ISIT); Paris, France. 7–12 July 2019; pp. 1592–1596. [Google Scholar]

[B21-entropy-26-00346] 21.Tandon A., Kiah H.M., Motani M. Bounds on the size and asymptotic rate of subblock-constrained codes. IEEE Trans. Inf. Theory. 2018;64:6604–6619. doi: 10.1109/TIT.2018.2864137. [DOI] [Google Scholar]

[B22-entropy-26-00346] 22.Stewart G.W. Introduction to Matrix Computations. Academic Press; New York, NY, USA: 1973. Computer Science and Applied Mathematics. [Google Scholar]

PERMALINK

Evaluating the Gilbert–Varshamov Bound for Constrained Systems †

Keshav Goyal

Han Mao Kiah

Roles

Abstract

1. Introduction

Figure 1.

2. Preliminaries

2.1. Review of Gilbert–Varshamov Bound

Table 1.

2.2. Our Contributions

Figure 2.

Figure 3.

3. Evaluating the Gilbert–Varshamov Bound

Example 1.

Theorem 1.

Example 2.

Lemma 1.

Lemma 2.

Proof.

Corollary 1.

4. Evaluating Marcus and Roth’s Improvement of the Gilbert–Varshamov Bound

Table 2.

Theorem 2.

Proof.

Remark 1.

Theorem 3.

Example 3.

Remark 2.

5. Single-State Graph Presentation

Corollary 2.

Example 4.

Corollary 3.

Example 5.

6. Numerical Plots

6.1. (L,w)-Sliding Window Constrained Codes

6.2. (d,k)-Runlength Limited Codes

Table 3.

6.3. (L,w)-Subblock Energy-Constrained Codes

Acknowledgments

Appendix A. Power Iteration Method for Derivatives of Dominant Eigenvalues

Theorem A1.

Lemma A1.

Proof.

Proof of Theorem A1.

Theorem A2.

Theorem A3.

Author Contributions

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

Funding Statement

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Evaluating the Gilbert–Varshamov Bound for Constrained Systems ^†

6.1. $(L, w)$ -Sliding Window Constrained Codes

6.2. $(d, k)$ -Runlength Limited Codes

6.3. $(L, w)$ -Subblock Energy-Constrained Codes