Abstract
We introduce a class of linear compartmental models called identifiable path/cycle models which have the property that all of the monomial functions of parameters associated to the directed cycles and paths from input compartments to output compartments are identifiable and give sufficient conditions to obtain an identifiable path/cycle model. Removing leaks, we then show how one can obtain a locally identifiable model from an identifiable path/cycle model. These identifiable path/cycle models yield the only identifiable models with certain conditions on their graph structure and thus we provide necessary and sufficient conditions for identifiable models with certain graph properties. A sufficient condition based on the graph structure of the model is also provided so that one can test if a model is an identifiable path/cycle model by examining the graph itself. We also provide some necessary conditions for identifiability based on graph structure. Our proofs use algebraic and combinatorial techniques.
Keywords: Structural identifiability, Linear compartmental model, Identifiable functions of parameters, Identifiable combinations
Introduction
The parameter identifiability problem is the question of whether or not the unknown parameters of a mathematical model can be determined from known data. This paper is concerned with structural identifiability analysis, that is, whether the model parameters can be identified from perfect input–output data (noise-free and of any duration required). Structural identifiability is a necessary condition for practical identifiability which is identifiability analysis in the presence of noisy and imperfect data. Thus, structural identifiability is an important step in the parameter estimation problem, since failure to recover parameters in the ideal case implies failure in the imperfect case as well. If all of the parameters of a model can be determined, we say the model is (at least) locally identifiable, but if some subset of the parameters can take on an infinite number of values yet yield the same input–output data, the model is said to be unidentifiable.
In this work, we examine a special class of models called linear compartmental models. Linear compartmental models are an important class of biological models used in the areas of cell biology, pharmacology, toxicology, ecology, physiology, and many other areas (DiStefano 2015). In a typical biological application, the mass or concentration of a substance (e.g., drug concentration in an organ) is represented by a compartment, and the transfer of material from one compartment to another is given by a constant rate parameter, called an exchange rate. The transfer of material from a compartment leaving the system is given by a constant rate parameter called the leak rate, and any compartment containing such a leak is called a leak compartment. An input represents the input of material to a particular compartment of the system (e.g., IV drug input) and an output represents a measurement from a compartment (e.g., drug concentration in an organ), where such compartments are called input compartments and output compartments, respectively. The resulting ODE system of equations (see Eq. (1)) is linear. This linearity feature has a nice mathematical consequence in that the model can be represented by a directed graph. Thus, we can analyze identifiability problems in terms of the combinatorial structure of that graph.
Recent work on identifiable reparametrizations (Meshkat and Sullivant 2014; Baaijens and Draisma 2016), sufficient conditions for identifiability (Meshkat et al. 2015; Gross et al. 2019; Gerberding et al. 2020), and identifiable functions of parameters (Meshkat and Sullivant 2014; Meshkat et al. 2018) has examined the combinatorial structure of the graph in a linear compartmental model to answer questions about what to do with an unidentifiable model. One approach to dealing with an unidentifiable model is to reparametrize the model over identifiable functions of parameters in the model. In other words, although not all the parameters are identifiable, one can attempt to reparametrize over a set of functions of parameters that can be determined from input–output data. In Meshkat and Sullivant (2014), necessary and sufficient conditions were given to obtain an identifiable scaling reparametrization in the case of a linear compartmental model with a single input and output in the same compartment, leaks from every compartment, and having a strongly connected graph. In Meshkat et al. (2015), these models were called identifiable cycle models because the monomial functions associated to the directed cycles are identifiable and it was shown that removing all but one leak from such models results in identifiability. Additionally, it was shown that removing a subset of leaks, but adding inputs or outputs to the remaining leak compartments, results in identifiability.
In this paper, we expand upon the results in Meshkat and Sullivant (2014); Meshkat et al. (2015) in the following ways. First, we consider the case of inputs and outputs not necessarily in the same compartment and define the analogous identifiable path/cycle model (Definition 3.4), which is a model where all the monomial functions associated to the directed cycles and paths from input to output are identifiable. Just as in Meshkat and Sullivant (2014), this occurs when the model has a coefficient map whose image has maximal dimension (Theorem 3.13). We then take these identifiable path/cycle models and remove leaks from all compartments except input/output compartments to achieve identifiable models (Theorem 3.19). A similar result was demonstrated in Meshkat et al. (2015), but in that version, the intersection of input and output compartments was nonempty, whereas in the present work the input and output compartments need not coincide. We then show that these identifiable path/cycle models yield the only identifiable models with certain conditions on their graph structure (Theorem 4.1). We thus provide necessary and sufficient conditions for identifiable models with certain graph properties (Corollary 4.3). We also give a sufficient condition for a model to be an identifiable path/cycle model which can be tested simply by examining the graph itself (Theorem 3.22). In addition, we weaken the conditions on the graph structure to obtain some necessary and sufficient conditions for identifiability (Corollary 5.11). We also give some necessary conditions for identifiability in terms of the structure of the graph (Theorems 6.1, 6.3, 6.4). Finally, we give a construction of identifiable models using results from Baaijens and Draisma (2016) (Algorithm 9.4).
Our results apply to a large class of linear compartmental models which arise in many real-world applications. Path models of the form in Proposition 3.29 arise in physiological models involving metabolism, biliary, or excretory pathways (DiStefano 2015) and models of neuronal dendritic trees (Bressloff and Taylor 1993). Path models also arise when modeling the delayed response to input and are called time-delay models (DiStefano 2015). One such example is Example 4.13 from DiStefano (2015) on oral dosing losses and delays in the gastrointestinal tract. Some other path models are considered in Sect. 7. More generally, we consider models that are strongly input–output connected. Mammillary and catenary models (DiStefano 2015) fall into this category, as well as a variation of mammillary and catenary models where input and output are in distinct neighboring compartments, but the edge from output to input is missing (see Fig. 1). More generally, our results apply to models that can be thought of as path models combined with catenary models and are considered in Sect. 7. Such a model could, for example, represent a time-delay model coupled with a catenary model.
Fig. 1.

Graph for Example 2.1
The organization of the paper is as follows. Section 2 gives the necessary background. Section 3 gives the definition of an identifiable path/cycle model and how to obtain one. Section 4 gives a classification of all identifiable models with certain graph properties. Section 5 examines weaker conditions on the graph structure for necessary and sufficient conditions for identifiability. Section 6 gives necessary conditions for identifiability in terms of the graph structure of the model. Section 7 demonstrates our results on some real-world examples. Section 8 gives computations on the number of models with maximal dimension with a certain number of inputs and outputs. Sect. 9 gives a construction of identifiable models. Section 10 gives a conclusion and a conjecture on identifiable scaling reparametrizations of identifiable path/cycle models. We give a summary of our main results in Table 1 and a summary of which of our results generalize previous results in Table 2.
Table 1.
Summary of main results
| Result | Explanation |
|---|---|
| Corollary 4.3 | Gives necessary and sufficient conditions for a strongly input–output connected model to be an identifiable path/cycle model |
| Theorem 3.22 | Gives a sufficient condition to be an identifiable path/cycle model based on graph structure |
| Corollary 5.11 | Gives necessary and sufficient conditions for an output connectable model to be an identifiable path/cycle model |
Table 2.
Summary of which new results in this paper generalize the prior results from Meshkat and Sullivant (2014) and Meshkat et al. (2015)
| Prior result | New result | Explanation |
|---|---|---|
| Theorem 1.2 of Meshkat and Sullivant (2014) | Theorem 3.13 | Generalizes conditions for identifiable cycle model to identifiable path/cycle model |
| Theorem 5 from Meshkat et al. (2015) | Theorem 3.19 | Generalizes removing leaks to obtain identifiability |
| Theorem 5.13 from Meshkat and Sullivant (2014) | Theorem 3.22 | Generalizes inductively strongly connected to almost inductively strongly connected |
| Proposition 5.4 from Meshkat and Sullivant (2014) | Proposition 3.29 | Generalizes identifiable cycle to identifiable path |
| Proposition 5.5 from Meshkat and Sullivant (2014) | Proposition 3.30 | Generalizes adding a new vertex |
| Proposition 5.3 from Meshkat and Sullivant (2014) | Theorems 6.3 and 6.4 | Generalizes necessary condition of having an exchange to having a path |
Background
Let G be a directed graph with vertex set V and set of directed edges E. Each vertex corresponds to a compartment in our model, and an edge denotes a direct flow of material from compartment j to compartment i. Also introduce three subsets of the vertices corresponding to the set of input compartments, output compartments, and leak compartments respectively. To each edge we associate an independent parameter , the rate of flow from compartment j to compartment i. To each leak node , we associate an independent parameter , the rate of flow from compartment i leaving the system.
We associate a matrix A(G), called the compartmental matrix to the graph and the set Leak in the following way:
For brevity, we will often use A to denote A(G). Also, define the vector consisting of nonzero parameters of A.
Then we construct a system of linear ODEs with inputs and outputs associated to the quadruple (G, In, Out, Leak) as follows:
| 1 |
where for . The coordinate functions are the state variables, the functions are the output variables, and the nonzero functions are the inputs. The resulting model is called a linear compartmental model.
We will indicate output compartments by this symbol:
Input compartments are labeled by “in,” and leaks are indicated by edges which go to no vertex.
Example 2.1
The model with G given in Figure 1 is a linear compartmental model with equations given by:
| 2 |
with output equation .
For a model (G, In, Out, Leak) where there is a leak in every compartment (i.e., ), it can greatly simplify the representation to use the fact that the diagonal entries of A(G) are the only places where the parameters appear. Since these are algebraically independent parameters, we can introduce a new algebraically independent parameter for the diagonal entries (i.e., we make the substitution ) to get generic parameter values along the diagonal. Identifiability questions in such a model are equivalent to identifiability questions in the model with this reparametrized matrix.
We will be considering graphs that have some special connectedness properties. We define these properties now, as well as the basic algebraic structures (monomial paths and cycles) we will be working over.
Definition 2.2
A directed graph G is connected if each pair of vertices in the graph is joined by an undirected path. A directed graph G is strongly connected if there exists a directed path from each vertex to every other vertex. A directed graph G is inductively strongly connected with respect to vertex 1 if each of the induced subgraphs is strongly connected for for some ordering of the vertices which must start at vertex 1.
Definition 2.3
A closed path in a directed graph G is a sequence of vertices with and such that is an edge for all . A cycle in G is a closed path with no repeated vertices. To a cycle , we associate the monomial , which we refer to as a monomial cycle. If a monomial cycle has length k, we refer to it as a k-cycle.
Note that we also include the monomial cycles which are 1-cycles, or self-cycles.
Definition 2.4
A path from vertex to vertex in a directed graph G is a sequence of vertices such that is an edge for all . To a path , we associate the monomial , which we refer to as a monomial path. If a monomial path has length k, we refer to it as a k-path.
Remark
We will sometimes drop the word “monomial” from “monomial path” or “monomial cycle” and simply refer to these as paths and cycles, as is the case in the next definition.
We now define the path/cycle map for a linear compartmental model :
Definition 2.5
Let be the set of all directed cycles and paths from input to output vertices in the graph G. Define the path/cycle map by:
| 3 |
Example 2.6
(Continuation of Example 2.1) Consider the model as described in Example 2.1. The path/cycle map for this model is
Now we give some definitions from Gross et al. (2019) regarding an important subgraph to this work:
Definition 2.7
For a linear compartmental model , let . The output-reachable subgraph to i (or to ) is the induced subgraph of G containing all vertices j for which there is a directed path in G from j to i. A linear compartmental model is output connectable if every compartment has a directed path leading from it to an output compartment.
We add the following definition:
Definition 2.8
A linear compartmental model is output connectable to every output if every compartment has a directed path leading from it to every output compartment.
We will be using the so-called differential algebra approach to structural identifiability (Ljung and Glad 1994; Ollivier 1990). Other methods to test identifiability include the Taylor Series approach (Pohjanpalo 1978), the generating series approach (Walter and Lecourtier 1982), the similarity transformation approach (Vajda et al. 1989), and the observability-identifiability condition (Villaverde 2019). In the differential algebra approach, we view the model equations as differential polynomials in a differential polynomial ring R(p)[u, y, x], i.e., the ring of polynomials in state variable vector x, output vector y, input vector u, and their derivatives, with coefficients in R(p) for parameter vector p. Since the unmeasured state variables cannot be determined, the goal in this approach is to use differential elimination to eliminate all unknown state variables and their derivatives. The resulting equations are only in terms of input variables, output variables, their derivatives, and parameters, so these equations have the following form:
| 4 |
An equation of the form (4) is called an input–output equation for .
For nonlinear models, one standard “reduced” generating set for these input–output equations is formed by those equations in a characteristic set (defined precisely in Glad 1990) that do not involve the ’s or their derivatives. In a characteristic set, which can be computed using the software DAISY (Bellu et al. 2007), each in each input–output equation (4) is a differential monomial, i.e., a monomial purely in terms of input variables, output variables, and their derivatives. The terms are called the coefficients of the input–output equations. These coefficients can be fixed uniquely by normalizing the input–output equations to make them monic (Bellu et al. 2007).
However, for linear models, it has been shown that these input–output equations can be found much more easily using the Transfer Function approach (Bellman and Åström 1970) or even a trick with Cramer’s Rule (Meshkat and Sullivant 2014). We will be taking the latter approach to get an explicit formula for the input–output equations.
We now state Theorem 3.8 from Gross et al. (2019) with input i and output j, which gives the input–output equation in in terms of the output-reachable subgraph to .
Theorem 2.9
Let be a linear compartmental model with at least one input. Let , and assume that there exists a directed path from some input compartment to compartment-j. Let H denote the output-reachable subgraph to , let denote the compartmental matrix for the restriction , and let be the the product of the differential operator d/dt and the identity matrix. Then the following is an input–output equation for involving :
| 5 |
where denotes the matrix obtained from by removing the row corresponding to compartment-i and the column corresponding to compartment-j. Thus, this input–output equation (5) involves only the output-reachable subgraph to .
Example 2.10
(Continuation of Example 2.1) The model with G given by the graph { } has leaks from every compartment, thus writing the diagonal elements as , we have the following input–output equation:
Identifiability
In the differential algebra approach to structural identifiability, one tests identifiability by using the input–output equations determined from the characteristic set (Bellu et al. 2007). However, for the case of a linear model, we have shown that the input–output equations can be formed using Eq. (5) and thus we would like to define identifiability using these equations. An important question arises here: can we use the input–output equations in Eq. (5) to test identifiability?
In the case of a single output, there is just a single input–output equation of the form in Eq. (5). However, in the case of multiple outputs, there is an input–output equation for each output in Out. The equations formed in Theorem 2.9 are not necessarily minimal, i.e., of lowest degree, but this condition of minimality is required in order for identifiability to be well-defined (Saccomani et al. 2003). In Theorem 3 of (Ovchinnikov et al. 2022), it is shown that the input–output equations in Eq. (5) can be used to analyze identifiability in the case of a linear compartmental model with at least one input and whose graph is strongly connected. Thus, in the rest of this paper, we will be assuming G is strongly connected and the model has at least one input when we are analyzing models with more than 1 output so that our definition of identifiability is well-defined.
The next step of the differential algebra approach assumes that the coefficients of the input–output equations can be recovered uniquely from input–output data, and thus are presumed to be known quantities (Soderstrom and Stoica 1989). While there are examples of linear models where this assumption can lead to incorrect conclusions about identifiability (see Example 2.14 of Hong et al. 2020), this assumption that the coefficients can be recovered uniquely from input–output data holds for all linear compartmental models with corresponding graphs such that one can reach a leak or an input from every vertex (see Theorem 2 of Ovchinnikov et al. 2022). We will only be considering either strongly connected or strongly input–output connected models where the output compartment contains a leak, so the coefficients of the input–output equations can be uniquely recovered.
We give some definitions of identifiability from Meshkat et al. (2015).
Definition 2.11
Let (G, In, Out, Leak) be a linear compartment model and let c denote the vector of all nonzero and nonmonic coefficient functions of all the linear input–output equations derived in Theorem 2.9 for each . The function c defines a map , where k is the total number of coefficients which we call the coefficient map. The linear compartment model (G, In, Out, Leak) is:
globally identifiable if c is a one-to-one function, and is generically globally identifiable if global identifiability holds everywhere in , except possibly on a set of measure zero.
locally identifiable if around any neighborhood of a point in , c is a one-to-one function, and is generically locally identifiable if local identifiability holds everywhere in , except possibly on a set of measure zero.
unidentifiable if c is infinite-to-one.
Remark
Throughout the rest of this paper, we will be concerned with generic local identifiability. Thus, we will drop the word “generic” and just state a model is locally identifiable. Additionally, if a model is “identifiable,” this means it is (at least) locally identifiable.
Example 2.12
(Continuation of Example 2.1) Consider the model as described in Example 2.1. The coefficient map for this model is
Since we will be concerned with generic local identifiability in this work, we will be using the following proposition from Meshkat et al. (2015) which follows from the fact that the rank of the Jacobian of c evaluated at a generic point is equal to the dimension of the image of c (Harris (1992)[Prop. 14.4]).
Proposition 2.13
(Proposition 2 from Meshkat et al. 2015) The model is generically locally identifiable if and only if the rank of the Jacobian of c is equal to when evaluated at a random point.
If a model is unidentifiable, then this means that not all of the parameters can be determined (uniquely or finitely). However, we may still be interested in finding identifiable functions of parameters or identifiable combinations, and in using these functions to attempt to reparametrize the model.
Definition 2.14
Let c be a function . A function is globally identifiable from c if there exists a function such that . The function f is locally identifiable if there is a finitely multivalued function such that .
Strongly Input–Output Connected
In order to consider identifiable path/cycle models, we will be considering graphs G that have the special property of being connected and every edge is contained in a cycle or path from input to output. We call this strongly input–output connected:
Definition 2.15
We say a graph G is strongly input–output connected if it is connected and every edge is contained in a cycle or path from input to output.
We first show that, in the case of a single output, being strongly input–output connected implies being output connectable, so if we assume the former we get output connectable and can use Theorem 2.9 with the whole matrix A. Likewise, for the case of multiple outputs, we show that being strongly connected implies being output connectable to every output.
Proposition 2.16
(1) Consider a model . Assume G is strongly input–output connected. Then G is output connectable. (2) Now consider a model . Assume G is strongly connected. Then G is output connectable to every output.
Proof
Let . Assume G is strongly input–output connected, i.e., it is connected and every edge is contained in a cycle or path from input to output. Since every edge contained in a path from input to output is connected to the output, we need only consider the edges in cycles. If a vertex in a cycle coincides with a vertex on a path from input to output, then we are done. Thus, assume that there exists a cycle whose vertices do not intersect with the vertices on paths from input to output. Since the graph is connected, the cycle must be attached via a directed edge from either the cycle to a path from input to output or vice versa. But the attaching edge must also be on a path from input to output. Thus for any edge from the path to the cycle, there must be a corresponding edge from the cycle to the path. Thus the graph is output connectable.
If and G is strongly connected, then since there is a path from each vertex to every other vertex, then G is output connectable to every output.
Remark
Note that in Proposition 2.16, G must be strongly connected as opposed to strongly input–output connected when , or else not every vertex may connect to every output.
Remark
A model that is strongly input–output connected in the case of a single output or strongly connected in the case of multiple outputs is always structurally observable (Godfrey and Chapman 1990), as it is output connectable to every output by Proposition 2.16.
We now show that the property of being strongly input–output connected is almost strongly connected, in the sense that the graph becomes strongly connected once an edge is added (if not already there) from the output to every input if or an edge is added from every output to the input if .
Proposition 2.17
(1) Consider a model . The model is strongly connected if an edge is added from output j to every input if and only if it is strongly input–output connected. (2) Now consider a model . The model is strongly connected if an edge is added from every output to input i if and only if it is strongly input–output connected. Strongly connected implies strongly input–output connected.
Proof
A model is strongly connected if and only if it is connected and every edge is contained in a cycle. Thus a model is strongly connected if a path from output j to every input is added if and only if it is connected and every edge is contained in a cycle or path from input to output, i.e., strongly input–output connected. Likewise, a model is strongly connected if a path from every output to input i is added if and only if it is connected and every edge is contained in a cycle or path from input to output, i.e., strongly input–output connected. Additionally, if a model is strongly connected, then it is strongly input–output connected, as every edge is contained in a cycle.
We can also examine the minimum number of edges in order to be either strongly connected or strongly input–output connected:
Proposition 2.18
If G is strongly connected, the minimum number of edges is |V|. If G is strongly input–output connected for input i and output j, the minimum number of edges is .
Proof
For G to be strongly connected, each vertex must have at least one incoming and one outgoing edge. Thus the minimum number of edges is |V|. If a graph is strongly input–output connected for input i and output j, then this means it becomes strongly connected if an edge is added from output j to input i (if not already there). This means the minimum number of edges is .
Expected Number of Coefficients
We first give a result from Meshkat et al. (2015), which we have reworded to agree with the new terminology in this work and have split into two parts: Proposition 2.19 shows that the coefficient map factors through, i.e., can be written purely in terms of, cycles, self-cycles and paths, and Lemma 2.23 gives the degree of the highest–order term on the right-hand side of the input–output equation.
Proposition 2.19
(Proposition 5 from Meshkat et al. 2015) Let represent a linear compartmental model that is output connectable. The coefficient map c factors through cycles, self-cycles, and paths from input to output.
Proof
Let be the set of all cycles in G, corresponding to a matrix A. Recall that the coefficients of the characteristic polynomial of A can be written as
where the sum is over all collections of vertex disjoint cycles involving exactly i edges of G, and if C is odd length and if C is even length. This means for every i, all cycles of length i appear as monomial terms in , and for , these cycles of length i appear as monomial products with other cycles in .
By Theorem 2.9 and the fact that G is output connectable, meaning that the output-reachable subgraph of G is all of G, the input–output equation for is given by:
| 6 |
This means the coefficients on the left-hand side factor through the cycles in G.
Let us now examine these coefficients of the terms in Eq. (6). For , the term gives the coefficients of the characteristic polynomial for the matrix with row i and column i removed, thus these coefficients factor through cycles of the induced subgraph removing vertex i.
Now assume . The characteristic polynomial of A can be determined by expanding along the row. Let be the matrix A with the entry nonzero. Then for , taking the partial derivative of the characteristic polynomial of with respect to precisely gives the polynomial , up to a minus sign. Since the coefficients of the characteristic polynomial of factor through the cycles, then taking the derivative of these coefficients with respect to has the effect of removing all monomial terms not involving and setting to one in the monomial terms that do involve . This effectively transforms all cycles involving to paths from the vertex to the vertex. Thus, each of the monomial terms are products of paths from the vertex to the vertex, cycles, and self-cycles. In other words, coefficients are of the form:
where the sum is over all collections of vertex disjoint cycles and paths from i to j involving exactly m edges of G, and if P is odd length and if P is even length.
Thus the coefficients can be factored over cycles, self-cycles, and paths from input to output. In other words, there is a polynomial map
which we will refer to as the path/cycle to coefficient map where k is the number of coefficients, such that where is the path/cycle map from Eq. 3.
Definition 2.20
Let the polynomial map from the path/cycle space of an output connectable model to its corresponding input–output equation coefficient space be defined as the path/cycle to coefficient map.
Example 2.21
(Continuation of Example 2.1) Consider the model as described in Example 2.1. The path/cycle to coefficient map for this model is
Note that the composition of the path/cycle map in Example 2.6 and the path/cycle to coefficient map in Example 2.21 yields the coefficients of the input–output equation for as shown in Example 2.10.
We will be writing the number of coefficients in terms of the minimal distance between an input and output compartment. We define this now:
Definition 2.22
Let i be an input compartment and let j be an output compartment, . Let be the set of all paths from vertex i to vertex j. Let l(P) denote the length of a path . Then we can define the minimum length of all paths from vertex i to vertex j as l(P).
Lemma 2.23
(Proposition 5 from Meshkat et al. 2015) Let represent a linear compartmental model with output connectable. The highest-order term in where on the right-hand side of the input–output equation, Eq. (6), is of degree .
Proof
Let be the matrix A with the entry nonzero. Let be the set of all cycles in , corresponding to a matrix . To determine the coefficient of the highest-order term in , recall that the coefficients of the characteristic polynomial of can be written as
where the sum is over all collections of vertex disjoint cycles involving exactly i edges of , and if C is odd length and if C is even length. This means for every m, all cycles of length m appear as monomial terms in , and for , these cycles of length m appear as monomial products with other cycles in .
We now determine the highest-order term in . Since is just the partial derivative of the characteristic polynomial of with respect to , up to a minus sign, then the right-hand side of the input–output equation for output is of the form, where :
We note that not all of these coefficients for are nonzero and thus we must determine the first nonzero coefficient.
Recall Definition 2.22 for the minimal distance between i and j. Let the length of the shortest cycle involving be of length , so that the length of the shortest path from i to j is of length . Then the coefficient of the highest-order term in is , which is a sum of the shortest paths (of length ) from i to j. Thus it is of the form . This means the highest-order term in is of degree .
We now give a formula for the number of coefficients of the input–output equation in the case of either single input or single output.
Theorem 2.24
(Number of nonzero coefficients) Let represent a linear compartmental model with G output connectable with at least leaks with . There are nonzero coefficients where and . Now let represent a linear compartmental model with G output connectable to every output with at least leaks with . There are nonzero coefficients where and .
Proof
Assume is a linear compartmental model with G output connectable with at least leaks with . By Eq. (6), the highest degree term is |V| on the left-hand side. For the right-hand side, is either 1 or 0. If , then the highest-order term in is of degree on the right-hand side. The highest-order term in is monic, so there are coefficients of terms in . For each , the highest degree term in is of order on the right-hand side by Lemma 2.23. In this case, the highest-order term in is not monic, so there are coefficients of terms in when . Altogether, there are nonzero coefficients where and .
For the case with multiple outputs, let represent a linear compartmental model with G output connectable to every output with at least leaks with . By applying the formula for the case of single output above for each input–output equation, we obtain that there are nonzero coefficients where and .
We need only show that the coefficients are nonzero (for a generic choice of parameters).
If there are leaks from every compartment, Proposition 2.19 shows that the coefficients factor through cycles, self-cycles, and paths from input to output.
Now consider the case of removing leaks. We will be substituting as the negative sum of all outgoing edges when , but if then stays the same. Since , we have that every compartment has an outgoing edge or leak, as every vertex has an outgoing edge except for possibly the output vertex by the output connectable assumption. This means the substitution as the negative sum of all outgoing edges and leaks retains the (i, i) entry of A to be nonzero.
Recall by Proposition 2.19, these coefficients can be factored over cycles, self-cycles, and paths when . Each coefficient, except for the highest order coefficient in when (which is a sum of paths from to j), must have a term involving a self-cycle. If the self-cycles in every coefficient are only from leak compartments, we are done. Otherwise, consider a coefficient that has terms involving self-cycles from non-leak compartments which we must substitute into for the case . We want to show that the substitution of the non-leak diagonal terms as the negative sum of all outgoing edges does not cancel every term in that coefficient, so that the coefficients remain nonzero after substitution. We claim that the substitution of as the negative sum of all outgoing edges for cannot create only terms that are products of cycles and paths from input to output. Since the graph must be output connectable, any cycle formed from the non-leak vertices must connect to the output. In other words, for a chain of vertices in a cycle , one of these vertices must connect to the output via a path from that vertex to the output. Without loss of generality, assume it is vertex . Thus the substitution of the non-leaks , cannot create a single monomial term of the form , but must also create a monomial where we have substituted as for some vertex r that connects via a path to the output.
This monomial cannot itself be a path from input to output, as the input and output vertices have leaks and thus the corresponding diagonal terms do not get substituted. Thus, it is not a path from input to output, and thus cannot cancel with any other terms in that coefficient.
Remark
We note that the assumption of at least leaks with and G to be output connectable is to prevent the situation where there are no outgoing edges or leaks from a non-leak vertex and thus upon substitution of the diagonal element as the negative sum of all outgoing edges and leaks, it becomes zero.
Definition 2.25
(Expected number of coefficients) We say a model
with has the expected number of coefficients if there are nonzero coefficients in the input–output equations (5) where and . We say a model with has the expected number of coefficients if there are nonzero coefficients in the input–output equations (5) where and .
Identifiable Path/Cycle Models
Definition 3.1
We say a model has a coefficient map with expected dimension if the dimension of the image of the coefficient map is maximal.
We will be examining a special class of models we call identifiable path/cycle models. This class of models is a generalization of identifiable cycle models as defined in Meshkat et al. (2015):
Definition 3.2
(Identifiable cycle models) We say a model with G strongly connected is an identifiable cycle model if all of the independent monomial cycles in the model are locally identifiable.
Example 3.3
The models and where G corresponds to a chain of exchanges , etc, and H corresponds to a central compartment given by compartment 1 and exchanges , etc, are identifiable cycle models due to Theorem 5.13 of Meshkat and Sullivant (2014). The first model is commonly called a catenary model and the second model is called a mammillary model (DiStefano 2015).
It was shown in Meshkat and Sullivant (2014) that a sufficient condition for a model to be an identifiable cycle model is that the dimension of the image of the coefficient map is . We now define the main object of interest in this paper, identifiable path/cycle models and spend the rest of this section forming analogous sufficient conditions on the dimension of the image of the coefficient map.
Definition 3.4
(Identifiable path/cycle models) We say a model is an identifiable path/cycle model if all of the independent monomial cycles and monomial paths from input to output in the model are locally identifiable and each parameter is contained in such a cycle or path.
Remark
As identifiable path/cycle models require all of the cycles and paths from input to output in the model to be identifiable, it only makes sense to consider models that are connected and every edge is contained in a cycle or path from input to output, i.e. strongly input–output connected. Otherwise, an edge that is not contained in a cycle or path from input to output will not appear in the coefficient map.
Example 3.5
(Continuation of Example 2.1) The model with G given by the graph { } is an identifiable path/cycle model, with identifiable paths and cycles given by . This can be demonstrated by writing each path and cycle as a function of the coefficients , e.g., using Groebner Bases. Further justification will come from Theorem 3.13.
For models with leaks from every compartment, the dimension of the image of the coefficient map is bounded above by the number of independent paths and cycles in the graph from Proposition 2.19. We now determine what this number is. We first show that when G is output connectable for the case of single output, there are independent directed paths and undirected cycles. We then examine the case where G is strongly input–output connected so that the indicator vectors for the independent directed paths and directed cycles in the graph correspond to 0/1 vectors. We show that this number of independent paths and cycles is equal to .
We define the |V| by |E| incidence matrix E(G) as:
| 7 |
In other words, E(G) has column vectors corresponding to the edges with a 1 in the jth row, in the kth row, and 0 otherwise. We define the indicator vector of a directed cycle C as the vector such that if and if , where is the set of edges associated to the directed cycle C.
We can also define the indicator vector of an undirected cycle with associated directed cycle C (reversing arrows to all point in the same direction) as the vector such that if , if , and if , where is the set of edges associated to the directed cycle C and corresponds to an edge s going in the opposite direction. In other words, if s corresponds to , then corresponds to .
The rank of the directed incidence matrix is well known:
Proposition 3.6
(Proposition 4.3 of Biggs 1993) Let G be a graph with |V| vertices, |E| edges, and l connected components. Then the rank of E(G) is . Thus, the dimension of the kernel of E(G) is .
We state one final result from Meshkat and Sullivant (2014); Meshkat et al. (2015), which shows that the kernel of E(G) can be written in terms of directed cycles when G is strongly connected, thus the indicator vectors are 0/1 vectors:
Proposition 3.7
(Proposition 4 of Meshkat et al. 2015) Let G be a strongly connected graph. Then a set of linearly independent indicator vectors of directed cycles form a basis for the kernel of E(G).
In other words, this proposition shows that the space of all undirected cycles can be generated by the space of all directed cycles when G is strongly connected. We now prove a similar result in terms of cycles and paths from input to output when G is strongly input–output connected.
Proposition 3.8
Let represent a linear compartmental model with G strongly input–output connected. Then the space of all directed paths and undirected cycles can be generated by the space of all directed paths and directed cycles and vice versa.
Proof
Let B have as its columns the indicator vectors of all directed paths and undirected cycles. We show that, for every undirected cycle, we can add a positive integer multiple of a directed path vector or directed cycle vector to obtain either a directed cycle or directed path from input to output. Since G is strongly input–output connected, every edge is in either a cycle or path from input to output. For every edge with a negative entry in the indicator vector of an undirected cycle, that edge either belongs to a cycle or path from input to output. If it belongs to a path from input to output, one can add a positive multiple of the path to the undirected cycle to achieve only non-negative entries corresponding to a closed path or path from input to output. If it does not belong to a path from input to output, then it belongs to a directed cycle. Thus one can add a positive multiple of the directed cycle to the undirected cycle to achieve only non-negative entries corresponding to a closed path or path from input to output. In either case, this corresponds to a multigraph with the property that the indegree of each vertex equals the outdegree of each vertex except possibly at input and output vertices. Cycles can be removed so that the result is a cycle or a path from input to output.
Lemma 3.9
Let represent a linear compartmental model with G output connectable if or G strongly input–output connected otherwise. Then the number of independent undirected cycles and directed paths from input to output is .
Proof
Let the matrix B have as columns the indicator vectors of the undirected cycles and directed paths from input to output vertices. Since G is either output connectable in the single output case or strongly input–output connected, then G is certainly connected, and thus there are undirected cycles that form a basis for the kernel of E(G) and there must be at least one path from input to output, so B is certainly not the zero matrix. Form the product E(G)B. If column k of B corresponds to a cycle, then column k of E(G)B will be zero, and if column k of B corresponds to a path from input in i and output in j, then column k of E(G)B will have a 1 in row i and a in row j.
Remove the zero columns and duplicate columns (which occur when there is more than one path from an input to an output) and zero rows from this matrix E(G)B and call the resulting matrix M. We claim M is the incidence matrix of the graph where there are vertices corresponding to each input/output compartment and there is a directed edge from an input compartment to an output compartment if and only if there is a path from the corresponding input to the corresponding output in the graph G. Call this graph . Note that there are only vertices in because we deleted zero rows from the matrix E(G)B to obtain the matrix M, thus deleting vertices that do not correspond to inputs or outputs. This graph must be connected because we assumed G is output connectable in the single output case and strongly input–output connected otherwise. Since the rank of the incidence matrix for a connected graph is the number of vertices minus one, this means the rank of E(G)B is .
Since the rank of E(G)B is equal to the rank of B minus the dimension of the column space of B intersected with the kernel of E(G), which is exactly because B is generated by paths and undirected cycles and a basis for the kernel of E(G) is given by undirected cycles, then this means the rank of B is exactly . Adding the |V| self-cycles, we obtain that the dimension of the path/cycle map is .
Corollary 3.10
Let represent a linear compartmental model with G strongly input–output connected. Then the number of independent directed cycles and directed paths from input to output is .
Proof
If , strongly input–output connected implies output connectable and if , we have strongly input–output connected. The statement follows from Lemma 3.9 and Proposition 3.8 to achieve a set of independent directed cycles and directed paths from input to output.
We now show that the dimension of the image of the coefficient map is bounded above by the number of independent paths and cycles. We will add the important assumption of either or so that the number of distinct input–output pairs equals , described in the Remark below. For we can assume G is strongly input–output connected as stated in Corollary 3.10, but for the case of we will assume G is strongly connected in order to ensure the input–output equations are irreducible as shown in Sect. 2. For the special case where and , we note that strongly input–output connected reduces to strongly connected. For the special case where and , then strongly input–output connected is sufficient in what follows, i.e., we can take the weaker of the two conditions strongly input–output connected and strongly connected.
Lemma 3.11
Let represent a linear compartmental model. Assume that either G is strongly input–output connected and or G is strongly connected and . The dimension of the image of the coefficient map is bounded above by .
Proof
The coefficient map factors through the cycles, self-cycles, and paths from input to output from Proposition 2.19. By Corollary 3.10, the number of independent paths and cycles is . Thus the dimension of the image of the coefficient map is bounded above by .
Remark
We require or so that there are either or distinct input–output pairs, respectively, which equals , the rank of E(G)B in the proof of Lemma 3.9. Example 3.12 demonstrates this.
Example 3.12
The model shown in Fig. 2 where G is given by { } has independent paths and cycles but the coefficient map factors over 11 paths and cycles given by , , , , , , , , , , . The problem here is that there are only 10 parameters, but we are attempting to factor over 11 paths and cycles. This is why we require or . However, this is a sufficient condition but not a necessary condition, as the model also shown in Fig. 2 where is given by { } has independent paths and cycles given by , , , , , , , , , , , and the coefficient map factors over these as well.
Fig. 2.

The models and from Example 3.12
Remark
Notice that in Lemma 3.11, we assume that G is either strongly input–output connected or strongly connected. We have just shown that the expected dimension in this case is . But a natural question that arises is, what if we do not assume this connectedness condition on G? Clearly, we still have that the coefficient map factors through cycles and paths. In this case there will be at most independent cycles and paths that appear in the coefficient map, i.e., the coefficient map may factor over fewer than independent paths and cycles. See Sect. 5.
This gives us the following theorem:
Theorem 3.13
Let represent a linear compartmental model with either G strongly input–output connected and or G strongly connected and . If the image of the coefficient map has dimension , then the model is an identifiable path/cycle model.
Proof
By Lemma 3.11, the dimension of the image of the coefficient map is bounded above by , which is also the number of independent paths and cycles. Recall a function f is locally identifiable if there is a finitely multivalued function such that . Let be the path/cycle map from Eq. 3. Since factors over paths and cycles, then there exists a function as defined in Definition 2.20 such that . If the dimension of the image of the coefficient map is precisely , then this function is locally invertible with and thus . Thus the paths and cycles are identifiable.
Example 3.14
(Continuation of Example 2.1) The model from Example 2.1 can be shown to have dimension of the image of the coefficient map equal to . Thus there are 7 identifiable paths and cycles given by the monomials .
Necessary Condition for Number of Edges and the Edge Inequality
Proposition 3.15
Let represent a linear compartmental model with either G strongly input–output connected and or G strongly connected and . If is an identifiable path/cycle model, then the expected number of coefficients.
Proof
We have that the dimension of the image of the coefficient map is bounded above by the expected number of coefficients, as it cannot exceed the number of coefficients. Thus the expected number of coefficients.
Definition 3.16
(Edge inequality) We say that a model has a number of edges given by the edge inequality if the number of edges |E| satisfies the expected number of coefficients.
We can also show that the property of being strongly input–output connected is a necessary condition for having expected dimension in the case of maximal number of edges.
Proposition 3.17
Let with G output connectable and represent a linear compartmental model for which the edge inequality is an equality with expected dimension . Then the graph must be strongly input–output connected.
Proof
Assume the coefficient map has expected dimension with the maximal number of edges. This means the expected dimension is the number of coefficients. If the graph is not connected with every edge in a cycle or path from input to output, then there are parameters that do not appear in the coefficient map, and thus the coefficient map factors over fewer than independent paths and cycles. But this contradicts having expected dimension .
Remark
If there are fewer edges, we can still achieve expected dimension without this condition of strongly input–output connected. In other words, being strongly input–output connected is a sufficient but not necessary condition to achieve independent cycles and paths in the coefficient map. See Sect. 5.
Obtaining Identifiability by Removing Leaks
In this section, we show that removing all leaks except leaks from input/output compartments results in identifiability, much like the results in Meshkat et al. (2015). We will follow the same proof. Recall that for an identifiable path/cycle model, we have the coefficient map . Let be the path/cycle map from Eq. 3, that is is a cycle or path from input to output of G). Then Proposition 2.19 tells us that c factors through without loss of dimension. Thus where is defined as in Definition 2.20 and the dimension of the image of c equals the dimension of the image of .
Passing from a model (G, In, Out, V) to a model (G, In, Out, Leak) such that amounts to restricting the parameter space to a linear subspace of dimension and we would like the image of under the coefficient map c to have dimension . Since c factors through the path/cycle map it suffices to prove that the image of under has dimension .
Lemma 3.18
Let be a directed graph with corresponding identifiable path/cycle model (G, In, Out, V). Assume that either G is strongly input–output connected and or G is strongly connected and . Consider a model (G, In, Out, L) where . Let denote the path/cycle map. Let be the linear space satisfying
Then the dimension of the image of under the map is .
Proof
Since is a linear space, we just consider the natural map from which maps to the path/cycle space. To show that the dimension of the image of this map is correct, we consider the Jacobian of this map and show that it has full rank.
Note that the rows corresponding to the |L| self-cycles are linearly independent so we focus on the by |E| submatrix ignoring those rows and columns, which we will call J. Arrange the matrix so that the first rows correspond to the paths and cycles of G and the last rows correspond to the non-leak diagonal elements. Let the first rows be called A and the last rows be called B. Clearly the rows of A are linearly independent by Lemma 3.11. The rows of B are linearly independent since they are in triangular form since each involves distinct parameters. To show that the full set of rows are linearly independent, we need only show that the row space of A and the row space of B intersect only in the origin.
To prove that J generically has maximal possible rank, it is enough to show that there is some point where the evaluation of J at said point yields the maximal rank. We choose the point where we set all the edge parameters for all . This specialization yields that the row space of A is exactly the path/cycle space of the graph G, i.e., all of the weightings on the edges of the graph where the indegree equals the outdegree of every vertex in a cycle and every vertex except the first and last in a path from input to output. Also, we have that the matrix B which has dimension , which consists of the rows corresponding to the vertices in , and the (negated) row corresponding to vertex i has a one for an edge if and only if , with all other entries zero.
Since A spans the path/cycle space of G, each element in the row space of A corresponds to a weighting on the edges of G where the total weight of all incoming edges at a vertex i equals the total weight of all outgoing edges at vertex i except at input or output vertices . On the other hand, we claim that the only vector in the row span of B with the same property is the zero vector. To show this, let be the row vector associated to some vertex i. Note that a vector in the row span of B will have zero weight on any of the outgoing edges of vertices in .
In order for the indegree to equal the outdegree, we would need to include a with an edge pointing toward vertex i. Continuing in this way, we can only stop when we have included an input or output vertex since the indegree need not equal the outdegree for those vertices. However, this contradicts the fact that a vector in the row span of B will have zero weight on any of the outgoing edges of vertices in .
Theorem 3.19
(Removing leaks) Let represent a linear compartmental model. Assume that either G is strongly input–output connected and or G is strongly connected and . Assume it is an identifiable path/cycle model. Then, the corresponding model where for any such L has expected dimension. In particular, if , then is locally identifiable.
Proof
By Lemma 3.18 and the comments preceding it we know that the image of the restricted parameter space under the path/cycle map has dimension , which is equal to the dimension of the image of the full parameter space under the path/cycle map. Since, for an identifiable path/cycle model, the dimension of the image of the coefficient map c is , this must be the same for the restricted model. In particular, if , then the model has parameters, hence it is locally identifiable.
Example 3.20
(Continuation of Example 2.1) The model obtained from the model in Example 2.1 by removing two leaks and leaving the leaks in the input and output compartments is locally identifiable (see Fig. 3).
Fig. 3.

Graph for model corresponding to in Example 3.20
Remark
We note that, while is sufficient in Theorem 3.19, it is certainly not necessary, as there are other possible configurations of leaks that also result in identifiability. The next example demonstrates this.
Example 3.21
Consider the model shown in Fig. 4 where and G is given by the edges { }. The identifiable models are the ones where and the unidentifiable models have .
Fig. 4.

On the left is the graph corresponding to with leak set , and on the right is the graph corresponding to with leak set from Example 3.21
We note that while appears to be sufficient for identifiability in this model, we can consider another model given by also shown in Fig. 4 where and is given by the edges { }. The identifiable models are the ones where and the unidentifiable models have . This shows the pattern of identifiability depends on the graph structure itself and not just the placement of inputs and outputs.
Sufficient Condition for Identifiable Path/Cycle Model
We now give a sufficient condition for a model to be an identifiable path/cycle model with 1 input and 1 output. This sufficient condition is analogous to the sufficient condition from Meshkat and Sullivant (2014) of inductively strongly connected for models with input and output in the same compartment. In fact, Theorem 3.22 reduces to Theorem 5.13 of Meshkat and Sullivant (2014) if the input and output compartments are the same.
Theorem 3.22
Let represent a linear compartmental model with G strongly input–output connected and . If has no path from compartment j to compartment i but becomes inductively strongly connected if an edge from compartment j to compartment i is added, then is an identifiable path/cycle model.
Before we prove Theorem 3.22, we define a graph structure which will be useful in the proof.
Definition 3.23
(Definition 5.6 from Meshkat and Sullivant 2014) A chain of cycles is a graph H which consists of a sequence of directed cycles that are attached to each other in a chain by joining at the vertices.
Theorem 3.24
Suppose represents a linear compartmental model with strongly input–output connected and . Suppose too that has expected dimension and also that has no path from j to i and becomes inductively strongly connected if the edge from j to i is added. Then, if G is a new graph obtained from by adding a vertex n and two edges and such that G has a chain of cycles containing either i and n or j and n, then the model has expected dimension.
Recall that we can induce a weight order on a polynomial ring for some weight vector where the weight of a monomial is where . We can then define the initial forms of a polynomial f as to be the sum of all terms of f whose monomial has the highest weight with respect to said .
Now, if we define the coefficient map associated to the graph G as , then we can also consider the pull-back of said map defined as where c, d correspond to the coefficients of the left and right-hand side of the input–output equation respectively, and a corresponds to the parameters found in compartmental matrix A. Now define to be the initial parameterization defined as the parameterization with pullback where for a given weight .
Lemma 3.25
(Corollary 5.9, Meshkat and Sullivant 2014) Let be a -algebra homomorphism and a weight vector, then
We will use Lemma 3.25 in the following way. We want to compute the dimension of the image of a polynomial parametrization . We know for other reasons an upper bound d on this dimension. We have a weight vector where we can compute the dimension of the image of the polynomial parametrization , and we show it is equal to d. Then, by Lemma 3.25, we know that the dimension of the image of must be d.
Proof of Theorem 3.24
Suppose is a linear compartmental model with expected dimension such that is strongly input–output connected, . Also suppose that if we add the edge from j to i, the new graph becomes inductively strongly connected. Note that by Theorem 2.24, the input–output equation of the model has nonzero, non-monic coefficients. Let and .
Define , to be the coefficient map associated to a graph G as above with corresponding pull-back . Choose weight as follows:
Recall that for each coefficient, the corresponding polynomial function is homogeneous in terms of the parameters. Also, recall that the left-hand side coefficients are generated by cycles of the corresponding graph, while the right-hand side coefficients are generated by products of cycles of the corresponding graph along with paths from the input to output.
Applying this weight to the polynomial coefficients has the effect of removing any monomial containing a cycle which is incident to compartment n, in all coefficients except for the lowest order terms in both c and d. Note that in the case of and , each of the monomials in the sum will have a cycle incident to n, meaning that each of them has the same weight.
More explicitly, in terms of the pull-back maps we have in all cases that
Thus, agrees with everywhere except for the highest order coefficients on either side of the input–output equation, in which case matches . This implies that the Jacobian matrix corresponding to defined as (whose generic rank yields the dimension of the image of the map), has the form
where is the Jacobian matrix of and C is the matrix
where l and k are the nodes to which the added node n has an edge to and from respectively.
Note that we assume that the model corresponding to has expected dimension, hence has rank . Since is lower block triangular, to show that it has rank , we need only show that C has generic rank 2.
Let H be a chain of cycles in G defined as in order such that is a cycle containing either the input or the output and is the cycle containing the node n. Also, define to be one of the shortest paths from i to j.
Now we will choose entries for the matrix A such that the matrix C has rank 2. First, let all diagonal elements of A be 1, i.e., for all . Also, let for all edges . For all edges in H, for each cycle , choose the edge weights so that the product of edges’ weights is equal to , that is so that the product of the edges in the cycle is equivalent to the sign of the cycle. For , choose edge weights so that the product of the edges’ weights is also equal to
First, consider the entry . The only nonzero monomials appearing here will arise from taking products of the cycles , since the cycle cannot be involved, as we only consider the elements of the sum of with as a factor. Also, is not a cycle, hence cannot be part of any of the . Since each cycle touches its two neighboring cycles, and no other cycles, and in the expansion we expand over all products of nontouching cycles that cover all n vertices, we get that the number of monomials will equal the number of subsets of with no adjacent elements. By Lemma 3.26, this is exactly .
Now consider the entry , which will arise from taking products of , since we must have , hence cannot have , and by similar reasoning above cannot have . Thus, we get that the number of monomials are the number of nonadjacent subsets of , hence we get the Fibonacci number .
In the case of the entry or equivalently , we must use the cycle prohibiting us from using , and again must not use . This yields that the number of monomials is the number of nonadjacent subsets of , i.e., the Fibonacci number .
Finally, when considering entry or equivalently , we must use and cycles , hence cannot use cycles or . This means that the number of monomials will be the number of nonadjacent subsets of , i.e., the Fibonacci number .
Thus, the submatrix C will have the form
The classical identity of Fibonacci number yields that this matrix has full rank. hence, the Jacobian of has full rank.
Note that the upper bound for the number of coefficients of the input–output equation, i.e., the upper bound on the dimension of the image of the coefficient map is via Theorem 2.24. Thus, because by Lemma 3.25, and is bounded above by , we have that as desired.
Lemma 3.26
The number of subsets S of such that S contains no pair of adjacent numbers is the -nd Fibonacci number, which satisfies the recurrence , , and .
We can now prove Theorem 3.22.
Proof of Theorem 3.22
By Theorem 3.24 and the inductive nature of inductively strongly connected graphs, it suffices to show that every inductively strongly connected graph beginning with a cycle between i and j has a chain of cycles containing the vertices i and n (or, analogously, a chain of cycles containing the vertices j and n).
We prove this by induction on n. Since G is inductively strongly connected if the edge from j to i is added, there is a nontrivial cycle c that passes through the vertex n. If c contains i, we are done. Otherwise, let q be the smallest vertex appearing in c, and let be the induced subgraph on . By induction, has a chain of cycles containing i and q. Attaching c to H gives a chain of cycles in G containing i and n. A similar argument can be applied to give a chain of cycles in G containing j and n.
Theorem 3.22 is not only useful as a sufficient condition for an identifiable path/cycle model, but it is also useful as a means to start with an identifiable path/cycle model and then remove leaks to obtain identifiability:
Corollary 3.27
Let represent a linear compartmental model with G strongly input–output connected and . If has no path from compartment j to compartment i but becomes inductively strongly connected if the edge from compartment j to compartment i is added, then
is locally identifiable, i.e. removing all but two leaks in the input/output compartments.
Proof
Example 3.28
The model shown in Fig. 5 with G given by the edges is an identifiable path/cycle model by Theorem 3.22. Thus, the model where we remove all but two leaks from the input/output compartments is locally identifiable.
Fig. 5.

(Color Figure Online) The model corresponds to the above graph with all four leaks, while the graph has the same graph with only the black leaks, that is leaks in compartments 1 and 2, all from Example 3.28
We also note that, as cycles with input/output in the same compartment were shown to have expected dimension in Meshkat and Sullivant (2014) (see Proposition 5.4), paths from input to output can be shown to have expected dimension as well.
Proposition 3.29
Let be a linear compartmental model with G given by a path from input 1 to output |V| with edges. Then is an identifiable path/cycle model and the model is locally identifiable.
Proof
Let . Assume is a linear compartmental model with G given by a path from input 1 to output n with edges. Recall the coefficients on the left hand side of the input–output equation are given by the characteristic polynomial of A, which is:
Since the roots of a polynomial can be determined from its coefficients, then all of are locally identifiable. Since the degree of the highest-order term on the right-hand side of the input–output equation is by Lemma 2.23, this reduces to zero so the right-hand side is . Thus the monomial path is identifiable. This means the dimension of the image of the coefficient map is which is the number of paths and cycles, thus the model is an identifiable path/cycle model. By Theorem 3.19, the model is locally identifiable.
In Meshkat and Sullivant (2014), it was shown in Proposition 5.5 that if a model has expected dimension, then the model also has expected dimension, where is the new graph obtained from G by adding a new vertex and an exchange and making the new input–output node. We show an analogous result now:
Proposition 3.30
Let be a linear compartmental model that has expected dimension where and . Let be a new graph obtained from G by adding a set of new vertices and a set of edges and making the new input node. Then the model also has expected dimension.
Proof
Let A be the full matrix associated to the graph where the first k rows and k columns correspond to the added path from compartment to compartment 1, be the matrix where the first k rows and first k columns have been deleted (and, hence associated to the graph G), and let be the edges of the graph G. We assume that the dimension of the image of the map c associated to the model is , and we want to show that for the model we get , as we are adding k new edges.
The input–output equation for the model is:
where and . The input–output equation for the model is:
For notational ease, write as , , , and . Note that p is the vector of new self-cycles and q can be interpreted as the added monomial path from compartment to compartment 1. We can write as:
and we can write as
Thus can be written as (up to a minus sign):
| 8 |
where are the k elementary symmetric polynomials in the parameter vector p. Here we assumed , but an analogous formula follows for the case of .
We will refer to the non-constant coefficients of as . Note that these are by assumption identifiable. Likewise,
| 9 |
We will refer to the coefficients of as . Note that these are by assumption identifiable. We must now show that if the mapping given by has expected dimension, then the new mapping also has expected dimension. Since the parameters in p are roots of the polynomial , then this means these parameters can be written in terms of the coefficients , which are identifiable, and thus the parameters in p are identifiable. This means each of the elementary symmetric polynomials are identifiable. Since and are identifiable from Eq. 8, then can be recovered from the first coefficient from Eq. 8. Likewise, since , , , and are identifiable, then can be recovered from the second coefficient of Eq. 8. Continuing in this fashion, we can recover , i.e. are identifiable. This means the dimension of the image of is k more than the dimension of the image of . Since the coefficients of Eq. 9 are just the coefficients scaled by q, which contains disjoint parameters from the parameters in the coefficients , then the dimension of the image of is the same as the dimension of the image of . The parameters in p do not appear in , thus combining the maps and we have that the dimension of the image of the new map must be k more than the dimension of the image of due to the identifiability of the parameters in p. Thus the model has dimension of the image of the coefficient map equal to , i.e., k more than .
Classification of All Identifiable Models that are Strongly Input–Output Connected with 1 Output or Strongly Connected with 1 Input and Leaks in Input/Output Compartments
Theorem 4.1 gives necessary conditions for strongly input–output connected models with 1 output or strongly connected models with 1 input with leaks in input/output compartments to be identifiable, namely that they must be identifiable path/cycle models when all the leaks are added to the model.
Necessary Conditions for Identifiability
Theorem 4.1
(Adding leaks) Let represent a linear compartmental model with and either G strongly input–output connected and or G strongly connected and which we assume has expected dimension, i.e., has dimension of the image of the coefficient map equal to . Then, the corresponding model with an additional leak also has expected dimension. Thus the model where and also has expected dimension.
Proof
Suppose is a linear compartmental model with either G strongly input–output connected and or G strongly connected and and which we assume has expected dimension, i.e., has dimension of the image of the coefficient map equal to .
Note that because we assume that has expected dimension and |L| leaks, this implies that the Jacobian of the coefficient map has the expected number of coefficients as the number of rows and columns with full rank . Note too that the addition of the parameters from the leaks being added to the model will not increase the number of coefficients in the resulting input–output equation, as the number of coefficients is the maximal amount by Theorem 2.24.
Therefore, the Jacobian of the coefficient map of the model generated by forcing every compartment in to have a leak, has the same number of rows but now columns. The dimension of the image of the coefficient map is bounded above by the number of cycles and paths when there are |V| leaks, which is . Thus adding leaks to a |L|-leak model cannot increase the dimension of the image of the coefficient map above if it has already achieved that dimension with |L| leaks.
Note then that if we consider the specialization generated by substituting zero for every added leak, and consider the submatrix of said Jacobian with expected number of coefficients as the number of rows and columns generated by the columns corresponding to the edges and leaks in L, we have exactly the Jacobian of the coefficient map of , which we know is full rank. Therefore, we have that the Jacobian of the coefficient map of is also full rank, implying that the model has expected dimension.
The same argument applies for adding any number of leaks up to |V| total leaks.
Remark
We note that while Theorems 3.19 and 4.1 assume opposite operations of adding or subtracting leaks, we have the condition in Theorem 3.19 that , while in Theorem 4.1 only is assumed, i.e., only the number and not the placement of leaks matters.
Necessary and Sufficient Conditions for Identifiability
Combining Theorem 4.1 with Theorem 3.19, we now come to the main result of this section and obtain the following necessary and sufficient conditions for identifiable models:
Corollary 4.2
Let represent a linear compartmental model with G strongly connected and let be the corresponding model with a leak in every compartment. is locally identifiable if and only if is an identifiable cycle model.
Proof
This follows from combining Theorems 4.1 and 1 of Meshkat et al. (2015) (also written as Theorem 9.3).
Corollary 4.3
Let represent a linear compartmental model with and and assume that either G is strongly input–output connected and or G is strongly connected and . Let be the corresponding model with a leak in every compartment. is locally identifiable if and only if is an identifiable path/cycle model.
Proof
Remark
Corollary 4.3 gives us a complete classification of all identifiable models that are strongly input–output connected with 1 output or strongly connected with 1 input and leaks in input/output compartments. We note that this class of models has the very special property of being dimension-preserving when leaks are added or subtracted from non-input/output compartments, up to a point. To demonstrate that this special dimension-preserving property when removing leaks is not always the case, we revisit Example 3.21.
Example 4.4
Recall Example 3.21 where we had the model where and G is given by the edges . The identifiable models are the ones where and the unidentifiable models have , so removing leaks from output compartments is not dimension-preserving for this example.
In the next section, we give a conjecture about removing leaks from non-input/output compartments in the general output connectable case for models with one output.
Other Expected Dimension Results
We first show that there are at most independent paths and cycles appearing in the coefficient map c if we relax the condition of strongly input–output connected to output connectable instead:
Proposition 5.1
Let represent a linear compartmental model with G output connectable. Assume that . Then there are at most independent paths and cycles in the coefficient map c.
Proof
If G is strongly input–output connected (or strongly connected) then we have already shown in Lemma 3.11 that the coefficient map factors through independent paths and cycles. If G is output connectable but not strongly input–output connected, then there may be fewer than independent directed paths and directed cycles because there are independent directed paths and undirected cycles by Lemma 3.9. Since the coefficient map factors over the directed paths and directed cycles, then there are at most independent paths and cycles in the coefficient map c.
This means the expected dimension is now the number of independent directed paths and directed cycles, which may be less than .
We can relax the connectedness conditions in Theorem 4.1 to output connectable instead and still obtain statements about expected dimension, although now the models with a full set of leaks are not identifiable path/cycle models.
Theorem 5.2
Let represent a linear compartmental model with G output connectable and and which we assume has dimension of the image of the coefficient map equal to . Then, the corresponding model with a leak in every compartment also has dimension of the image of the coefficient map equal to .
Proof
The proofs mirrors the one in Theorem 4.1.
Example 5.3
The model shown in Fig. 6 where G is the graph given by is output connectable and has dimension of the image of the coefficient map equal to , thus it is locally identifiable. By Theorem 5.2, the model also has dimension of the image of the coefficient map equal to . Thus the identifiable functions are . Note that it is not an identifiable path/cycle model because the parameter does not appear in the coefficient map (as it is not strongly input–output connected).
Fig. 6.

The graph corresponding to model from Example 5.3
This result shows that if a model has its dimension of the image of the coefficient map is equal to , then adding leaks alone maintains the dimension of the image of the coefficient map. This result is perhaps more useful for its contrapositive, i.e., if a model with leaks from every compartment does not have dimension for c, then no amount of removing leaks up to a certain point () can attain identifiability.
Corollary 5.4
Let represent a linear compartmental model with G output connectable and which does not have the dimension of the image of the coefficient map equal to . Then, the corresponding model with is not locally identifiable.
Example 5.5
The model shown in Fig. 7 where G is the graph given by is output connectable and has dimension of the image of the coefficient map not equal to , but equal to 4 instead. By Corollary 5.4, the model where is thus not locally identifiable.
Fig. 7.

The graph corresponding to model from Example 5.5
These results also give us insight into models that have dimension of the image of the coefficient map equal to but are not strongly input–output connected. We can show that the self-cycles are always identifiable:
Theorem 5.6
Let represent a linear compartmental model with G output connectable and . If has dimension of the image of the coefficient map equal to , then the self-cycles are locally identifiable.
Proof
Since the coefficient map always factors over , this means the self-cycles are locally identifiable.
Finally, we give a conjecture on removing leaks from non-input/output compartments for output connectable models and prove this conjecture in a special case:
Conjecture 5.7
Let represent a linear compartmental model. Assume that G is output connectable and . Assume the dimension of the image of the coefficient map is k. Then, the corresponding model where also has dimension of the image of its coefficient map as k.
In other words, we conjecture that this property of being dimension-preserving applies to all output connectable models. However, if the dimension to begin with is not maximal, the dimension-preserving property will not lead to identifiability. We give a proof of this conjecture in the special case where the dimension of the image of the coefficient map is :
Theorem 5.8
Let represent a linear compartmental model with G output connectable and . If has dimension of the image of the coefficient map equal to , then the the corresponding model where also has dimension of the image of its coefficient map as . In particular, if , then is locally identifiable.
To prove Theorem 5.8, we give a variation of Lemma 3.18 and then a variation of the proof of Theorem 3.19.
Lemma 5.9
Let be a directed graph with corresponding model (G, In, Out, V). Assume that G is output connectable and . Consider a model (G, In, Out, L) where . Let denote the path/cycle map. Let be the linear space satisfying
If the dimension of the image of is , then the dimension of the image of under the map is .
Proof
Removing the assumption of strongly input–output connected from Lemma 3.18 means that we cannot guarantee there are independent directed paths and directed cycles. However, if we assume the dimension of the image of is , then the rest of the proof follows that of Lemma 3.18.
Proof of Theorem 5.8
By Lemma 5.9 we know that the image of the restricted parameter space under the path/cycle map has dimension , which is equal to the dimension of the image of the full parameter space under the path/cycle map. Since the dimension of the image of the coefficient map c is , this must be the same for the restricted model. In particular, if , then the model has parameters, hence it is locally identifiable.
Example 5.10
(Example 5.3 revisited) The model where G is the graph given by is output connectable and has dimension of the image of the coefficient map equal to , thus the model given by is locally identifiable.
Combining Theorems 5.2 and 5.8, we get the following necessary and sufficient conditions:
Corollary 5.11
Let represent a linear compartmental model with and and assume that G is output connectable and . Let be the corresponding model with a leak in every compartment. is locally identifiable if and only if has dimension of the image of the coeffiicent map as .
Remark
Corollary 5.11 shows that this dimension-preserving property when adding or removing leaks from non-input/output compartments also holds in the output connectable case when the dimension of the image of the coefficient map is .
Necessary Conditions for Identifiable Models Based on Model Structure
Outside of checking the conditions in Theorem 3.22, i.e., if a model is an inductively strongly connected model if edges from output to input are added, checking if a model is an identifiable path/cycle model amounts to checking the dimension of the image of the coefficient map, and thus cannot be ascertained by simply examining the graph of the model. However, it is possible to provide necessary conditions for identifiable models and identifiable path/cycle models based on the graph itself, and thus can be used to rule out identifiability.
Theorem 6.1
Let represent a linear compartmental model with either G strongly input–output connected and or G strongly connected and . If , then is unidentifiable.
Proof
If the number of parameters , where is the maximal dimension by Lemma 3.11, then the model is unidentifiable. This reduces to .
We can now make some statements about necessary conditions in the case of the maximal amount of edges.
Theorem 6.2
Let represent a linear compartmental model with G strongly connected and edges. If is locally identifiable (or equivalently, is an identifiable cycle model), it must have an exchange.
Proof
From Proposition 5.3 of Meshkat and Sullivant (2014), we know that G must have an exchange in order for to be an identifiable cycle model. Thus, if G does not have an exchange, is not an identifiable cycle model and thus is not an identifiable model.
Theorem 6.3
Let represent a linear compartmental model with G strongly input–output connected, , and edges and . If is locally identifiable (or equivalently, is an identifiable path/cycle model), it must have an edge from i to j (i.e., a path of length ).
Proof
If there is no path from i to j, then the coefficient of the highest-order term on the right-hand side of the input–output equation is zero and there would be fewer than coefficients. But there are edges, so if is locally identifiable, then it has expected dimension , which is impossible if there are fewer than coefficients.
We can also have an analogous necessary condition in the case of fewer than edges.
Theorem 6.4
Let with represent a linear compartmental model with G strongly input–output connected and edges where and . If is locally identifiable (or equivalently, is an identifiable path/cycle model), it must have a path from i to j of length at most k.
Proof
The coefficient of the highest-order term on the right-hand side of the input–output equation is a sum of shortest paths from input to output of length and this must be nonzero for to have expected dimension. Since there are nonzero coefficients and expected dimension is , this means k is at least . So there must be a path from i to j of length at most k.
Examples
We now provide some real-world examples that fall into the categories of models considered in this paper. In particular, we obtain identifiability or unidentifiability results in Examples 7.1 and 7.2 without any symbolic computation, i.e., purely based on the graph structure alone.
Example 7.1
Consider Example 13.6 from DiStefano (2015) on HIV vaccine development (Part 1). Three models are considered that fall into the category of path models with leaks from every compartment as in Proposition 3.29, shown in Fig. 8. The top model corresponds to Experiment 1, the middle model corresponds to Experiment 2, and the bottom model corresponds to Experiment 3. It is clear that the Experiment 1 model is identifiable. Using Proposition 3.29, we can easily obtain that the Experiment 2 model is identifiable. By Proposition 3.29, the Experiment 3 model has expected dimension and is thus unidentifiable with identifiable functions given by the paths and cycles (where the “self-cycles” have been expanded out): , , , .
Fig. 8.

The three models considered in Example 7.1
Example 7.2
Consider Example 13.16 from DiStefano (2015) on HIV vaccine development (Part 3). The models from Example 7.1 are amended by adding on exchanges to the output compartments, shown in Fig. 9, and the numbering scheme has changed to agree with DiStefano (2015). The Experiment 1 model is identifiable by Theorem 3.19 as it is an identifiable cycle model (it is inductively strongly connected) with a single leak in the input/output compartment. The Experiment 2 model is almost inductively strongly connected and thus is identifiable by Corollary 3.27. A variation on the Experiment 3 model with leaks from all compartments can be shown to be an identifiable path/cycle model by a direct calculation, thus removing the leak from compartment 9 retains the dimension by Theorem 3.19, which means the model in Experiment 3 is unidentifiable. Alternatively, one can apply Proposition 3.30 to a variation on the model in Experiment 2 with leaks from every compartment (which has expected dimension) and thus obtain that the variation on the model in Experiment 3 with leaks from every compartment also has expected dimension. Now removing the leak from compartment 9 retains the dimension by Theorem 3.19, and thus the model in Experiment 3 is unidentifiable. The identifiable functions are given by the paths and cycles (where the “self-cycles” have been expanded out): , , , , , .
Fig. 9.

The three models considered in Example 7.2
Computations
In the table below we outline the number of graphs with n vertices and m edges that have the expected dimension with input in i and output in j, assuming leaks from every compartment:
| (n, m) | Total | Strongly connected | Strongly input–output connected | Strongly input–output connected | ||||
|---|---|---|---|---|---|---|---|---|
| (3,2) | 15 | NA | NA | NA | 1 | 1 | 3 | 3 |
| (3,3) | 20 | 2 | 2 | 2 | 7 | 4 | 10 | 8 |
| (3,4) | 15 | 9 | 7 | 3 | 11 | NA | 12 | 4 |
| (4,3) | 220 | NA | NA | NA | 2 | 2 | 7 | 7 |
| (4,4) | 495 | 6 | 6 | 6 | 37 | 25 | 72 | 59 |
| (4,5) | 792 | 84 | 54 | 62 | 193 | 70 | 267 | 167 |
| (4,6) | 924 | 316 | 166 | 118 | 445 | NA | 518 | 184 |
| (4,7) | 792 | 492 | NA | 86 | 565 | NA | 603 | 96 |
| (5,4) | 4845 | NA | NA | NA | 6 | 6 | 24 | 24 |
| (5,5) | 15,504 | 24 | 24 | 24 | 222 | 162 | 518 | 432 |
| (5,6) | 38,760 | 720 | 576 | 600 | 2470 | 1288 | 4130 | 1110 |
| (5,7) | 77,520 | 6440 | 4052 | 4030 | 13,004 | 3154 | 17,708 | 1552 |
| (5,8) | 125,970 | 26,875 | 9565 | 10,336 | 40,126 | NA | 48,277 | 17,113 |
| (5,9) | 167,960 | 65,280 | NA | 15,984 | 82,159 | NA | 91,658 | 20,272 |
| (5,10) | 184,756 | 105,566 | NA | 9841 | 120,202 | NA | 128,003 | 10,689 |
The number of strongly connected graphs and the number of strongly input–output connected graphs with different input/output configurations is noted.
We then computed the number of models with expected dimension for 4 notable cases: the case of identical single input and single output with a strongly connected graph G (as in Meshkat and Sullivant 2014), the case of single input but multiple outputs with a strongly connected graph G, the case of distinct single input and single output with a strongly input–output connected graph G, and the case of single output but multiple inputs with a strongly input–output connected graph G. Due to restrictions on the number of edges, not all cases are possible, and those are labeled “NA.”
Construction of Identifiable Models
In this section, we consider the special case of single input and single output in the same compartment with G strongly connected and , as in Meshkat and Sullivant (2014), i.e. with G strongly connected. Since Leak is assumed to be V and input/output are assumed to be the same vertex, we can just discuss the graph G in what follows.
In Meshkat and Sullivant (2014), Theorem 5.7 gives a way of constructing a new model with expected dimension from a smaller model with expected dimension by adding an incoming and outgoing edge to a chain of cycles (See Definition 3.23):
Theorem 9.1
(Theorem 5.7 of Meshkat and Sullivant 2014) Let be a graph that has the expected dimension with vertices. Let G be a new graph obtained from by adding a new vertex and two edges and and such that G has a chain of cycles containing both 1 and n. Then G has the expected dimension.
In Baaijens and Draisma (2016), the authors strengthened this result to allow for adding loops of any length, not just length two:
Proposition 9.2
(Proposition 4.14 of Baaijens and Draisma 2016) Let on vertices be a graph with the expected dimension. Construct from G by adding new vertices and edges , , and for where are vertices of G. Then has the expected dimension.
We will use Proposition 4.14 from Baaijens and Draisma (2016) combined with Theorem 1 of Meshkat et al. (2015) to form identifiable models. In other words, we will use Proposition 4.14 to construct identifiable cycle models and then use Theorem 1 to eliminate all but one leak to form an identifiable model. We state Theorem 1 here:
Theorem 9.3
(Theorem 1 from Meshkat et al. 2015) Let M be an identifiable cycle model. If the model is changed to have exactly one leak, then the resulting model is locally identifiable.
Algorithm 9.4
(Construction of identifiable models with and one leak )
Begin with where and .
Construct from G by adding new vertices and edges , , and for where are vertices of G and adding leaks from every new vertex.
Repeat Step 2 by starting a some vertex and ending at some vertex for and adding leaks from every new vertex.
Continue adding edges, vertices, and leaks as described in Steps 2 and 3.
Remove all leaks except one leak.
Theorem 9.5
Let be a model constructed from Algorithm 9.4. The model is identifiable.
Proof
By Proposition 9.2 the model is an identifiable cycle model and by Theorem 9.3 the model with only one leak is identifiable.
Example 9.6
The model shown in Fig. 10 where G is given by the edges is an identifiable cycle model by Proposition 9.2. Thus we can remove all leaks except one, e.g., and the resulting model is identifiable. We note that this model is not inductively strongly connected, thus Proposition 9.2 does expand upon the results in Meshkat et al. (2015) to construct identifiable models.
Fig. 10.

The graph corresponding to from Example 9.6
Note that the authors in Baaijens and Draisma (2016) only considered identifiable cycle models. We suspect there may be a similar result to Proposition 9.2 for the case of identifiable path/cycle models.
Conclusion and Future work
In this work, we have defined identifiable path/cycle models and found sufficient conditions for obtaining them. We have also formed necessary and sufficient conditions for identifiable models with certain graph properties. Most importantly, we have demonstrated this notion of maximal dimension of the image of the coefficient map in terms of the number of edges and the number of inputs and outputs and have shown that the only identifiable models with certain graph properties are models where the dimension of the image of the coefficient map is preserved to be maximal while adding or subtracting leaks up to a point. This shows that identifiable models of this form have a special dimension-preserving property even when they are no longer identifiable from added leaks. We hope that this motivates the investigation of other classes of models that have this same dimension-preserving property.
As mentioned in the introduction, one of the main uses of identifiable functions of parameters is to find identifiable reparametrizations over those functions of parameters. In Meshkat and Sullivant (2014), necessary and sufficient conditions for an identifiable scaling reparametrization were found and an algorithm to find such a scaling reparametrization was discussed for the special case of single input/single output in the same compartment, G strongly connected, and leaks from every compartment. We end this work with the following conjecture to generalize these necessary and sufficient conditions for the case of multiple inputs or multiple outputs:
Conjecture 10.1
Let represent a linear compartmental model with G strongly input–output connected and or G strongly connected and . An identifiable scaling reparametrization exists if and only if is an identifiable path/cycle model.
While the models described in Corollary 5.11 also have the dimension of the image of the coefficient map as , we note that these models will not have an identifiable scaling reparametrization since these models do not necessarily have every parameter appearing in a cycle or path from input to output. However, it may be possible to reparametrize those models another way, e.g., scaling and adding. We leave these problems of identifiable reparametrizations of models with maximal dimension of the image of their coefficient map for future work.
Acknowledgements
This project began at an AIM workshop on “Identifiability problems in systems biology,” and the authors thank AIM for providing financial support and an excellent working environment. The authors also thank Seth Sullivant, Anne Shiu, Gleb Pogudin, and two referees for their helpful comments. Cashous Bortner was partially supported by the US National Science Foundation (DMS 1615660). Nicolette Meshkat was partially supported by the Clare Boothe Luce fellowship from the Henry Luce Foundation and the US National Science Foundation (DMS 1853525).
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- Baaijens JA, Draisma J. On the existence of identifiable reparametrizations for linear compartment models. SIAM J Appl Math. 2016;76(4):1577–1605. doi: 10.1137/15M1038013. [DOI] [Google Scholar]
- Bellman R, Åström KJ. On structural identifiability. Math Biosci. 1970;7(3–4):329–339. doi: 10.1016/0025-5564(70)90132-X. [DOI] [Google Scholar]
- Bellu G, Saccomani MP, Audoly S, D’Angiò L. DAISY: a new software tool to test global identifiability of biological and physiological systems. Comput Methods Prog Biol. 2007;88(1):52–61. doi: 10.1016/j.cmpb.2007.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Biggs N. Algebraic graph theory. Cambridge: Cambridge University Press; 1993. [Google Scholar]
- Bressloff PC, Taylor JG. Compartmental-model response function for dendritic trees. Biol Cybern. 1993;70(2):199–207. doi: 10.1007/BF00200833. [DOI] [Google Scholar]
- DiStefano JJ., III . Dynamic systems biology modeling and simulation. New York: Academic Press; 2015. [Google Scholar]
- Gerberding S, Obatake N, Shiu A. Identifiability of linear compartmental models: the effect of moving inputs, outputs, and leaks. Linear Multilinear Algebra. 2020;0:1–22. doi: 10.1080/03081087.2020.1812497. [DOI] [Google Scholar]
- Glad ST. Differential algebraic modelling of nonlinear systems. Boston: Birkhäuser Boston; 1990. pp. 97–105. [Google Scholar]
- Godfrey KR, Chapman MJ. Identifiability and indistinguishability of linear compartmental models. Math Comput Simul. 1990;32:273–295. doi: 10.1016/0378-4754(90)90185-L. [DOI] [Google Scholar]
- Gross E, Harrington H, Meshkat N, Shiu A. Linear compartmental models: input–output equations and operations that preserve identifiability. SIAM J Appl Math. 2019;79(4):1423–1447. doi: 10.1137/18M1204826. [DOI] [Google Scholar]
- Harris J. Algebraic geometey. Graduate Texts in Mathematics. Berlin: Springer; 1992. [Google Scholar]
- Hong H, Ovchinnikov A, Pogudin G, Yap C (2018) Global identifiability of differential models. preprint, arXiv:1801.08112
- Ljung L, Glad T. On global identifiability for arbitrary model parametrizations. Automatica. 1994;30(2):265–276. doi: 10.1016/0005-1098(94)90029-9. [DOI] [Google Scholar]
- Meshkat N, Rosen Z, Sullivant S (2018) Algebraic tools for the analysis of state space models. In: The 50th anniversary of Gröbner bases, Adv. Stud. Pure Math., vol 77. Math. Soc. Japan, Tokyo, pp 171–205
- Meshkat N, Sullivant S. Identifiable reparametrizations of linear compartment models. J Symb Comput. 2014;63:46–67. doi: 10.1016/j.jsc.2013.11.002. [DOI] [Google Scholar]
- Meshkat N, Sullivant S, Eisenberg M. Identifiability results for several classes of linear compartment models. Bull Math Biol. 2015;77(8):1620–1651. doi: 10.1007/s11538-015-0098-0. [DOI] [PubMed] [Google Scholar]
- Ollivier F (1990) Le Problème de l’Identifiabilité Structurelle Globale: Étude Théorique, Méthodes Effectives et Bornes de Complexité. PhD thesis, École Polytéchnique
- Ovchinnikov A, Pogudin G, Thompson P (2019) Input–output equations and identifiability of linear ode models. preprint, arXiv:1910.03960
- Pohjanpalo H. System identifiability based on the power series expansion of the solution. Math Biosci. 1978;41(1):21–33. doi: 10.1016/0025-5564(78)90063-9. [DOI] [Google Scholar]
- Saccomani MP, Audoly S, D’Angiò L. Parameter identifiability of nonlinear systems: the role of initial conditions. Automatica. 2003;39(4):619–632. doi: 10.1016/S0005-1098(02)00302-3. [DOI] [Google Scholar]
- Soderstrom T, Stoica P. System identification. Hoboken: Prentice-Hall; 1989. [Google Scholar]
- Vajda S, Godfrey KR, Rabitz H. Similarity transformation approach to identifiability analysis of nonlinear compartmental models. Math Biosci. 1989;93(2):217–248. doi: 10.1016/0025-5564(89)90024-2. [DOI] [PubMed] [Google Scholar]
- Villaverde AF. Obervability and structural identifiability of nonlinear biological systems. Complexity. 2019;2019:8497093. [Google Scholar]
- Walter E, Lecourtier Y. Global approaches to identifiability testing for linear and nonlinear state space models. Math Comput Simul. 1982;24(6):472–482. doi: 10.1016/0378-4754(82)90645-0. [DOI] [Google Scholar]
