Hyperbolic systems with non-diagonalisable principal part and variable multiplicities, I: well-posedness

Claudia Garetto; Christian Jäh; Michael Ruzhansky

doi:10.1007/s00208-018-1672-1

. 2018 Mar 22;372(3):1597–1629. doi: 10.1007/s00208-018-1672-1

Hyperbolic systems with non-diagonalisable principal part and variable multiplicities, I: well-posedness

Claudia Garetto ¹, Christian Jäh ¹, Michael Ruzhansky ^2,^✉

PMCID: PMC6411233 PMID: 30930490

Abstract

In this paper we analyse the well-posedness of the Cauchy problem for a rather general class of hyperbolic systems with space-time dependent coefficients and with multiple characteristics of variable multiplicity. First, we establish a well-posedness result in anisotropic Sobolev spaces for systems with upper triangular principal part under interesting natural conditions on the orders of lower order terms below the diagonal. Namely, the terms below the diagonal at a distance k to it must be of order $- k$ . This setting also allows for the Jordan block structure in the system. Second, we give conditions for the Schur type triangularisation of general systems with variable coefficients for reducing them to the form with an upper triangular principal part for which the first result can be applied. We give explicit details for the appearing conditions and constructions for $2 \times 2$ and $3 \times 3$ systems, complemented by several examples.

Mathematics Subject Classification: 35L45 (primary), 46E35 (secondary)

Introduction

The main aim of this paper is to consider the Cauchy problem for hyperbolic systems

\begin{matrix} \{\begin{matrix} D_{t} u = A (t, x, D_{x}) u + B (t, x, D_{x}) u + f (t, x), & (t, x) \in [0, T] \times R^{n}, \\ {u|}_{t = 0} = u_{0}, & x \in R^{n}, \end{matrix} \end{matrix}

with the usual notation $D_{t} = - i \partial_{t}$ and $D_{x} = - i \partial_{x}$ . Here, we assume that $A (t, x, D_{x}) = [a_{ij} (t, x, D_{x})]_{i, j = 1}^{m}$ is an $m \times m$ matrix of pseudo-differential operators of order 1, i.e. $a_{ij} \in C ([0, T], Ψ_{1, 0}^{1} (R^{n}))$ with possibly complex valued symbols. In the first part of the paper we will also assume that

\begin{matrix} A (t, x, D_{x}) = Λ (t, x, D_{x}) + N (t, x, D_{x}), \end{matrix}

with real-valued symbols in

\begin{matrix} Λ (t, x, D_{x}) = diag (λ_{1} (t, x, D_{x}), λ_{2} (t, x, D_{x}), \dots, λ_{m} (t, x, D_{x})), \end{matrix}

and

\begin{matrix} N (t, x, D_{x}) = [\begin{matrix} 0 & a_{12} (t, x, D_{x}) & a_{13} (t, x, D_{x}) & \dots & a_{1 m} (t, x, D_{x}) \\ 0 & 0 & a_{23} (t, x, D_{x}) & \dots & a_{2 m} (t, x, D_{x}) \\ ⋮ & ⋮ & ⋮ & \dots & ⋮ \\ 0 & 0 & 0 & \dots & a_{m - 1 m} (t, x, D_{x}) \\ 0 & 0 & 0 & \dots & 0 \end{matrix}] . \end{matrix}

Finally, we assume that

\begin{matrix} B (t, x, D_{x}) = [b_{ij} (t, x, D_{x})]_{i, j = 1}^{m}, b_{ij} \in C ([0, T], Ψ_{1, 0}^{0} (R^{n})), \end{matrix}

is an $m \times m$ matrix of pseudo-differential operators of order 0 with possibly complex valued symbols. We can take any $n \geq 1$ and we can assume that $m \geq 2$ since in the case $m = 1$ there are no multiplicities and thus much more is known. It is also well-known that even if all the coefficients in A and B depend only on time, due to multiplicities, the best one can hope for is the well-posedness of the Cauchy problem (1) in suitable classes of Gevrey spaces. Thus, the main questions that we address in this paper are:

Under what structural conditions on the zero order part $B (t, x, D_{x})$ is the Cauchy problem (1) well-posed in $C^{\infty}$ or, even better, in suitable scales of Sobolev spaces?
Under what conditions on the general matrix $A (t, x, D_{x})$ of first order pseudo-differential operators can we reduce it (microlocally) to another system with A satisfying the upper triangular condition (2)?

Note that this paper is part of a wider analysis of hyperbolic systems with multiplicities. Here we investigate the well-posedness of these systems. In the second part of this paper we plan to carry out the microlocal analysis of their solutions.

In the case of $2 \times 2$ systems the questions above have been analysed with the answer to (Q1) given by the following theorem:

Theorem A

([27, Theorem 7.2]) Let $m = 2$ . Suppose that the pseudo-differential operator $b_{21}$ is of order not greater than $- 1$ . Then the Cauchy problem (1) is well-posed in $C^{\infty}$ . Moreover, it is well-posed in the anisotropic Sobolev space $[\begin{matrix} H^{s_{1}} (R^{n}) \\ H^{s_{2}} (R^{n}) \end{matrix}]$ provided $s_{2} - s_{1} \geq 1$ . In that case the solution satisfies the following estimates:

\begin{matrix} ‖ u_{1} {(t, \cdot) ‖}_{H^{s}} + {‖ u_{2} (t, \cdot) ‖}_{H^{s + 1}} \leq c e^{ct} (‖ u_{1}^{0} ‖_{H^{s}} + {‖ u_{2}^{0} ‖}_{H^{s + 1}}), 0 \leq t \leq T, \end{matrix}

for $u_{j}^{0} \in H_{comp}^{s + j - 1} (R^{n})$ , $j = 1, 2$ , with $c > 0$ depending on s, T, and the support of the initial data.

The case of systems of general size but for coefficients depending only on t and for $n = 1$ was also considered. More precisely, in [26] the authors considered the Cauchy problem

\begin{matrix} \{\begin{matrix} D_{t} u = A (t) D_{x} u + B (t) D_{x} u + f (t, x), & (t, x) \in [0, T] \times R, \\ {u|}_{t = 0} = 0, & x \in R, \end{matrix} \end{matrix}

with $A (t) = [a_{ij} (t)]_{i, j = 1}^{m} \in C {([0, T])}^{m \times m}$ in the form

\begin{matrix} A (t) = Λ (t) + N (t), \end{matrix}

similar to (2). They showed the following result in the absence of lower order terms and for zero Cauchy data:

Theorem B

([26, Proposition 1]) Let $B (t) \equiv 0$ and let $s \in R$ . Then the Cauchy problem (3) is $C^{\infty}$ -well-posed. Moreover, there exist $r_{1}$ ,..., $r_{m - 1} \in [0, 1]$ such that for every $f \in C (R, {(H^{s} (R))}^{m})$ identically 0 at $t = 0$ it admits a unique solution $u \in C (R, {(S^{'} (R))}^{m})$ satisfying

\begin{matrix} u_{m} \in C (R, H^{s} (R)), u_{m - j} \in C (R, H^{s - r_{1} - \dots - r_{j - 1}} (R)), \end{matrix}

for $j = 1, \dots, m - 1$ , and identically 0 at $t = 0$ . In particular, if $λ_{j} (t) \neq λ_{k} (t)$ , $t \in R$ , $1 \leq j < k \leq m$ , no loss of anisotropic regularity appears.

The case of (microlocally) diagonalisable systems of any order with fully variable coefficients was considered by Rozenblum [41] under the condition of transversality of the intersecting characteristics. Also allowing the variable multiplicities, this transversality condition was later removed in [32, 33] with sharp $L^{p}$ -estimates for solutions, with further applications to the spectral asymptotics of the corresponding elliptic systems.

Before stating our main results and collecting some necessary basic notions we give a brief overview of the state of the art for hyperbolic equations and systems. We have a complete understanding of strictly hyperbolic systems, i.e., systems without multiplicities, with $C^{\infty}$ -coefficients. This starts with the groundbreaking work of Lax [35] and Hörmander [28] and heavily relies on the modern theory of Fourier integral operators (FIO). Well-posedness is here obtained in the space of distributions $D^{'}$ . There are also well-posedness results for less regular coefficients with respect to t. For instance, well-posedness with loss of derivatives has been obtained by Colombini and Lerner [9] for second order strictly hyperbolic equations with Log-Lipschitz coefficients with respect to t and smooth in x. It is possible to further drop the regularity in t (for instance Hölder), however, this has to be balanced by stronger regularity in x (Gevrey) and leads to more specific (Gevrey) well-posedness results (see [3, 31] and references therein). Paradifferential techniques have been recently used for this kind of strictly hyperbolic equations by Colombini et al. [6, 7].

The analysis of hyperbolic equations with multiplicities (weakly hyperbolic) has started with the seminal paper by Colombini et al. [5] in the case of coefficients depending only on time. Profound difficulties in such analysis have been exhibited by Colombini et al. [4, 8] showing that even the second order wave equation in $R$ with smooth time-dependent propagation speed (but with multiplicity) and smooth Cauchy data need not be well-posed in $D^{'}$ . However, they turn out to be well-posed in suitable Gevrey classes or spaces of ultradistributions. In the last decades many results were obtained for weakly hyperbolic equations with t-dependent coefficients ([3, 11, 16, 18–20, 34], to quote only very few). More recently, advances in the theory of weakly hyperbolic systems with t-dependent coefficients have been obtained for systems of any size in presence of multiplicities with regular or low regular (Hölder) coefficients [16, 22, 23]. In addition, in [17] precise conditions on the lower order terms (Levi conditions) have been formulated to guarantee Gevrey and ultradistributional well-posedness. Previously very few results were known in the field for systems of a certain size ( $2 \times 2$ , $3 \times 3$ ) [12, 13] or of a certain form (for instance without lower order terms or with principal part of a certain form) [44].

Weakly hyperbolic equations with x-dependent coefficients were considered for the first time in the celebrated paper by Bronshtein [2]. As shown already in some earlier works by Ivrii, the corresponding Cauchy problem is well-posed under “almost analytic regularity”, namely, if the coefficients and initial data are in suitable Gevrey classes. Bronshtein’s result was extended to (t, x)-dependent scalar equations by Ohya and Tarama [38] and to systems by Kajitani and Yuzawa [31]. The regularity assumptions are always quite strong with respect to x (Gevrey) and not below Hölder in t. See also [10, 37]. Geometrical and microlocal analytic approaches are known for equations or systems under specific assumptions on the characteristics and/or lower order terms. See [29, 30, 33, 36, 39], to quote only a few. Time-dependent coefficients of low regularity (distributional) have been considered in [21].

In this paper we will be interested in the case of coefficients depending on both t and x and we will make use of the usual definitions of symbol classes. We say that a (possibly) complex valued function $a = a (x, ξ) \in C^{\infty} (R^{n} \times R^{n})$ belongs to $S_{1, 0}^{m} (R^{n} \times R^{n})$ if there exist constants $C_{α, β} > 0$ such that

\begin{matrix} \forall α, β \in N_{0}^{n} : | \partial_{x}^{α} \partial_{ξ}^{β} a (x, ξ) | \leq C_{α, β} {〈ξ〉}^{m - | β |} \forall (x, ξ) \in R^{n} \times R^{n} . \end{matrix}

The set of pseudo-differential operators associated to the symbols in $S_{1, 0}^{m} (R^{n} \times R^{n})$ is denoted by $Ψ_{1, 0}^{m} (R^{n} \times R^{n})$ .

If there is no question about the domain under consideration, we will abbreviate the symbol- and operator-classes by $S_{1, 0}^{m}$ and $Ψ_{1, 0}^{m}$ , respectively, or simply by $S^{m}$ and $Ψ^{m}$ .

We also denote by $C ([0, T], S_{1, 0}^{m} (R^{n} \times R^{n}))$ the space of all symbols $a (t, x, ξ) \in S_{1, 0}^{m} (R^{n} \times R^{n})$ which are continuous with respect to t. The set of operators associated to the symbols in $C ([0, T], S_{1, 0}^{m} (R^{n} \times R^{n}))$ is denoted by $C ([0, T], Ψ_{1, 0}^{m} (R^{n} \times R^{n}))$ .

Again, if there is no question about the domain under consideration, we will abbreviate the symbol- and operator-classes by $C S_{1, 0}^{m}$ and $C Ψ_{1, 0}^{m}$ , respectively, or simply by $C S^{m}$ and $C Ψ^{m}$ .

Let us give our main result concerning the first question (Q1) for the systems with the principal part A satisfying the upper triangular condition (2). Here, $f_{k}$ , $u_{k}$ and $u_{k}^{0}$ , for $k = 1, \dots, m$ , stand for the components of the vectors f, u and $u_{0}$ , respectively.

Theorem 1

Let $n \geq 1$ , $m \geq 2$ , and let

\begin{matrix} \{\begin{matrix} D_{t} u = A (t, x, D_{x}) u + B (t, x, D_{x}) u + f (t, x), & (t, x) \in [0, T] \times R^{n}, \\ {u|}_{t = 0} = u_{0} (x), & x \in R^{n}, \end{matrix} \end{matrix}

where $A (t, x, D_{x}) \in {(C S^{1})}^{m \times m}$ is an upper-triangular matrix of pseudo-differential operators of order 1 in the form (2), and $B (t, x, D_{x}) \in {(C S^{0})}^{m \times m}$ is a matrix of pseudo-differential operators of order 0, continuous with respect to t. Hence, if

\begin{matrix} the lower order terms b_{ij} belong to C ([0, T], Ψ^{j - i}) for i > j, \end{matrix}

$u_{k}^{0} \in H^{s + k - 1} (R^{n})$ and $f_{k} \in C ([0, T], H^{s + k - 1})$ for $k = 1, \dots, m$ , then (4) has a unique anisotropic Sobolev solution u, i.e., $u_{k} \in C ([0, T], H^{s + k - 1})$ for $k = 1, \dots, m$ .

Remark 1

As stated earlier, we allow A and B to have complex valued symbols as long as the symbols of $Λ$ in (2), i.e. the eigenvalues of $A (t, x, ξ)$ , are real valued.

The main condition of Theorem 1 for the Sobolev well-posedness is that the pseudo-differential operator $b_{ij}$ below the diagonal (i.e. for $i > j$ ) must be of order $j - i$ . In other words, the terms below the diagonal at a distance k to it must be of order $- k$ .

In solving the Cauchy problem (4) we will make use of Fourier integral operators depending on the parameter $t \in [0, T]$ . Namely, we will work with operators of the type

\begin{matrix} \int_{0}^{t} \int_{R^{n}} e^{i φ (t, s, x, ξ)} a (t, s, x, ξ) \hat{g} (s, ξ) d ξ d s \end{matrix}

where $φ$ is the solution of a certain eikonal equation and the symbol a is determined via asymptotic expansion and transport equations. In Sect. 2.1 we will recall some well-known Sobolev estimates for this type of operators.

In Sect. 2 we will prove Theorem 1 after we explain its idea in the cases of $m = 2$ and $m = 3$ .

Consequently, in Sect. 3 we give an answer to the second question (Q2) above in the form of a suitable variable coefficients extension of the Schur triangularisation. For constant matrices such a procedure is well known (see e.g. [1, Theorem 5.4.1]).

Theorem C

(Schur’s triagularisation theorem) Given a (constant) $m \times m$ matrix A with eigenvalues $λ_{1}, \dots, λ_{m}$ in any prescribed order, there is a unitary $m \times m$ matrix T such that $R = T^{- 1} A T$ is upper triangular with the diagonal elements $r_{ii} = λ_{i}$ . Furthermore, if the entries of A and its eigenvalues are all real, T may be chosen to be real orthogonal.

It follows that R can be written as $D + N$ , where $D = diag (λ_{1}, \dots, λ_{m})$ and N is a nilpotent upper triangular matrix.

If the matrix A depends on one or several parameters, namely $A = A (t, x, ξ)$ , the situation becomes less clear and it is difficult to give a complete description, in particular together with a prescribed regularity of the involved transformation matrices. The regularity of the matrix A and the desire to maintain it through the transformation puts already constrains on the matrix as, in general, the eigenvalues can only be expected to be Lipschitz continuous in the parameters even if all the entries depend smoothly on the parameters (see, e.g., [2, 40] and the references therein). In the sequel, we will present some sufficient conditions to ensure the existence of an upper triangularisation for $A (t, x, ξ)$ which respects its regularity. For example, it will apply to the case when A is a matrix of first order symbols continuous with respect to t, i.e., $A (t, x, ξ) \in (C S^{1})^{m \times m}$ .

Our main result for this part of the problem is the following theorem.

Theorem 2

Let $A (t, x, ξ) \in {(C S^{1})}^{m \times m}$ , be a $m \times m$ -matrix with eigenvalues $λ_{1}, \dots, λ_{m} \in C S^{1}$ , and let $h_{1}, \dots, h_{m - 1} \in (C S^{0})^{m}$ be the corresponding eigenvectors. Suppose that for $e_{1} = [1, 0, \dots, 0]^{T} \in R^{m - i + 1}$ the condition

\begin{matrix} 〈h^{(i)} (t, x, ξ) | e_{1}〉 \neq 0, \forall (t, x, ξ) \in [0, T] \times R^{n} \times R^{n} \end{matrix}

holds for all $i = 1, \dots, m - 1$ , with the notation for $h^{(i)}$ explained in (37). Then, there exists a matrix-valued symbol $T (t, x, ξ) \in {(C S^{0})}^{m \times m}$ , invertible for $(t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M}$ with $T^{- 1} (t, x, ξ) \in {(C S^{0})}^{m \times m}$ , such that

\begin{matrix} T^{- 1} (t, x, ξ) A (t, x, ξ) T (t, x, ξ) = Λ (t, x, ξ) + N (t, x, ξ) \end{matrix}

for all $(t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M}$ , where

\begin{matrix} Λ (t, x, ξ) = diag (λ_{1} (t, x, ξ), λ_{2} (t, x, ξ), \dots, λ_{m} (t, x, ξ)) \end{matrix}

and

\begin{matrix} N (t, x, ξ) = [\begin{matrix} 0 & N_{12} (t, x, ξ) & N_{13} (t, x, ξ) & \dots & N_{1 m} (t, x, ξ) \\ 0 & 0 & N_{23} (t, x, ξ) & \dots & N_{2 m} (t, x, ξ) \\ ⋮ & ⋮ & ⋮ & \dots & ⋮ \\ 0 & 0 & 0 & \dots & N_{m - 1 m} (t, x, ξ) \\ 0 & 0 & 0 & \dots & 0 \end{matrix}], \end{matrix}

and N is a nilpotent matrix with entries in $C S^{1}$ .

Furthermore, there is an expression for the matrix symbol T which will be given in Theorem 6. Also, the assumption (5) can be relaxed, see Remark 6. In Sect. 3 we will prove this result as well as describe the procedure how to obtain the desired upper triangular form. Moreover, we work out in detail the cases of $m = 2$ and $m = 3$ clarifying this Schur triangualisation procedure and give a number of examples.

The results and techniques of this paper are a natural outgrowth of the paper [27] where the case $m = 2$ was considered and to which the results of the present paper reduce in the case of $2 \times 2$ systems. It is with great sorrow that we remember the untimely departure of our colleague and friend Todor Gramchev who was the inspiration for both [27] and the present paper.

Well-posedness in anisotropic Sobolev spaces

This section is devoted to proving the well-posedness of the Cauchy problem (1). For the reader’s convenience we first give a detailed proof in the cases $m = 2$ and $m = 3$ . This will inspire us in proving Theorem 1. We note that the case $m = 2$ has been studied in [27] and we will briefly review its derivartion. However, first we collect a few results about Fourier integral operators that we will need in the sequel.

Auxiliary remarks

In solving the Cauchy problem (1), we will deal with solutions of certain scalar pseudo-differential equations. For each characteristic $λ_{j}$ of A, we will be denoting by $G_{j}^{0} θ$ the solution to

\begin{matrix} \{\begin{matrix} D_{t} w = λ_{j} (t, x, D_{x}) w + b_{jj} (t, x, D_{x}) w, \\ w (0, x) = θ (x), \end{matrix} \end{matrix}

and by $G_{j} g$ the solution to

\begin{matrix} \{\begin{matrix} D_{t} w = λ_{j} (t, x, D_{x}) w + b_{jj} (t, x, D_{x}) w + g (t, x), \\ w (0, x) = 0 . \end{matrix} \end{matrix}

The operators $G_{j}^{0}$ and $G_{j}$ can be microlocally represented by Fourier integral operators

\begin{matrix} G_{j}^{0} θ (t, x) = \int_{R^{n}} e^{i φ_{j} (t, x, ξ)} a_{j} (t, x, ξ) \hat{θ} (ξ) d ξ \end{matrix}

and

\begin{matrix} G_{j} g (t, x) = \int_{0}^{t} \int_{R^{n}} e^{i φ_{j} (t, s, x, ξ)} A_{j} (t, s, x, ξ) \hat{g} (s, ξ) d ξ d s, \end{matrix}

with $φ_{j} (t, s, x, ξ)$ solving the eikonal equation

\begin{matrix} \{\begin{matrix} \partial_{t} φ_{j} = λ_{j} (t, x, \nabla_{x} φ_{j}), \\ φ_{j} (s, s, x, ξ) = x \cdot ξ, \end{matrix} \end{matrix}

and with the notation

\begin{matrix} φ_{j} (t, x, ξ) = φ_{j} (t, 0, x, ξ) . \end{matrix}

Here we also have the amplitudes $A_{j, - k} (t, s, x, ξ)$ of order $- k$ , k $\in N$ , giving $A_{j} \sim \sum_{k = 0}^{\infty} A_{j, - k}$ , and they satisfy the transport equations with initial data at $t = s$ , and we have $a_{j} (t, x, ξ) = A_{j} (t, 0, x, ξ)$ .

If $a_{j} \in S^{m}$ , i.e. if the amplitude $a_{j}$ in (6) is a symbol of order m, we will write $G_{j}^{0} \in I_{1, 0}^{m} .$ However, in the above construction of propagators for hyperbolic equations, we have $a_{j} \in S^{0}$ , so that $G_{j}^{0} \in I_{1, 0}^{0}$ .

By $I_{1, 0}^{m}$ , we denote the class of Fourier integral operators with amplitudes in $S_{1, 0}^{m}$ . For further information, the reader may consult [15, 42, 43] and the references therein.

With that, we can record the following estimate:

Lemma 1

For any $σ \in R$ , for sufficiently small t, we have

\begin{matrix} {∥G_{j}^{0} θ (t)∥}_{H^{σ}} \leq C_{A, σ, u_{0}} {‖ θ ‖}_{H^{σ}}, {∥G_{j} g (t)∥}_{H^{σ}} \leq C_{A, σ} t {‖ g ‖}_{L_{s}^{\infty} H_{x}^{σ}} . \end{matrix}

This statement follows from the continuity of $λ_{j}, φ_{j}, a_{j}, A_{j}$ with respect to t and from the $H^{σ}$ -boundedness of non-degenerate Fourier integral operators, see e.g. [15] (there are also surveys on such questions [42, 43]). It is important to note that the constant for the estimate for $G_{j}$ does not depend on the initial data of the Cauchy problem; see also Remark 2.

The case $m = 2$

To motivate the higher order cases, here we review the construction for $2 \times 2$ systems adapting it for the subsequent higher order arguments. Hence, in this subsection we follow the proof in [27]. Thus, we consider the system

\begin{matrix} \{\begin{matrix} D_{t} u = A (t, x, D_{x}) u + B (t, x, D_{x}) u + f (t, x), & (t, x) \in [0, T] \times R^{n}, \\ {u|}_{t = 0} = u_{0}, & x \in R^{n}, \end{matrix} \end{matrix}

where $u_{0} (x) = [u_{1}^{0} (x), u_{2}^{0} (x)]^{T}$ , $f (t, x) = [f_{1} (t, x), f_{2} (t, x)]^{T}$ , and with the operators $A (t, x, D_{x})$ and $B (t, x, D_{x})$ given by

\begin{matrix} A (t, x, D_{x}) = [\begin{matrix} λ_{1} (t, x, D_{x}) & a_{12} (t, x, D_{x}) \\ 0 & λ_{2} (t, x, D_{x}) \end{matrix}] \end{matrix}

and

\begin{matrix} B (t, x, D_{x}) = [\begin{matrix} b_{11} (t, x, D_{x}) & b_{12} (t, x, D_{x}) \\ b_{21} (t, x, D_{x}) & b_{22} (t, x, D_{x}) \end{matrix}] . \end{matrix}

We suppose that all entries of $A (t, x, D_{x})$ belong to $C Ψ_{1, 0}^{1}$ and all entries of $B (t, x, D_{x})$ belong to $C Ψ_{1, 0}^{0}$ . By using the operators $G_{j}^{0}$ and $G_{j}$ introduced in Sect. 2.1, we can reformulate the Eq. (7) as

\begin{matrix} u_{1} = & U_{1}^{0} + G_{1} ((a_{12} + b_{12}) u_{2}), \end{matrix}

\begin{matrix} u_{2} = & U_{2}^{0} + G_{2} (b_{21} u_{1}), \end{matrix}

where

\begin{matrix} U_{j}^{0} = G_{j}^{0} u_{j}^{0} + G_{j} (f_{j}), j = 1, 2 . \end{matrix}

Plugging (10) in (9), we obtain

\begin{matrix} u_{1} = {\tilde{U}}_{1}^{0} + G_{1} (a_{12} G_{2} (b_{21} u_{1})) + G_{1} (b_{12} G_{2} (b_{21} u_{1})), \end{matrix}

where

\begin{matrix} {\tilde{U}}_{1}^{0} = G_{1}^{0} u_{1}^{0} + G_{1} (f_{1}) + G_{1} ((a_{12} + b_{12}) U_{2}^{0}) . \end{matrix}

Using the rules of composition of Fourier integral operators, see e.g. [15], and by Lemma 1, we get that the operator $G_{1} \circ a_{12} \circ G_{2} \circ b_{21}$ in (12) acts continuously on $H^{s}$ if it is of order 0. Since $a_{12} \in C Ψ_{1, 0}^{1}$ we therefore need to assume that $b_{21} \in C Ψ_{1, 0}^{- 1}$ .

The operator $G_{1} \circ b_{12} \circ G_{2} \circ b_{21}$ belongs to $C I_{1, 0}^{- 1}$ since $b_{21} \in C Ψ_{1, 0}^{- 1}$ and $b_{12} \in C Ψ_{1, 0}^{0}$ .

We now introduce the following scale of Banach spaces $X^{s} (t) : = C ([0, t], H^{s})$ , $t \in [0, T]$ , equipped with the norm

\begin{matrix} {‖ u ‖}_{X^{s} (t)} = sup_{τ \in [0, t]} {‖ u (τ, \cdot) ‖}_{H^{s}} . \end{matrix}

Let

\begin{matrix} G_{1}^{0} u_{1} : = G_{1} (a_{12} G_{2} (b_{21} u_{1})) + G_{1} (b_{12} G_{2} (b_{21} u_{1})) . \end{matrix}

It follows that (12) can be written as

\begin{matrix} u_{1} = {\tilde{U}}_{1}^{0} + G_{1}^{0} u_{1} . \end{matrix}

By composition of Fourier integral operators and Lemma 1 we have that the 0-order Fourier integral operator $G_{1}^{0}$ maps $C ([0, T], H^{s})$ continuously into itself and for small time interval it is a contraction, in the sense that there exists $T^{*} \in [0, T]$ such that

\begin{matrix} ‖ G_{1}^{0} {(u - v) ‖}_{X^{s} (T^{*})} \leq C_{A, s} T^{*} {‖ u - v ‖}_{X^{s} (T^{*})}, \end{matrix}

with $C_{A, s} T^{*} < 1$ . Banach’s fixed point theorem ensures the existence of a unique fixed point $u_{1}$ for the map $G_{1}^{0}$ . Hence, by assuming that the initial data ${\tilde{U}}_{1}^{0}$ belongs to $C ([0, T^{*}], H^{s})$ we conclude that there exists a unique $u_{1} \in C ([0, T^{*}], H^{s})$ solving (12). Note that the same argument proves that the operator $I - G_{1}^{0}$ is invertible on a sufficiently small interval in t since $G_{1}^{0} = I$ at $t = 0$ . From formula (13) it is clear that in order to get ${\tilde{U}}_{1}^{0}$ to belong to $C ([0, T^{*}], H^{s})$ we need to assume that $U_{2}^{0} \in H^{s + 1}$ . Finally, we get $u_{2}$ by substitution of $u_{1}$ in (10).

Remark 2

Note that the constant $T^{*}$ depends only on A and s. Thus, the argument above can be iterated by taking $u (T^{*}, x)$ as new initial data. In this way one can cover an arbitrary finite interval [0, T] and obtain a solution in $C ([0, T], H^{s}) \times C ([0, T], H^{s + 1})$ .

Remark 3

Since $a_{12} (t, x, D_{x})$ is a first order operator combining (11) with (13) we easily see that in order to get Sobolev well-posedness of order s we need to take initial data $u_{1}^{0}$ and $u_{2}^{0}$ in $H^{s}$ and $H^{s + 1}$ , respectively, and right hand-side functions $f_{1}$ and $f_{2}$ in $C ([0, T], H^{s})$ and $C ([0, T], H^{s + 1})$ , respectively.

We have therefore proved the following theorem stated for the first time in [27, Theorem 7.2].

Theorem 3

Consider the Cauchy problem (7), with the $2 \times 2$ matrices

\begin{matrix} A (t, x, D_{x}) \in {(C S^{1})}^{2 \times 2} and B (t, x, D_{x}) \in {(C S^{0})}^{2 \times 2}, \end{matrix}

where A is of the form (2). Assume that $b_{21} \in C ([0, T], Ψ_{1, 0}^{- 1})$ , the right hand-side functions $f_{1}$ and $f_{2}$ belong to $C ([0, T], H^{s})$ and $C ([0, T], H^{s + 1})$ , respectively, and the initial data $u_{1}^{0}$ and $u_{2}^{0}$ belong to $H^{s}$ and $H^{s + 1}$ , respectively. Then, (7) has a unique solution in $C ([0, T], H^{s}) \times C ([0, T], H^{s + 1})$ . More generally it is well-posed in the anisotropic Sobolev space $C ([0, T], H^{s_{1}}) \times C ([0, T], H^{s_{2}})$ , provided $s_{2} - s_{1} = 1$ .

Remark 4

It was also shown in [27] that the solution u satisfies the estimate

\begin{matrix} ‖ u_{1} {(t, \cdot) ‖}_{H^{s}} + {‖ u_{2} (t, \cdot) ‖}_{H^{s + 1}} \leq c e^{ct} (‖ u_{1}^{0} ‖_{H^{s}} + {‖ u_{2}^{0} ‖}_{H^{s + 1}}), 0 \leq t \leq T, \end{matrix}

for $u_{j}^{0} \in H_{comp}^{s + j - 1}$ , $j = 1, 2$ with $c > 0$ depending on s, T, and the support of the initial data. Since well-posedness is obtained for any Sobolev order s it follows that the Cauchy problem (7) is also $C^{\infty}$ well-posed.

The case $m = 3$

In this section we will extend the construction to the case of $3 \times 3$ systems. In the argument there is an additional substitution and a fixed point argument step compared to the case $m = 2$ . The advantage of giving the case of $m = 3$ here is that we can make the argument more concrete compared to the more abstract construction in the general case that will be given in the following section. Thus, let

\begin{matrix} \{\begin{matrix} D_{t} u = A (t, x, D_{x}) u + B (t, x, D_{x}) u + f (t, x), & (t, x) \in [0, T] \times R^{n}, \\ {u|}_{t = 0} = u_{0}, & x \in R^{n}, \end{matrix} \end{matrix}

where $u_{0} (x) = [u_{1}^{0} (x), u_{2}^{0} (x), u_{3}^{0} (x)]^{T}$ , $f (t, x) = [f_{1} (t, x), f_{2} (t, x), f_{3} (t, x)]^{T}$ , $A (t, x, D_{x})$ is defined by the matrix

\begin{matrix} [\begin{matrix} λ_{1} (t, x, D_{x}) & a_{12} (t, x, D_{x}) & a_{13} (t, x, D_{x}) \\ 0 & λ_{2} (t, x, D_{x}) & a_{23} (t, x, D_{x}) \\ 0 & 0 & λ_{3} (t, x, D_{x}) \end{matrix}], \end{matrix}

and

\begin{matrix} B (t, x, D_{x}) = [\begin{matrix} b_{11} (t, x, D_{x}) & b_{12} (t, x, D_{x}) & b_{13} (t, x, D_{x}) \\ b_{21} (t, x, D_{x}) & b_{22} (t, x, D_{x}) & b_{23} (t, x, D_{x}) \\ b_{31} (t, x, D_{x}) & b_{32} (t, x, D_{x}) & b_{33} (t, x, D_{x}) \end{matrix}] . \end{matrix}

We assume that all the entries of $A (t, x, D_{x})$ and $B (t, x, ξ)$ belong to $C Ψ_{1, 0}^{1}$ and $C Ψ_{1, 0}^{0}$ , respectively. Using the notations introduced earlier, we can write

\begin{matrix} \begin{matrix} u_{3} (t, x) & = U_{3}^{0} + G_{3} (b_{31} u_{1}) + G_{3} (b_{32} u_{2}), \\ u_{2} (t, x) & = U_{2}^{0} + G_{2} ((a_{23} + b_{23}) u_{3}) + G_{2} (b_{21} u_{1}), \\ u_{1} (t, x) & = U_{1}^{0} + G_{1} ((a_{12} + b_{12}) u_{2}) + G_{1} ((a_{13} + b_{13}) u_{3}), \end{matrix} \end{matrix}

where

\begin{matrix} U_{j}^{0} (t, x) = G_{j}^{0} (u_{j}^{0}) + G_{j} (f_{j}), j = 1, 2, 3 . \end{matrix}

Now, we plug $u_{3}$ into $u_{1}$ and $u_{2}$ in formula (16) and, thus, obtain

\begin{matrix} \begin{matrix} u_{2} (t, x) & = {\tilde{U}}_{2}^{0} + G_{2} (b_{21} u_{1}) + G_{2} ((a_{23} + b_{23}) G_{3} (b_{31} u_{1})) \\ + G_{2} ((a_{23} + b_{23}) G_{3} (b_{32} u_{2})), \\ u_{1} (t, x) & = {\tilde{U}}_{1}^{0} + G_{1} ((a_{13} + b_{13}) G_{3} (b_{31} u_{1})) \\ + G_{1} ((a_{13} + b_{13}) G_{3} (b_{32} u_{2})) + G_{1} ((a_{12} + b_{12}) u_{2}), \end{matrix} \end{matrix}

where

\begin{matrix} {\tilde{U}}_{j}^{0} = U_{j}^{0} + G_{j} ((a_{j 3} + b_{j 3}) (t, x, D_{x}) U_{3}^{0}), j = 1, 2 . \end{matrix}

We introduce the operator $G_{2}^{0}$ by setting

\begin{matrix} G_{2}^{0} u_{2} : = G_{2} ((a_{23} + b_{23}) G_{3} (b_{32} u_{2})) \end{matrix}

and in analogy with the case $m = 2$ we define

\begin{matrix} L_{2} u_{2} : = u_{2} - G_{2}^{0} u_{2} . \end{matrix}

By Lemma 1 we have that for any s, $G_{2}^{0}$ has the operator norm in $H^{s}$ strictly less than 1 on a sufficiently small interval $[0, T^{*}]$ , so $L_{2}$ is a perturbation of the identity operator. By the Neumann series it follows that $L_{2}$ is invertible as a continuous operator from $C ([0, T^{*}], H^{s})$ to $C ([0, T^{*}], H^{s})$ . Noting now that

\begin{matrix} u_{2} - G_{2}^{0} u_{2} = L_{2} u_{2} = {\tilde{U}}_{2}^{0} + G_{2} (b_{21} u_{1}) + G_{2} ((a_{23} + b_{23}) G_{3} (b_{31} u_{1})), \end{matrix}

we have that

\begin{matrix} u_{2} (t, x) = L_{2}^{- 1} {\tilde{U}}_{2}^{0} + L_{2}^{- 1} G_{2} ((a_{23} + b_{23}) G_{3} (b_{31} u_{1})) + L_{2}^{- 1} G_{2} (b_{21} u_{1}) . \end{matrix}

Since this expression depends only on $u_{1}$ , we can plug it into the formula for $u_{1}$ in (18) and obtain

\begin{matrix} \begin{matrix} u_{1} (t, x) & = {\tilde{U}}_{1}^{0} + G_{1} ((a_{13} + b_{13}) G_{3} (b_{31} u_{1})) + \\ + G_{1} ((a_{13} + b_{13}) G_{3} (b_{32} u_{2})) + G_{1} ((a_{12} + b_{12}) u_{2}) \\ = {\tilde{U}}_{1}^{0} + G_{1} ((a_{13} + b_{13}) G_{3} (b_{31} u_{1})) + \\ + G_{1} ((a_{13} + b_{13}) G_{3} (b_{32} (L_{2}^{- 1} {\tilde{U}}_{2}^{0})) \\ + G_{1} ((a_{13} + b_{13}) G_{3} (b_{32} L_{2}^{- 1} G_{2} ((a_{23} + b_{23}) G_{3} (b_{31} u_{1})))) \\ + G_{1} ((a_{13} + b_{13}) G_{3} (b_{32} (L_{2}^{- 1} G_{2} (b_{21} u_{1}))) \\ + G_{1} ((a_{12} + b_{12}) L_{2}^{- 1} {\tilde{U}}_{2}^{0}) \\ + G_{1} ((a_{12} + b_{12}) L_{2}^{- 1} G_{2} ((a_{23} + b_{23}) G_{3} (b_{31} u_{1}))) \\ + G_{1} ((a_{12} + b_{12}) L_{2}^{- 1} G_{2} (b_{21} u_{1})) . \end{matrix} \end{matrix}

By collecting now the terms with order $\leq 0$ we can simplify the previous formula as follows:

\begin{matrix} \begin{matrix} u_{1} (t, x) & = {\tilde{U}}_{1}^{0} + G_{1} (a_{13} G_{3} (b_{31} u_{1})) + G_{1} (a_{13} G_{3} (b_{32} (L_{2}^{- 1} {\tilde{U}}_{2}^{0}))) \\ + G_{1} (a_{13} G_{3} b_{32} L_{2}^{- 1} G_{2} (a_{23} G_{3} (b_{31} u_{1}))) \\ + G_{1} (a_{13} G_{3} b_{32} L_{2}^{- 1} G_{2} (b_{23} G_{3} (b_{31} u_{1}))) \\ + G_{1} (a_{13} G_{3} b_{32} (L_{2}^{- 1} G_{2} (b_{21} u_{1}))) \\ + G_{1} (b_{13} G_{3} (b_{32} L_{2}^{- 1} G_{2} (a_{23} G_{3} (b_{31} u_{1})))) \\ + G_{1} (a_{12} L_{2}^{- 1} {\tilde{U}}_{2}^{0}) \\ + G_{1} (a_{12} L_{2}^{- 1} G_{2} (a_{23} G_{3} (b_{31} u_{1}))) \\ + G_{1} (a_{12} L_{2}^{- 1} G_{2} (b_{23}) G_{3} (b_{31} u_{1}))) \\ + G_{1} (b_{12} L_{2}^{- 1} G_{2} ((a_{23} G_{3} (b_{31} u_{1}))) \\ + G_{1} (a_{12} L_{2}^{- 1} G_{2} (b_{21} u_{1})) + l.o.t . \end{matrix} \end{matrix}

Looking at the terms

\begin{matrix} \begin{matrix} G_{1} (a_{13} G_{3} (b_{32} (L_{2}^{- 1} {\tilde{U}}_{2}^{0}))), \\ G_{1} (a_{12} L_{2}^{- 1} G_{2} (b_{21} u_{1})), \\ G_{1} (a_{12} L_{2}^{- 1} G_{2} (a_{23} G_{3} (b_{31} u_{1}))) \end{matrix} \end{matrix}

and keeping in mind that in order to get the right Sobolev regularity we need to have operators of order 0, we deduce that $b_{21}$ and $b_{32}$ must have order $- 1$ while $b_{31}$ must have order $- 2$ . Considering now the initial data

\begin{matrix} {\tilde{U}}_{j}^{0} = U_{j}^{0} + G_{j} ((a_{j 3} + b_{j 3}) U_{3}^{0}), j = 1, 2, \end{matrix}

by using (17) we obtain

\begin{matrix} {\tilde{U}}_{j}^{0} = U_{j}^{0} + G_{j} ((a_{j 3} + b_{j 3}) (G_{3}^{0} (u_{3}^{0}) + G_{3} (f_{3}))), j = 1, 2 . \end{matrix}

Combining these formulas with an analysis of the term $G_{1} (a_{12} L_{2}^{- 1} {\tilde{U}}_{2}^{0})$ we deduce that ${\tilde{U}}_{2}^{0}$ must belong to $H^{s + 1}$ . This implies $U_{2}^{0} \in H^{s + 1}$ and $U_{3}^{0} \in H^{s + 2}$ . Concluding, similarly to the case $m = 2$ , that is by the Banach fixed point theorem argument on $u_{1}$ and substitution in $u_{2}$ and $u_{3}$ , we get anisotropic Sobolev well-posedness by assuming $u_{1}^{0}$ and $f_{1}$ in $H^{s}$ , $u_{2}^{0}$ and $f_{2}$ in $H^{s + 1}$ , and $u_{3}^{0}$ and $f_{3}$ in $H^{s + 2}$ . This well-posedness is obtained by means of one invertible operator $L_{2}$ , and in analogy with case $m = 2$ the well-posedness can be extended to the whole interval [0, T] by an iterated argument. This proves Theorem 1 in the case $m = 3$ .

The general case

We are now ready to prove the main result of our paper in the general case of an upper-triangular $m \times m$ matrix, i.e, a matrix A of the type

\begin{matrix} [\begin{matrix} λ_{1} (t, x, D_{x}) & a_{12} (t, x, D_{x}) & \dots & a_{1 m} (t, x, D_{x}) \\ 0 & λ_{2} (t, x, D_{x}) & \dots & a_{2 m} (t, x, D_{x}) \\ ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & λ_{m - 1} (t, x, D_{x}) & a_{m - 1 m} (t, x, D_{x}) \\ 0 & 0 & \dots & λ_{m} (t, x, D_{x}) \end{matrix}] . \end{matrix}

For the convenience of the reader we recall here the statement of Theorem 1.

Theorem 1

Let

\begin{matrix} \{\begin{matrix} D_{t} u = A (t, x, D_{x}) u + B (t, x, D_{x}) u + f (t, x), & (t, x) \in [0, T] \times R^{n}, \\ {u|}_{t = 0} = u_{0} (x), & x \in R^{n}, \end{matrix} \end{matrix}

where $A (t, x, D_{x})$ is an upper-triangular matrix of pseudo-differential operators of order 1 in the form (2), and $B (t, x, D_{x})$ is a matrix of pseudo-differential operators of order 0, continuous with respect to t. Hence, if the lower order terms $b_{ij}$ belong to $C ([0, T], Ψ^{j - i})$ for $i > j$ , $u_{k}^{0} \in H^{s + k - 1}$ and $f_{k} \in C ([0, T], H^{s + k - 1})$ for $k = 1, \dots, m$ then (20) has a unique anisotropic Sobolev solution u, i.e., $u_{k} \in C ([0, T], H^{s + k - 1})$ for $k = 1, \dots, m$ .

Proof

Making use of the notations introduced earlier we can write the components of the solution u as

\begin{matrix} u_{i} (t, x) & = U_{i}^{0} + G_{i} (\sum_{j > i}^{m} a_{ij} (t, x, D_{x}) u_{j}) + G_{i} (\sum_{\begin{matrix} j = 1 \\ j \neq i \end{matrix}}^{m} b_{ij} (t, x, D_{x}) u_{j}) \\ = U_{i}^{0} + \sum_{j < i} G_{i} (b_{ij} (t, x, D_{x}) u_{j}) + \sum_{i < j \leq m} G_{i} ((a_{ij} + b_{ij}) (t, x, D_{x}) u_{j}), \end{matrix}

where

\begin{matrix} U_{i}^{0} = G_{i}^{0} u_{j}^{0} + G_{i} (f_{i}), \end{matrix}

and $G_{i}, G_{i}^{0}$ are Fourier integral operator of order 0 for $i = 1, \dots, m$ . Note that from the fact that $b_{ij}$ is a symbol of order 0 for every i, j and, in particular, of order $j - i$ for $j < i$ we obtain that the operator $G_{i} (b_{ij})$ is of order $j - i$ for $j < i$ , while $G_{i} (a_{ij} + b_{ij})$ is, in general, of order 1. To simplify the argument we introduce the notations $G_{i, j}^{j - i}$ and $G_{i, j}^{1}$ for the operators $G_{i} (b_{ij})$ and $G_{i} (a_{ij} + b_{ij})$ , respectively. Here the superscript stands to remind us of the order of the operator. Hence,

\begin{matrix} u_{i} = U_{i}^{0} + \sum_{j < i} G_{i, j}^{j - i} (u_{j}) + \sum_{i < j \leq m} G_{i, j}^{1} (u_{j}), \end{matrix}

for $i = 1, \dots, m$ . By begin by substituting

\begin{matrix} u_{m} = U_{m}^{0} + \sum_{j < m} G_{m, j}^{j - m} (u_{j}), \end{matrix}

into

\begin{matrix} u_{m - 1} = U_{m - 1}^{0} + \sum_{j < m - 1} G_{m - 1, j}^{j - m + 1} (u_{j}) + G_{m - 1, m}^{1} (u_{m}) . \end{matrix}

We get

\begin{matrix} \begin{matrix} u_{m - 1} & = U_{m - 1}^{0} + \sum_{j < m - 1} G_{m - 1, j}^{j - m + 1} (u_{j}) + G_{m - 1, m}^{1} U_{m}^{0} + \sum_{j < m} G_{m - 1, m}^{1} G_{m, j}^{j - m} (u_{j}) \\ = (U_{m - 1}^{0} + G_{m - 1, m}^{1} U_{m}^{0}) + \sum_{j < m - 1} (G_{m - 1, j}^{j - m + 1} (u_{j}) + G_{m - 1, m}^{1} G_{m, j}^{j - m} (u_{j})) \\ + G_{m - 1, m}^{1} G_{m, m - 1}^{- 1} u_{m - 1} . \end{matrix} \end{matrix}

Note that it is enough to assume $U_{m}^{0} \in H^{s + 1}$ and $U_{m - 1}^{0} \in H^{s}$ to obtain $U_{m - 1}^{0} + G_{m - 1, m}^{1} U_{m}^{0} \in H^{s}$ . Since all the operators above are of order $\leq 0$ we conclude that the operator

\begin{matrix} L_{m - 1} = I - G_{m - 1, m}^{1} G_{m, m - 1}^{- 1} : = I - G_{m - 1}^{0} \end{matrix}

is invertible on a sufficiently small interval [0, T] and, therefore,

\begin{matrix} u_{m - 1} - G_{m - 1, m}^{1} G_{m, m - 1}^{- 1} u_{m - 1} & = (U_{m - 1}^{0} + G_{m - 1, m}^{1} U_{m}^{0}) \\ + \sum_{j < m - 1} (G_{m - 1, j}^{j - m + 1} (u_{j}) + G_{m - 1, m}^{1} G_{m, j}^{j - m} (u_{j})), \end{matrix}

yields

\begin{matrix} u_{m - 1} = L_{m - 1}^{- 1} {\tilde{U}}_{m - 1}^{0} + L_{m - 1}^{- 1} \sum_{j < m - 1} {\tilde{G}}_{m - 1}^{j - m + 1} u_{j}, \end{matrix}

with ${\tilde{U}}_{m - 1}^{0}$ and ${\tilde{G}}_{m - 1}^{j - m + 1}$ defined by the right-hand side of (22). We now substitute $u_{m}$ and $u_{m - 1}$ into $u_{m - 2}$ making use of (23). We obtain

\begin{matrix} u_{m - 2} & = U_{m - 2}^{0} + \sum_{j < m - 2} G_{m - 2, j}^{j - m + 2} (u_{j}) \\ + G_{m - 2, m - 1}^{1} (u_{m - 1}) + G_{m - 2, m}^{1} (u_{m}) \\ = U_{m - 2}^{0} + \sum_{j < m - 2} G_{m - 2, j}^{j - m + 2} (u_{j}) + G_{m - 2, m - 1}^{1} L_{m - 1}^{- 1} {\tilde{U}}_{m - 1}^{0} \\ + G_{m - 2, m - 1}^{1} L_{m - 1}^{- 1} \sum_{j < m - 2} {\tilde{G}}_{m - 1}^{j - m + 1} u_{j} \\ + G_{m - 2, m - 1}^{1} L_{m - 1}^{- 1} {\tilde{G}}_{m - 1}^{- 1} u_{m - 2} + G_{m - 2, m}^{1} U_{m}^{0} \\ + G_{m - 2, m}^{1} \sum_{j < m - 2} G_{m, j}^{j - m} (u_{j}) + G_{m - 2, m}^{1} G_{m, m - 2}^{- 2} u_{m - 2} \\ + G_{m - 2, m}^{1} G_{m, m - 1}^{- 1} L_{m - 1}^{- 1} {\tilde{U}}_{m - 1}^{0} \\ + G_{m - 2, m}^{1} G_{m, m - 1}^{- 1} L_{m - 1}^{- 1} \sum_{j < m - 2} {\tilde{G}}_{m - 1}^{j - m + 1} u_{j} \\ + G_{m - 2, m}^{1} G_{m, m - 1}^{- 1} L_{m - 1}^{- 1} {\tilde{G}}_{m - 1}^{- 1} u_{m - 2} . \end{matrix}

We set

\begin{matrix} {\tilde{U}}_{m - 2}^{0} & = U_{m - 2}^{0} + G_{m - 2, m - 1}^{1} L_{m - 1}^{- 1} {\tilde{U}}_{m - 1}^{0} \\ + G_{m - 2, m}^{1} U_{m}^{0} + G_{m - 2, m}^{1} G_{m, m - 1}^{- 1} L_{m - 1}^{- 1} {\tilde{U}}_{m - 1}^{0} . \end{matrix}

The operators $G_{m - 2, m - 1}^{1} L_{m - 1}^{- 1}$ and $G_{m - 2, m}^{1}$ in (25) are of order 1. Keeping in mind that we already assumed $U_{m}^{0} \in H^{s + 1}$ and $U_{m - 1}^{0} \in H^{s}$ , in order to obtain Sobolev order s the initial data $U_{m}^{0}$ , $U_{m - 1}^{0}$ and $U_{m - 2}^{0}$ must belong to $H^{s + 2}$ , $H^{s + 1}$ and $H^{s}$ , respectively. Thus,

\begin{matrix} u_{m - 2} = {\tilde{U}}_{m - 2}^{0} + G_{m - 2}^{0} u_{m - 2} + \sum_{j < m - 2} {\tilde{G}}_{m - 2}^{j - m + 2} u_{j}, \end{matrix}

where $G_{m - 2}^{0}$ is a zero order operator defined by

\begin{matrix} \begin{matrix} G_{m - 2}^{0} u_{m - 2} & = G_{m - 2, m - 1}^{1} L_{m - 1}^{- 1} {\tilde{G}}_{m - 1}^{- 1} u_{m - 2} + G_{m - 2, m}^{1} G_{m, m - 2}^{- 2} u_{m - 2} \\ + G_{m - 2, m}^{1} G_{m, m - 1}^{- 1} L_{m - 1}^{- 1} {\tilde{G}}_{m - 1}^{- 1} u_{m - 2}, \end{matrix} \end{matrix}

and the last summand in (26) is obtained by collecting all the operators acting on $u_{j}$ with $j < m - 2$ in (24). Since the norm of $G_{m - 2}^{0}$ can be taken strictly less than one in a sufficiently small interval [0, T] we have that the operator

\begin{matrix} L_{m - 2} = I - G_{m - 2}^{0} \end{matrix}

is invertible and, therefore,

\begin{matrix} u_{m - 2} = L_{m - 2}^{- 1} {\tilde{U}}_{m - 2}^{0} + \sum_{j < m - 2} L_{m - 2}^{- 1} {\tilde{G}}_{m - 2}^{j - m + 2} u_{j} . \end{matrix}

Note that ${\tilde{U}}_{m - 2}^{0} \in H^{s}$ if $U_{m}^{0} \in H^{s + 2}$ , $U_{m - 1}^{0} \in H^{s + 1}$ and $U_{m - 2}^{0} \in H^{s}$ . By iterating the same procedure we deduce that

\begin{matrix} u_{k} = {\tilde{U}}_{k}^{0} + G_{k}^{0} u_{k} + \sum_{j < k} {\tilde{G}}_{k}^{j - k} u_{j}, \end{matrix}

where ${\tilde{U}}_{k}^{0}$ depends on $U_{k}^{0}$ , $U_{j}^{0}$ and ${\tilde{U}}_{j}^{0}$ with $j > k$ and $G_{k}^{0}$ is a zero order operator defined by using invertible operators $L_{m - 1}$ , $L_{m - 2}$ ,..., $L_{k}$ . In addition, we obtain ${\tilde{U}}_{k}^{0} \in H^{s}$ since $U_{m}^{0} \in H^{s + m - k}$ , $U_{m - 1}^{0} \in H^{s + m - k - 1}, \dots, U_{k}^{0} \in H^{s}$ . It follows that for $k = 2$ we have

\begin{matrix} u_{2} = {\tilde{U}}_{2}^{0} + G_{2}^{0} u_{2} + {\tilde{G}}_{2}^{- 1} u_{1}, \end{matrix}

where the operator $G_{2}^{0}$ is of zero operator and defined by invertible operators $L_{m - 1}, L_{m - 2}, \dots, L_{2}$ , ${\tilde{G}}_{2}^{- 1}$ is of order $- 1$ , and ${\tilde{U}}_{2}^{0} \in H^{s}$ since $U_{m}^{0} \in H^{s + m - 2}$ , $U_{m - 1}^{0} \in H^{s + m - 3}, \dots, U_{2}^{0} \in H^{s}$ . Hence, by inverting the operator $L_{2} = I - G_{2}^{0}$ on a sufficiently small interval [0, T] we have

\begin{matrix} u_{2} = L_{2}^{- 1} {\tilde{U}}_{2}^{0} + L_{2}^{- 1} {\tilde{G}}_{2}^{- 1} u_{1} . \end{matrix}

Now by substitution of $u_{2}, u_{3}, \dots, u_{m}$ in the equation of $u_{1}$ we arrive at the formula (28) with $k = 1$ , i.e.,

\begin{matrix} u_{1} = {\tilde{U}}_{1}^{0} + G_{1}^{0} u_{1}, \end{matrix}

where ${\tilde{U}}_{1}^{0} \in H^{s}$ since $U_{m}^{0} \in H^{s + m - 1}$ , $U_{m - 1}^{0} \in H^{s + m - 2}, \dots, U_{2}^{0} \in H^{s + 1}, U_{1}^{0} \in H^{s}$ . Concluding, by the Banach fix point argument we prove that there exists a unique $u_{1} \in C ([0, T], H^{s})$ solving the equation above with the given initial conditions. By substitution in the equations for $u_{2}, \dots, u_{m - 1}, u_{m}$ we arrive at the desired Sobolev well-posedness with $u_{k} \in C ([0, T], H^{s + k - 1})$ for $k = 2, \dots, m$ . Note that, since the sufficiently small interval [0, T] where we get well-posedness does not depend on the initial data, by a standard iteration argument we can achieve well-posedness on any bounded interval [0, T] as stated in the theorem. $□$

Schur decomposition of $m \times m$ matrices

In this section we investigate how to reduce an $m \times m$ matrix to the upper triangular form. We recall that such decomposition is well-known for constant matrices and goes under the name of Schur’s triangularisation, with its statement given in Theorem C.

One of the difficulties when dealing with variable multiplicities is the loss of regularity in the parameters at the points of multiplicities. In the following, we will assume that A is a matrix of (possibly) complex valued first order symbols, continuous with respect to t, i.e., $A (t, x, ξ) \in (C S^{1})^{m \times m}$ .

We will now develop a parameter dependent extension of the Schur triangularisation procedure and we will describe it step by step. Then we will give an example for it for the systems of low sizes, namely, for $m = 2$ and $m = 3$ .

In the case of $m = 2$ the construction below was introduced in [27] and now we give its general version for systems of any size.

Normal forms of matrices depending on several parameters have a long history and are notoriously involved; for some remarks and related works, we refer the reader to [14, 24, 25, 45].

First step or Schur step

The first step in our triangularisation follows the construction in the constant case except that we will not get a unitary transformation matrix. For this reason we talk of a Schur step. Throughout this paper $e_{i}$ denotes the i-th vector of the standard basis of $R^{n}$ with an appropriate dimension n.

Proposition 1

(Schur step) Let the $m \times m$ matrix valued symbol $A (t, x, ξ) \in {(C S^{1})}^{m \times m}$ , have a real eigenvalue $λ \in C S^{1}$ and a corresponding eigenvector $h \in (C S^{1})^{m}$ such that there exists $j \in {1, \dots, m}$ with

\begin{matrix} 〈h (t, x, ξ) | e_{j}〉 \neq 0 \forall (t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M}, \end{matrix}

for a sufficiently large $M > 0$ . Then there exist an $m \times m$ matrix valued symbol $T (t, x, ξ) {(C S^{0})}^{m \times m}$ , invertible for $(t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M}$ with $T^{- 1} \in {(C S^{0})}^{m \times m}$ , and an $(m - 1) \times (m - 1)$ matrix valued symbol $E (t, x, ξ) \in {(C S^{1})}^{m \times m}$ , such that

\begin{matrix} T^{- 1} (t, x, ξ) A (t, x, ξ) T (t, x, ξ) = [\begin{matrix} λ & a_{12} & \dots & a_{1 m} \\ 0 \\ ⋮ & E (t, x, ξ) \\ 0 \end{matrix}] \end{matrix}

for all $(t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M}$ .

Proof

First let us note that we can assume that $j = 1$ in (29). If that is not the case, we can exchange the rows 1 and j as well as columns 1 and j to move the jth component of the eigenvector to the first component.

We define the rescaled eigenvector $μ$ componentwise by

\begin{matrix} μ_{i} (t, x, ξ) = \frac{〈h (t, x, ξ) | e_{i}〉}{〈h (t, x, ξ) | e_{1}〉} \forall i = 1, \dots, m . \end{matrix}

Now we set

\begin{matrix} T (t, x, ξ) = [\begin{matrix} μ_{1} & 0 & \dots & 0 \\ μ_{2} \\ ⋮ & I_{m - 1} \\ μ_{m} \end{matrix}] . \end{matrix}

Since $μ_{1} \equiv 1$ it follows that

\begin{matrix} T^{- 1} (t, x, ξ) = [\begin{matrix} μ_{1} & 0 & \dots & 0 \\ - μ_{2} \\ ⋮ & I_{m - 1} \\ - μ_{m} \end{matrix}], \end{matrix}

where $I_{m - 1}$ is the $(m - 1) \times (m - 1)$ identity matrix. By direct computations we get

\begin{matrix} A T = [\begin{matrix} \sum_{j = 1}^{m} a_{1 j} μ_{j} \\ ⋮ & A_{(2)} & \dots & A_{(m)} \\ \sum_{j = 1}^{m} a_{mj} μ_{j} \end{matrix}] = [\begin{matrix} λ μ_{1} \\ ⋮ & A_{(2)} & \dots & A_{(m)} \\ λ μ_{m} \end{matrix}], \end{matrix}

where we used that

\begin{matrix} \sum_{j = 1}^{m} a_{ij} μ_{j} = λ μ_{i}, i = 1, \dots, m, \end{matrix}

and denoted the ith column of A by $A_{(i)}$ . The equations in (30) are given by the eigenvalue equation $A μ = λ μ$ . Further, from $μ_{1} \equiv 1$ we obtain

\begin{matrix} T^{- 1} A T = & [\begin{matrix} λ μ_{1}^{2} & a_{12} μ_{1} & \dots & a_{1 m} μ_{1} \\ - μ_{2} μ_{1} λ + μ_{2} λ \\ ⋮ & E \\ - μ_{m} μ_{1} λ + μ_{m} λ \end{matrix}] \\ = & [\begin{matrix} λ & a_{12} & \dots & a_{1 m} \\ 0 \\ ⋮ & E \\ 0 \end{matrix}], \end{matrix}

which concludes the proof. Note that by construction the matrix E has entries in $C S^{1}$ which depend on A. In particular its eigenvalues are the eigenvalues of A excluding $λ$ (counted as many times as they occur). $□$

Applying Proposition 1 repeatedly for $m - 2$ times to E, we obtain a full Schur transformation of A, that is a full reduction to an upper triangular form. In the next subsection we describe this iteration in detail. This triangularisation procedure is summarised in Theorem 6 where sufficient conditions on the eigenvectors of A are given.

The triangularisation procedure

The reduction to an upper triangular form or the Schur transformation of A is possible under certain conditions on its eigenvectors. More precisely, let

\begin{matrix} h_{1} (t, x, ξ), \dots, h_{m - 1} (t, x, ξ) \in (C S^{0})^{m} \end{matrix}

be $m - 1$ eigenvectors of $A (t, x, ξ) = {[a_{ij} (t, x, ξ)]}_{i, j = 1}^{m}$ , $a_{ij} \in C S^{1}$ , corresponding to the eigenvalues $λ_{1} (t, x, ξ)$ , $\dots$ , $λ_{m - 1} (t, x, ξ) \in C S^{1}$ . To formulate the sufficient conditions for the existence of such Schur transformation, we introduce a set of auxiliary vectors $h^{(i)}$ , $i = 1, \dots, m - 1$ , which depend only on $h_{i}$ and the previous vectors $h^{(j)} \in C S^{0}$ , $j = 1, \dots, i - 1$ . When $i = 1$ we set $h^{(1)} = h_{1}$ .

As in Proposition 1 we begin by assuming

\begin{matrix} 〈h^{(1)} (t, x, ξ) | e_{1}〉 \neq 0 \end{matrix}

for $(t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M}$ .

Remark 5

As noted in the proof of Proposition 1, we could have that

\begin{matrix} 〈h^{(1)} (t, x, ξ) | e_{j}〉 \neq 0 \end{matrix}

for another arbitrary $j \in {1, \dots, m}$ . Then, we could transform the matrix $A (t, x, ξ)$ by a constant permutation matrix P such that $P^{- 1} h^{(1)}$ is eigenvector of $P^{- 1} A P$ corresponding to $λ_{1}$ which satisfies $〈P^{- 1} h^{(1)} (t, x, ξ) | e_{1}〉 \neq 0$ . For this reason we state (32) with $h^{(1)}$ and $e_{1}$ .

Step 1

By Proposition 1 there exists a matrix

T_{1}

such that

\begin{matrix} T_{1}^{- 1} A T_{1} = [\begin{matrix} λ_{1} & a_{12} & \dots & a_{1 m} \\ 0 \\ ⋮ & E_{m - 1} \\ 0 \end{matrix}] . \end{matrix}

The matrix

T_{1}

is given by

\begin{matrix} T_{1} = [\begin{matrix} ω_{1} & e_{2} & \dots & e_{m} \end{matrix}], ω_{1} = {[\begin{matrix} ω_{11} & \dots & ω_{1 m} \end{matrix}]}^{T} \end{matrix}

with

\begin{matrix} ω_{1 j} = \frac{〈h^{(1)} (t, x, ξ) | e_{j}〉}{〈h^{(1)} (t, x, ξ) | e_{1}〉} . \end{matrix}

In the sequel we make use of the projector

Π_{k} : R^{m} \to R^{m - k}

0 \leq k \leq m - 1

, defined by

\begin{matrix} Π_{k} [\begin{matrix} x_{1} \\ ⋮ \\ x_{m} \end{matrix}] = [\begin{matrix} x_{k + 1} \\ ⋮ \\ x_{m} \end{matrix}] . \end{matrix}

Note that

Π_{0}

is the identity map

I_{m} : R^{m} \to R^{m}

Step 2

Since

h_{2}

is an eigenvector of A with eigenvalue

λ_{2}

we get that

T_{1}^{- 1} h_{2}

is an eigenvector of

T_{1}^{- 1} A T_{1}

with eigenvalue

λ_{2}

as well. By the structure of

T_{1}^{- 1} A T_{1}

we easily see that

h^{(2)} : = Π_{1} T_{1}^{- 1} h_{2}

is an eigenvector of

E_{m - 1}

, corresponding to

λ_{2}

. Arguing as in Remark 5 we assume that

\begin{matrix} 〈Π_{1} T_{1}^{- 1} h_{2} | e_{1}〉 \neq 0 \forall (t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M}, \end{matrix}

to be able to apply Proposition 1 to

E_{m - 1}

. We get that there exists an

(m - 1) \times (m - 1)

matrix

{\tilde{T}}_{2}

such that

{\tilde{T}}_{2}^{- 1} E_{m - 1} {\tilde{T}}_{2}

is of form

\begin{matrix} [\begin{matrix} λ_{2} & * & \dots & * \\ 0 \\ ⋮ & E_{m - 2} \\ 0 \end{matrix}], \end{matrix}

where in the first row the first row of

E_{m - 1}

appears. Thus, setting

\begin{matrix} T_{2} = [\begin{matrix} 1 & 0 & \dots & 0 \\ 0 \\ ⋮ & {\tilde{T}}_{2} \\ 0 \end{matrix}], \end{matrix}

we obtain

\begin{matrix} T_{2}^{- 1} T_{1}^{- 1} A T_{1} T_{2} = [\begin{matrix} λ_{1} & * & * & \dots & * \\ 0 & λ_{2} & * & \dots & * \\ 0 & 0 \\ ⋮ & ⋮ & E_{m - 2} \\ 0 & 0 \end{matrix}] . \end{matrix}

Note that in (34) we write explicitly only the entries most relevant to our triangularisation. To compute the matrix

{\tilde{T}}_{2}

, we set

\begin{matrix} ω_{2} = {[\begin{matrix} ω_{22} & \dots & ω_{2 m} \end{matrix}]}^{T}, \end{matrix}

where

\begin{matrix} ω_{2 j} (t, x, ξ) : = \frac{〈h^{(2)} (t, x, ξ) | e_{j}〉}{〈h^{(2)} (t, x, ξ) | e_{1}〉}, j = 2, \dots, m, \end{matrix}

and then

\begin{matrix} {\tilde{T}}_{2} = [\begin{matrix} ω_{2} & e_{2} & \dots & e_{m - 1} \end{matrix}] . \end{matrix}

It is clear that

T_{2}

has the same structure as

T_{1}

, i.e., it is defined via a rescaled eigenvector as the first column and an identity matrix (

I_{m - 1}

for

T_{1}

and

I_{m - 2}

for

T_{2}

Step K

By iterating the method

k - 1

times we can find

k - 1

matrices

T_{1}, T_{2}, \dots, T_{k - 1}

of size

m \times m

such that

\begin{matrix} T_{k - 1}^{- 1} \cdot \dots \cdot T_{1}^{- 1} A T_{1} \cdot \dots \cdot T_{k - 1} = \\ [\begin{matrix} λ_{1} & * & * & \dots & \dots & * \\ 0 & ⋱ & * & \dots & \dots & * \\ 0 & 0 & λ_{k - 1} & * & \dots & * \\ 0 & 0 & 0 \\ ⋮ & ⋮ & ⋮ & E_{m - k + 1} \\ 0 & 0 & 0 \end{matrix}], \end{matrix}

where

E_{m - k + 1}

is a

(m - k + 1) \times (m - k + 1)

matrix and the equality is true on

[0, T] \times R^{n} \times {| ξ | \geq M}

. Since

h_{k}

is an eigenvector of A corresponding to

λ_{k}

, the vector

\begin{matrix} T_{k - 1}^{- 1} T_{k - 2}^{- 1} \cdot \dots \cdot T_{1}^{- 1} h_{k} \end{matrix}

is an eigenvector of

\begin{matrix} T_{k - 1}^{- 1} T_{k - 2}^{- 1} \cdot \dots \cdot T_{1}^{- 1} A T_{1} T_{2} \cdot \dots \cdot T_{k - 1} \end{matrix}

and

\begin{matrix} h^{(k)} : = Π_{k - 1} T_{k - 1}^{- 1} T_{k - 2}^{- 1} \cdot \dots \cdot T_{1}^{- 1} h_{k} \in (C S^{0})^{m - k + 1} \end{matrix}

an eigenvector of

E_{m - k + 1}

corresponding to

λ_{k}

. Thus, to satisfy the assumptions of Proposition 1 and keeping in mind Remark 5, we require that

\begin{matrix} 〈h^{(k)} (t, x, ξ) | e_{1}〉 \neq 0 \forall (t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M} . \end{matrix}

It follows that there exists an

(m - k + 1) \times (m - k + 1)

transformation matrix

{\tilde{T}}_{k}

such that

{\tilde{T}}_{k}^{- 1} \dots {\tilde{T}}_{1}^{- 1} A {\tilde{T}}_{1} \dots {\tilde{T}}_{k}

is of the form

\begin{matrix} [\begin{matrix} λ_{k} & * & \dots & * \\ 0 \\ ⋮ & E_{m - k} \\ 0 \end{matrix}] . \end{matrix}

and set

\begin{matrix} T_{k} = [\begin{matrix} I_{k - 1} & 0 \\ 0 & {\tilde{T}}_{k} \end{matrix}] . \end{matrix}

The matrix

{\tilde{T}}_{k}

is defined by

\begin{matrix} {\tilde{T}}_{k} = [\begin{matrix} ω_{k} & e_{2} & \dots & e_{m - k + 1} \end{matrix}], ω_{k} = {[\begin{matrix} ω_{kk} & \dots & ω_{km} \end{matrix}]}^{T}, \end{matrix}

where

\begin{matrix} ω_{kj} = \frac{〈h^{(k)} (t, x, ξ) | e_{j}〉}{〈h^{(k)} (t, x, ξ) | e_{1}〉}, j = k, \dots, m . \end{matrix}

Stepm-1

This is the last step as

E_{2}

is a

2 \times 2

matrix. We have that

\begin{matrix} h^{(m - 1)} = Π_{m - 2} T_{m - 2}^{- 1} \cdot \dots \cdot T_{1}^{- 1} h_{m - 1} \in (C S^{0})^{2} \end{matrix}

is an eigenvector of

E_{2}

corresponding to

λ_{m - 1}

and that

{\tilde{T}}_{m - 1}

exists as before if

\begin{matrix} 〈h^{(m - 1)} (t, x, ξ) | e_{1}〉 \neq 0 \in \forall (t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M} . \end{matrix}

The matrix

{\tilde{T}}_{m - 1}

is given by

\begin{matrix} {\tilde{T}}_{m - 1} = [\begin{matrix} ω_{m - 1} & e_{2} \end{matrix}] = [\begin{matrix} ω_{m - 1, m - 1} & 0 \\ ω_{m - 1, m} & 1 \end{matrix}], \end{matrix}

where

\begin{matrix} ω_{m - 1, j} = \frac{〈h^{(m - 1)} (t, x, ξ) | e_{j}〉}{〈h^{(m - 1)} (t, x, ξ) | e_{1}〉}, j = m - 1, m, \end{matrix}

and then

\begin{matrix} T_{m - 1} = [\begin{matrix} I_{m - 2} & 0 \\ 0 & {\tilde{T}}_{m - 1} \end{matrix}] . \end{matrix}

We are now ready to state Theorem 6 which summarises the triangularisation procedure explained above. For the convenience of the reader we recall the notations introduced so far:

$h_{1}, \dots, h_{m - 1}$ are the eigenvectors of the matrix A corresponding to the eigenvalues $λ_{1}, \dots, λ_{m - 1}$ .
$h^{(1)} = h_{1}$ and
$\begin{matrix} h^{(i)} = Π_{i - 1} T_{i - 1}^{- 1} T_{i - 2}^{- 1} \cdot \dots \cdot T_{1}^{- 1} h_{i} \in (C S^{0})^{m - k + 1}, \end{matrix}$ 37
for $i = 2, \dots, m - 1$ .
the matrices $T_{k}$ are inductively defined as follows: $T_{0} = I_{m}$ and
$\begin{matrix} T_{k} = [\begin{matrix} I_{k - 1} & 0 \\ 0 & {\tilde{T}}_{k} \end{matrix}], {\tilde{T}}_{k} = [\begin{matrix} ω_{k} & e_{2} & \dots & e_{m - k} \end{matrix}], e_{i} \in R^{m - k}, \end{matrix}$
where
$\begin{matrix} ω_{kj} = \frac{〈h^{(k)} (t, x, ξ) | e_{j}〉}{〈h^{(k)} (t, x, ξ) | e_{1}〉}, j = k, \dots, m . \end{matrix}$

Finally, we note that $h^{(k)}$ depends only on $T_{k - 1}$ , $\dots$ , $T_{1}$ and, thus, only on the eigenvectors $h^{(k - 1)}$ , $\dots$ , $h^{(1)}$ .

Summarising, we can formulate a more precise version of Theorem 2.

Theorem 6

(Schur Decomposition) Let $A (t, x, ξ) \in {(C S^{1})}^{m \times m}$ be a matrix with eigenvalues $λ_{1}, \dots, λ_{m} \in C S^{1}$ , and let $h_{1}, \dots, h_{m - 1} \in (C S^{0})^{m}$ be the corresponding eigenvectors. Suppose that for $e_{1} \in R^{m - i + 1}$ the condition

\begin{matrix} 〈h^{(i)} (t, x, ξ) | e_{1}〉 \neq 0, \forall (t, x, ξ) \in [0, T] \times R^{n} \times R^{n} \end{matrix}

holds for all $i = 1, \dots, m - 1$ , with the notation explained above. Then, there exists a matrix-valued symbol $T (t, x, ξ) \in {(C S^{0})}^{m \times m}$ , invertible for $(t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M}$ , $T^{- 1} (t, x, ξ) \in {(C S^{0})}^{m \times m}$ , such that

\begin{matrix} T^{- 1} (t, x, ξ) A (t, x, ξ) T (t, x, ξ) = Λ (t, x, ξ) + N (t, x, ξ) \end{matrix}

for all $(t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M}$ , where

\begin{matrix} Λ (t, x, ξ) = diag (λ_{1} (t, x, ξ), λ_{2} (t, x, ξ), \dots, λ_{m} (t, x, ξ)) \end{matrix}

and

\begin{matrix} N (t, x, ξ) = [\begin{matrix} 0 & N_{12} (t, x, ξ) & N_{13} (t, x, ξ) & \dots & N_{1 m} (t, x, ξ) \\ 0 & 0 & N_{23} (t, x, ξ) & \dots & N_{2 m} (t, x, ξ) \\ ⋮ & ⋮ & ⋮ & \dots & ⋮ \\ 0 & 0 & 0 & \dots & N_{m - 1 m} (t, x, ξ) \\ 0 & 0 & 0 & \dots & 0 \end{matrix}], \end{matrix}

and N is a nilpotent matrix with entries in $C S^{1}$ . Furthermore, the matrix symbol T is given by

\begin{matrix} T (t, x, ξ) = T_{1} T_{2} \cdot \dots \cdot T_{m - 1}, \end{matrix}

with the notation explained above.

Remark 6

Taking into account Remark 5, let us stress that condition (38) is not restrictive as it can be replaced by the following: suppose that there exist $m - 1$ numbers $j_{i} \in {1, \dots, m - i + 1}$ , $i = 1, \dots m - 1$ , such that for all $i = 1, \dots, m - 1$

\begin{matrix} 〈h^{(i)} (t, x, ξ) | e_{j_{i}}〉 \neq 0 \forall (t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M} \end{matrix}

holds.

Remark 7

If $A (t, x, ξ)$ has complex symbols (as allowed in Theorem 1, see also Remark 1) and real eigenvalues, the eigenvalues of the Schur transformed system clearly remain real. The upper triangular entries may still be complex valued symbols.

Remark 8

Theorem 6 is quite general in the sense that the functions $a_{ij}$ could be complex-valued. In this paper, we are concerned with hyperbolic matrices, i.e. we assume that the eigenvalues $λ_{1}, \dots, λ_{m}$ are real. We stress that the Schur transform does not change the hyperbolicity of the matrix as the eigenvalues of $T^{- 1} A T$ are also $λ_{1}, \dots, λ_{m}$ .

Remark 9

For our applications in this and future work it is important that the transform T in Theorem 6 keeps the regularity of the original matrix A, i.e. that the elements of the Schur transform $T^{- 1} A T$ are in the same class as the elements of A. Here, we stated everything with $C S^{1}$ and $C S^{0}$ as that is the regularity considered in this paper. Note that one could replace C with $C^{k}$ or $C^{\infty}$ and find a matrix T such that the transformed matrix $T^{- 1} A T$ inherits the same regularity with respect to t. In addition, one could also drop the regularity in t to $L^{\infty}$ and the triangularisation procedure would still work preserving the boundedness in t through every step.

For the sake of simplicity and the reader’s convenience, in the next subsections we analyse Theorem 6 in the special cases of $m = 2$ and $m = 3$ .

The case $m = 2$

We now formulate Theorem 6 in the special case $m = 2$ . In this way we recover the formulation given in [27].

Theorem 7

([27, Theorem 7.1]) Suppose that $A (t, x, ξ) \in {(C S^{1})}^{2 \times 2}$ admits eigenvalues $λ_{j} (t, x, ξ) \in C S^{1}$ , $j = 1, 2$ , and an eigenvector $h (t, x, ξ) \in {(C S^{0})}^{2}$ satisfying

\begin{matrix} 〈h (t, x, ξ) | e_{j}〉 \neq 0, (t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M}, \end{matrix}

for $j = 1$ or $j = 2$ . Then, we can find a $2 \times 2$ matrix valued symbol $T (t, x, ξ) \in {(C S^{0})}^{2 \times 2}$ , invertible for ${| ξ | \geq M}$ with $T^{- 1} (t, x, ξ) \in {(C S^{0})}^{2 \times 2}$ , such that

\begin{matrix} T^{- 1} (t, x, ξ) A (t, x, ξ) T (t, x, ξ) = [\begin{matrix} λ_{1} (t, x, ξ) & a_{12} (t, x, ξ) \\ 0 & λ_{2} (t, x, ξ) \end{matrix}] \end{matrix}

for all $(t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M}$ .

Proof

For $2 \times 2$ matrices the triangularisation procedure described in the previous subsection can stop at Step 1. By Remark 5, we may assume that (40) holds for the eigenvector h corresponding to $λ_{1}$ and for $j = 1$ . We set $h = h_{1}$ and $h^{(1)} = h_{1}$ . The vector

\begin{matrix} ω_{1} = [\begin{matrix} ω_{11} (t, x, ξ) \\ ω_{12} (t, x, ξ) \end{matrix}], ω_{1 j} (t, x, ξ) = \frac{〈h^{(1)} (t, x, ξ) | e_{j}〉}{〈h^{(1)} (t, x, ξ) | e_{1}〉}, \end{matrix}

belongs to $C S^{0}$ and is an eigenvector of A associated to $λ_{1}$ . We then set

\begin{matrix} T_{1} (t, x, ξ) = [\begin{matrix} ω_{1} & e_{2} \end{matrix}] = [\begin{matrix} ω_{11} (t, x, ξ) & 0 \\ ω_{12} (t, x, ξ) & 1 \end{matrix}] . \end{matrix}

With that, we obtain

\begin{matrix} A (t, x, ξ) T_{1} (t, x, ξ) = [\begin{matrix} a_{11} ω_{11} + a_{12} ω_{12} & a_{12} \\ a_{21} ω_{11} + a_{22} ω_{12} & a_{22} \end{matrix}] \end{matrix}

and finally, with

\begin{matrix} T_{1}^{- 1} (t, x, ξ) = [\begin{matrix} ω_{11} (t, x, ξ) & 0 \\ - ω_{12} (t, x, ξ) & 1 \end{matrix}], \end{matrix}

we obtain

\begin{matrix} \begin{matrix} T_{1}^{- 1} (t, x, ξ) A (t, x, ξ) T_{1} (t, x, ξ) \\ = [\begin{matrix} a_{11} ω_{11}^{2} + a_{12} ω_{12} ω_{11} & a_{12} ω_{11} \\ - a_{11} ω_{12} ω_{11} - a_{12} ω_{12}^{2} + a_{21} ω_{11} + a_{22} ω_{12} & - ω_{12} a_{12} + a_{22} \end{matrix}] \end{matrix} \end{matrix}

By construction, we have

\begin{matrix} \begin{matrix} a_{11} ω_{11} + a_{12} ω_{12} = λ_{1} ω_{11}, \\ a_{21} ω_{11} + a_{22} ω_{12} = λ_{1} ω_{12}, \end{matrix} \end{matrix}

and $ω_{1} = 1$ . This yields $a_{11} ω_{11} + a_{12} ω_{12} = λ_{1} ω_{11} = λ_{1}$ and

\begin{matrix} \begin{matrix} - a_{11} ω_{12} ω_{11} - a_{12} ω_{12}^{2} + a_{21} ω_{11} + a_{22} ω_{12} \\ = - ω_{12} (a_{11} ω_{11} + a_{12} ω_{12}) + a_{21} ω_{11} + a_{22} ω_{12} = - λ_{1} ω_{12} + λ_{1} ω_{12} = 0 . \end{matrix} . \end{matrix}

Using $a_{11} + a_{22} = λ_{1} + λ_{2}$ , we obtain

\begin{matrix} - ω_{12} a_{12} + a_{22} = - ω_{12} a_{12} + a_{22} + a_{11} ω_{11} - a_{11} ω_{11} = λ_{2} . \end{matrix}

Thus, we get that

\begin{matrix} T_{1}^{- 1} (t, x, ξ) A (t, x, ξ) T_{1} (t, x, ξ) = [\begin{matrix} λ_{1} (t, x, ξ) & a_{12} (t, x, ξ) \\ 0 & λ_{2} (t, x, ξ) \end{matrix}] \end{matrix}

for $(t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M}$ . This concludes the proof. $□$

Example

(i)
By direct computations we can easily see that if $h_{1} = {[h_{11} h_{12}]}^{T} = e_{1}$ then the matrix A is automatically in the upper triangular form. Indeed,
$\begin{matrix} a_{21} h_{11} + a_{22} h_{12} = λ_{1} h_{12} \end{matrix}$
implies $a_{21} = 0$ . A typical example (already discussed in [27]) is the Jordan block matrix
$\begin{matrix} A = [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}], \end{matrix}$
where $λ_{1} = 0$ is an eigenvalue with eigenvector $h_{1} = e_{1}$ .
(ii)
Condition (40) is trivially fulfilled when $\det A \equiv 0$ and A is of the form
$\begin{matrix} [\begin{matrix} a & a \\ - a & - a \end{matrix}], \end{matrix}$
for $a = a (t, x, ξ)$ . Indeed, also in this case one can take 0 as an eigenvalue with eigenvector $h_{1} = {[1 1]}^{T}$ .

The case $m = 3$

With the notation introduced in Sect. 3.2, we assume that the $3 \times 3$ matrix $A (t, x, ξ) \in {(C S^{1})}^{3 \times 3}$ admits three eigenvalues $λ_{i} (t, x, ξ) \in C S^{1}$ , $i = 1, 2, 3$ , and two corresponding eigenvectors $h_{i} (t, x, ξ) \in (C S^{0})^{3}$ , $i = 1, 2$ . Then, we set $h^{(1)} : = h_{1}$ and, as in Remark 6 we suppose that there is a $j_{1} \in {1, 2, 3}$ with

\begin{matrix} 〈h^{(1)} (t, x, ξ) | e_{j_{1}}〉 \neq 0 \forall (t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M} . \end{matrix}

Thus, we can set

\begin{matrix} ω_{1 j} (t, x, ξ) = \frac{〈h^{(1)} (t, x, ξ) | e_{j}〉}{〈h^{(1)} (t, x, ξ) | e_{j_{1}}〉} . \end{matrix}

Now, we rearrange the matrix A such that the first component of $ω_{1}$ becomes identically equal to 1. Then, with $j_{2}, j_{3} \in {1, 2, 3} \ {j_{1}}$ , we can write

\begin{matrix} T_{1}^{- 1} = {[\begin{matrix} ω_{1} & e_{2} & e_{3} \end{matrix}]}^{- 1} = [\begin{matrix} ω_{1 j_{1}} & 0 & 0 \\ - ω_{1 j_{2}} & 1 & 0 \\ - ω_{1 j_{3}} & 0 & 1 \end{matrix}] = [\begin{matrix} 1 & 0 & 0 \\ - ω_{1 j_{2}} & 1 & 0 \\ - ω_{1 j_{3}} & 0 & 1 \end{matrix}], \end{matrix}

which leads to

\begin{matrix} T_{1}^{- 1} h_{2} = [\begin{matrix} ω_{1 j_{1}} & 0 & 0 \\ - ω_{1 j_{2}} & 1 & 0 \\ - ω_{1 j_{3}} & 0 & 1 \end{matrix}] [\begin{matrix} h_{2 j_{1}} \\ h_{2 j_{2}} \\ h_{2 j_{3}} \end{matrix}] = [\begin{matrix} h_{2 j_{1}} \\ - ω_{1 j_{2}} h_{2 j_{1}} + h_{2 j_{2}} \\ - ω_{1 j_{3}} h_{2 j_{1}} + h_{2 j_{3}} \end{matrix}] . \end{matrix}

We then get

\begin{matrix} h^{(2)} = Π_{1} T_{1}^{- 1} h_{2} = [\begin{matrix} - ω_{1 j_{2}} h_{2 j_{1}} + h_{2 j_{2}} \\ - ω_{1 j_{3}} h_{2 j_{1}} + h_{2 j_{3}} \end{matrix}] \end{matrix}

and the condition (38) that there exists $j \in {1, 2}$ such that

\begin{matrix} 〈h^{(2)} (t, x, ξ) | e_{j}〉 \neq 0 \forall (t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M} \end{matrix}

translates to: either

\begin{matrix} - ω_{1 j_{2}} h_{2 j_{1}} + h_{2 j_{2}} \neq 0 \Rightarrow h_{2 j_{2}} h_{1 j_{1}} - h_{1 j_{2}} h_{2 j_{1}} \neq 0 \end{matrix}

\begin{matrix} - ω_{1 j_{3}} h_{2 j_{1}} + h_{2 j_{3}} \neq 0 \Rightarrow h_{2 j_{3}} h_{1 j_{1}} - h_{1 j_{3}} h_{2 j_{1}} \neq 0 \end{matrix}

holds. Thus, assuming that (41) holds, the matrix ${\tilde{T}}_{2}$ is given by

\begin{matrix} [\begin{matrix} ω_{21} & 0 \\ ω_{21} & 1 \end{matrix}] = [\begin{matrix} 1 & 0 \\ \frac{- ω_{1 j_{3}} h_{2 j_{1}} + h_{2 j_{3}}}{- ω_{1 j_{2}} h_{2 j_{1}} + h_{2 j_{2}}} & 1 \end{matrix}], ω_{2 j} = \frac{〈h^{(2)} (t, x, ξ) | e_{j}〉}{〈h^{(2)} (t, x, ξ) | e_{j_{2}}〉}, j = 1, 2, \end{matrix}

and the matrix $T_{2}$ by

\begin{matrix} [\begin{matrix} 1 & 0 & 0 \\ 0 & ω_{21} & 0 \\ 0 & ω_{22} & 1 \end{matrix}] = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & \frac{- ω_{1 j_{3}} h_{2 j_{1}} + h_{2 j_{3}}}{- ω_{1 j_{2}} h_{2 j_{1}} + h_{2 j_{2}}} & 1 \end{matrix}] . \end{matrix}

Thus, we obtain

\begin{matrix} T (t, x, ξ) = T_{1} T_{2} = [\begin{matrix} 1 & 0 & 0 \\ ω_{1 j_{2}} & 1 & 0 \\ ω_{1 j_{3}} & 0 & 1 \end{matrix}] [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & \frac{- ω_{1 j_{3}} h_{2 j_{1}} + h_{2 j_{3}}}{- ω_{1 j_{2}} h_{2 j_{1}} + h_{2 j_{2}}} & 1 \end{matrix}] . \end{matrix}

If we have (42) instead of (41), then we would need a permutation matrix

\begin{matrix} P_{j_{2} \leftrightarrow j_{3}} = [\begin{matrix} 1 & 0 & 0 \\ 0 & 0 & 1 \\ 0 & 1 & 0 \end{matrix}] \end{matrix}

in (43), i.e.

\begin{matrix} T (t, x, ξ) = T_{1} (t, x, ξ) P_{j_{2} \leftrightarrow j_{3}} T_{2} (t, x, ξ) \end{matrix}

and

\begin{matrix} T_{2} (t, x, ξ) = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & \frac{- ω_{1 j_{2}} h_{2 j_{1}} + h_{2 j_{2}}}{- ω_{1 j_{3}} h_{2 j_{1}} + h_{2 j_{3}}} & 1 \end{matrix}] . \end{matrix}

Theorem 8

Suppose that $A (t, x, ξ) \in {(C S^{1})}^{3 \times 3}$ admits three eigenvalues $λ_{i} \in C S^{1}$ , $i = 1, 2, 3$ , and two corresponding eigenvectors $h_{i} (t, x, ξ) \in (C S^{1})^{3}$ , $i = 1, 2$ . Suppose that there exists a $j_{1} \in {1, 2, 3}$ such that

\begin{matrix} h_{1 j_{1}} (t, x, ξ) \neq 0 \forall (t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M} . \end{matrix}

Further suppose that there exists $j_{2} \in {1, 2, 3} \ {j_{1}}$ such that

\begin{matrix} h_{2 j_{2}} h_{1 j_{1}} - h_{1 j_{2}} h_{2 j_{1}} \neq 0 \forall (t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M} . \end{matrix}

Then, there exists a matrix-valued symbol $T (t, x, ξ) \in {(C S^{0})}^{3 \times 3}$ , invertible for all $(t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M}$ with $T^{- 1} (t, x, ξ) \in {(C S^{0})}^{3 \times 3}$ , such that

\begin{matrix} T^{- 1} (t, x, ξ) A (t, x, ξ) T (t, x, ξ) = Λ (t, x, ξ) + N (t, x, ξ) \end{matrix}

holds for all $(t, x, ξ) \in [0, T] \times R^{n} \times {| ξ | \geq M},$ where $Λ (t, x, ξ) = diag (λ_{1}, λ_{2}, λ_{3})$ and

\begin{matrix} N (t, x, ξ) = [\begin{matrix} 0 & N_{13} (t, x, ξ) & N_{13} (t, x, ξ) \\ 0 & 0 & N_{23} (t, x, ξ) \\ 0 & 0 & 0 \end{matrix}] . \end{matrix}

We end this subsection by discussing some examples of $3 \times 3$ matrices fulfilling the assumptions above on their eigenvalues.

Examples

(i)
If the matrix A has eigenvectors
$\begin{matrix} h_{1} = [\begin{matrix} 1 \\ 0 \\ 1 \end{matrix}] and h_{2} = [\begin{matrix} 1 \\ 1 \\ 0 \end{matrix}] \end{matrix}$
then conditions (44) and (45) are easily fulfilled with $j_{1} = 1$ and $j_{2} = 2$ . Indeed, $h_{11} = 1$ and
$\begin{matrix} h_{22} h_{11} - h_{12} h_{21} = h_{22} h_{11} = 1 . \end{matrix}$
More in general to satisfy (44) and (45) it would be enough to have two eigenvectors
$\begin{matrix} h_{1} = [\begin{matrix} h_{11} \\ h_{12} \\ h_{13} \end{matrix}] and h_{2} = [\begin{matrix} h_{21} \\ h_{22} \\ h_{23} \end{matrix}] \end{matrix}$
with $h_{11} \neq 0$ , $h_{22} \neq 0$ and $h_{12} = 0$ .
(ii)
A matrix with eigenvectors
$\begin{matrix} h_{1} = [\begin{matrix} 1 \\ 0 \\ 1 \end{matrix}] and h_{2} = [\begin{matrix} 1 \\ 1 \\ 0 \end{matrix}] \end{matrix}$
has a special form. Indeed, for $λ_{1}$ and $λ_{2}$ eigenvalues corresponding to $h_{1}$ and $h_{2}$ , respectively, by using the eigenvector equations we obtain
$\begin{matrix} \begin{matrix} a_{13} & = λ_{1} - a_{11}, \\ a_{23} & = - a_{21}, \\ a_{33} & = λ_{1} - a_{31}, \end{matrix} \end{matrix}$
and
$\begin{matrix} \begin{matrix} a_{12} & = λ_{2} - a_{11}, \\ a_{22} & = λ_{2} - a_{21}, \\ a_{32} & = - a_{31} . \end{matrix} \end{matrix}$
Hence
$\begin{matrix} A = [\begin{matrix} a_{11} & λ_{2} - a_{11} & λ_{1} - a_{11} \\ a_{21} & λ_{2} - a_{21} & - a_{21} \\ a_{31} & - a_{31} & λ_{1} - a_{31} \end{matrix}] . \end{matrix}$

Footnotes

Michael Ruzhansky was supported in parts by EPSRC Grant EP/R003025/1 and by the Leverhulme Grant RPG-2017-151. No new data was collected or generated during the course of research.

Inspired by our colleague and friend Todor Gramchev (1956–2015).

Contributor Information

Claudia Garetto, Email: c.garetto@lboro.ac.uk.

Christian Jäh, Email: c.jaeh@lboro.ac.uk.

Michael Ruzhansky, Email: m.ruzhansky@imperial.ac.uk.

References

1.Bernstein DS. Matrix Mathematics—Theory, Facts, and Formulas. 2. Princeton: Princeton University Press; 2009. [Google Scholar]
2.Bronshtein MD. Smoothness of roots of polynomials depending on parameters. Sibirsk. Mat. Zh., 20(3), 493–501, (1979) Sib. Math. J. 1980;20:347–352. doi: 10.1007/BF00969937. [DOI] [Google Scholar]
3.Colombini F, Kinoshita T. On the Gevrey well posedness of the Cauchy problem for weakly hyperbolic equations of higher order. J. Differ. Equ. 2002;186:394–419. doi: 10.1016/S0022-0396(02)00009-8. [DOI] [Google Scholar]
4.Colombini F, Spagnolo S. An example of a weakly hyperbolic Cauchy problem not well posed in $C^{\infty}$ . Acta Math. 1982;148:243–253. doi: 10.1007/BF02392730. [DOI] [Google Scholar]
5.Colombini F, De Giorgi E, Spagnolo S. Sur les équations hyperboliques avec des coefficients qui ne dépendent que du temps. Ann. Sc. Norm. Super. Pisa Cl. Sci. 1979;6:511–559. [Google Scholar]
6.Colombini F, Del Santo D, Fanelli F, Métivier G. Time-dependent loss of derivatives for hyperbolic operators with non regular coefficients. Commun. Partial Differ. Equ. 2013;38(10):1791–1817. doi: 10.1080/03605302.2013.795968. [DOI] [Google Scholar]
7.Colombini F, Del Santo D, Fanelli F, Métivier G. A well-posedness result for hyperbolic operators with Zygmund coefficients. J. Math. Pures Appl. 2013;9(100):455–475. doi: 10.1016/j.matpur.2013.01.009. [DOI] [Google Scholar]
8.Colombini F, Jannelli E, Spagnolo S. Nonuniqueness in hyperbolic Cauchy problems. Ann. Math. 1987;126:495–524. doi: 10.2307/1971359. [DOI] [Google Scholar]
9.Colombini F, Lerner N. Hyperbolic operators with non-Lipschitz coefficients. Duke Math. J. 1995;77(3):657–698. doi: 10.1215/S0012-7094-95-07721-7. [DOI] [Google Scholar]
10.Colombini F, Nishitani T. Second order weakly hyperbolic operators with coefficients sum of powers of functions. Osaka J. Math. 2007;44(1):121–137. [Google Scholar]
11.D’Ancona P, Kinoshita T. On the wellposedness of the Cauchy problem for weakly hyperbolic equations of higher order. Math. Nachr. 2005;278:1147–1162. doi: 10.1002/mana.200310299. [DOI] [Google Scholar]
12.D’Ancona P, Kinoshita T, Spagnolo S. Weakly hyperbolic systems with Hölder continuous coefficients. J. Differ. Equ. 2004;203(1):64–81. doi: 10.1016/j.jde.2004.03.016. [DOI] [Google Scholar]
13.D’Ancona P, Kinoshita T, Spagnolo S. On the 2 by 2 weakly hyperbolic systems. Osaka J. Math. 2008;45(4):921–939. [Google Scholar]
14.Dieci L, Eirola T. On smooth decompositions of matrices. SIAM J. Matrix Anal. Appl. 1999;20(3):800–819. doi: 10.1137/S0895479897330182. [DOI] [Google Scholar]
15.Duistermaat JJ. Fourier Intergal Operators Progress in Mathematics. Boston: Birkhäuser Boston, Inc; 1996. [Google Scholar]
16.Garetto C. On hyperbolic equations and systems with non-regular time dependent coefficients. J. Differ. Equ. 2015;259(11):5846–5874. doi: 10.1016/j.jde.2015.07.011. [DOI] [Google Scholar]
17.Garetto C, Jäh C. Well-posedness of hyperbolic systems with multiplicities and smooth coefficients. Math. Ann. 2017;369(1–2):441–485. doi: 10.1007/s00208-016-1436-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Garetto C, Ruzhansky M. Well-posedness of weakly hyperbolic equations with time dependent coefficients. J. Differ. Equ. 2012;253(5):1317–1340. doi: 10.1016/j.jde.2012.05.001. [DOI] [Google Scholar]
19.Garetto C, Ruzhansky M. Weakly hyperbolic equations with non-analytic coefficients and lower order terms. Math. Ann. 2013;357(2):401–440. doi: 10.1007/s00208-013-0910-9. [DOI] [Google Scholar]
20.Garetto C, Ruzhansky M. A note on weakly hyperbolic equations with analytic principal part. J. Math. Anal. Appl. 2014;412(1):1–14. doi: 10.1016/j.jmaa.2013.09.011. [DOI] [Google Scholar]
21.Garetto C, Ruzhansky M. Hyperbolic second order equations with non-regular time dependent coefficients. Arch. Ration. Mech. Anal. 2015;217(1):113–154. doi: 10.1007/s00205-014-0830-1. [DOI] [Google Scholar]
22.Garetto C, Ruzhansky M. On hyperbolic systems with time dependent Hölder characteristics. Ann. Mat. Pura Appl. 2017;196(1):155–164. doi: 10.1007/s10231-016-0567-6. [DOI] [Google Scholar]
23.Garetto C, Ruzhansky M. On $C^{\infty}$ well-posedness of hyperbolic systems with multiplicities. Ann. Mat. Pura Appl. 2017;196(5):1819–1834. doi: 10.1007/s10231-017-0639-2. [DOI] [Google Scholar]
24.Gingold H. On continuous triangularization of matrix functions. SIAM J. Math. Anal. 1979;10(4):709–720. doi: 10.1137/0510065. [DOI] [Google Scholar]
25.Gingold H, Hsieh P-F. Globally analytic triangularization of a matrix function. Linear Algebra Appl. 1992;169:75–101. doi: 10.1016/0024-3795(92)90172-7. [DOI] [Google Scholar]
26.Gramchev, T., Orrú, N.: Cauchy problem for a class of nondiagonalizable hyperbolic systems. Discret. Contin. Dyn. Syst., 533–542 (2011). 10.3934/proc.2011.2011.533
27.Gramchev T, Ruzhansky M. Cauchy problem for $2 \times 2$ hyperbolic systems of pseudo-differential equations with nondiagonalisable principal part. Studies in phase space analysis with applications to PDEs. Progr. Nonlinear Differ. Equ. Appl. 2013;84:129–144. [Google Scholar]
28.Hörmander L. The Analysis of Linear Partial Differential Operators. Heidelberg: Springer; 1985. [Google Scholar]
29.Hörmander L. Hyperbolic systems with double characteristics. Comm. Pure Appl. Math. 1993;46:261–301. doi: 10.1002/cpa.3160460207. [DOI] [Google Scholar]
30.Ivrii V Ya, Petkov VM. Necessary conditions for the correctness of the Cauchy problem for non-strictly hyperbolic equations. (Russian) Russ. Math. Surv. 1974;29:3–70. doi: 10.1070/RM1974v029n05ABEH001295. [DOI] [Google Scholar]
31.Kajitani K, Yuzawa Y. The Cauchy problem for hyperbolic systems with Hölder continuous coefficients with respect to the time variable. Ann. Sc. Norm. Super. Pisa Cl. Sci. (5) 2006;5(4):465–482. [Google Scholar]
32.Kamotski I, Ruzhansky M. Estimates and spectral asymptotics for systems with multiplicities. Funct. Anal. Appl. 2005;39:308–310. doi: 10.1007/s10688-005-0052-2. [DOI] [Google Scholar]
33.Kamotski I, Ruzhansky M. Regularity properties, representation of solutions and spectral asymptotics of systems with multiplicities. Comm. Partial Differ. Equ. 2007;32:1–35. doi: 10.1080/03605300600856816. [DOI] [Google Scholar]
34.Kinoshita T, Spagnolo S. Hyperbolic equations with non-analytic coefficients. Math. Ann. 2006;336:551–569. doi: 10.1007/s00208-006-0009-7. [DOI] [Google Scholar]
35.Lax P. Asymptotic solutions of oscillatory initial value problems. Duke Math. J. 1957;24:627–646. doi: 10.1215/S0012-7094-57-02471-7. [DOI] [Google Scholar]
36.Melrose RB, Uhlmann GA. Microlocal structure of involutiveconical refraction. Duke Math. J. 1982;46:571–582. doi: 10.1215/S0012-7094-79-04630-1. [DOI] [Google Scholar]
37.Nishitani T. On the Cauchy problem for $D_{t}^{2} - D_{x} a {(t, x)}^{n} D_{x}$ . Ann. Univ. Ferrara Sez. VII Sci. Mat. 2006;52(2):395–430. doi: 10.1007/s11565-006-0029-y. [DOI] [Google Scholar]
38.Ohya, Y., Tarama, S.: The Cauchy Problem with multiple characteristics in the Gevery class–Hölder coefficients in $t$ . Hyperbolic Equations and Related Topics. Kataka/Kioto, pp. 273–306 (1984)
39.Parenti C, Parmeggiani A. On the Cauchy problem for hyperbolic operators with double characteristics. Commun. Partial Differ. Equ. 2009;34:837–888. doi: 10.1080/03605300902892360. [DOI] [Google Scholar]
40.Parusinski A, Rainer A. Regularity of roots of polynomials. Ann. Sc. Norm. Super. Pisa Cl. Sci. (5) 2016;16(2):481–517. [Google Scholar]
41.Rozenblum G. Spectral asymptotic behaviour of elliptic systems (Russian) Zap. LOMI. 1980;96:255–271. [Google Scholar]
42.Ruzhansky M. Singularities of affine fibrations in the theory of regularity of Fourier integral operators. Russ. Math. Surv. 2000;55(1):93–161. doi: 10.1070/RM2000v055n01ABEH000250. [DOI] [Google Scholar]
43.Ruzhansky, M.: Regularity theory of Fourier integral operators with complex phases and singularities of affine fibrations. CWI Tract, 131. Stichting Mathematisch Centrum, Centrum voor Wiskunde en Informatica, Amsterdam (2001)
44.Yuzawa Y. The Cauchy problem for hyperbolic systems with Hölder continuous coefficients with respect to time. J. Differ. Equ. 2005;219(2):363–374. doi: 10.1016/j.jde.2004.12.006. [DOI] [Google Scholar]
45.Wasow W. On holomorphically similar matrices. J. Math. Anal. Appl. 1962;4(2):202–206. doi: 10.1016/0022-247X(62)90050-1. [DOI] [Google Scholar]

[CR1] 1.Bernstein DS. Matrix Mathematics—Theory, Facts, and Formulas. 2. Princeton: Princeton University Press; 2009. [Google Scholar]

[CR2] 2.Bronshtein MD. Smoothness of roots of polynomials depending on parameters. Sibirsk. Mat. Zh., 20(3), 493–501, (1979) Sib. Math. J. 1980;20:347–352. doi: 10.1007/BF00969937. [DOI] [Google Scholar]

[CR3] 3.Colombini F, Kinoshita T. On the Gevrey well posedness of the Cauchy problem for weakly hyperbolic equations of higher order. J. Differ. Equ. 2002;186:394–419. doi: 10.1016/S0022-0396(02)00009-8. [DOI] [Google Scholar]

[CR4] 4.Colombini F, Spagnolo S. An example of a weakly hyperbolic Cauchy problem not well posed in $C^{\infty}$ . Acta Math. 1982;148:243–253. doi: 10.1007/BF02392730. [DOI] [Google Scholar]

[CR5] 5.Colombini F, De Giorgi E, Spagnolo S. Sur les équations hyperboliques avec des coefficients qui ne dépendent que du temps. Ann. Sc. Norm. Super. Pisa Cl. Sci. 1979;6:511–559. [Google Scholar]

[CR6] 6.Colombini F, Del Santo D, Fanelli F, Métivier G. Time-dependent loss of derivatives for hyperbolic operators with non regular coefficients. Commun. Partial Differ. Equ. 2013;38(10):1791–1817. doi: 10.1080/03605302.2013.795968. [DOI] [Google Scholar]

[CR7] 7.Colombini F, Del Santo D, Fanelli F, Métivier G. A well-posedness result for hyperbolic operators with Zygmund coefficients. J. Math. Pures Appl. 2013;9(100):455–475. doi: 10.1016/j.matpur.2013.01.009. [DOI] [Google Scholar]

[CR8] 8.Colombini F, Jannelli E, Spagnolo S. Nonuniqueness in hyperbolic Cauchy problems. Ann. Math. 1987;126:495–524. doi: 10.2307/1971359. [DOI] [Google Scholar]

[CR9] 9.Colombini F, Lerner N. Hyperbolic operators with non-Lipschitz coefficients. Duke Math. J. 1995;77(3):657–698. doi: 10.1215/S0012-7094-95-07721-7. [DOI] [Google Scholar]

[CR10] 10.Colombini F, Nishitani T. Second order weakly hyperbolic operators with coefficients sum of powers of functions. Osaka J. Math. 2007;44(1):121–137. [Google Scholar]

[CR11] 11.D’Ancona P, Kinoshita T. On the wellposedness of the Cauchy problem for weakly hyperbolic equations of higher order. Math. Nachr. 2005;278:1147–1162. doi: 10.1002/mana.200310299. [DOI] [Google Scholar]

[CR12] 12.D’Ancona P, Kinoshita T, Spagnolo S. Weakly hyperbolic systems with Hölder continuous coefficients. J. Differ. Equ. 2004;203(1):64–81. doi: 10.1016/j.jde.2004.03.016. [DOI] [Google Scholar]

[CR13] 13.D’Ancona P, Kinoshita T, Spagnolo S. On the 2 by 2 weakly hyperbolic systems. Osaka J. Math. 2008;45(4):921–939. [Google Scholar]

[CR14] 14.Dieci L, Eirola T. On smooth decompositions of matrices. SIAM J. Matrix Anal. Appl. 1999;20(3):800–819. doi: 10.1137/S0895479897330182. [DOI] [Google Scholar]

[CR15] 15.Duistermaat JJ. Fourier Intergal Operators Progress in Mathematics. Boston: Birkhäuser Boston, Inc; 1996. [Google Scholar]

[CR16] 16.Garetto C. On hyperbolic equations and systems with non-regular time dependent coefficients. J. Differ. Equ. 2015;259(11):5846–5874. doi: 10.1016/j.jde.2015.07.011. [DOI] [Google Scholar]

[CR17] 17.Garetto C, Jäh C. Well-posedness of hyperbolic systems with multiplicities and smooth coefficients. Math. Ann. 2017;369(1–2):441–485. doi: 10.1007/s00208-016-1436-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Garetto C, Ruzhansky M. Well-posedness of weakly hyperbolic equations with time dependent coefficients. J. Differ. Equ. 2012;253(5):1317–1340. doi: 10.1016/j.jde.2012.05.001. [DOI] [Google Scholar]

[CR19] 19.Garetto C, Ruzhansky M. Weakly hyperbolic equations with non-analytic coefficients and lower order terms. Math. Ann. 2013;357(2):401–440. doi: 10.1007/s00208-013-0910-9. [DOI] [Google Scholar]

[CR20] 20.Garetto C, Ruzhansky M. A note on weakly hyperbolic equations with analytic principal part. J. Math. Anal. Appl. 2014;412(1):1–14. doi: 10.1016/j.jmaa.2013.09.011. [DOI] [Google Scholar]

[CR21] 21.Garetto C, Ruzhansky M. Hyperbolic second order equations with non-regular time dependent coefficients. Arch. Ration. Mech. Anal. 2015;217(1):113–154. doi: 10.1007/s00205-014-0830-1. [DOI] [Google Scholar]

[CR22] 22.Garetto C, Ruzhansky M. On hyperbolic systems with time dependent Hölder characteristics. Ann. Mat. Pura Appl. 2017;196(1):155–164. doi: 10.1007/s10231-016-0567-6. [DOI] [Google Scholar]

[CR23] 23.Garetto C, Ruzhansky M. On $C^{\infty}$ well-posedness of hyperbolic systems with multiplicities. Ann. Mat. Pura Appl. 2017;196(5):1819–1834. doi: 10.1007/s10231-017-0639-2. [DOI] [Google Scholar]

[CR24] 24.Gingold H. On continuous triangularization of matrix functions. SIAM J. Math. Anal. 1979;10(4):709–720. doi: 10.1137/0510065. [DOI] [Google Scholar]

[CR25] 25.Gingold H, Hsieh P-F. Globally analytic triangularization of a matrix function. Linear Algebra Appl. 1992;169:75–101. doi: 10.1016/0024-3795(92)90172-7. [DOI] [Google Scholar]

[CR26] 26.Gramchev, T., Orrú, N.: Cauchy problem for a class of nondiagonalizable hyperbolic systems. Discret. Contin. Dyn. Syst., 533–542 (2011). 10.3934/proc.2011.2011.533

[CR27] 27.Gramchev T, Ruzhansky M. Cauchy problem for $2 \times 2$ hyperbolic systems of pseudo-differential equations with nondiagonalisable principal part. Studies in phase space analysis with applications to PDEs. Progr. Nonlinear Differ. Equ. Appl. 2013;84:129–144. [Google Scholar]

[CR28] 28.Hörmander L. The Analysis of Linear Partial Differential Operators. Heidelberg: Springer; 1985. [Google Scholar]

[CR29] 29.Hörmander L. Hyperbolic systems with double characteristics. Comm. Pure Appl. Math. 1993;46:261–301. doi: 10.1002/cpa.3160460207. [DOI] [Google Scholar]

[CR30] 30.Ivrii V Ya, Petkov VM. Necessary conditions for the correctness of the Cauchy problem for non-strictly hyperbolic equations. (Russian) Russ. Math. Surv. 1974;29:3–70. doi: 10.1070/RM1974v029n05ABEH001295. [DOI] [Google Scholar]

[CR31] 31.Kajitani K, Yuzawa Y. The Cauchy problem for hyperbolic systems with Hölder continuous coefficients with respect to the time variable. Ann. Sc. Norm. Super. Pisa Cl. Sci. (5) 2006;5(4):465–482. [Google Scholar]

[CR32] 32.Kamotski I, Ruzhansky M. Estimates and spectral asymptotics for systems with multiplicities. Funct. Anal. Appl. 2005;39:308–310. doi: 10.1007/s10688-005-0052-2. [DOI] [Google Scholar]

[CR33] 33.Kamotski I, Ruzhansky M. Regularity properties, representation of solutions and spectral asymptotics of systems with multiplicities. Comm. Partial Differ. Equ. 2007;32:1–35. doi: 10.1080/03605300600856816. [DOI] [Google Scholar]

[CR34] 34.Kinoshita T, Spagnolo S. Hyperbolic equations with non-analytic coefficients. Math. Ann. 2006;336:551–569. doi: 10.1007/s00208-006-0009-7. [DOI] [Google Scholar]

[CR35] 35.Lax P. Asymptotic solutions of oscillatory initial value problems. Duke Math. J. 1957;24:627–646. doi: 10.1215/S0012-7094-57-02471-7. [DOI] [Google Scholar]

[CR36] 36.Melrose RB, Uhlmann GA. Microlocal structure of involutiveconical refraction. Duke Math. J. 1982;46:571–582. doi: 10.1215/S0012-7094-79-04630-1. [DOI] [Google Scholar]

[CR37] 37.Nishitani T. On the Cauchy problem for $D_{t}^{2} - D_{x} a {(t, x)}^{n} D_{x}$ . Ann. Univ. Ferrara Sez. VII Sci. Mat. 2006;52(2):395–430. doi: 10.1007/s11565-006-0029-y. [DOI] [Google Scholar]

[CR38] 38.Ohya, Y., Tarama, S.: The Cauchy Problem with multiple characteristics in the Gevery class–Hölder coefficients in $t$ . Hyperbolic Equations and Related Topics. Kataka/Kioto, pp. 273–306 (1984)

[CR39] 39.Parenti C, Parmeggiani A. On the Cauchy problem for hyperbolic operators with double characteristics. Commun. Partial Differ. Equ. 2009;34:837–888. doi: 10.1080/03605300902892360. [DOI] [Google Scholar]

[CR40] 40.Parusinski A, Rainer A. Regularity of roots of polynomials. Ann. Sc. Norm. Super. Pisa Cl. Sci. (5) 2016;16(2):481–517. [Google Scholar]

[CR41] 41.Rozenblum G. Spectral asymptotic behaviour of elliptic systems (Russian) Zap. LOMI. 1980;96:255–271. [Google Scholar]

[CR42] 42.Ruzhansky M. Singularities of affine fibrations in the theory of regularity of Fourier integral operators. Russ. Math. Surv. 2000;55(1):93–161. doi: 10.1070/RM2000v055n01ABEH000250. [DOI] [Google Scholar]

[CR43] 43.Ruzhansky, M.: Regularity theory of Fourier integral operators with complex phases and singularities of affine fibrations. CWI Tract, 131. Stichting Mathematisch Centrum, Centrum voor Wiskunde en Informatica, Amsterdam (2001)

[CR44] 44.Yuzawa Y. The Cauchy problem for hyperbolic systems with Hölder continuous coefficients with respect to time. J. Differ. Equ. 2005;219(2):363–374. doi: 10.1016/j.jde.2004.12.006. [DOI] [Google Scholar]

[CR45] 45.Wasow W. On holomorphically similar matrices. J. Math. Anal. Appl. 1962;4(2):202–206. doi: 10.1016/0022-247X(62)90050-1. [DOI] [Google Scholar]

PERMALINK

Hyperbolic systems with non-diagonalisable principal part and variable multiplicities, I: well-posedness

Claudia Garetto

Christian Jäh

Michael Ruzhansky

Abstract

Introduction

Theorem A

Theorem B

Theorem 1

Remark 1

Theorem C

Theorem 2

Well-posedness in anisotropic Sobolev spaces

Auxiliary remarks

Lemma 1

The case m=2

Remark 2

Remark 3

Theorem 3

Remark 4

The case m=3

The general case

Theorem 1

Proof

Schur decomposition of m×m matrices

First step or Schur step

Proposition 1

Proof

The triangularisation procedure

Remark 5

Theorem 6

Remark 6

Remark 7

Remark 8

Remark 9

The case m=2

Theorem 7

Proof

Example

The case m=3

Theorem 8

Examples

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

The case $m = 2$

The case $m = 3$

Schur decomposition of $m \times m$ matrices

The case $m = 2$

The case $m = 3$