Soft Randomized Machine Learning Procedure for Modeling Dynamic Interaction of Regional Systems

Yuri S Popkov

doi:10.3390/e21040424

. 2019 Apr 20;21(4):424. doi: 10.3390/e21040424

Soft Randomized Machine Learning Procedure for Modeling Dynamic Interaction of Regional Systems

Yuri S Popkov ^1,^2,^3,⁴

PMCID: PMC7514913 PMID: 33267138

Abstract

The paper suggests a randomized model for dynamic migratory interaction of regional systems. The locally stationary states of migration flows in the basic and immigration systems are described by corresponding entropy operators. A soft randomization procedure that defines the optimal probability density functions of system parameters and measurement noises is developed. The advantages of soft randomization with approximate empirical data balance conditions are demonstrated, which considerably reduces algorithmic complexity and computational resources demand. An example of migratory interaction modeling and testing is given.

Keywords: soft randomization, entropy, entropy operator, migration, immigration, empirical balance, empirical risk

1. Introduction

The mutual influence of migratory processes in regional systems is a problem of growing significance in the modern world. The socioeconomic statuses of different regions demonstrate higher heterogeneity in response to rising political and military tension. All these factors cause an abrupt redistribution of migration flows and regional population variations, thereby increasing the cost of regional population maintenance [1,2,3,4]. Therefore, it is important to develop different tools (mathematical models, algorithms, and software) for forecasting the distribution of migration flows with adaptation to their dynamics considering available resources.

The authors of [5] suggested a dynamic entropy model for the migratory interaction of regional systems. In comparison with biological reproduction, migration mobility is a rather fast process [1,6]. Thus, the short-term dynamics of regional population size are described by the locally stationary state of a migratory process [7]. The latter can be simulated under the hypothesis that all migrants have a random and independent spatial distribution over interacting regional systems with given prior probabilities. The mathematical model of a locally stationary state is given by a corresponding entropy operator that maps the space of admissible resources into the space of migratory processes [8].

Mathematical modeling and analysis of interregional migration is considered in numerous publications. First, it seems appropriate to mention the monographs [9,10] that are dedicated to a wide range of interregional migration problems, including mathematical modeling of migration flows. Note that the problem of migration touches upon many aspects of socioeconomic, psychological and political status of the space of migratory movements. Thus, of crucial role is the structural analysis of inter- and intraregional migration flows [4] and motivations that generate them [2,11]. The results of structural and motivational analysis of migratory processes are used for computer simulation. There exist three directions of research in this field, each relying on some system of hypotheses. One of the directions involves the stochastic hypothesis about the origin of migratory motivations [12], which is simulated using agent technologies [13,14]. This direction is adjoined by investigations based on the thermodynamic model of migration flows [3,8]. Of course, the short list above does not exhaust the whole variety of migration studies, merely outlining some topics of research.

This paper studies a stochastic version of the model in [5], in which random parameters and measurement noises are characterized by probability density functions (PDFs). These functions are estimated using retrospective information on the real dynamics of regional population size with “soft” randomized machine learning [15]. The learned model was implemented in the form of computer simulations, i.e., generation of an ensemble of random trajectories with the entropy-optimal PDFs of the model parameters and measurement noises. The resulting ensemble was used for testing of the model and also for short-term forecasting.

The method developed below is illustrated by an example of the randomized modeling and forecasting of the migratory interaction among three EU countries (Germany, France, and Italy—the system $GFI$ ) and two countries as sources of immigration (Syria and Libya—the system $SL$ ).

2. Randomized Model of Migratory Interaction

Consider the dynamic discrete-time model of migratory interaction with shared resource constraints that is presented in [5]. The first sub-model represents migration flows within the system $GFI$ and is described by the dynamic regression equation

K [(s + 1) h] = (A - E) K [s h] + F (z [s h]), (K, F) \in R^{N}, s = \bar{0, K - 1},

(1)

where

A = h (\begin{matrix} 1 & α_{2} a_{21} & \dots & α_{N} a_{N 1} \\ α_{1} a_{12} & 1 & \dots & α_{N} a_{N 2} \\ \dots & \dots & \dots & \dots \\ α_{1} a_{1 N} & α_{2} a_{2 N} & \dots & 1 \end{matrix}),

(2)

E = h diag [α_{n}, n = \bar{1, N}] .

(3)

In these equations, $K [s h]$ denotes the population distribution in the regional system $GFI$ at a time $s h$ .

At a time $s h$ , the distribution of immigration flows from the regional system $SL$ to the regional system $GFI$ in terms of an entropy operator is modeled by the second sub-model, which can be described by a vector function $F (z [s h])$ with the components

f_{n} [s h] = h \sum_{j = 1}^{M} b_{j n} {(z [s h])}^{c_{j n}}, n = \bar{1, N}, s = \bar{0, K - 1},

(4)

The variable z, which is the exponential Lagrange multiplier in the entropy-optimal distribution problem of immigration flows, satisfies the equation

\sum_{k = 1}^{M} \sum_{n = 1}^{N} c_{k n} b_{k n} {(z [s h])}^{c_{k n}} = T [s h],

(5)

where $T [s h]$ is the amount of a shared resource used by all regions from the system $GFI$ to maintain immigrants.

In this model, the input data are the amounts $T [0], T [h], \dots, T [(K - 1) h]$ ; and the output data are the regional population distributions $K [0], K [h], \dots, K [(K - 1) h]$ .

The dynamic model in Equations (1)–(5) contains the following parameters:

$α_{n} \in [0, 1], n = \bar{1, N},$ as the shares of mobile population in system regions;
$a_{i n} \in [0, 1], (i, n) = \bar{1, N},$ as the prior probabilities of individual migration in the system $GFI$ ;
$b_{k n}, k = \bar{1, M}, n = \bar{1, N},$ as the prior probabilities of individual immigration from region k of the system $SL$ to region n of the system $GFI$ ; and
$c_{k n}, k = \bar{1, M}, n = \bar{1, N},$ as the normalized 1 specific generalized cost of immigration maintenance.

Normalization means that $0 < c_{k n} < 1, k = \bar{1, M}, n = \bar{1, N}$ .

The parameters form three groups: mobility, migratory movements within the system $GFI$ , and immigratory movements from the system $SL$ to the system $GFI$ . All these characteristics are specified by the regions of both systems. The dimensionality of the parametric space is reduced using the same approach as in [5]. The whole essence is to assign a relative regional differentiation of all parameters except for the weights $b_{1}$ (mobility) and $b_{2}$ (internal migration) of these groups, which are considered as model variables.

This approach leads to the parametric transformation

\begin{matrix} α_{n} = b_{1} m_{n}, a_{i n} = b_{2} h_{i n}, \\ (i, n) = \bar{1, N}; k = \bar{1, M}, \end{matrix}

(6)

where $m_{n}$ and $h_{i n}$ are given parameters which characterize the relation of variables.

Then, the dynamic model of migratory interaction in Equations (1)–(5) takes the form

K [(s + 1) h] = (b_{1} b_{2} \tilde{A} - b_{1} \tilde{E}) K [s h] + \tilde{F} (z [s h]),

(7)

with the matrix

\tilde{A} = h (\begin{matrix} 1 & m_{2} h_{21} & \dots & m_{N} h_{N 1} \\ m_{1} h_{12} & 1 & \dots & m_{N} h_{N 2} \\ \dots & \dots & \dots & \dots \\ m_{1} h_{1 N} & m_{2} h_{2 N} & \dots & 1 \end{matrix})

(8)

and the diagonal matrix

\tilde{E} = h diag [m_{n}, n = \bar{1, N}] .

(9)

The vector $\tilde{F} (μ, z) [s h]$ consists of the components

{\tilde{f}}_{n} (z [s h]) = h \sum_{k = 1}^{M} q_{k n} {(z [s h])}^{c_{k n}}, n = \bar{1, N}, s = \bar{0, K - 1} .

(10)

For each time $s h$ , the variable z satisfies the equation

\sum_{k = 1}^{M} \sum_{n = 1}^{N} c_{k n} q_{k n} {(z [s h])}^{c_{k n}} = T [s h], s = \bar{0, K - 1},

(11)

i.e., there exist K values $z = z^{*} [s h], s = \bar{0, K - 1}$ .

The randomized version of this model is described by Equations (7)–(11) but some parameters (variables) have random character. These are two randomized parameters, $b_{1}$ and $b_{2}$ , as well as the variable $z = b_{3}$ , all of the interval type. More specifically, the parameters $b_{1}$ and $b_{2}$ belong to the intervals

B_{1} = [b_{1}^{-}, b_{1}^{+}], B_{2} = [b_{2}^{-}, b_{2}^{+}] .

(12)

The interval $B_{3}$ of the variable $b_{3}$ is given by Equation (11).

Theorem 1.

Let the parameters $b_{k n}$ and $c_{k n}$ in Equation (11) be positive and $c_{k n} \in [0, 1] .$ Then, the solution $b_{3}^{*}$ of this equation belongs to the interval

$B_{3} = [b_{3}^{-}, b_{3}^{+}],$ (13)

where

$\begin{matrix} b_{3}^{-} & = & {(\frac{T [s h]}{M N c_{m a x} b_{m a x}})}^{1 / c_{m a x}}; b_{3}^{+} = {(\frac{T [s h]}{M N c_{m i n} b_{m i n}})}^{1 / c_{m i n}}; \\ c_{m i n} & = & min_{k n} c_{k n}, c_{m a x} = max_{k n} c_{k n}; b_{m i n} = min_{k n} b_{k n}, b_{m a x} = max_{k n} b_{k n} . \end{matrix}$ (14)

The proof is postponed to the Appendix A.

Therefore, the randomized dynamic model in Equations (7)–(11) includes three random parameters $b = {b_{1}, b_{2}, b_{3}}$ of the interval type that are defined over the three-dimensional cube with faces (Equations (12) and (13)), i.e.,

B = ⨂_{j = 1}^{3} B_{j} .

(15)

The probabilistic properties of the randomized parameters are described by a continuously differentiable PDF $W (b)$ .

By assumption, real distributions of regional population sizes contain errors that are simulated by a random vector $\bar{ξ} [s h] \in R^{N}$ with the interval components

\bar{ξ} [s h] \in Ξ_{s} = [{\bar{ξ}}^{-} [s h], {\bar{ξ}}^{+} [s h]] .

(16)

The probabilistic properties of this vector are described by a continuously differentiable PDF $Q (\bar{ξ})$ .

The measured output of the randomized model has an additive noise,

v [s h] = K [s h] + \bar{ξ} [s h] .

(17)

3. Characterization of Empirical Risk and Measurement Noises

Construct a synthetic functional $J [W (b), Q (\bar{ξ})]$ that depends on the PDFs of the model parameters and measurement noises for assessing in quantitative terms the empirical risk (the difference between the regional population distribution generated by the model in Equations (7)–(11) and the real counterpart) and the guaranteed power of these noises. The functional must have components characterizing an intrinsic uncertainty of randomized machine learning (RML) procedures, the approximation quality of empirical balances (the empirical risk) and the worst properties of the corresponding random interval-type noises.

1. Uncertainty. In accordance with the general concept of RML, the first component among the listed ones is an entropy functional that describes the level of uncertainty:

H [b), Q (\bar{ξ})] = - \int_{B} W (b) ln W (b) d b - \int_{Ξ} Q (\bar{ξ}) ln Q (\bar{ξ}) d \bar{ξ} .

(18)

The two other functional components are constructed using Hölder’s vector and matrix norms (The vector norm has the form ${∥ a ∥}_{\infty} = {max}_{n} | a_{n} |$ ; the matrix norm, the form ${∥ A ∥}_{\infty} = {max}_{i j} | a_{i j} |$ .) [16].

2. Approximate empirical balances. First, consider a characterization of the empirical risk. For the model in Equations (7)–(11), the deviation between the output and real data vectors is given by

\bar{ε} [s h] = (b_{1} b_{2} \tilde{A} - b_{1} \tilde{E}) Y [s h] + F (b_{3} [s h]) - Y [s h], s = \bar{0, K - 1} .

(19)

Using well-known inequalities for the matrix and vector norms, it is possible to write

\begin{matrix} ∥ \bar{ϵ} {[s h] ∥}_{\infty} & \leq & ∥ (b_{1} b_{2} \tilde{A} - b_{1} \tilde{E}) ∥_{\infty} {∥ Y [s h] ∥}_{\infty} + ∥ F (b_{3} [s h]) ∥_{\infty} + {∥ Y [(s + 1) h] ∥}_{\infty} = \\ = & φ (b_{1}, b_{2}, b_{3}, s), s = \bar{0, K - 1} . \end{matrix}

(20)

Introducing the average matrix and vector norms over the observation interval,

\begin{matrix} φ (b_{1}, b_{2}, b_{3}) \leq h (\frac{1}{K} \sum_{s = 0}^{K - 1} max_{n} y_{n} [s h]) (b_{1} max_{n} m_{n} + b_{1} b_{2} max_{i, j} h_{i j}) + \\ + & \frac{1}{K} \sum_{s = 0}^{K - 1} max_{n} y_{n} [(s + 1) h] + M N c_{m a x} b_{m a x} {(b_{3})}^{c_{m a x}} . \end{matrix}

(21)

The parameters $b_{1}$ and $b_{2}$ take values within the intervals $B_{1}$ and $B_{2}$ (Equation (12)) while the parameter $b_{3}$ within the interval

B_{3} = [{(\frac{T_{m a x}}{M N c_{m a x} q_{m a x}})}^{1 / c_{m a x}}, {(\frac{T_{m a x}}{M N c_{m i n} q_{m i n}})}^{1 / c_{m i n}}],

(22)

where

T_{m a x} = max_{s} T [s h] .

(23)

Denote

\begin{matrix} U_{1} & = & h (\frac{1}{K} \sum_{s = 0}^{K - 1} max_{n} y_{n} [s h]) max_{n} m_{n}; U_{2} = h (\frac{1}{K} \sum_{s = 0}^{K - 1} max_{n} y_{n} [s h]) max_{i, j} h_{i j}; \\ U_{3} & = & M N h c_{m a x} b_{m a x}; U_{4} = \frac{1}{K} \sum_{s = 0}^{K - 1} max_{n} y_{n} [(s + 1) h] . \end{matrix}

(24)

Then, the function $φ (b_{1}, b_{2}, b_{3})$ takes the form

φ (b_{1}, b_{2}, b_{3}) = b_{1} U_{1} + b_{1} b_{2} U_{2} + {(b_{3})}^{c_{m a x}} U_{3} + U_{4} .

(25)

Note that the coefficients $U_{1}, \dots, U_{4}$ are determined by real data on regional population distributions and also by the characteristics of internal migration within the system $GFI$ and immigration flows from the system $SL$ .

The equality in Equation (25) defines a function $φ (b_{1}, b_{2}, b_{3})$ of random variables. Let its expectation be the characteristic of the empirical risk, i.e.,

r [W (b)] = \int_{B} W (b) φ (b) d b,

(26)

where $B = B_{1} \otimes B_{2} \otimes B_{3}$ and the intervals $B_{1}$ and $B_{2}$ have given limits. At the same time, the limits of the interval $B_{3}$ are specified by the equalities in Equation (22).

Power of noises. The measurement noises are simulated by random vectors $\bar{ξ} [s h] \in R^{N}, s = \bar{0, K - 1}$ . The components of these vectors may have different domains (ranges of values) at different times $s = \bar{0, K - 1}$ . For each time, introduce the Euclidean norm $∥ \bar{ξ} {[s h] ∥}_{N}^{2}$ and its expectation

n_{s} [Q (\bar{ξ} [s h])] = \int_{Ξ} Q (\bar{ξ} [s h]) {∥ \bar{ξ} [s h] ∥}_{N}^{2} d \bar{ξ} [s h] .

(27)

The average expectation of this norm over the time interval has the form

{\bar{n}}_{s} [Q (\bar{ξ} [s h])] = \frac{1}{K} \sum_{s = 0}^{K - 1} n_{s} [Q (\bar{ξ} [s h])] .

(28)

If the measurement noises are the same on the observation interval, then the noise power functional can be written as

{\bar{n}}_{s} [Q (\bar{ξ} [s h])] = n [Q (\bar{ξ})] = \int_{Ξ} Q (\bar{ξ}) {∥ \bar{ξ} ∥}_{N}^{2} d \bar{ξ} .

(29)

This formula involves the Euclidean norm for a quantitative characterization of the noise power. However, it is possible to choose other norms depending on problem specifics.

4. Soft Randomized Estimation of Model Parameters

The model characteristics and measurement noises are estimated using a learning data collection: the real cost of immigrants maintenance $T [0], \dots, T [(K - 1) h]$ (input data) and the real distributions of regional population sizes $Y [0], \dots, Y [(K - 1) h]$ (output data).

In accordance with the general procedure of soft randomized machine learning [15], the optimal probability density functions $W (b)$ (model parameters) and $Q (\bar{ξ})$ (measurement noises) are calculated by the constrained minimization of the synthetic functional $J [W (b), Q (\bar{ξ})]$ that contains the following functionals:

the entropy
$H [W (b)] = - \int_{B} W (b) ln W (b) d b - \int_{Ξ} Q (\bar{ξ}) ln Q (\bar{ξ}) d \bar{ξ};$ (30)
the average empirical risk over the observation interval
$r [W (b)] = \int_{B} W (b) (b_{1} U_{1} + b_{1} b_{2} U_{2} + {(b_{3})}^{c_{m a x}} U_{3} + U_{4}) d b;$ (31)
and
the average error norm
$n [Q (\bar{ξ})] = \int_{Ξ} Q (\bar{ξ}) \sum_{i = 1}^{N} ξ_{i}^{2} d \bar{ξ} .$ (32)

The soft randomized learning algorithm has the form

\begin{matrix} J [W (b), Q (\bar{ξ})] = H [W (b)] - r [W (b)] - n [Q (\bar{ξ})] \Rightarrow max, \\ \int_{B} W (b) d b = 1, \int_{Ξ} Q (\bar{ξ}) d \bar{ξ} = 1 . \end{matrix}

(33)

The solution of this problem is the optimal PDFs under maximal uncertainty, for the model parameters of the form

W^{*} (b) = \frac{exp (b_{1} U_{1} - b_{1} b_{2} U_{2} - {(b_{3})}^{c_{m a x}} U_{3} - U_{4})}{P},

(34)

where

P = \int_{B} exp (b_{1} U_{1} - b_{1} b_{2} U_{2} - {(b_{3})}^{c_{m a x}} U_{3} - U_{4}) d b,

(35)

and for the measurement noises of the form

Q^{*} (\bar{ξ}) = \frac{exp (- \sum_{i = 1}^{N} ξ_{i}^{2})}{Q},

(36)

where

Q = \int_{Ξ} exp (- \sum_{i = 1}^{N} ξ_{i}^{2}) d \bar{ξ} .

(37)

In the case of soft randomization, there is no need for solving the empirical balance equations, which have high complexity and computational intensiveness due to the presence of integral components. Here, computational resources are required for the normalization procedure of the resulting PDFs. On the other hand, the morphology of the optimal PDFs depends on a specific choice of the approximate data balancing criterion and a numerical characterization of the measurement noises.

5. Randomized Forecasting of Dynamic Migratory Interaction

Consider randomized forecasting of dynamic migratory interaction using the principle of soft randomization. Let $T_{p r} = [s_{0} h, s_{p r} h]$ be the forecasting interval and assume the initial state (the regional population distribution at the initial time $s_{0} h$ ) coincides with the real distribution, i.e., $K [s_{0} h] = Y [s_{0} h]$ . The shared cost of the system $GFI$ to maintain immigrants is distributed in accordance with a given scenario. For each scenario, the value $T_{m a x}$ and also the interval $B_{3}$ in Equations (12), (22), and (23) are determined.

The forecasted trajectories are constructed using the randomized model in Equations (7), (10), and (11)

\begin{matrix} K [(s + 1) h] & = & (b_{1} b_{2} \tilde{A} - b_{1} \tilde{E}) K [s h] + F [s h | b_{3}], \\ F [s h | b_{3}] & = & {\sum_{k = 1}^{M} b_{k n} {(b_{3})}^{c_{k n}}, n = \bar{1, N}}, \\ s & = & \bar{s_{0}, s_{p r}}, K [s_{0} h] = Y [s_{0} h] . \end{matrix}

(38)

The randomized parameters $b_{1}, b_{2},$ and $b_{3}$ take values within the corresponding intervals with the probability density function $W^{*} (b)$ (Equation (34)).

An ensemble of the forecasted trajectories for the model’s output is obtained taking into account a random vector $\bar{ξ} \in Ξ$ with the PDF $Q^{*} (\bar{ξ})$ (Equation (36)):

v [s h] = K [s h] + \bar{ξ}, s = \bar{s_{0}, s_{p r}} .

(39)

For each scenario $T [s_{0} h], \dots, T [s_{p r} h],$ an ensemble $K$ of random forecasting trajectories is generated via sampling (the transformation of a PDF into a corresponding sequence of random vectors of length I) of the optimal PDFs of the model parameters and measurement noises for each time $s h$ . The resulting ensemble allows deriving empirical estimates of different numerical characteristics as follows:

the average trajectory
$\bar{K} [s h] = \frac{1}{I} \sum_{i = 1}^{I} K^{(i)} [s h], s = \bar{s_{0}, s_{p r}};$ (40)
the variance trajectory
${\bar{σ}}^{2} [s h] = \frac{1}{I - 1} \sum_{i = 1}^{I} {∥ K^{(i)} [s h] - \bar{K} [s h] ∥}^{2}, s = \bar{s_{0}, s_{p r}};$ (41)
the variance pipe, i.e., the set of random trajectories that almost surely (since an ensemble consists of a finite number of trajectories, the matter concerns not probability but its empirical estimate) belong to the domain
$D = {K [s h] : \bar{K} [s h] - {\bar{σ}}^{2} [s h] \leq K [s h] \leq \bar{K} [s h] + {\bar{σ}}^{2} [s h], s = \bar{s_{0}, s_{p r}}};$ (42)
the empirical probability distribution and its dynamics on the forecasting interval
$P (K [s h] \leq Δ, s = \bar{s_{0}, s_{p r}}) = \frac{I_{Δ}}{I},$ (43)
where $I_{Δ}$ denotes the number of vectors $K [s h]$ whose components are smaller than $Δ$ ; and
the median trajectory $\hat{K} [s h], s = \bar{s_{0}, s_{p r}}$ , which satisfies the equation
$P (K [s h]) = 0, 5; s = \bar{s_{0}, s_{p r}} .$ (44)

The ensemble $K$ can be used to calculate other characteristics, e.g., $α$ -quantiles, confidence probabilities, etc.

6. Example

The appearance of territories with low economic status always causes the growth of immigration. The early 2000s were remarkable for the formation of several such territories in Northern and Central Africa, the Near East, Afghanistan, etc. As a result, tens of millions of migrants moved to the EU as the level of life in these territories dropped below the subsistence minimum. The EU countries have to allocate considerable financial resources for their filtering and accommodation, which are often unacceptable. An example below illustrates the use of soft randomization for estimating and forecasting of immigration flows from Syria (1) and Libya (2) (the system $SL$ ) to Germany (1), France (2), and Italy (3) (the system $GFI$ ).

1. Randomized model, parameters, measurement errors, time intervals, and real data collections. Choose the randomized mathematical model (Equation (25)) with the normalized variables

p_{n} [s h] = \frac{K_{n} [s h]}{K_{m a x}}, n = \bar{1, 3} .

(45)

This gives

\begin{matrix} p_{n} [(s + 1) h] & = & (1 - b_{1} m_{n}) p_{n} [s h] + h b_{1} b_{2} \sum_{i = 1, i \neq n}^{3} m_{i} h_{i n} p_{i} [s h] + h f_{n} [s h], \\ f_{n} [s h] & = & \sum_{i = 1}^{M} b_{i n} b_{3}^{c_{i n}}, n = \bar{1, 3}, \\ T [s h] & = & \sum_{n = 1}^{3} \sum_{i = 1}^{2} c_{i n} b_{i n} b_{3}^{c_{i n}} . \end{matrix}

(46)

The state variables of the system $GFI$ and also the immigration flows from the system $SL$ are normalized, i.e.,

0 \leq p_{n} [s h] \leq 1, 0 \leq f_{n} [s h] \leq 1, n = \bar{1, N} .

(47)

The variable $z^{*}$ characterizes the entropy operator of the immigration process and satisfies the last equation in Equation (46). The values of the parameters $m_{i}, h_{i n}, b_{i n},$ and $c_{i n}$ are combined in Table 1, where columns are different values of corresponding parameter. Recall that the two lowest rows of Table 1 indicate the values of the parameters $c_{i n}$ . By assumption, the regions of both systems have the same specific cost.

Table 1.

Values of relative parameters.

$m_{n}$	0.43	0.50	0.40
$h_{1 n}$	0	0.3	0.3
$h_{2 n}$	0.3	0	0.3
$h_{3 n}$	0.5	0.4	0
$b_{1 n}$	0.4	0.3	0.3
$b_{2 n}$	0.3	0.1	0.4
$c_{1 n}$	0.4	0.4	0.3
$c_{2 n}$	0.4	0.4	0.3

Open in a new tab

In accordance with this table, $m_{m a x} = 0.5, h_{m a x} = 0.5, b_{m i n} = 0.3, b_{m a x} = 0.4,$ and $c_{m a x} = c_{m i n} = c = 0.5$ . The measurement errors of population sizes $\bar{ξ} [s h] \in R^{3}$ (in normalized units) belong to the intervals

\bar{ξ} [s h] \in Ξ = [{\bar{ξ}}_{-}, {\bar{ξ}}_{+}], ξ_{n}^{\pm} = 0.01,

(48)

and by assumption they have the same limits for times $s h$ .

The normalized observation (model output) has the form

v [s h] = p [s h] + \bar{ξ} [s h] .

(49)

The random parameter model in Equation (46) was employed for estimating parameter characteristics and testing on corresponding time intervals with step $h = 1$ year:

$T_{e s t} = 2009$ –2013 as the estimation interval; and
$T_{t s t} = 2014$ –2018 as the testing interval.

2. Entropy estimation of PDFs of model parameters and measurement noises (interval $T_{e s t}$ ). This problem was solved using available data on regional population distribution for Germany ( $n = 1$ ), France ( $n = 2$ ), and Italy ( $n = 3$ ) and also on the shared cost of immigrants maintenance on the estimation interval (see Table 2 and UNdata service at https://data.un.org/).

Table 2.

Input and output data collections.

Year	2009	2010	2011	2012	2013
s	0	1	2	3	4
$Y_{1} [s]$	81.90	81.77	80.27	80.42	80.64
$y_{1} [s]$	1.00	0.998	0.980	0.982	0.985
$Y_{2} [s]$	62.47	62.80	63.11	63.41	63.70
$y_{2} [s]$	0.762	0.767	0.771	0.774	0.778
$Y_{3} [s]$	59.39	59.53	59.63	59.71	59.75
$y_{3} [s]$	0.725	0.727	0.728	0.729	0.726
$T [s]$ (billion)	0.093	0.094	0.095	0.096	0.097

Open in a new tab

In this model, the random parameters $b_{1}, b_{2},$ and $b_{3}$ take values within the intervals

b_{1} \in B_{1} = [1.0, 2.5]; b_{2} \in B_{2} = [0.5, 1.8], b_{3} \in B_{3} = [0.3, 1.5] .

(50)

In accordance with Equation (24),

U_{1} = 0.5; U_{2} = 0.5; U_{3} = 1.2; U_{4} = 0.986 .

(51)

Then, the soft RML procedure yields the following optimal PDFs of the model parameters and measurement noises:

\begin{matrix} W^{*} (b) & = & \frac{exp (- 0.5 b_{1} - 0.5 b_{1} b_{2} - 1.2 b_{3}^{0.5} - 0.986)}{W}, \\ Q^{*} (\bar{ξ}) & = & \frac{exp (- \sum_{n = 1}^{3} ξ_{n}^{2})}{Q}, \end{matrix}

(52)

where

\begin{matrix} W & = & \int_{B_{1}} \int_{B_{2}} \int_{B_{3}} exp (- 0.5 b_{1} - 0.5 b_{1} b_{2} - 1.2 b_{3}^{0.5} - 0.986) d b_{1} d b_{2} d b_{3}, \\ Q & = & \prod_{n = 1}^{3} \int_{- 0.01}^{0.01} exp (- ξ^{2}) d ξ . \end{matrix}

(53)

The two-dimensional sections of the three-dimensional PDFs of the model parameters are shown in Figure 1a–c, while the graphs of the PDFs of the measurementnoises in Figure 2.

3. Model testing. The randomized model in Equation (49) with the optimal PDFs in Equations (52) and (53) was tested using the above data on regional population sizes from the UNdata service (https://data.un.org/) (see Table 3). This table also presents the testing results in terms of the ensemble-average trajectories ${\bar{p}}_{1} [s h], {\bar{p}}_{2} [s h],$ and ${\bar{p}}_{3} [s h]$ .

Table 3.

Input and output data collections.

Year	2014	2015	2016	2017	2018
s	0	1	2	3	4
$Y_{1} [s]$	81.489	81.707	82.063	82.386	82.674
$y_{1} [s]$	0.985	0.988	0.993	0.996	1.000
${\bar{p}}_{1} [s h]$	0.986	0.615	0.743	0.639	0.999
$Y_{2} [s]$	64.190	64.457	64.791	65.134	65.484
$y_{2} [s]$	0.721	0.472	0.564	0.529	0.708
${\bar{p}}_{2} [s h]$	0.722	0.695	0.707	0.691	0.715
$Y_{3} [s]$	59.585	59.504	59.504	59.509	59.516
$y_{3} [s]$	0.775	0.609	0.562	0.699	0.650
${\bar{p}}_{3} [s h]$	0.776	0.617	0.607	0.705	0.628
$T [s]$ (billion)	0.097	0.097	0.097	0.098	0.098

Open in a new tab

Testing was performed via sampling of the randomized interval parameters with the PDFs in Equations (52) and (53) and construction of the corresponding trajectories by Equation (49). Figure 3a–c shows ensembles of such trajectories $v_{1} [s h], v_{2} [s h], v [s h]$ as well as the ensemble-average trajectories ${\bar{v}}_{1} [s h], {\bar{v}}_{2} [s h], {\bar{v}}_{3} [s h]$ (Graph 1); the real trajectories $y_{1} [s h], y_{2} [s h], y_{3} [s h]$ of regional population sizes (Graph 2); and the limits of the variance pipes ${\bar{p}}_{1}^{*} [s h] \pm σ_{1}, {\bar{p}}_{2}^{*} [s h] \pm σ_{2}, {\bar{p}}_{3}^{*} [s h] \pm σ_{3}$ (Graph 3).

(a) ${\bar{v}}_{1}$ [4], (b) ${\bar{v}}_{2}$ [4], (c) ${\bar{v}}_{3}$ [4].

The testing accuracy was estimated in terms of the relative root-mean-square error

δ_{n} = \frac{\sqrt{\sum_{s = 0}^{4} {({\bar{p}}_{n} [s h] - y_{n} [s h])}^{2}}}{\sqrt{\sum_{s = 0}^{4} {({\bar{p}}_{n} [s h])}^{2}} + \sqrt{\sum_{s = 0}^{4} {(y_{n} [s h])}^{2}}} .

(54)

In the example under study, it constituted 4.6% (Region 1), 3.5% (Region 2), and 2.6% (Region 3).

7. Conclusions

This paper has developed a mathematical model for dynamic migratory interaction of regional systems with locally stationary states described by corresponding entropy operators. The model incorporates random parameters, and their probabilistic characteristics—the probability density functions of system parameters and measurement noises—have been calculated using soft randomized machine learning. An example of migratory interaction modeling and testing has been given.

Appendix A

Proof of Theorem 1.

Consider the function

$φ (z) = μ \sum_{k = 1}^{M} \sum_{n = 1}^{N} c_{k n} q_{k n} {(z [s h])}^{c_{k n}},$ (A1)

which appears on the left-hand side of Equation (19). Taking advantage of the obvious inequalities,

$φ_{-} (z) = M N μ c_{m i n} q_{m i n} {(z)}^{c_{m i n}} < φ (z) < M N μ c_{m a x} q_{m a x} {(z)}^{c_{m a x}} = φ_{+} (z) .$ (A2)

The variables are $0 < c_{m i n} < 1, 0 < c_{m a x} < 1, c_{m i n} < c_{m a x}$ , and $c_{m i n} < c_{k n} < c_{m a x}$ . Consider the equations

$φ_{-} (z) = T [s h], φ (z) = T [s h], φ_{+} (z) = T [s h] .$ (A3)

The functions $φ_{-} (z), φ (z),$ and $φ_{+} (z)$ are strictly convex. Therefore, the solutions of these equations has the relationship

$z_{-} < z^{*} < z_{+},$ (A4)

which concludes the proof of Theorem 1. □

Funding

This work was supported by Russian Science Foundation (project No. 17-11-01220).

Conflicts of Interest

The author declares no conflict of interest.

References

1.Bilecen B., Van Mol C. Introduction: International academic mobility and inequalities. J. Ethic Migr. Stud. 2017;43:1241–1255. doi: 10.1080/1369183X.2017.1300225. [DOI] [Google Scholar]
2.Black R., Xiang B., Caller M., Engberson G., Heering L., Markova E. Migration and development: Causes and consequences. In: Penninx R., Berger M., Kraal K., editors. The Dynamics of International Migration and Settlement in Europe: A State of the Art. Amsterdam University Press; Amsterdam, The Netherlands: 2006. pp. 41–63. [Google Scholar]
3.Imel’baev S.S., Shmul’yan B.L. Modeling of stochastic communication systems. In: Wilson A.G., editor. Entropy Methods for Complex Systems Modeling. Nauka; Moscow, Russian: 1975. pp. 170–234. [Google Scholar]
4.Van der Knaap G.A., Steegers W.F. Structural analysis of interregional and intraregional migration patterns. In: Heide H., Willekens F., editors. Demographic Research and Spatial Policy. Academic Press; Voorburg, The Netherlands: 1984. [Google Scholar]
5.Popkov Y.S. Dynamic entropy model for migratory interaction of regional systems. Tr. Inst. Sist. Analiz. Ross. Akad. Nauk. 2018;2:3–11. [Google Scholar]
6.Zelinsky W. The hypothesis of the mobility transition. Geogr. Rev. 1971;46:219–249. doi: 10.2307/213996. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Popkov Y.S. Mathematical Demoeconomy: Integrating Demographic and Economic Approaches. De Gruyter; Berlin, Germany: 2014. [Google Scholar]
8.Wilson A.G. Entropy in Urban and Regional Modelling. Routledge; London, UK: 1970. [Google Scholar]
9.Rogers A., Willekens F., Raymer J. Modelling interregional migration flows: continuity and change. J. Math. Popul. Stud. 2001;9:231–263. doi: 10.1080/08898480109525506. [DOI] [Google Scholar]
10.Rogers A., Little J., Raymer J. The Indirect Estimation of Migration: Methods for Dealing with Irregular, Inadequate, and Missing Data. Springer Science & Business Media; Berlin, Germany: 2010. [Google Scholar]
11.Volpert V., Petrovskii S., Zincenko A. Interaction of human migration and wealth distribution. Nonlinear Anal. 2017;150:408–423. doi: 10.1016/j.na.2017.02.024. [DOI] [Google Scholar]
12.Pan J., Nagurney A. Using Markov chains to model human migration in a network equilibrium framework. Math. Comput. Model. 1994;19:31–39. doi: 10.1016/0895-7177(94)90014-0. [DOI] [Google Scholar]
13.Klabunde A., Willekens F. Decision-making in agent-based models of migration: State of the art and challenges. Eur. J. Popul. 2016;32:73–97. doi: 10.1007/s10680-015-9362-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Klabunde A., Zinn S., Willekens F., Leuchter M. Multistable modelling extended by behavioural rules. An application to migration. Popul. Stud. 2017;71:61–67. doi: 10.1080/00324728.2017.1350281. [DOI] [PubMed] [Google Scholar]
15.Popkov Y.S. Soft randomized machine learning. Doklady Math. 2018;98:646–647. doi: 10.1134/S1064562418070293. [DOI] [Google Scholar]
16.Voevodin V.V., Kuznetsov Y.A. Matrices and Calculations. Nauka; Moscow, Russian: 1984. (In Russian) [Google Scholar]

[B1-entropy-21-00424] 1.Bilecen B., Van Mol C. Introduction: International academic mobility and inequalities. J. Ethic Migr. Stud. 2017;43:1241–1255. doi: 10.1080/1369183X.2017.1300225. [DOI] [Google Scholar]

[B2-entropy-21-00424] 2.Black R., Xiang B., Caller M., Engberson G., Heering L., Markova E. Migration and development: Causes and consequences. In: Penninx R., Berger M., Kraal K., editors. The Dynamics of International Migration and Settlement in Europe: A State of the Art. Amsterdam University Press; Amsterdam, The Netherlands: 2006. pp. 41–63. [Google Scholar]

[B3-entropy-21-00424] 3.Imel’baev S.S., Shmul’yan B.L. Modeling of stochastic communication systems. In: Wilson A.G., editor. Entropy Methods for Complex Systems Modeling. Nauka; Moscow, Russian: 1975. pp. 170–234. [Google Scholar]

[B4-entropy-21-00424] 4.Van der Knaap G.A., Steegers W.F. Structural analysis of interregional and intraregional migration patterns. In: Heide H., Willekens F., editors. Demographic Research and Spatial Policy. Academic Press; Voorburg, The Netherlands: 1984. [Google Scholar]

[B5-entropy-21-00424] 5.Popkov Y.S. Dynamic entropy model for migratory interaction of regional systems. Tr. Inst. Sist. Analiz. Ross. Akad. Nauk. 2018;2:3–11. [Google Scholar]

[B6-entropy-21-00424] 6.Zelinsky W. The hypothesis of the mobility transition. Geogr. Rev. 1971;46:219–249. doi: 10.2307/213996. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B7-entropy-21-00424] 7.Popkov Y.S. Mathematical Demoeconomy: Integrating Demographic and Economic Approaches. De Gruyter; Berlin, Germany: 2014. [Google Scholar]

[B8-entropy-21-00424] 8.Wilson A.G. Entropy in Urban and Regional Modelling. Routledge; London, UK: 1970. [Google Scholar]

[B9-entropy-21-00424] 9.Rogers A., Willekens F., Raymer J. Modelling interregional migration flows: continuity and change. J. Math. Popul. Stud. 2001;9:231–263. doi: 10.1080/08898480109525506. [DOI] [Google Scholar]

[B10-entropy-21-00424] 10.Rogers A., Little J., Raymer J. The Indirect Estimation of Migration: Methods for Dealing with Irregular, Inadequate, and Missing Data. Springer Science & Business Media; Berlin, Germany: 2010. [Google Scholar]

[B11-entropy-21-00424] 11.Volpert V., Petrovskii S., Zincenko A. Interaction of human migration and wealth distribution. Nonlinear Anal. 2017;150:408–423. doi: 10.1016/j.na.2017.02.024. [DOI] [Google Scholar]

[B12-entropy-21-00424] 12.Pan J., Nagurney A. Using Markov chains to model human migration in a network equilibrium framework. Math. Comput. Model. 1994;19:31–39. doi: 10.1016/0895-7177(94)90014-0. [DOI] [Google Scholar]

[B13-entropy-21-00424] 13.Klabunde A., Willekens F. Decision-making in agent-based models of migration: State of the art and challenges. Eur. J. Popul. 2016;32:73–97. doi: 10.1007/s10680-015-9362-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B14-entropy-21-00424] 14.Klabunde A., Zinn S., Willekens F., Leuchter M. Multistable modelling extended by behavioural rules. An application to migration. Popul. Stud. 2017;71:61–67. doi: 10.1080/00324728.2017.1350281. [DOI] [PubMed] [Google Scholar]

[B15-entropy-21-00424] 15.Popkov Y.S. Soft randomized machine learning. Doklady Math. 2018;98:646–647. doi: 10.1134/S1064562418070293. [DOI] [Google Scholar]

[B16-entropy-21-00424] 16.Voevodin V.V., Kuznetsov Y.A. Matrices and Calculations. Nauka; Moscow, Russian: 1984. (In Russian) [Google Scholar]

PERMALINK

Soft Randomized Machine Learning Procedure for Modeling Dynamic Interaction of Regional Systems

Yuri S Popkov

Abstract

1. Introduction

2. Randomized Model of Migratory Interaction

Theorem 1.

3. Characterization of Empirical Risk and Measurement Noises

4. Soft Randomized Estimation of Model Parameters

5. Randomized Forecasting of Dynamic Migratory Interaction