1. Introduction
It has been well known for at least 100 years that many processes arising in physics, chemistry, ecology etc. can be adequately described only by nonlinear partial differential (integro-differential, functional-differential) equations (see, e.g., an extensive discussion on this matter in Chapter 1 of [
1]). During the second half of the last century, one may note also a rapidly growing number of papers devoted to applications of nonlinear partial differential equations for mathematical modeling in life sciences (see, e.g., the classical book [
2], the recent monographs [
3,
4] and references therein).
On the other hand, the rigorous mathematical models came to social sciences and humanities only recently. In particular, papers devoted to rigorous mathematical modeling interaction of communities (populations) of different language speakers were published only during the last two decades [
5,
6,
7,
8,
9,
10,
11]. These models are based on nonlinear differential equations of reaction–diffusion type.
We start from the nonlinear mathematical model describing interaction of three communities of language speakers, which was proposed in [
10]. The model is governed by three nonlinear reaction–diffusion (RD) equations, which have the following form in the one-dimensional approximation (there are some misprints in [
10], which are corrected here)
This model (of course, one needs to supply the relevant initial and boundary conditions) describes interaction of three communities of language speakers. Functions
and
describe frequencies of monolingual speakers, i.e., they speak always (or almost always) native language. Function
stands for community of speakers, who fluently speak both languages and use each language depending on circumstances. Time derivatives
and
indicate the rate of change in these frequencies, while the space-derivatives describe mobility (diffusion) in space of speakers. The second terms in each equation of (
1) are some generalization of a standard logistic terms arising in many well known biological models including the famous Fisher Equation [
12] and the diffusive Lotka–Volterra system (DLVS) for interacting species (see, e.g., [
2,
4]). The constant
K (like in the logistic terms) means the carrying capacity of environment and defines an upper size of all three communities of speakers, i.e., it is assumed that
The language shift (a process whereby speakers of a community abandon their native language in favor of another) of some numbers of monolingual speakers to bilingual those is described by the terms and . It can be noted that the language shift leads to growing the bilingual community (provided any other forces are absent).
On the other hand, the terms
and
describe an opposite tendency, when bilingual tends to be monolingual. It occurs, for example, in the case of the state politics leading to the lower status of one language comparing with another. The real example is the Russification in Ukraine during the Soviet period when a few million Ukrainians completely switched to the Russian language (actually, the main aim of paper [
10] is to study mathematically Anglicization in Scotland). The coefficients
and
represent the likelihood of bilingual speakers then becoming monolingual in community
u and
w, respectively. Notable, the inequality
(in particular, if
then one puts
) takes place if the language of community
u is under pressure.
In paper [
10], the RD system (
1) was used in order to model the Anglicization process in Scotland during the 20th century. As a result, percentages of Gaelic speakers in all parts of Scotland decreased drastically. However, there is no mathematical analysis of the governing equations therein, while those were solved numerically (with the relevant boundary and initial conditions) in order to show a good correspondence between the numerical solutions and data from successive censuses.
In this paper, a modification of the RD system (
1) is studied by analytical methods and a plausible interpretation of the mathematical results obtained is provided. The main results are presented in
Section 2. Firstly, a modification of the system in question is proposed; secondly, Lie symmetries and a variety of exact solutions (traveling waves) are found. In
Section 3, properties of the exact solutions obtained are under examination, in particular, the coefficient restrictions leading to the exact solution, which qualitatively describes the language shift occurred in Ukraine during the Soviet times, are derived. Finally, some conclusions are presented and future work is proposed in the last section.
2. Main Results
The RD system (
1) contains fractional nonlinearities and is a very difficult task to solve analytically. Having this in mind, we propose a simpler system under biologically motivated restrictions. Our idea is to reduce fractional nonlinearities to quadratic ones. It can be noted that the fractional nonlinearities arising in (
1) are a direct generalization of those introduced in the earlier work [
9]. In that work, it is assumed that speakers of both languages have the common carrying capacity
K. We think that this assumption is not well-founded because the language of a specified speaker is usually related to his/her nationality. So, one cannot claim that different nationalities have the same carrying capacities all the time. Moreover, the so-called standard model for two competing languages [
7] does not use such assumption. The basic model in [
7] contains the standard logistic terms arising in many biologically motivated models (see, e.g., [
2,
4]). Taking into account the above justification, we can replace the fractional nonlinearities by logistic terms, which also restrict unbounded growth of these communities. It means that the terms
and
are replaced by
, and
, respectively. As a result, we obtain the following modification of system (
1)
which contains only quadratic nonlinearities. Hereafter we assume that the coefficients
and
(
) are positive, while all other are nonnegative (i.e., some of them can be zero).
The nonlinear RD system (
2) can be simplified using the following re-scaling of the variables
and introducing new notations
Thus, system (
2) is reduced to the equivalent form
Notably, system (
3) with
is a particular case of the well-known DLVS, which describes a large number of processes in biology and chemistry (see, e.g., [
2,
4] and references cited therein). However the above restriction is equivalent to
in (
2), what contradicts to the basic restrictions in the model (see interpretation of the terms
and
). Thus, hereafter we assume that
, i.e., system (
3) is not equivalent to DLVS.
It is well known that there is no general theory of integrating nonlinear partial differential equations at the present time and it is very unlikely that one will be developed soon. The most effective methods for constructing
particular exact solutions of nonlinear differential equations of reaction–diffusion type are the classical Lie method and its various generalizations (see, e.g., the recent monographs [
1,
13,
14] for more details). Here we apply the classical Lie method and the so-called tanh method [
15,
16,
17].
Theorem 1. The nonlinear system (3) for any set of specified nonnegative coefficients with the additional restrictions and is invariant only with respect to the time and space translations generated by Lie symmetries The proof is based on application of the well known Lie’s algorithm to system (
3) and is reduced to examination of several cases depending on values of the coefficients arising in the system. We omit here the relevant calculations. Notably, a detailed proof is presented in our recent paper [
18] for a similar (but inequivalent) three-component system.
Remark 1. In contrast to the three-component DLVS, which admits some nontrivial Lie symmetries (provided its coefficients are correctly specified) [4,19], the RD system (3) possesses a poor symmetry. It is well known that the Lie symmetries (
4) generate only two inequivalent substitutions (following the classical Sophus Lie papers, the terminology “ansatz” is often used), which reduce system (
3) to the relevant systems of ordinary differential equations (ODEs). The first ansatz does not depend on the space variable
x, hence one leads only to time-dependent solutions. Here, we are not interested in such types solutions because their realistic interpretation is questionable.
The second ansatz follows from the linear combination
of the Lie symmetries (
4) and has the form
Here
and
W are new unknown functions. Solutions of form (
5) is often called plane wave solutions (traveling waves). From the applicability point of view, the most interesting those are
traveling fronts, i.e., solutions (
5), which are bounded and nonnegative. A huge number of papers is devoted to construction of traveling fronts for nonlinear PDEs, especially for scalar reaction–diffusion (with/without convection term). Traveling fronts for such equations are presented in the monograph [
20] (see also the handbook [
21]).
In the case of nonlinear RD systems, the progress is rather modest. To the best of our knowledge, an essential progress is derived only in the case of DLVS. Several traveling fronts are constructed in [
4,
22,
23,
24] for the two-component DLVS and in [
25,
26] for the three-component DLVS.
So, our aim is to find traveling fronts for system (
3). Substituting ansatz (
5) into system (
3), one obtains
System (
6) is a three-component system of nonlinear second-order ODEs. Although this system is simpler than the original RD system (
3), we can say nothing about its integrability because even the similar system obtained by reducing of the two-component DLVS has been not solved in [
4,
22,
23,
24]. In order to find particular solutions of (
6), we start from the steady-state points. Obviously that steady-state points of (
6) coincide with the stationary (homogenous) those of the RD system (
3) and can be easily calculated by solving algebraic equations. Assuming
, the full list of steady-state points are as follows
Obviously there are also steady-state points
, where
, however we prefer examine this case elsewhere. Notably, the 3rd and 4th points, like the 5th and 6th, are equivalent because the first and third equations of system (
6) have the same structure. So, without loss of generality we may say that there are only four essentially different points in (
7).
Typically, each traveling front possesses the following property: such a solution connects two steady-state points provided . We were able to identify the relevant traveling fronts in the cases listed below.
Case 1. (as ) and (as ).
Case 2. (as ) and (as ).
Case 3. (as ) and (as ). This case occurs provided the additional restriction takes place.
Let us consider
Case 1 and use the tanh method. To the best of our knowledge paper [
15] is one of the earliest works devoted to the tanh method (there are a lot recent papers, see, e.g., [
17,
27] and papers cited therein). However, it can be noted that there are not many papers devoted to application of this method to nonlinear systems of PDEs. The method is essentially based at the ad hoc ansatz [
15]
where
. The highest power
N should be determined by balancing the highest degree terms in
Y, upon substitution of ansatz (
8) into the equation in question. The known relation
is essentially used when one makes balancing. Typically direct calculation show that
for the second-order PDEs. So, we obtain ansatz
Having the correctly-specified
N, unknown parameters
can be easily calculated (some of them are arbitrary constants). Of course, it often happens that
, therefore a trivial solution is only obtained. So, the tanh method is not applicable to a wide range of nonlinear equations. It turns out that this technique works in the case of system (
6).
Thus, using ansatz (
8), we may look for traveling fronts of the form
where
and
(
) are real and natural numbers, respectively. Since the exact solution of the form (
9) connects steady-state points
and
, one immediately obtains the sigma-s values
Substituting (
9) into system (
6) and taking into account (
10), one can determine sufficient conditions for the coefficients
when the traveling fronts can be found explicitly.
Omitting the relevant calculations, we only present the result. So, system (
3) has the exact solution
provided its coefficients satisfy the restrictions:
The second exact solution
was constructed provided the coefficients of system (
3) satisfy the restrictions:
It is easily seen that the traveling front (
11) is more general than (
13), since its velocity
is not fixed.
In
Case 2, taking into account the corresponding steady-state points, we are looking for the traveling fronts in the form
where
is an unknown positive constant. Substituting (
15) into system (
6) and making the corresponding calculations, we arrive at the exact solution
The traveling front (
16) satisfies system (
3) if the coefficient restrictions
are satisfied.
Finally, in
Case 3, the exact solutions of system (
6) were prescribed to have the form
After the relevant calculations, the traveling front
of the nonlinear system (
3) was derived provided the coefficient restrictions
take place.
Remark 2. In Cases 1–3 there exist such sets of the positive parameters (excepting in Case 3)satisfying the restrictions (12), (14), (17) and (19) that three components of the exact solutions (11), (13), (16) and (18), respectively, are positive. Thus, all the solutions obtained are indeed traveling fronts. Remark 3. It can be easily checked that all the solutions derived above satisfy the zero Neumann conditions at . In the case of a bounded domain , one obtains at the boundaries and provided and are sufficiently large. Such boundary conditions (they often called no flux conditions) are typical requirements in many real-world models and, for instance, were used in [10]. 3. Interpretation of Traveling Fronts
In this section, we study in detail exact solution (
11). First of all, we answer the question: When do positive coefficients
and
lead automatically to positive values of
and
in formulae (
12)? It turns out that some additional restrictions are needed. The structure of such restrictions essentially depends on the sign of the parameter
, i.e., on the traveling front direction. Thus, one needs to examine separately two cases:
(i) and
(ii) .
In Case
(i), one immediately obtains
(see the formula for
in (
12)). For a simplicity, we assume additionally
and introduce the notations
Substituting these notations into (
12), we arrive at the system of the inequalities:
Since all the component of (
11) should be nonnegative (we remind the reader that each component means a frequency of the community speakers), the inequality
takes place, which follows from
. Thus, the restriction
is obtained. It can be also noted that
and
(the case
and
leads to a contradiction).
In order to satisfy all the inequalities in (
20), we set
where
is a sufficiently small parameter. Now the 4th inequality in (
20) is reduced to the form:
hence
The 2nd and 3rd those are satisfied provided
Now one realizes that the following algorithm guarantees the positivity of all the coefficients in (
12). Firstly, we fix any
from the interval
and a small
, say
. Secondly, we take any
satisfying (
21) and calculate
Finally, we choose a sufficiently large
in order to satisfy inequalities (
22).
Remark 4. In the case and , the above algorithm is simplified to the identification of the restrictions and where .
Case
(ii) is essentially simpler. In fact, one immediately obtains
and
in (
12). Assuming additionally that
and solving the inequalities
and
(see (
12)), we obtain the restrictions
which guarantee the positivity of all the coefficients in (
12).
Thus, we can use the formulae derived above in order to construct examples of traveling fronts to plot the relevant curves (using the package Maple) and to present their plausible interpretation.
Figure 1,
Figure 2 and
Figure 3 represent the exact solution (
11) in Case
(i) (
Figure 1 and
Figure 2) and Case
(ii) (
Figure 3). All the curves satisfy the natural requirement of positivity at the given space intervals.
In
Figure 1 and
Figure 2, three traveling fronts are moving to the right along the
axes as it is predicted in Case
(i). If we assume that the blue and green curves represent the communities of Russian language speakers and Ukrainian language speakers, while the red curve describes the frequency of bilingual speakers, then the real language shift occurred in Ukraine during the Soviet period (from the end of the Second WW till the USSR collapse) is qualitatively described by these curves. In fact, the language situation in Ukraine can be approximated by the 1D model because the communities of different language speakers varies very essentially from east to west (not so much from north to south).
So, taking the point
as the eastern end and the point
as the western end, one realizes that the above curves at the time moment
(see the curves in the left part of the figure) reflects the situation in the end of the Second WW (the borders of the modern Ukraine were formed in that time). The frequency of Russian language speakers (blue curve) was very high in the eastern part (see the interval
), while an opposite situation was in the western part (interval
), in which Ukrainian language dominated (actually the Russian language was unknown therein). In the central part of Ukraine (interval
), the linguistic situation was more complicated and this is shown in
Figure 1 (left plot). However, one may say that Ukrainian language speakers (green curve) formed the main part of inhibitors of the Central Ukraine and the frequency of using this language decreased in the eastern direction. Finally, the community of bilingual speakers (red curve) was concentrated mostly in the east part after the end of the Second WW.
The time moment
(see the curves in the right part of the
Figure 1) reflects the situation in the end of Soviet times, i.e., in the beginning of 1990s. In that time, the community of Russian language speakers (blue curve) dominated in the east and central part of Ukraine (interval
), the community of bilingual speakers (red curve) was also strong in these parts. However, the frequency of using Ukrainian language was very low and one may say about a rapid extinction of this community. In that time, Ukrainian language dominated only in the western part of Ukraine, while there was also a part of the Central Ukraine, in which the frequencies of using both languages was in some equilibrium (interval
).
Traveling fronts presented in
Figure 2 model the situation under the assumption that the USSR could exist 20–30 years longer doing the same language politics, which was in favor of Russian language. Of course, one can expect the almost complete extinction of Ukrainian language speakers as it is shown (see green curve), however existence of a large community of bilingual speakers (red curve) seams to be not plausible. In fact, there is no any reason to study a ‘dead’ language. So, we believe that the red curve does not describe adequately the frequency of using both languages for large values of time.
In
Figure 3, the exact solution (
11) is pictured in Case
(ii) , so that the traveling fronts are moving to the left. As a result, the relevant interpretation is different. In fact, the time evolution leads to extinction of two communities, while only one monolingual community is the winner of this language competition.
Finally, it should be pointed out that the exact solutions of the form (
11) used above for interpretation of the language shift occurred in Ukraine during the Soviet times do not express exact numbers of speakers, however these solutions describe qualitatively the real linguistic situation. In order to get accurate quantitative results, one needs to calculate correct coefficients in the RD system (
2) using census data in the former USSR. This is another nontrivial problem, which will be treated elsewhere.