A Cooperative Stochastic Differential Game of Transboundary Industrial Pollution between Two Asymmetric Nations

Considering the fact that transboundary pollution control calls for the cooperation between interested parties, this paper studies a cooperative stochastic differential game of transboundary industrial pollution between two asymmetric nations in infinite-horizon level. In this paper, we model two ways of transboundary pollution: one is an accumulative global pollutant with an uncertain evolutionary dynamic and the other is a regional nonaccumulative pollutant. In our model, firms and governments are separated entities and they play a Stackelberg game, while the governments of the two nations can cooperate in pollution reduction. We discuss the feedback Nash equilibrium strategies of governments and industrial firms, and it is found that the governments being cooperative in transboundary pollution control will set a higher pollution tax rate and make more pollution abatement effort than when they are noncooperative. Additionally, a payment distribution mechanism that supports the subgame consistent solution is proposed.


Introduction
Global environment is a whole ecosystem that is interconnected and indivisible.Any environmental pollution, misuse of resources, and ecological damage risk pressures are likely to be cross-border.Through pathways such as water or air, pollutants can spread across an incredible distance to cause transboundary pollution.Atmospheric pollution and acid rain, ocean and river basin water pollution, hazardous waste transponder movement, and other issues, have been gradually a threat to human survival and development, which become a core issue with international concern.With the development of the industrialization, the transboundary pollution in the world has become increasingly serious.To settle the problem, unilateral response on the part of one nation or region is often ineffective; it calls for cooperation between interested parties [1,2].In this paper, we present a cooperative stochastic differential game model of transboundary industrial pollution between two asymmetric nations.Our objective is to find the optimal pollution abatement strategy which maximizes the social welfare for the two nations gaming for transboundary industrial pollution control.
Differential games have been used as an effective tool to study transboundary pollution control.For example, Dockner and Van Long [3] model a simple dynamic game of two neighboring countries which game for transboundary pollution control; it is found that when the governments of the two countries are restricted to use of linear strategies, noncooperative behavior may result in overall losses for both countries.In 1998, Zagonari extends the model of Dockner and Van Long to analyze a pollution control game between environmentally concerned countries and consumption-oriented countries.In 1993, a tax/subsidy scheme is presented by Martin et al. to combat the transboundary problem of global climate change.Petrosjan and Zaccour [4] apply the Shapley value to determine a fair distribution of the total cooperative cost incurred by countries in a cooperative game of pollution reduction.Bayramoglu [5] uses a dynamic and strategic framework to analyze the transboundary pollution between Romania and Ukraine, it is found that, among the three different institutional systems such as noncooperative game of countries, uniform emission policy, and constant emission policy, the noncooperative game can provide the highest level of total welfare.Jørgensen and Zaccour [6] take emissions and investments in abatement technology as control variables to analyze the problem of cooperative transboundary industrial pollution control; in their study, a coordinated solution that maximizes joint welfare is derived together with a payment distribution mechanism that supports the subgame consistent solution.Bertinelli et al. [7] study the strategic behavior of two countries facing transboundary CO 2 pollution under a differential game setting; the author considers that feedback strategy may lead to less social waste than when the countries take open-loop strategy.Taking emission permits trading into account; Li [8] studies a differential game of transboundary industrial pollution between two neighboring countries, and the focus of the author's analysis is to find the two countries' noncooperative and cooperative optimal emission paths.Benchekroun and Martín-Herrán [9] study the impact of foresight in a transboundary pollution game; their study shows that when all countries are myopic, that is, choose the "laisser-faire" policy, their payoffs are smaller than when all countries are farsighted.
Most cooperative environmental games do not distinguish the government from the industrial sector in any nation.Apart from these studies, Yeung and Petrosyan [2] take industries and governments as separated entities and develop a cooperative differential game model of transboundary industrial pollution and the time-consistent solutions are derived.Extending the work of Yeung and Petrosyan [2], Huang et al. [10] develop a model in which there is a Stackelberg game between the industrial firms and their local government while the governments can cooperate in transboundary pollution control, and the feedback Nash equilibrium strategies of governments and industrial firms are given.In this paper, we raise a stochastic differential game model of transboundary industrial pollution between two asymmetric countries.Our work can be viewed as a continuing work of Huang et al. [10] in the context of transboundary industrial pollution control.Compared with the work of Huang et al. [10], there are three significant features in our paper.(i) We present a stochastic differential game model of transboundary industrial pollution in which uncertain pollution stock dynamics are taken into account.It is because that uncertain pollution stock dynamics is frequent [2,[11][12][13]; actually many factors like unexpected changes in abatement technologies, the unexpected changes of decay rate of the pollutants, and so forth may incur uncertain pollution stock dynamics.(ii) We extend the model set by Huang et al. [10] from finite-horizon level to infinite-horizon level; this is because, in many situations, the terminal time of the game, , is either very far in the future or unknown to the players.A way to resolve the problem, as suggested by Dockner and Nishimura [14], is to set  = ∞, so it makes sense to present a model in infinite-horizon level to analyze transboundary industrial pollution control.In our model, we find that the difference of the horizontal level does play an important effect on the behavior of game players.From the government's point of view, the plan is based on long-term interests, so the planning period for the infinite-horizon level is more reasonable.For example, we find that, in the infinite-horizon level, the optimal tax rate and the pollution abatement effort are relatively stable.While, in finite-horizon level, Huang et al. [10] consider the optimal tax rate and the pollution abatement effort decrease in the late game; it means that a hyperopia government can protect the environment better.(iii) In our model, the asymmetry of participants in game is considered; we assume that the industrial firms in one country have more green and energy-efficient technologies than the others, and we show that the one with technology advantage may set a higher pollution tax and make more pollution abatement effort, no matter what condition it would be in, cooperation or noncooperation.
The paper is organized as follows.In Section 2, the game formulation is provided.In Section 3, we characterize the noncooperative outcomes.Cooperative arrangements and individual rationality are analyzed in Section 4. We illustrate the results of a numerical example in Section 5. Section 6 is the results summarizing the paper.

Government and Domestic Industrial Firm: A Stackelberg
Game.Consider a multinational economy, which is comprised of two nations.There are  and  industrial firms in nation 1 and nation 2, respectively.In order to control production pollution, the government (the leader) of each nation imposes a pollution tax on domestic industrial firms, and then the industrial firms (the followers) choose their optimal output (emissions) according to the pollution tax rate.This leads to a Stackelberg equilibrium.
Let ∑  =1   () and ∑  =1 V  () denote the quantity of goods produced by  firms in nation 1 and  firms in nation 2 at time , respectively.Firm 's (or 's) revenue function   (or   ) is assumed to be concave and increasing, with the following simple functional form: where  1 ,  2 > 0 are constants; the instantaneous profits of industrial firm in the two nations can be expressed as: where   () is the pollution tax rate imposed on industrial firms by government  ( ∈ 1, 2) at time .Through the first-order condition of (2), the optimal yields for the firms in nation 1 and nation 2 could be obtained as follows: (3)

Local and Global Environmental Impacts.
Transboundary pollution may damage the environment through two ways, that is, an accumulative global pollutant and a regional nonaccumulative pollutant.For example, some transboundary pollutants are caused by firms such as passing-by waste in waterways, wind-driven suspended particles in air, unpleasant odour, noise, dust, and heat, which are all nonaccumulative pollutants.We assume that the nonaccumulative pollutant, for an output of   () produced by the firm in nation 1, would cause a short-term impact (cost) of  1 1   () and  2  1   () on nation 1 and nation 2, respectively.Similarly, an output of V  () produced by the firm in nation 2, would cause a short-term (cost) of  1  2 V  () and  2 2 V  () on nation 1 and nation 2, respectively.In addition to the nonaccumulative pollutant, there is an accumulative pollutant, such as green-housegas, CFC, and atmospheric particulates; these pollutants can be maintained in environment for a long time and built up existing pollution stocks to create long-term global environmental impacts [2,5,10,15,16].Let () ⊂  + denote the level of pollution at time , and the dynamics of pollution stock are governed by the stochastic differential equation where  denotes a noise parameter and () is a Wiener process,  1 ≥ 0 and  2 ≥ 0 are the amount added to the pollution stock by a unit of firms 's and 's output in nation 1 and nation 2, respectively.  () denotes the pollution abatement effort of nation ,   is the efficiency of the abatement,     ()(()) 1/2 denotes the amount of pollution removed by   () unit of abatement effort of nation , and  denotes the natural rate of decay of the pollutants.
The government's planning horizon is [ 0 , ∞).The discount rate is .Each government seeks to maximize the integral of its instantaneous objectives (5a) and (5b) over the planning horizon [ 0 , ∞) subjected to pollution dynamics (4), with controls on the level of abatement effort and pollution tax.
Substituting   () and V  () from ( 3) into ( 4) and (5a) and (5b), one obtains a stochastic differential game in which government  ∈ 1, 2 seeks max max subject to Since the payoffs of nations are measured in monetary terms, the games (6a) and (6b) are a transferable payoff game.

Noncooperative Outcomes
In this section, the solution of the noncooperative games (6a) and (6b) and (7) will be discussed under a noncooperative framework.
A feedback Nash equilibrium solution can be characterized as follows.
for  ∈ 1, 2, provides a Nash equilibrium solution to the games (6a), (6b), and (7), if there exist suitably smooth functions   (, ) : [ 0 , ∞) ×  → ,  ∈ 1, 2, satisfying the following partial differential equations: Performing the indicated maximization in (8a) and (8b) yields Substituting the results of (9a) and (9b) into (8a) and (8b), we obtain the following.Proposition 2. The government 's payoff during [ 0 , ∞),   (, ), can be obtained as where   ,   satisfy the following set of constant coefficient quadratic ordinary differential equations: 2 ) , (11c) ) .(11d) The corresponding feedback Nash equilibrium of the games (8a) and (8b) can be obtained as Equation (12) indicates that, under noncooperation, optimal pollution tax rate (  1 or   2 ) may depend on the regional pollutant ( 1  1 or  2 2 ) which is caused by the firms in home, but the regional pollutant ( 2  1 or  1 2 ) which is caused by the firms in neighbor may have no effect on optimal pollution tax rate.In addition, the optimal abatement effort of the region does not depend on the damage from the regional pollutant at home ( 1  1 or  2 2 ); this is because the abatement effort of the nation can only affect pollution stock and cannot affect current pollution.

Cooperative Arrangements
Now consider the case when both nations cooperate in pollution control.To uphold the cooperative scheme, both group rationality and individual rationality are required to be satisfied at any time.

Group Optimality and Cooperative State Trajectory.
To secure group optimality the participating two nations would seek to maximize their joint expected payoff by solving the following stochastic control problem: subject to (7).Invoking Bellman's technique, a set of controls {ĉ *  , Î *  } = { φ  (), φ  ()} constitutes an optimal solution to the stochastic control problem ( 13) and ( 7), if there exists continuously differentiable function (, ) : [, ∞) ×  → ,  ∈ 1, 2, satisfying the following partial differential equations: Performing the indicated maximization in ( 14) yields the optimal controls under cooperation: Substituting the results in (15a) and (15b) into ( 14), we obtain the following.

𝑊 (𝑡, 𝑥) = [ Â𝑥 (𝑡) + B] ,
( where Â, B satisfy the following set of constant coefficient quadratic ordinary differential equations: The corresponding feedback Nash equilibriums under cooperation can be obtained as Substituting the optimal control strategy from (18) into (7) yields the dynamics of pollution accumulation under cooperation: Solving the stochastic cooperative pollution dynamics yields the cooperative state trajectory: for  ∈ [ 0 , ∞) .The term  *  is used to denote an element of the set  *  .An agreed upon optimality principle would be sought to allocate the cooperative payoff.In a dynamic framework, individual rationality has to be maintained at every instant of time within the cooperative duration [ 0 , ∞) along the cooperative trajectory (20).For  ∈ [ 0 , ∞), let vector   ( *  ) = { 1 ( *  ),  2 ( *  )} denote the solution imputation (payoff under cooperation) over the period [, ∞) to player  ∈ {1,2} given that the state is  *  ∈  *  .Individual rationality along the cooperative trajectory requires where  () (,  *  ) denotes the payoff to nation  under noncooperation over the period.Let () = [ 1 (),  2 ()] denote the instantaneous payoff of the cooperative game at time  ∈ [ 0 , ∞) for the cooperative Γ  ( *  0 ).

Theorem 4. An instantaneous payment at time
yields a subgame consistent solution to the cooperative game Γ  ( *  0 ).
When the two nations adopt the cooperative strategies, the rate of instantaneous payment that nation  ( ∈ {1, 2}) will realize at time  with the state being  *  can be expressed as

Numerical Example
Consider a multinational economy consisting of two nations, and each with two industrial firms.Industrial firm's revenue function in nation 1 and nation 2 at time  is  2 , respectively.The dynamics of pollution stock are governed by the stochastic differential equation ( 4), where ( 0 ) = 20,  = 0.01, and  = 0.05.We defined   as the amount added to the pollution stock by a unit of output in nation  ( = 1, 2).They are a positive number less than 1; that is to say, the stock of pollution will increase with the output but not higher than the amount of output.According to According to what we defined before,  1  ,  2  ( = 1, 2) is a pollution damage cost coefficients.It is said that the bigger they are, the higher the cost of environmental damage for this nation is.We make the following assignments:  1 1 = 0.2,  2 1 = 0.15,  2 2 = 0.15, and  1 2 = 0.16.ℎ  ( = 1, 2) are the value of pollution damage to nation  ( = 1, 2).The larger they are, the greater the impact of the environment is, which is determined by the different regions.And we make assignments: ℎ 1 = 4, ℎ 2 = 5.Other parameters are as follows:  = 0.05,  0 = 0.The parameters of the model, which are used in the numerical solution, are presented in Table 1.
Then we obtain a stochastic differential game in which government 1 and government 2 under noncooperation seek max Through the analysis mentioned above, we have obtained the pollution tax rate, abatement effort of each nation, and the dynamics of pollution stock under noncooperative game and cooperative game.Now we will analyze their difference between the governments under noncooperation and cooperation through Table 2 and Figures 1-5  From Table 2, we can see that, for each government, the noncooperative optimal tax rates  1 () and  2 () are less than the cooperative tax rates ĉ1 () and ĉ2 ().This means that cooperation would push to raise the tax rate to reduce production and cut pollution.And, for both governments, the optimal pollution tax rate is not changing with time; this is different from Huang et al. [10] and Yeung and Petrosyan [2], because, in their research, they model a finite-horizon level game regardless of cooperative game or noncooperative game, and, in the later stage of game, both governments would pay more attention to the economic  benefits than pollution abatement.Then they would decrease the pollution tax rate to encourage production and stimulate economic development [10].But, in our model, the game is in infinite-horizon, and there is no later stage of the game, so the optimal pollution tax rate will not decrease.In addition, under noncooperation, the governments take the externalities of production into account including their own nation only.However, under a cooperation, the externalities are considered by the governments, which include their own nation and the neighbors, so the optimal pollution tax rate is higher than under noncooperation.Figures 1 and 2 show the pollution abatement effort of both governments.From them we find that, under noncooperation, the pollution abatement effort of both governments in the two nations could remain stable all the time, but, under cooperation, both governments would make a high pollution abatement effort at first, and then, the effort level may reduce to a certain degree; after that, it would tend to remain stable.For each government, the noncooperative pollution abatement effort is lower than the cooperative pollution abatement effort.This is determined by externality   of pollution emissions.Under cooperative game, the governments pay attention to the damage for the entire region, so marginal damage of pollution emission would be greater and the governments would be more concerned about pollutant reduction.
Figure 3 shows the dynamics of pollution stock under noncooperation or cooperation.We can see that, the pollution stock under noncooperative game is higher than that under cooperative game.It is due to the higher pollution tax rate and higher pollution abatement effort of both governments under cooperative game.Then, in both economic and environmental terms, cooperation is always better than noncooperation.Now consider the case when the two nations want to cooperate.To secure group optimality the two nations seek to maximize their joint expected payoff by solving the following stochastic control problem: max According to Theorem 4, we get the appropriate dynamic distribution scheme that both group optimality and individual rationality are satisfied at any time during cooperative game.The instantaneous payments of two nations,  1 (,  *  ) and  2 (,  *  ), at different time  with given  *  are shown in (33a) and (33b) and Figures 3 and 4. (33b) It is shown in Figures 4 and 5 that the instantaneous payment of both nations which satisfy the property of subgame consistency shows an increasing trend at the start and then remains stable.In addition, the instantaneous payment for nation 1 is always higher than that of nation 2. This is because the industrial firms in nation 1 have more green and energyefficient technology, so they would get more instantaneous payment.

Conclusions
In this paper, we have shown a stochastic differential game of transboundary industrial pollution between two asymmetric nations in infinite-horizon level.In our model, the firms and governments are separate entities.As participants in the games, they take game behavior to optimize their own payment, and the game equilibrium has been solved under the two levels of firms and governments simultaneously.The uncertain evolution dynamic of pollution stock that has a long-term global impact has been taken into account in our models.Otherwise, another negative environmental externality, a regional nonaccumulative pollutant that causes a short-term local impact, has also been considered.We characterize the parameters of spaces where two neighboring governments can cooperate and give the feedback Nash equilibrium strategies of governments and industrial firms.Through analysis, we have shown that the optimal pollution tax rate hinges on the two effects of regional damage: the source region and the global pollution stock, under noncooperative game, while, under cooperative game, in addition to the above two kinds of effects, it also hinges on the additional effect of regional damage coming from the neighborhood, so the latter is greater than the former.Moreover, we have shown that the pollution abatement effort under cooperation is more than under noncooperation.By analyzing this stochastic differential game model, we have also provided a payment distribution mechanism, which supports the subgame consistent solution, and present a numerical example to illustrate the implementation of this mechanism.

4 . 2 .
Individually Rational and Time-Consistent Imputation and Payment Distribution Mechanism.Let  *  denote the set of realizable values of  * () at time  generated by (20).

Figure 1 :Figure 2 :
Figure 1: The pollution abatement effort under noncooperation, where  1 and  2 denote the pollution abatement effort of governments under noncooperation in nations 1 and 2, respectively.

Figure 3 :
Figure 3: The dynamics of pollution stock under noncooperation or cooperation, where  1 and  2 denote the pollution stock under noncooperation and cooperation, respectively.

Figure 4 :
Figure 4: The instantaneous payments of nation 1 under cooperation.

Figure 5 :
Figure 5: The instantaneous payments of nation 2 under cooperation.

Table 1 :
The parameters of model used in the numerical solutions.

Table 2 :
The optimal pollution tax rate. 11 = 0.4,  21 = 0.3,  12 = 0.5,  22 = 0.4.  ( = 1, 2) can also be considered as pollution abatement effort efficiency, which is linked to pollution control policies and environmental governance efficiency in different nations.Generally speaking, pollution abatement effort efficiency is higher in developed countries.We assign  1 = 0.4,  2 = 0.2.Abatement costs are 0.4( 1 ()) 1/2 and 0.5( 2 ()) 1/2 for nations 1 and 2, respectively.Local pollution impacts are closely linked to the level of production activities.The accumulation of pollution stock like greenhouse gas often concerns the interactions between the natural environment and the pollutants emitted.So stochastic elements would appear.