Extended Oligopolies with Pollution Penalties and Rewards

Copyright © 2018 Akio Matsumoto et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. An extended n-firm oligopoly with product differentiation is considered. It is assumed that the government selects an emission standard for the industry and based on the output and technology of each firm it selects a maximum allowed amount of emission for each firm. If the actual amount is higher than the allowed maximum, then the firm has to pay a constant multiple of the excess to the government; otherwise it is rewarded similarly based on the saved emission amount. The existence of the unique interior equilibrium is first proved, and then the effect of the level of penalty or reward and that of the emission standard on the industry output and therefore on the total emission level is also examined. Time delay is introduced into the penalties the firms have to pay and into the rewards the firms receive. In analyzing the stability of the equilibrium both discrete and continuous time scales are considered. For mathematical simplicity the case of symmetric firms is analyzed. In the discrete case the various values of the delay length are examined. The equilibrium is stable if either the total industry output is sufficiently large or the common speed of adjustment of the firms is sufficiently small. In the continuous case, either the equilibrium is always stable or stability occurs if the delay is sufficiently small and at the critical value Hopf bifurcation occurs.


Introduction
In classical duopoly models the firms are competing in a market and their production outputs or prices are the decision variables.Their profits are the differences of their selling revenues and production costs.These classical models have been extended in many different ways including the consideration of environmental issues.In this direction, the effect of different environmental regulation policies has been investigated by many researchers ( [1][2][3] among others).Mostly single firms were considered in relation to environmental R&D, and very few works were devoted to the extension of oligopoly models in this direction.Montero [3] examined the effect of R&D investment for pollution abatement technology with different environmental policies in duopolistic product markets.Static models were investigated in earlier stages; the existence of equilibrium in Cournot oligopoly with pollution treatment cost sharing was proved by Okuguchi and Szidarovszky [4], whose work was further extended by including emission standard and R&D in the oligopoly model [5].If the government is unable to assess the individual emission levels of the different firms, then it can measure only the total level of pollution, and ambient charges are introduced.In this policy [6] the government defines a cut-off for the total pollution level and, regardless of the specific emission level of each firm, all are equally punished or rewarded.Ganguli and Raju [7] demonstrated that in a Bertrand duopoly increasing ambient charges could lead to greater pollution; however Raju and Ganguli [8] showed the opposite effect in a Cournot duopoly framework when the increase of ambient charges reduces pollution.This result was further generalized by Matsumoto et al. [9] for -firm oligopolies where the stability of the dynamic model with naive expectation was also examined.If the government is familiar with the used technology and production output of each firm, then it is able to assess the proportion of each firm from the total pollution level.Therefore each firm can be punished or rewarded according to its assessed individual emission level compared to its allowed proportional maximum from the government defined cut-off threshold.In this paper this idea will be elaborated.After the formulation of the mathematical model the existence of the static equilibrium will be proved.Then the effect of penalty and reward parameters on the industry output and the total pollution level will be investigated.Assuming gradient adjustment of the firms and introducing time delay into the penalties and the rewards, dynamic models will be developed with both discrete and continuous time scales, and the stability conditions of the equilibrium will be derived, analyzed, and compared.

Model and Cournot-Nash Equilibrium
Consider  firms in an oligopoly with differentiated products.Let   be output of firm .The price of the product of firm  is seen as with  ≥ 2 and   > 0 and 0 <   < 1. Firm  emits pollution     in connection with its production with   > 0. The value of   is technology-dependent and assumed to be fixed.(In our next project we will consider the profitability of technology changes with their additional costs and environmental benefits.)The government can measure the total emission quantity and has an exogenously selected environmental standard .So the maximum allowed emission of firm  is clearly If a firm exceeds this amount then it has to pay a penalty of  times the exceeded amount, and if its emission amount is below the maximum allowed amount, then the firm is rewarded by  times the saved emission amount.With (2), the payoff of firm  becomes where   is the marginal cost of firm .Substituting (1) into (3) yields The first term corresponds to revenue and production cost and the second term refers to emission penalty or reward.
Assuming interior optimum, the first order condition implies that Notice that (5) strictly decreases in   with fixed values of   ( ̸ = ).At   = 0, its value is If this value is nonpositive, then   = 0 is optimum, which is not interior.As   → ∞, the value of (5) tends to −∞, so there is always a unique best response.For mathematical simplicity, let us assume symmetric firms in the sense that firms have identical substitutability and technology in emission production.
Since the maximum prices   are different, the firms might have different output levels.Then (5) becomes and, with notation  = ∑  =1   , we have So By adding these equations for  = 1, 2, . . .,  and dividing by , which is a quadratic equation for , At  = 0, the left-hand side is negative and as  → ∞, it converges to +∞, so there is real root.Since the constant term is negative, one root is positive and the other is negative.So only the positive root has economic meaning.Then the corresponding equilibrium levels of the firms are given by (9).

Effect of Penalty or Reward on Pollution Levels
The penalty factor  and the emission standard  are the strategic (or fiscal-policy) variables of the government and should be determined so as to maximize a social welfare function which includes the sum of the firms' profits, consumer's surplus, and any technological external effects.(Determining the optimal values of these variables could be a research subject of a next paper.)In this study, however, we treat them as exogenously determined parameters and analyze how the total production level (and therefore the total emission level) depends on  and .
Considering  as a function of , and implicitly differentiating (11) with respect to , we have implying that Based on (11), the denominator can be rewritten as which is positive.So the sign of / depends on the sign of the numerator.The first term is negative; the second term is positive.Using again (11), we see that the -multiple of the numerator equals It is reasonable to assume that the first term is negative.Therefore / > 0 if and only if Let  denote the right-hand side of this inequality and () again the left-hand side of (11); then this is the case when It is easy to see that the numerator is positive.This inequality means that increase in the value of  has an increasing effect on the total industry output as well as in the total emission level if the emission standard  is sufficiently large.Otherwise the opposite effect can be observed.
Next we examine the effect of increasing the value of .Considering now  as function of  and implicitly differentiating (11) we have showing that We already established that the denominator is positive, so / > 0 showing that the increase in the emission standard always has an increasing effect on the industry output as well as on the total emission level.From (11), we have that which is a convex parabola in  with all other parameters assumed to be fixed.Its roots are zero and The cases of  * > 0 and  * < 0 are illustrated in Figures 1(a

Dynamic Extensions and Stability Analysis
Assume the government has a time delay  > 0 in posing penalty or giving reward to the firms.If the firms use gradient adjustment, the adjustment process in discrete time scales is that turns to be the following form in continuous time scales where for  = 1, 2, . . ., . (24) In the literature best response dynamics are also used frequently; however, for linear systems, they are equivalent with gradient adjustment processes [10].We will first examine discrete time dynamics and then continuous time dynamics to detect the stability conditions.
if there is no dominant firm.Let these derivatives be denoted by 1 +   ,   ,   and   , respectively; then the linearized equation has the following form, where   and  are now their distances from equilibrium levels: It is challenging to consider this -dimensional system of delay difference equations.For the sake of simplicity, we make the following two assumptions.
Assumption 2. The firms are identical in sense that they have the same adjustment coefficients, the same reservation prices, and the same marginal costs: Assumption 3. The firms have the same initial level of output: Under these assumptions, the equilibrium levels of the firms are identical and dynamic equation ( 22) generates identical trajectories of   () for  = 1, 2, . . ., .In addition, the coefficients   ,   ,   , and   are also identical, () = () and ( − ) = ( − ).The delay difference equation is now written as where It should be noticed that, at the equilibrium, Equation ( 29) is a linear delay difference equation.A lot of effort has been devoted to detect the corresponding stability condition.According to Čermák [11], there are three versions of the stability condition so far.The following is Theorem 4 of Čermák [11] that presents a necessary and sufficient condition on  and  for the asymptotic stability of (29).(Theorem 4 is a slightly modified version of the original theorem of Papanicolaou [12].Variables  and  are introduced to be consistent with the notation of this paper.)where  ∈ (0, /).

Theorem 4. The zero solution of (29) is asymptotically stable if and only if (𝛼, 𝛽) is an internal point of the finite area bounded by two lines
In Figure 2(a), three stability regions corresponding to  = 2, 4 and 6 are illustrated.(We can construct the stability region for any number of  by applying Theorem 4.) In particular for  = 2, the stability region becomes the isosceles triangle, the right side is described by  = 1−, the left side by  = 1 + , and the base by  = −1.For  = 4, 6, the horizontal base becomes a positive-sloping concave dashed curve for  > 0 and a negative-sloping concave dashed curve for  < 0, leading to a quadrilateral-wise stability region.As the value of  increases with an increment of 2, the dashed curves shift upward and thus the stability region shrinks.On the other hand, in Figure 2(b), three stability regions corresponding to  = 3, 5 and 7 are illustrated.The stability regions with odd  have parallelogram-wise shape with nonlinear upper and lower sides which are the concave and convex dotted curves.As  increases, the upper side rotates downward-around the vertical axis and the lower side rotates upward-around the vertical axis.As a result increasing  has a decreasing effect on the stability region.The right and left sides described by  = 1 −  and  = −1 −  are not affected by a change in the value of .In short, we summarize the properties of the stability region as follows: (1) The stability regions with even  are slightly different than those with odd .
(3) The stability region is the intersection of two subregions: one is delay-independent and the other is delay-dependent.
We can derive the characteristic polynomial by looking for the solution in a special form, which is the power function: Substituting this solution into (29), we have, after simplification, In principle, finding the locations of  yields the stability conditions for the zero solution of the delay difference equation.In particular, if  = 2, then the characteristic equation ( 36) is quadratic: Asymptotic stability is guaranteed by the following conditions [10]: These three conditions construct the isosceles triangle in Figure 2(a).Returning to the model parameters, (31) and (32) immediately imply that the second condition is clearly satisfied: Since the first condition is using ( 31) and (32), it reduces to and the third condition has the form In the case with  = 2, we notice the following: (a) If  −  −  < 0, then (41) holds and ( 42) is satisfied if  is sufficiently large or the speed of adjustment  is sufficiently small.
(b) If  −  −  ≥ 0, then the discrete system is stable if  is sufficiently large or the value of  is sufficiently small.
An equivalent condition can be given by rewriting ( 41) and ( 42 ( Letting the stability condition becomes where () denotes the left-hand side of (11).
If  = 3, then characteristic equation ( 36) is cubic: The stability conditions (except trivial one) are as follows [13]: These three inequalities construct the parallelogram-wise stability region.The parallel negative-sloping lines are described by the first two conditions of (47) with the equality.In the same way, replacing the inequality of the third condition of (47) with the equality and solving for  yields two solutions: The first equation presents a positive-sloping convex dotted curve in Figure 2(b), some part of which overlaps the upper side of the stability region whereas the second equation gives a positive-sloping concave dotted curve, some part of which overlaps the lower side.
As is already seen, the first condition of (47) always holds.The second condition can be written as The inequality holds for sufficiently small , ,  and sufficiently large .The third condition is equivalent to since  < 0 from (32).The right-hand side is positive for  > −1 with which this inequality holds as  < 0.
The absolute value of  can be small when , , and  are small and/or  is large.Therefore, the stability conditions for  = 3 might be satisfied if , , and  are sufficiently small and/or  is sufficiently large.For larger  values, the stability conditions in terms of the model parameters become even more complicated although the stability regions can be visualized due to Theorem 4.

Continuous Time Model.
We now draw attention to the dynamic equation ( 23) that is a nonlinear delay differential equation.If the right-hand side of ( 23) is denoted by    , the followings are easily verified: Under Assumptions 1 and 2, the firms become symmetric and linearizing (23) in the neighborhood of the equilibrium yields q () = ( − 1)  () +  ( − ) .
This is a linear delay differential equation and its asymptotic behavior depends on the location of the eigenvalues.As usual in the theory of delay differential equations [14], we look for the solution in exponential form: then we have the characteristic equation: As  = 0, when the equation is without delay, the characteristic root is negative: implying that the system is asymptotically stable.Stability switch occurs if  =  with some  > 0, when Separating the real and imaginary parts, where  +  − 1 < 0 is already confirmed.In our case The sign of the braced terms on the right-hand side of (60) is ambiguous.Since  +  − 1 < 0, there is no solution for  if the right-hand side is nonnegative, and therefore there is no stability switch.If it is negative, then there is a unique value of , Notice that this is the case if which occurs if environmental standard and/or penalty/ reward factor is large or the industry output and/or substitutability factor is small.Hopf bifurcation is used to find the direction of the stability switch.Let  be the bifurcation parameter and consider  as function of :  = ().By implicitly differentiating (54) with respect to , we have (65) Hence the real part is showing that the sign of the real part changes from negative to positive as the value of  increases from zero, so stability is lost.That is, at the smallest stability switching point, stability is lost and it cannot be regained later.If there is positive solution for , then || > | − 1|, and since from ( 57) and (58) we know that sin  > 0 and cos  < 0, the smallest (critical) value of  is the following: At the critical value of  there is the possibility of the birth of limit cycles.From (60), we know that the system is always asymptotically stable if From (11), the left-hand side equals

Concluding Remarks
This paper examines -firm oligopolies with product differentiation when the firms face penalties or rewards depending on the amounts of their pollution levels.The government selects an emission standard for the entire industry, and based on the specific technology and output of each firm, the government determines its maximum allowed emission level.The amount of penalty or that of the reward is determined by the difference of the actual emission level and the maximum allowed amount.This simple mathematical model can be considered as the counterpart of models with ambient pollution charges discussed earlier in the literature.The existence of the unique interior equilibrium is first proved, and then the effects caused by changes in the strategic or policy variables,  and , of the government are examined.We establish that an increase in the value of  has an increasing effect on the total pollution level if the emission standard  is sufficiently large; otherwise the opposite effect occurs.An increase in the value of  always has an increasing effect on the total pollution level of the industry.Dynamic extensions are introduced and the stability of the equilibrium is examined with both discrete and continuous time scales.In the case of discrete time scales, the different values of the delay length are assumed in the penalty and reward terms and in the continuous time scales, a positive delay is introduced.In the case of discrete time scales the equilibrium is stable if either the industry output is sufficiently large or the common speed of adjustment of the firms is sufficiently small.In the case of continuous time scales the equilibrium is always stable if the total marginal profit of the firms at zero output levels is nonnegative; otherwise it is stable if the length of the delay is sufficiently small.At the critical value of the delay Hopf bifurcation occurs.Symmetric firms are assumed for mathematical simplicity, in which case the equilibrium could be given analytically; however, in the nonsymmetric case, it is not possible making the further analysis much more complicated.This issue will be the subject of our next project.We will also examine the possibility of technology changes.In this case the additional costs and reduced penalties have to be compared.
) and 1(b).With positive values of  > 0,  increases in  in both cases illustrating the conclusion based on relation (19).
69) so the system is stable if multiplier of  is nonnegative.(The inequality   >   is usually assumed to have a positive level of output of firm k in a standard Cournot oligopoly model.So if me is small, then   ≥   + .However, me can be large enough to have   <   + .)Notice that the multiplier of  is the total marginal profit of the firms at zero environmental standard when all firms have zero production levels.