Complementary Variety: When Can Cooperation in Uncertain Environments Outperform Competitive Selection?

Evolving biological and socio-economic populations can sometimes increase their growth rate by cooperatively redistributing resources among their members. In unchanging environments this simply comes down to reallocating resources to fitter types. In uncertain and fluctuating environments, cooperation cannot always outperform blind competitive selection. When can it? The conditions depend on the particular shape of the fitness landscape. The article derives a single measure that quantifies by how much a resource redistribution intervention in stochastic environments can possibly outperform the blind forces of natural selection. It is a multivariate and multilevel measure that essentially quantifies the amount of complementary variety between different population types and environmental states. The more complementary the fitness of types in different environmental states, the proportionally larger the potential benefit of strategic cooperation over competitive selection. With complementary variety, holding population shares constant will always outperform natural and market selection (including bet-hedging, portfolio management, stochastic switching). The result can be used to both determine the acceptable cost of learning the details of a fitness landscape, and to design multilevel classification systems of population types and environmental states that maximize population growth. Two empirical cases are explored, one from the evolving economy and another one from migrating birds. from information theory (Cover & Thomas, 2006; Shannon, 1948). We measure selection forces with the Kullback-Leibler divergence (so-called relative entropy). This also allows us to derive a straightforward way to analyze multilevel dynamics over nested levels of coarse-graining in a single equation.


Introduction
According to Darwin, natural selection "is as immeasurably superior to man's feeble efforts, as the works of Nature are to those of Art" [1, p. 65].Nevertheless, we often have the ambition to go beyond the contributions competitive selection among different types has to offer.We know of a large variety of redistribution strategies in nature and society that aim at creating fitter populations.For example, in economic evolution, conscious portfolio managers can increase the growth rates of the entire market by constantly reshuffling resources despite current market selection [2][3][4][5].In the political economy, countercyclical fiscal policies have been a common policy tool over the last century [6,7].And, in biological populations, stochastic phenotype switches redistribute phenotypes among offspring to offset the acting forces of selection [8][9][10][11][12].
The details of the implementation of those redistribution mechanisms might be diverse but do not matter for our purposes.We ask under which conditions it is possible that an interventionist resource distribution strategy among types can outperform blind competitive selection among types in a stochastic environment.What are the conditions under which, for the sake of population's growth, it is more efficient to relocate resources among its individuals than to let nature and markets do the selection?We answer the question by coining a new term: complementary variety.We find that it is only beneficial for members of a population to cooperate if different population types complement each other's fitness in different environmental states.We also see that, with complementary variety, it is always beneficial to cooperate in the light of an uncertain future environment.Complementary variety provides the sine qua non boundary condition for beneficial resource redistributing cooperation in stochastic environments.The contribution of the article consists in providing a single measure that quantifies the complementary variety of a population in a specific environment and linking it to the potential gain of strategic intervention.

The Fitness Gains of Intervention.
For the case of a constant environment (a single environmental state), during Figure 1: Fitness landscape where the bottom dimension refers to different population types (groups of the overall population) and environmental states (temporal periods), while the height represents fitness (the growth rate of each type in each state, following the traditional definition of fitness in fitness landscapes (28)).Both (a) and (b) represent the same case, while (a) is coarse-grained into 8 population types and 8 environmental states; and (b) is fine-grained into 28 population types and 36 environmental states (based on data from [22][23][24]).
which population types have stable relative growth rates, the best resource distribution strategy always simply comes down to placing all resources on the fastest growing type.If the fitness matrix does not change as a result of this intervention (no density dependence), the population fitness after this strategic intervention will be equal to the growth rate of the fittest type, which is always at least as large as the population fitness eventually attainable through handsoff natural selection.In the first case, we say that types use strategic cooperation ( SC ) to maximize overall population growth, while, in the second case, we talk about hands-off competitive selection ( CS ).In order to obtain a gain from strategic cooperation, we need to identify the growth rate of each type/group and make the effort to redistribute resources to the type with maximal fitness.This might have a cost, but as long as the obtained fitness gain of the intervention ( SC −  CS ) is larger than the involved cost, it is still worthwhile.We are interested in the potential benefit of this sort.
What about fluctuating environments during which the growth rates of different members change over time?For example, Figure 1(a) shows a fitness landscape in which types have different fitness values in different environmental states that appear with different probability.Our first research question relates to such cases: RQ1: When can strategic cooperation outperform competitive selection in stochastic environments?
In terms of the previously introduced notation, we ask for a measure of the fitness matrix that allows us to quantify when a resource redistribution strategy in fluctuating environments is worthwhile:  SC −  CS > 0. This quantity also gives us an upper bound for the maximal cost for which it is profitable to obtain information about the fitness landscape and on executing a resource distribution mechanism.Only if there is a potential benefit for intervention do we need to study the landscape.Otherwise we just let blind selection run its course.
As a general lead, we know the optimal strategy or, in the words of Richard Levins, the "correspondence between the optimal structures of populations and species and the pattern of environmental heterogeneity in space and time" [13].It is provided by the literature of bet-hedging [2,[9][10][11][12][14][15][16][17][18], also known as stochastic switching in biology [8,19] and logoptimal portfolios in economics [3][4][5]20] (see Supporting Information S4, in Supplementary Material available online at https://doi.org/10.1155/2017/5052071,for an introduction to this literature).This literature shows that in some cases one can increase population fitness beyond the growth rate obtainable by any single type by spreading resources among different types.This is done by constantly redistributing resources back to the same (optimized) distribution of resources among types, counteracting the up and down of the changes produced by the varying fitness landscape.It can be shown that bet-hedging achieves optimal long term population growth [21].In the worst case (under some mathematical conditions referred to as being "outside the region of bet-hedging"), bet-hedging optimization suggests betting all resources to the fastest growing type, which is equivalent to the optimal strategy for unchanging environments [10,[14][15][16].
Unfortunately, the literature of bet-hedging is surprisingly quiet about the general boundary conditions under which it makes sense to apply it.RQ1 explores these boundary conditions and we generalize it to even nonoptimized strategies.With "strategic cooperation" we refer to any redistribution of resources that holds population shares constant over time.In a stochastic environment with uncertainty about the realization of the next environmental state, that is the best one can do.In an uncertain stochastic environment, we know the distribution, and during our strategy we might sometimes win and sometimes loose as lady luck rolls the environmental dice.However, we aim at winning on average, holding on to our strategy in every turn.In the final discussion we relate our generalized results back to the literature of optimized bethedging, which turns out to be a special case of our result.Our results will therefore provide the conditions sine qua non for the applicability of bet-hedging.

The Fitness Gains of Coarse-Graining.
The foregoing question leads to the related question when it is worthwhile for the population to undergo the effort of working with a more fine-grained fitness landscape in space and time.If it pays off to allocate resources on the fitter type in a constant environment, then it is also always beneficial to fine-grain the population by splitting types into subgroups.This is because-per definition-the growth rate of a more aggregate higher level type is the average between subtypes of which at least one has to have higher fitness than the average (or at least all equal).We can then allocate all resources on this even fitter lower level type.This shows that the question of optimal intervention is inherently linked to the level of coarse-graining of population types.
For example, both Figures 1(a) and 1(b) show the same empirical evidence over the same period, with the difference that the latter distinguishes among almost 16 times more points in the fitness landscape, while the former averages over these details.The 36 environmental states and 26 population types of Figure 1(b) are coarse-grained into 8 states and 8 types in Figure 1(a).
The question is equivalent to asking when it pays off to learn the detailed particularities of a fitness landscape to execute resource redistribution strategy among more finegrained subtypes (such as Figure 1(b)) and when it is enough for the population to simply consider a more coarse-grained outlook (such as Figure 1(a)), which implies letting the "blind forces" of natural selection "ride the waves" of the lower levels of the landscape.
This question is essentially the same as our previous question and comes down to quantifying the potential benefit obtained by a strategic intervention on the more detailed fitness landscape.If  SC −  CS < 0, we might as well let competitive selection ride over the more detailed peaks and valleys of the fitness matrix in a blind manner.
In practice we see that fine-graining seems to pay off, as we observe that both biological and social evolution seem to distinguish among ever more detailed characteristics.Throughout history, the increasing number of branches of the tree of life has grown life's diversity [26], and the evolving economy, once made of a handful of generic professions, adds new job types much quicker as it takes old ones away, evolving an ever more fine-tuned division of labor [27].At the same time, evolving populations also seem to distinguish among more fine-grained environmental states.Animals that learn how to read weather situations in more detail increase survival chances [28], and data analytic companies that distinguish among ever more detailed market conditions maximize sales [29,30].While both processes seem to be general tendencies, it is not clear which one of them is more beneficial and how they relate to each other.We can analyze these questions by applying the newly developed methodological toolkit from RQ1: RQ2: How much fitness can potentially be gained by disaggregating the fitness landscape in more detailed population types and/or in more detailed environmental states?1.3.Outline.The following section answers RQ1 by deriving our theoretical toolkit.As a first step, we compare the achievable fitness of competitive selection and strategic cooperation with help of the more common analytical frameworks of the variance and covariance of fitness.This is useful to build basic intuition but is limited to simple cases.In order to advance to more realistic multivariate cases, we subsequently analyze the same setup with the help of a measure from information theory [21,31].We measure selection forces with the Kullback-Leibler divergence (so-called relative entropy).This also allows us to derive a straightforward way to analyze multilevel dynamics over nested levels of coarse-graining in a single equation.
We then continue to explore RQ2 by applying the developed measures to two empirical examples from biology and the evolving economy.We analyze in which cases it would increase the fitness of the population of migrating birds in North America if they would select their temporal homes according to more detailed geographical locations and if the export portfolio of Japan's economy would benefit from creating a trade strategy that goes beyond a coarse-grained distinction of manufactured and nonmanufactured goods and would make more detailed distinctions among subtypes such as machinery, chemicals, and oils.
In the Discussion, we link the identified concept to the well-established literature on bet-hedging and review both limitations and arising research questions in the conclusions.

Theoretical Methods: RQ1
The evolving population is subdivided into different population groups , with each group-type  consisting of a certain number of discrete units that appear with frequency ().Types can be identified by any exclusive and exhaustive variable or property of a group, such as allele, genotype, phenotype, location, attribute, characteristic, or community membership.The application of the resulting equations does not depend on the choice of any ecological, morphological, physiological, molecular, behavioral, geographic, sociodemographic, or any other way to partition the population into types.
Population fitness   is the factor of reproduction or growth of the total number of population units .The population fitness indicates the average number of offspring: Table 1: Fitness landscapes in matrix form with types  and environmental states  and corresponding fitness values (, ).The larger fitness value of the binary type split (horizontal) is denoted with capital letter  and the less fit of the two with the small letter .Despite this simple case of two types, the covariance relation of (2) also holds also for more than two types.(a) schematizes a positive covariance between type fitness in environments  =  and  = , while (b) schematizes a negative covariance, with superior type fitness along the diagonal of the matrix.If the environment changes, types can have different fitness values in different environmental states.In a stationary time series, each environmental state has a certain probability of occurrence and can be represented by a random variable with distribution () (for simplicity assumed to be discrete).The fitness of type  in each environmental state  is represented with the following: (, ) (see Table 1 for the basic setup of a fitness matrix between types and environmental states).
Much in the sense of game theory, we ask about potential fitness benefits of strategic cooperation if resources can be redistributed among types.Resource redistribution converts units from one type into units of another type.Such interventionist strategies are not possible for all evolving populations (genetic material cannot easily be "redistributed").It is straightforward to apply such strategy to social populations in which the population units  represent redistributable resources (like money).It is also applicable to biological populations where a genotype can reproduce a nonproportional share of phenotypes (so-called "stochastic switching" [8][9][10][11][12]) and when organisms with cognitive abilities change their behavior according to different environmental patterns.
The constant redistribution of strategic intervention implies counteracting the forces of competitive natural selection to persistently return to the same stable portfolio distribution.In a varying environment, this means that the population before updating in any environmental state is the same (( | ) = constant for all e).For our comparative purposes between the mean population fitness of strategic cooperation ( SC ) and competitive selection ( CS ), we set the initial population distribution for the case of natural selection equal to the stable portfolio population distribution for the case of strategic intervention:  0 () = ( | ) for all .This makes it straightforward to compare both cases directly with each other.

One-and Two-State Environments:
Variance and Covariance.We start by analyzing the two simple cases of unchanging environments and of stochastic environments with two states.We can analyze both in terms of more common variance and covariance measures.
For unchanging environments where the fitness of each type is unchanging, Fisher's fundamental theorem of natural selection applies.It equates the rate of increase in mean population fitness to the variance in fitness of its types [32].After about half a century of much confusion in the literature, it was finally clarified that Fisher meant his theorem only to apply to this case of unchanging type fitness, as only in this case can natural selection select consistently among types [33][34][35].What is sometimes confusing is that Fisher regarded all possible exogenous and endogenous reasons for changing type fitness as a matter of the environment (such as dominance, density dependence, epistasis, climate, and interactions with other species).He therefore defined the state of fixed type fitness as one single (unchanging) environmental state: "what Fisher's theorem tells us is that natural selection [⋅ ⋅ ⋅ ] at all times acts to increase the fitness of a species to live under the conditions that existed an instant earlier" [35, p. 131].
In this case, any strategic cooperation that holds shares of types constant over time can never lead to higher population fitness than competitive selection.By artificially holding the shares of types constant, selection cannot contribute to the increase of population fitness by increasing the share of the fitter types: The best one can do is to equate both by mimicking the end-result of competitive selection (only the fittest type survived) and place all resources on the fittest type (equation [S5] in Supporting Information).
If there are two different environmental states, the role of the variance can sometimes be represented as a covariance of type fitness in both environmental states.This applies to a special case where both environmental states occur with equal probability (in line with Table 1, where ( = ) = ( = )).
It turns out that no strategic cooperation can be beneficial if the fitness values among different types in different environmental states do not complement each other.Complementarity means that some types grow faster in some environmental states, while other types grow faster in other environmental states (on average).We can use the covariance of type fitness in both environmental states to quantify the average tendency of type fitness in both environmental states.Table 1(a) provides an illustrative sketch of a positive correlation: here type  is the fitter type in both environmental states  and .In this case there is no complementary variety and it is not possible to increase population fitness by a strategic cooperation.This also means that it is not even worthwhile to distinguish between those two environmental states  =  and  = .We might as well simply let natural selection run its course and not worry about the environmental distribution and its more fine-grained fitness landscape.The average population fitness achieved by competitive selection ( ,  CS ) will always be greater than or equal to the population fitness achieved by a strategic cooperation ( ,  SC ) (Supporting Information S3.1): where ,  is the geometric mean of the population fitness over environmental states: This shows that a negative covariance is the necessary condition for a potential benefit of strategically cooperating among types by holding the type shares constant (in the optimal case through portfolio theoretic bet-hedging).The schematization of a negative covariance in Table 1(b) visualizes that this implies that there is a complementary variety among the different types in the two environmental states: one type that is more specialized for one environmental state and the other type for the other state.
The covariance measure provides useful intuition for the concept of complementary variety of a fitness landscape.The problem with it is that it can only handle two covarying environmental states.We will therefore develop the same ideas by using the information theoretic measure of relative entropy, which allows quantifying the same logic for any kind of multivariate environmental distribution with any number of stochastically varying environmental states.

Multistate Environments: Relative Entropies.
When working with information theoretic decomposition of fitness we usually represent fitness on a logarithmic scale.This does not change the generality of the results.Logarithmic growth factors are the norm in economics and in biology often referred to as Malthusian fitness [32,36,37].It is useful since fitness "adds up" through multiplication over time (∏ ) and the logarithm converts this geometric process into a more intuitive linear summation (log (∏ ) = ∑ log ()).We will use logarithms of base 2, which represents fitness in terms of the number of population doublings at each time step and the arising informational quantities in bits.

Single
Step Updating.We start by expressing a simple replicator step in terms of relative entropy.The proportion of a type after reproduction is defined by the so-called "replicator equation":  +1 () = () * ()/  .The log population fitness can be decomposed in the following way (Supporting Information S2): where [⋅ ⋅ ⋅ ] stands for the expected value of population types ([⋅ ⋅ ⋅ ] = ∑  () * [⋅ ⋅ ⋅ ]) and  KL is the relative entropy or the Kullback-Leibler divergence:  KL (() ‖  +1 ()) = ∑  () * log(()/ +1 ()) [21,38].The relative entropy  KL is perhaps the most general measure of information theory and has Shannon's famous absolute entropy and mutual information as special cases (for a short primer on  KL , see Supporting Information S1).It has some insightful information theoretic interpretations as the inefficiency of an average code (see S1) but is essentially a directed measure of the degree of divergence between two distributions, in our case  and  +1 (see Figure 2). KL is always nonnegative and only zero if there is no change in the population, with  =  +1 [21].
As such,  KL can be used as a measure that quantifies the strength of evolutionary selection [39][40][41][42].As shown by Frank [43][44][45],  KL can be related to the variance in Complexity type fitness and therefore to Fisher's fundamental theorem of natural selection [32].The larger the variance among the fitness of different types, var[()], the larger the difference between () and  +1 () and the larger  KL (() ‖  +1 ()).

Figure 2 visualizes this relation.
With this in mind, (3) says that population fitness is large when the expected value term is large and the  KL term large.The expected value is taken over the updated generation  +1 and is therefore large when the fitter types increase and the less fit ones decrease their shares, while  KL is small when the population has reached equilibrium and does not change anymore (being zero with no divergence, at  +1 = ).Equal to our previous argument for unchanging environments, this provides another way of showing that the mean population fitness   is at its maximum if we move all resources to the fittest type.In this case the expected value term is at its maximum and the  KL term at its minimum.

Average Updating.
When considering multiple stochastic environmental states, we apply the decomposition of (3) to the joint probability distributions between the population and the environment, (, ) [46].This leads to the following version of our previous equation (see Supporting Information S2): A subtle but very important detail of this decomposition is that in this case  − and  + refer not to the initial and final distribution of the entire observational period (the superscript + is not equal to the superscript +1 we used before) but to the average distribution before and after typical updating over these different environmental states.
For all practical purposes, these average distributions can be calculated in the following way.(),  (), and (, ) can be detected empirically by measuring the fitness values as geometric means during a given environmental state.This defines the average population distributions in a certain environment,  − ( | ), since  () = ∑  ( | ) * (, ).Based on this result we can calculate the average population share after average updating in a certain environment by using the replicator equation: Note that both resulting distributions  − ( | ) and  + ( | ) represent average distributions which depend both on the fitness matrix and also on the timeframe of evolutionary observation (which affects the average population fitness per environmental state  ()).Finally, we can calculate the joint distributions  − (, ) and  + (, ) by multiplying the conditional distributions per environmental state with the observed environmental distribution ().The relative entropy  KL in (4) works with these joint and conditional distributions.
The question of whether competitive selection is potentially superior to strategic cooperation turns out to hinge on the superiority of the arithmetic mean of the geometric mean or, vice versa, the geometric mean of the arithmetic mean: where   [⋅ ⋅ ⋅ ] stands for the expected value over all types and   for the number of periods of occurrence of environmental state  during the timeframe of observation.It turns out that this question is equivalent to the question of whether the average relative entropy between the first and the last periods of evolutionary observation is proportionally larger or smaller than the joint relative entropy during average updating.We quantify this with The term  CS is the Kullback-Leibler divergence that measures the average strength of selection between the first and the last period of the entire observational period, while in between natural selection runs its course. 0 stands for the distribution of the initial population and   for the final distribution of the population after the total number  of observed periods of the evolutionary dynamic ( = ∑    ).This is much in line with our (3), since we simply look at population change between two points in time.The term  SC is in line with (4).It quantifies the average updating per period that is counteracted by a resource distribution strategy.Since  SI measures the average updating per period and  CS the span over all periods, the former is multiplied with the total number of periods .
Supporting Information S3.2 shows Δ KL is proportional to the potential increase in population fitness Δ This quantifies the level of complementary variety in the fitness landscape.To see this, note that, during each new period (and therefore at each environmental state ), strategic cooperation assures that  − ( | ) is held constant at the same distribution as was set at the beginning ( − ( | ) =  0 () for all ).Based on this stable portfolio distribution,  + ( | ) is calculated on the basis of the replicator equation and therefore considers the occurring selection forces despite any intervening redistribution.This is justified because, even with a proactive strategy that keeps shares constant, some types effectively increase and others decrease their share temporally in a particular environmental state.For example, a portfolio manager on the stock market has real gains and losses as market selection effectively acts on the portfolio but constantly counteracts them by constant intervention that counterbalances market selection in order to keep the portfolio stable. SC quantifies the average strength of selection that is counteracted by the strategy.
With complementary variety in the fitness landscape, the fitter types in one environment are mainly the less fit types in the other environment, and selection works in opposing directions in different environments.As a result, growth tendencies in the different environmental states rather balance each other out by shifting gains back and forth.There will be relatively little change between the population distributions of the first and the last periods.The divergence  CS ( 0 ‖   ) will be relatively small.However, the fact that selection conditioned on one environmental state will counteract selection in another environmental state increases the aggregated conditional divergence over all environmental states:  SC ( − ‖  + ) (for more on conditional Kullback-Leibler divergence see Supporting Information S1).The counterbalancing tendencies represent complementary variety.
In this sense, in line with the interpretation of  KL as a measure of the strength of selection (see Figure 2), it can be used as a nonnegative multivariate measure to quantify the amount of complementary variety among fitness values in different environmental states.The more the fitness landscape is skewed to opposing directions in different environmental states, the larger the average  SC divergence of different environmental states is and the proportionally larger the potential benefit of strategic cooperation is.
The surprising finding of ( 6) is that the existence of complementary variety assures that any population distribution that is held constant will always increase population fitness in comparison to competitive selection.It is not necessary to calculate the optimal population distribution in the sense of an optimized bet-hedging strategy.The law of complementary variety states that any intervention that holds population shares constant will always outperform hands-off natural selection, if the environmentally conditioned relative entropy over all periods is larger than the relative entropy between the first and last period.

Coarse-Graining of Types and States.
We now link our new results to the previously introduced question of the benefit of coarse-graining.One of the benefits of working with relative entropy  KL is that it naturally lends itself to multilevel decomposition.This is done through the chain rule of relative entropies [21], which exploits the nestedness of conditional probabilities.
The question of course-graining applies to both population types and environmental states.For both we track the levels of coarse-graining with a subscript index  = {0, 1, 2, 3, . . ., }, in which the first level  = 0 refers to the most coarse-grained aggregation.
For the coarse-graining of types,  0 represents the entire population and the lowest level  =  refers to the most fine-grained level of indivisible individuals, with the set   containing  different members.The intermediate level consists of nested subdivisions of higher level types into lower level types,  +1 ∈   .This does not require that each higher level type consists of the same number of lower level types.In other words, levels consist of nested hierarchies, where a lower level is some kind of subdivision of higher level types into lower level types.The shares of the types in lower levels,  2 , are conditioned on a higher level type,  1 : ( 2 |  1 ).For the corresponding type fitness, nested conditioning does not have an effect and ( 2 |  1 ) = ( 2 ).In this sense, unconditioned higher level fitness is the average of conditioned lower level constellations: (  ) = ∑  +1 ( +1 |   ) * ( +1 ).
For the coarse-graining of environmental states, the ultimate lowest level  depends on the smallest meaningful discretization of time.The highest level  0 represents the entire period of observation.If we take the geometric mean of the growth rates of smaller periods, we can calcualte one single average growth rate over all periods.Any subsequent lower level environmental states are the temporal subperiods that create any aggregated higher level geometric mean fitness: ( 1 ) = ∏  ( 2 , ) () (see Supporting Information S3.3 for an example).
Using these two ways of coarse-graining types and states allows us to expand (4) over multiple levels.The logic is similar to Price's well-known recursive multilevel decomposition of evolutionary dynamics [43,44,47,48].We decompose type fitness in a recursive manner by applying the same logic with which we decomposed population fitness.Equation ( 8) works with joint probabilities of  and .However, for readability, we use shorthand subscripts, such as  2|1 , to indicate the nested conditioning of the joint distribution ( 2 ,  2 |  1 ,  1 ).For reasons of readability, we will focus on the population level,  = 0, different types,  = 1, and subtypes,  = 2, but in general this logic can be followed for any   and  +1 .Using the same shorthand, ( 8) is the same as (4): log The term in the curly brackets decomposes each type of fitness from level 1 into conditioned lower level types on level 2 by applying a conditional version of the same equation.This results in a  KL term for the higher level and (a conditional) one for the lower level.
We can now apply the chain rule for relative entropy to (9), which brings us back to the form of (4), just on the more fine-grained level (see Supporting Information S1): The nested nature of hierarchies results in the fact that the sum of the conditional relative entropies over all levels (see (9)) is equal to the unconditioned relative entropy on the lowest (most fine-grained) level of types (see (10)).We can always apply the chain rule from the highest level to the lowest discernable level of indivisible individuals, with maximally log 2 () nested levels of types (since the smallest possible group size at each level is 2).).The 8 main export groups of the first-digit classification are aggregated into two binary groupings on higher levels.The highest level distinguishes between basic goods and manufactured goods.Basic goods consist of alimentation and fuels and minerals and manufactured goods of oils and chemicals and manufactured devices (semilog plot).
Equations ( 9) and ( 10) allow us to explore multilevel trade-offs in strategic cooperation.Equation ( 9) proposes that we can intervene and redistribute at different levels.We can redistribute resources among types of level 1 and/or for subtypes within those types.This leads to a well-known problem to the literatures of both evolutionary biology and social science.On the one hand, cooperating on the lower level aims at increasing the fitness of a specific population, which ultimately aims at increasing the competitiveness of this population with other populations on the next higher level.We do strategic cooperating on the lower level in order to influence competitive selection on the higher level.On the other hand, strategic cooperation on certain levels sets constraints on lower levels.For example, if governments cross-subsidize among industries or sectors, it does constrain market forces on the firm level and potentially hamper the emergence of disruptive innovations.In practice, the viability of any strategic cooperation among lower level types (see (10)) depends on the reach of resource redistribution flexibility.In some cases resource redistribution might be restricted to a particular higher level type (within an economic sector or genetic family).

Empirical Results: RQ2
Equipped with the theoretical toolbox that we developed to answer RQ1, we now explore RQ2 with two empirical cases.The goal is to explore the involved orders of magnitude of fitness gains arising from strategic cooperation.We do so by calculating the potential gains of optimized strategic cooperation for a successively fine-grained fitness matrix given by empirical data.We then quantify fitness gained by distinguishing more detailed population types and by considering more fine-grained environmental states.
The first case is taken from the evolving economy, that is, Japan's trade export (data taken from [24], that is, the cleaned version of [22,23]) (see Figure 3).The second case uses the data provided by the North American Breeding Bird Survey and divides the population up into geographic locations of birds [25].It is important to emphasize that the idea of the following applications is not to learn anything new about Japanese exports or North American birds but to study the behavior of our equations when applied to real-world data.
This leads us to the question of how to define population types and environmental states.The United Nations Statistical Commission reports working on the basis of 870 different national classifications from 154 countries to identify economically relevant types [49].The question of meaningful environmental states is even trickier.An environmental state can be defined as any recurring characteristic in time, such as the weather, the state of the global economy, or seasons, such as "every other year." This being said, we can use our decomposition for two goals.We can use it descriptively, by working with some arbitrarily chosen criteria for types and states, and then calculate how much we could improve fitness if we would strategically intervene.We can also use it normatively, by defining states and types in a way that aims at creating complementary variety.This means that we create environmental states in which different types complement each other and/or vice versa.
In the following, we do not explore the question of the best way of doing this, but we opt for a simple mixed approach.For the classification of types, we use traditional groupings from existing taxonomies: UN export classifications (see Figure 3) and geographic boundaries (see Figure 4).Based on these types, we then use a simplistic way to create a certain degree of complementary variety through the definition of corresponding environmental states: we define an environmental state as the state during which a particular type grows faster than any other type in the population.This can provide as many environmental states as types (one environmental state that is optimal for one particular type) but can also result in less environmental states than types (since a type could never be the fittest).

Japanese Exports.
In 1974, the total population  0 of the Japanese export economy counted US$ 54.7 billion and in 2010 some $701.1 billion.This results in a compound annual growth factor ,  of 1.07341 over 36 periods.We decompose this total population over three nested levels to the 8 main export groups of the official first-digit classification of UN COMTRADE SITC rev. 2 (see Figure 3).
The first row in Figure 5 successively fine-grains the total export population  0 into finer types (from 2 types at the first level types  1 , to 4 types at the second level  2 , to eight types at  3 ).Naturally, at each lower level we discover faster growing subtypes.Fitness optimization suggests placing all resources on the fastest growing type, which increases population fitness (see shaded types in Figure 5).We calculate the potential gain of this strategic intervention Δ ,  as the difference between the reallocated resources and the empirically detected growth rate (competitive natural selection).For example, for the case "Space 2 Time 1" in Figure 5   = 1.0738 − 1.07341 = 0.039%.This case is visualized in Figure 6(a).
We now consider respective environmental states.For a binary split of the Japanese export economy, we end up with two environmental states: one in which basic products grow faster than manufactured products (which happens in 15 of the 36 years in our sample or () = 0.42) and vice versa for the other type (see Figure 5, "Space 2 Time 2").
Fitness optimization over this 2-by-2 fitness matrix suggests a bet-hedging strategy that allocates a constant share of 55% of available resources to basic export products and 45% to manufactured products.This is visualized in Figure 6(b).The rest of the first column of cases in Figure 5 fine-grains the entire period into 4 and then 8 different environmental states.Figure 5 also shows the values for (4); that is, ,  = 2 [log ] * 2  KL ( − ‖ + ) (exponentiation gets rid of the logarithm).
Optimizing the fitness landscape with four population types and two environmental states (case "Space 4 Time 2" in Figure 5) suggests neglecting exports in alimentation and in technology products and holding a constant share of 33.8% of the resources for fuels and minerals and 66.2% for oils and chemicals.Going one level deeper and distinguishing among eight types suggests placing all resources on the subtype mineral fuels ("Space 8 Time 2" in Figure 5).
The rest of Figure 5 continues more fine-grained distinctions among population types and among lower level environmental states and mainly speaks for itself.Figures 6(c) and 6(d) visualize the cases with a square fitness matrix (equal number of types and states), with four types and states (Figure 6(c)) and eight types and states (Figure 6(d)).The values in Figure 6(d) are the same as in Figure 1(a), just discretized.

Migrating Birds.
The following example asks about potential fitness gains if the North American population of migrating birds would distinguish among more finegrained geographic locations.If the bird population sensed differences in growth potentials across North America and if it were free to reallocate to exploit stationary environmental patterns, how much could the population of migrating birds gain from more detailed distinctions?
The overall population ( 0 ) increased from 1,520,305 birds in 1980 to 1,896,379 birds in 2014.This results in a compound annual growth factor ,  of 1.00652 (geometric mean over 34 periods).The database provides migrating bird populations according to political states in North America [25].Figure 4 shows the selected coarse-graining between north and south (referring to our first level types  1 ), then east and west (adding the second level types  2 ), and then a more fine-grained distinction between midnorth and midsouthern states to distinguish among 8 different thirdlevel groups  3 (roughly following different climate regions).
We follow the same logic as we did for the case of Japanese exports in Figure 5. On the first level we distinguish only two types in space (North and South of North America) and treat entire period of 34 years as one single updating reproductive cycle.Optimal growth with a binary distinction between north and south occurs when some 47% of the birds migrate to the Northern part (which is favorable 44% of the time).This strategic intervention contributes almost 1% of additional fitness to the population (Δ ,  = 0.00969 or about 15,000 additional birds).On the lowest level we distinguish among eight types in space and eight environmental states in time.Growth optimization here recommends that some 20% of the birds migrate to the Midnorthwest, while the remaining 80% should migrate to the Midnortheast (see Figure 4).This  4 (see for full names of types).Vertical environmental states are identified by superior relative fitness of a type within a squared split (see text).Fitness optimization suggests placing resources only at the shaded types, while nothing should be bet on the nonshaded ones.Optimal population shares were calculated with constrained nonlinear optimization algorithms in R (NLoptr).The values we calculated with the Augmented Lagrangian Algorithm, using the Broyden-Fletcher-Goldfarb-Shanno method (auglag-lbfsg), and confirmed with the Sequential Quadratic Programming method (auglag-slsqp), as well as with Sequential Quadratic Programming (slsqp).fine-grained distinction would add some 1.2% of additional fitness to the population growth rate.

3.3.
The Fitness Gains of Fine-Graining. Figure 7 visualizes the proportionality relation of complementary variety from (7) for the different cases of Japanese exports and migrating birds in North America.Each point refers to a specific coarsegraining and a specific strategic intervention.The larger the complementary variety, the larger the fitness value of strategic cooperation among types stemming from fine-graining.
Figure 8 visualizes the resulting fitness gains of strategic cooperation according to increasingly detailed population types (horizontal -axis) and environmental states (vertical -axis) (i.e., Figure 8(a) visualizes the values reported in Figure 5).It shows that more detailed distinctions among types tend to lead to higher fitness gains than more detailed distinctions among environmental states.It turns out that, in both cases, the distinction among more environmental states than the number of population types actually slightly decreased fitness: the highest fitness increase was detected with as many types as states (a squared fitness matrix, as depicted in Table 1).This is not necessarily the case for all fitness landscapes, as sometimes the distinctions among more environmental states than types can lead to an increase in population fitness (for a proof, see Supporting Information S3.3).However, this presented mathematical counterexample might also be a theoretical exception ("angels dancing on the head of a pin"), while our empirical data suggests that a match between the number of states and types plays an important role in the exploitation of fitness landscapes.Squared fitness matrixes with one specialized type per state are often the starting assumption in the bet-hedging literature [2,14,15,50], which brings us to the final discussion that links our results to this well-known literature.

Discussion: Optimal Intervention
The main contribution of this paper is the derivation of conditions for cooperative resource redistribution in terms of information theoretical measures.The derived proportionality relation of complementary variety applies for any strategic intervention that holds the population shares constant in an uncertain, fluctuating environment.Holding the shares constant is the best intervention one can do under "uncertainty" regarding the realization of the next environment (if the future environmental state is known, there is no uncertainty and one can do better).For our empirical exploration we used a series of numeric optimization algorithms to detect the  ∝ Δ KL .For the cases of (a) Japanese exports and (b) migrating birds in North America.The deltas are calculated for the empirical case of natural selection versus the case that the strategic cooperation holds the initial empirical population distribution constant over all periods (S T:emp); and for the case that compares the population distribution that turns out to be optimal with the case in which this same optimized distribution would be the starting distribution of competitive selection over the entire period (S T:opt).Optimal population shares were calculated with constrained nonlinear optimization algorithms in R (NLoptr).optimal portfolio distribution that is to be held constant (see Figure 5, i.e., Broyden-Fletcher-Goldfarb-Shanno, Sequential Quadratic Programming, and Sequential Quadratic Programming).There is also a well-known analytical result that allows determining the optimal distribution (for all practical purposes, at least for binary cases with two types).The analytical setup is given by the literature of bet-hedging, which goes at least back to ideas from Bernoulli in 1738 [51].Bet-hedging is a special case of strategic cooperation; it is the optimal strategic cooperation.Being a special case, our results about complementary variety also provide the condition sine qua non for bet-hedging.
In his pioneering work, Kelly [2] has shown that if there is only one type that survives per environmental state (perfectly specialized type, or "winner takes it all"), then the optimal correspondence between () and () is one-toone proportional.So if a certain environmental state shows up % of the time, the population should always allocate exactly % of its resources on the type that is specialized for this state.With a diagonal fitness matrix, in order to survive, types are forced to complement the fitness strengths and weaknesses of each.Without cooperation, nobody will survive in the long term, since sooner or later an environmental state will come along that is fatal for each type.A diagonal fitness matrix exhibits an extreme case of complementary variety.
In cases where the fitness matrix is not diagonal (more than one nonzero value per row and/or column), bet-hedging optimization leads to the suggestion to distort proportionality between () and () by the mixed fitness matrix [9,10,15,16].In some constellations, it turns out to be beneficial not to allocate any resources to some types (referred to as being outside the "region of bet-hedging" [10,[14][15][16]).These types are superfluous (see, e.g., the cases of Space 4 and Space 8 in Figure 5).
Employing a "clutter" of matrix algebra ("a clutter is the universal measure of just-enough yuckiness to deter referees and editors from reading proofs of lemmas" (Weitzman, 1998; p. 338)) it can be shown that if the entries of the fitness matrix are linearly dependent among different environmental states, then bet-hedging over all possible types is never possible (i.e., is always outside the so-called "region of bet-hedging") (for the formal proof see Supporting Information S4).Linear dependence of the matrix means that the fitness values of at least two environmental states are perfectly positively correlated.This suggests that the distinction among these different environmental states is redundant.On the other extreme, Supporting Information S4 also shows that the only way bethedging among all types is always possible is if the underlying fitness matrix is diagonal (exactly one nonzero entry in each row and each column and 0 s elsewhere).In a diagonal fitness matrix the rows (environmental states) are perfectly negatively correlated and assured to be independent.This is the groundbreaking case that Kelly studied [2] and again the most extreme case of complementary variety.
In between these two extremes, the size of the region of bet-hedging-and therefore the likelihood of being able to optimize a strategic intervention-is defined by the degree of specialization among types.This refers to the difference between type that has the highest fitness in a particular environmental state (most specialized) and the less fit types in this state.The larger this difference, the larger the area of bet-hedging (this can be shown with the matrix determinant; for details see Supporting Information S4).Specialization of different types to respective environmental states implies complementary variety, as the different types potentially complement each other in different environments.This implies that the higher the degree of specialization, the larger the complementary variety, the larger the area of bet-hedging, and therefore the larger the chance of optimizing fitness with the help of all different types.

Conclusions
The decision if it is beneficial to intervene and to cooperate in evolutionary dynamics in uncertain, stochastic environments hinges on the shape of the underlying fitness landscape, including the information we have about it.We derived a measure that allows us to quantify the potential benefit of strategic intervention.Its interpretation leads to the concept of complementary variety of different population types in different environmental states.If there is complementary variety, it is beneficial for types to cooperate in order to exploit this complementarity.The more type fitness is skewed to opposing directions in different environmental states, the more the specialization of different types to different environmental states is extreme, the larger the complementary variety among them is, and the proportionally larger the potential benefit of strategic cooperation over competitive selection is.An important special case is Kelly's bet-hedging with one perfectly specialized type per state [2].In this case, types need to collaborate and cross-subsidize each other in order to survive.
Complementary variety is the necessary condition for strategic intervention to trump the "blind forces" of natural selection.In this sense, the identified proportionality relation also provides a measure for the acceptable cost of any strategic intervention.This cost has two components.For one, it is necessary to learn the environmental distribution and the related fitness values.Furthermore, resource redistribution might be costly.In many cases, there might be a diverse range of costs for redistribution among different types and levels.In reality, redistribution might not be possible at all.In practice, it might often be the case that more detailed lower levels provide both more reliable information and more easily accessible resource redistribution possibilities, while dynamics on higher levels might be more obscure and make it more difficult of exchange resources.This would suggest that lower levels lend themselves more to strategic cooperation, while higher levels would lend themselves more to competitive selection, but this hypothesis is to be empirically tested.This discussion about information and evolution, combined with the information theoretic logic presented here, provides multiple connection to the quickly growing body of literature that links evolutionary dynamics and information theory [11, 14-18, 30, 39, 40, 43-45].
Another major limitation to our results is that we assume the fitness matrix to be fixed and our time series stationary.This assumes that intervention would not alter the fitness matrix and neglects density dependence.Allocating all 2 million birds along the Northern border of the USA (see Figure 4) might be optimal if the identified fitness matrix is fixed, but in reality this overpopulation of migrating birds might saturate the carrying capacity of this region.Density dependence continues to be an active field of research for both naturalists and social scientists, but general conclusions seem difficult to obtain given the vast amount of possible interaction effects [17,52,53].
Finally, in our empirical exploration we have seen that our concept of complementary variety lends itself as criteria for the creation of a classification system of population types and environmental states.We have used a very simplistic way of doing so, even when adopting previously existing type taxonomies.There are surely ways to optimize the classification of types and states to maximally benefit from complementary variety.An ingenious classification system could even create different levels of complementary variety over multiple levels.The number of possible multilevel hierarchies to partition the total population is very large.It is defined by the so-called Bell number.A population with only 53 individuals allows for more multilevel partitions than there are atoms on earth (some 10 50 ), while 75 individuals allow for more different multilevel hierarchies than the estimated number of atoms in the observable universe (some 10 80 ).In a biological or social behavioral sense, certainly not all possible partitions are meaningful or allow for resource exchange.But even with these constraints we are still left with a myriad of combinatorial possibilities.The concept of complementary variety can help in the search for classification systems that identify meaningful types and environmental states over multiple levels.
i r o n m e n t a l s t a t e s P o p u la ti o n ty p e s M a n u f. i r o n m e n t a l s t a t e s Fitness (growth factor) P o p u la ti o n ty p e s (b)

Figure 2 :
Figure2: Illustration of Kullback-Leibler relative entropy as a measure of strength of evolutionary selection.The left hand side illustrates a case with less variance in type fitness, less population change through selection, and smaller  KL divergence between  (0.5; 0.5) and  +1 (0.4; 0.6).The right-hand side illustrates the case with larger variance in type fitness, more population change through selection, and larger  KL divergence between  (0.5; 0.5) and  +1 (0.2; 0.8).

Figure 3 :
Figure 3: Japan's export products according to the first-digit classification of Standard International Trade Classification (SITIC), revision 2 for 1974-2010 in nominal value US$ (based on[22][23][24]).The 8 main export groups of the first-digit classification are aggregated into two binary groupings on higher levels.The highest level distinguishes between basic goods and manufactured goods.Basic goods consist of alimentation and fuels and minerals and manufactured goods of oils and chemicals and manufactured devices (semilog plot).

Figure 6 :
Figure 6: Fine-graining of the Japanese export economy (1974-2010), visualization of selected cases from Figure 5: (a) Space 2 Time 1; (b) Space 2 Time 2; (c) Space 4 Time 4; (d) Space 8 Time 8. Optimization suggests only working with the types that are outlined with colored borders, while neglecting the other types.

Figure 7 :
Figure 7: Empirical application of the proportionality of complementary variety of (7): Δ

Figure 8 :
Figure 8: Visualization of the fitness gain of fine-graining Δ Figure 5: Space-time fitness matrixes of Japanese exports 1974-2010.Fitness values are calculated as geometric means based on annual averages.Horizontal types are based on Figure