Representation Bias , Return Forecast , and Portfolio Selection in the Stock Market of China Daping

Representation bias means a kind of cognitive tendency, and, for investors, it can affect their behavior in the stock market. Whether the representation bias can help the return forecast and portfolio selection is an interesting problem that is less studied. In this paper, based on the representation bias theory and current markets situation in China, a new hierarchy of stock measurement system is constructed and a corresponding set of criteria is also proposed. On each criterion, we try to measure the influence among stocks with adapted fuzzy AHP. Then the Hausdorff distance is applied to weight and compute the horizontal representation returns. For the forecast returns, according to representation behaviors, there is also a new computation method. Empirical results show that the representation bias information is useful to the return forecast as well as the portfolio selection.


Introduction
The concept of representation bias is proposed by Tversky and Kahneman [1,2] as a normal behavioral characteristic in financial decisions.They believe that the "representation heuristic" badly affects people's decisions during their opinions building and reasoning.DeBondt and Thaler [3] argue that there exists an overreaction that, after correcting the probabilities, investors may overweight the newly obtained information.As for the investors' behaviors in the security market, Fuller [4] defines one of them as representation bias that may mislead the investors to believe they have already processed the information correctly right before they make a decision.In general, there are two kinds of representation biases: horizontal representation bias and vertical representation bias [5].According to Zhang [6], the horizontal bias means people have a tendency to classify one thing with its analogues and forecast the thing in the future according to its similarities.Meanwhile, the vertical bias implies that, in financial markets, people easily tend to judge or forecast a stock according to its own history records.
As for the effects behavioral bias might bring to the financial market, many scholars have done some interesting research.In the Chicago Stock Exchange market, Shefrin and Statman's tests show that investors' behavioral bias can significantly and short-termly affect the afternoon stock prices [7].According to Coval and Shumway [8], prices set by loss-averse traders are reversed significantly and more quickly than those set by unbiased traders.In recent years, behavioral investment portfolio theory has been applied to derive the behavioral investment portfolio frontier and also used for the portfolio selection problem.In this field, how the behavioral bias affects the decision making is focused on by researchers.Chira et al. [9] take experiments with the students in college, and then they analyze the effects of different behavioral biases on financial decisions.Xu et al. [10] extend Tversky's model based on investors' representation bias and under the frame of maximizing the utility; then they examine it with the vertical representation bias as an example.Zhao and Fang [11] propose a computation method of both the vertical and horizontal representation bias returns and try to find if the representation information can help return forecast in financial market.
In the asset allocation, as for the measurement of the subjective things, such as behaviors and emotions, Saaty et al. [12] originally use the AHP to deal with financial problems.Then, with the development of financial theory, complex financial system draws much attention.And the fuzzy theory and methods, which, compared with the traditional ones, are less subjective and can characterize the fuzzy investment environment and process better, are gradually put into computation.Enea and Piazza [13] combine fuzzy theory and AHP method together and put forward fuzzy AHP, but they do not solve some problems with special values.Based on their work, Tiryaki and Ahlatcioglu [14] apply an adapted method which solves the zero-value problem to Turkey's stock market, and the investment decisions are made with the mean-variance model.However, the optimal investment weights are not shown.According to [11], the adapted fuzzy analytic hierarchy process method is firstly used to measure horizontal representation bias.The work is based on the consideration that, in the complex financial market, the vivid path of how the representation bias affects the investor's decisions is unknown yet.Although before investors make investment decisions they would evaluate the market, they probably will not calculate the things with specific AHP or some methods so strictly and accurately.In other words, it is like a fuzzy process.
This paper can be seen as an updated version of our last paper mentioned above.This paper follows the main thoughts of how to measure the effect of the horizontal and vertical representation bias on the stock returns, but, instead, considering the current financial environment in China and the related policies, we renovate the evaluation system with hierarchy, criterion, and weights.In the computation part, we apply Hausdorff distance to deal with the weighting problems.According to the situation that the investors' vertical representation bias may influence the expectation of the returns in the future, we propose another method of weighting with the degree of matching the stock's historical data and its present trend and overcome the problem of zerodenominator.Then we take empirical experiments with the data in Chinese stock market, and the results are acceptable.And the new method is also tested empirically, and we compare it with Chira et al. 's method in [9].At last, we put the forecast returns into a behavior investment portfolio selection model and show the effective frontiers, which suggest that representation bias can help the returns forecast and optimize the portfolio selection to a certain extent.
This paper is organized as follows.In the next section we state a fuzzy measurement about the representation behaviors and a utility model.In Section 3, we apply the methods with an empirical experiment and discuss the computational results.We end up the paper with a summary of conclusions in Section 4.

Representation Bias and Returns.
In general, there are two kinds of representation biases: horizontal bias and vertical bias.The horizontal representation bias implies a kind of behavior that people tend to classify one thing with other similar things and forecast it following its similar things' rules.The vertical representation bias signifies another behavior or other habits that people tend to easily judge or forecast a thing according to its own history records (see [5]).Xu et al. put forward a method to calculate the vertical and horizontal representation returns; then Zhao and Fang propose a new one (see [10,11]).Here we follow their explanations to the representation returns, but we extend the computation in detail.

Horizontal Representation Returns.
The horizontal representation return means the returns which investors forecast and compute with horizontal representation bias and information.Taking stocks, for example, the horizontal representation return of a stock is mainly influenced by the other stocks which have similar characteristics, such as stocks of similar industries and of the same fund company.Investors with horizontal representation bias behaviors tend to judge a stock in the light of the situations of the other similar stocks.Therefore, to build a proper stock hierarchy system is vitally important.In this paper, to calculate the horizontal representation bias returns takes two steps as follows.
Step 1 (pick up the initial stocks).Select some stocks to put in the initial portfolio.Take part 3 of this paper, for example; we select 15 stocks and name them as  1 ,  2 ,  3 ,  4 ,  5 , . . .,  15 .
Step 2 (weighting and calculating the horizontal representation bias returns).Choose some characteristics of the stocks which the investors care about.Here we divide the indicators into four groups, including the investment environment, company issues, profitability of the stocks, and the investors' objectives.We choose 30 indicators and denote them as  1 ,  2 ,  3 ,  4 ,  5 , . . .,  30 .
In recent years, the Chinese government has weakened its regulation to the stock market in some way, and the "invisible hand" has handled more about the market than before.Hence, compared with our former work, here we lightened the weight of the government supervision and raised the weights of industry development and regional economic conditions, according to regional and industry economic performances have been improved significantly.The new stock hierarchy system is as Table 1 shows. Define where  ℎ  means the horizontal representation return of stock ;   means the returns of the other similar stocks,  = 1, . . ., ,  ̸ =  ; and  , means the effect factor of stock  compared with the target stock for criterion .On the criterion , if stock  has a great influence on stock ,  , will be endowed with a great value.For example, if stock 1 has a bigger impact on stock  than stock 2 (here  ̸ = 1, 2), then  ,1 >  ,2 .  means the weight of criterion  in the whole stock hierarchy system, ∑   = 1.It is clear that the return of stock  is a kind of weighted sum of the other stock returns.
It can be found that the key to measure horizontal representation return is to calculate the effect factor  , ; then the representation bias is quantified in some way.
where  , means the fuzzy value of stock  on criterion , and it is mainly calculated by the adapted fuzzy analytic hierarchy process method.Then it can be put into the measurement of the similarity between stocks.For the distance between fuzzy numbers, we apply the Hausdorff distance (see [14,15]).Take the triangle fuzzy numbers for example.First we define the distance between point  and a fuzzy number where  ∈ V 2 can be seen as the value of its membership function is greater than 0, which implies  V 2 () > 0.Then, the distance between two fuzzy numbers is For distance, it should meet the symmetry.Therefore, the Hausdorff distance between two triangle fuzzy numbers is defined as With the methods above, the effect factor  , can be figured out, and then the horizontal representation return is calculated.

Vertical Representation Returns.
The vertical representation bias suggests that investors tend to judge or forecast a stock based on its history instead of the other related things.Therefore, we assume that the vertical representation return of a stock is mainly influenced by its own historical data.And the procedures of calculating the vertical representation returns are as shown below.
Step 1 (pick up the initial stocks).Select some stocks to put in the initial portfolios.
Step 2 (weight and calculate the vertical representation bias returns).Investors with vertical representation behavioral bias focus on a stock's return history and adapt their expectations based on it.For stock,  = 1, . . ., , we choose its historical returns with  periods and denote them as  ,1 ,  ,2 , . . .,  , .We try to figure out the similarity in the correlation between the historical data and present data of a stock, and, according to that, we weight the historical data of different periods with notations  ,1 ,  ,2 , . . .,  , .Chira et al. believe the weights of different periods should satisfy  ,1 <  ,2 < ⋅ ⋅ ⋅ <  , , which means the longer the period is from now, the less the weight is (see [9]).But we argue that the effect of each period on the present performance is not so appropriate.By their way, we can find that the nearer periods can have more effect on the forecast, which tend to easily weight too much on the late ones, which encourages the fact that the forecast will follow the tendency to a great extent.Therefore, we propose another method to deal with the weighting, and the new way stresses the matching of the history and present.We suppose that when investors find similar history, they are going to learn the history and forecast the future returns based on the learning.Besides, in calculation, we also use the concept of "distance" to handle the weights.And the distance is an absolute value of the minus.
Define the vertical representation return as shown in the following: where  V  is the vertical representation return of stock,  , is the historical return of stock  on time , and  , is the weight of  , , which implies the effect of the history on the present.Define For the present value, we choose the average of the last  0 periods as a proxy variable and denote it as  ,0 . 0 can be determined by time series regressions of the returns. , is the absolute value of period  minus present for stock , which is like the distance,  = 1, . . ., , since we mainly care about the effects of the past  periods.It should be noted that, in order to avoid the situation that the denominator is 0, we set up it as the absolute value plus 1.

Representation Returns.
In real life, however, for investors with representation behavioral bias, it is hard to isolate the biases perfectly clearly from each other.Hence here we try to combine the horizontal and vertical representation returns together and build a new measurement as the representation returns.We introduce a horizontal representation bias preference parameter , which is between 0 and 1. Define where   is the combined representation return for stock  and  is the horizontal representation bias preference parameter.
From (8), we can see that when  is 1, it means that the investors totally trust the horizontal representation returns; when  is 0, it suggests that the investors turn to the vertical representation returns.Here we mainly analyze the forecast errors by.According to (8), we assume that the real return is r , the forecast error of  ℎ  is denoted as  1 , the forecast error of  V  is denoted as  2 , and the forecast error of   is .Then we have By (11), we can see that the forecast error of the representation returns is influenced by the forecast errors of both horizontal and vertical representation returns.And it is also affected by the horizontal representation bias preference parameter .It should be noted that the parameter  depends on the investors' representation preference.If an investor prefers horizontal representation information, then  tends to be greater than 0.5; otherwise, the parameter is smaller.

Portfolio Selection Based on Representation Returns and
Prospect Theory.Generally speaking, there are two frameworks about portfolio selection: maximizing the utility and return-risk trade-off.The mean-variance portfolio theory allows investors to minimize the risk with an acceptable return or maximize their expected return with reasonable risk (see [16,17]).Nowadays, it has been widely used in real market.However, considering the fact that the traditional mean-variance model might not be proper for the investors' behaviors, we select the portfolio selection model based on prospect theory in the empirical experiment.
Prospect theory is proposed by Kahneman and Tversky in 1979.In this theory, reference point is a vitally important concept.It is like a benchmark which people tend to use for comparison when they judge something.According to Kahneman and Tversky, they find that investors evaluate an asset mostly depending on the reference point with which the return or loss is compared instead of the real value.In other words, when investors compare with a certain reference level, they care about the relative value even more than the absolute value.When a reference point changes, investors may make totally different decisions.He and Zhou [18] suppose that the reference point is always set as the riskless coupon rate of the bond with long maturity since investors tend to compare the return rate with the coupon rate of bond.In the next section of this paper, we introduce a new parameter showing the reference level.
Assume that there is a single stage model, and the market is free from friction, allowing no short selling.There are  risky assets, and the initial wealth is  0 .The representation returns are denoted by a vector  = (  ,  2 , . . .,   ).Define  = ( 1 ,  2 , . . .,   ), in which   is the amount of investment in asset , and ∑  =1   = 1.At the end of the investment, the profit is W =  0 (1 +  ⋅ ).
Define the utility of the investors with representation behavioral bias with Fibbo  function.Its classical form is where  is the utility function and  means the sensitivity of the investors when facing the changes in returns.In addition, we use the prospect theory to measure the changes.There is Here V(⋅) is the value function, and  0 denotes the investor's reference level.According to Kahneman and Tversky, the loss makes greater impact than returns on the decision making, so the value function is -shaped.In particular, according to Kahneman and Tversky, V(⋅) can be stated as below: Taking ( 14) and ( 13) into ( 12), there is According to the rule of maximizing the utility and the market situation in China that there is no short selling, we get the mathematical programming model as follows: max  (, W)

Empirical Experiments
In order to cover different industries and areas, we select 15 stocks from Stock A market of China.The stocks are Poly Real Estate, Daqin Railway, Gree Electric Appliances, ICBC, Gezhouba Dam, Conch Cement, Minsheng Bank, Shandong gold, Sany, Vanke A, Wuliangye, Yunnan Baiyao, Sinopec, Zoomlion, and ZTE.Denote the stock by  1 ,  2 ,  3 ,  4 ,  5 , . . .,  15 .All the data is from Wind Database, and the sample is from January 6, 2012, to December 28, 2012, weekly.The returns are calculated with logarithm before computation.

Calculation of the Horizontal Representation Returns.
With the steps of calculating   ,   stated in Section 2, the horizontal representation returns are computed as follows.
Step 1. Set the weight   of every criterion as we show in Table 1.
Step 3. Construct the comparison matrix for every single criterion.Here we show the comparison matrix of the indicator  1 tradable shareholders as an example in Table 2.
Step 4. Calculate the  and  matrices, which are 30 in all.
Step 5. Calculate the fuzzy number   for every stock on every criterion; then we can get   .Here we show the fuzzy numbers of every stock on criterion  1 as an example in Table 3.
As the example shows above, similarly, we can figure out the fuzzy values of the stocks for the other 29 indicators.What is more is that, according to the importance of the different hierarchies, we can also get the different values according to the computation with the similarity between stocks.For example, we assume that the numerical relationship between the four hierarchies is 1 : 1 : 1 : 1.Then we can standardize the similarities and put them into the calculation of horizontal representation returns.According to Welch and Goyal [19], the average of the historical returns between certain times can be set as benchmarks of forecasting, since the mathematical average without any computation is supposed to contain no information.With this assumption, if the forecast horizontal representation returns perform better, this implies that the horizontal representation acknowledgement provides useful information and can be helpful in the market judgment.In the empirical experiment, we calculate the average with the last four historical numbers as the benchmark and try to rolling forecast the returns in the next four weeks.The results are as Table 4 shows.
From Table 4, we can see that the four forecasts with horizontal representation information all perform better than the  benchmarks do, and the average error reduction is 29.77%.
Considering that the benchmarks are set as containing no information and the new horizontal representation returns look better in forecasting, it is shown in some way that the horizontal representation bias can help forecast the returns.In other words, the investors' representation behaviors can provide useful information in the return forecast.Besides, it should be noted that here we mainly focus on whether the representation behaviors can contain meaningful information instead of the accuracy of the forecast.Since the benchmarks do not forecast very well, the error reductions are great sometimes.

Calculation of the Vertical Representation Returns.
As for the computation and tests of the vertical representation returns, we select the last four months' (sixteen) returns as a sample used in the forecasting.Then we use two methods to weight method in [10] and ours and show the comparison of the results.The assumptions are similar to the horizontal situation stated above, if the forecast returns perform better, which means the information of the vertical representation behaviors can be used in the predictions.
According to the method mentioned in [10], the weights follow an arithmetic sequence, which can make sure that the nearer from now it is, the heavier the weight is, and the weights grow evenly by time.Hence, we set the initial weight as 0.01293, and the arithmetic is 0. 0128.In this way, the sum of the last 12 weights is 1.With ( 6), the returns of the next 4 periods can be forecasted.
In Section 2, we adapt a method to calculate the vertical representation return, and the method voids the situation of zero-denominator.In the computation part, we first need to find out the lags of the return series by regressions and decide how many periods are needed for them to be picked up as the average will be a proxy variable of the present return.Furthermore, we select the average of the last four periods as the present return level for stock  and denote it as  ,0 .Then the weights can be calculated according to (7).At last, we figure out the return forecast results.From Table 5, we can see the results of the two methods.It can be found out that the results with our method are better than those in [10], which suggests that the information of the vertical representation measurements can be useful in the return forecast in some way.

Results Based on the Portfolio Models of Representation
Returns and Prospect Theory.We take the vertical representation returns calculated with our method as an example and put them into the portfolio selection model of prospect theory along with their benchmarks.According to Tversky and Kahneman (see [1,2]), in the model mentioned in 2.2, the behavioral characteristics are best measured when  =  = 0.88,  = 2.25.
We can draw the frontiers of the portfolios with the vertical representation returns  V and the decision variables  according to the portfolio selection model.Considering that the portfolio returns are between −0.12 and 0.12, we divide the interval of [0, 0.12] into 20 different levels and calculate every utility value.Then, we get the frontiers with different.The frontiers are as Figure 1 shows.
In Figure 1, the horizontal axis is about the portfolio returns, while the vertical is about the expected utility.All the three frontiers are smooth and downside curves, showing the frontiers when  = 1,  = 3, and  = 5.We can see clearly that when  is greater, the curve is steeper.That is because when  is greater, the impact of the excess returns on investors' utility is greater and the investors tend to be more sensitive to the changes of returns.By this figure we can also find that, with the return increasing, the utility is decreasing.And this is because when the portfolio returns go up, the expectations of the investors also increase; then the losses from the investment increase, too.We have already known that the losses have  greater impact than the returns on the utility, and then the utility falls down.It should be noted that when  = 1, it means the investors' attitudes to both the profit and the loss are the same.

Conclusions
In this paper, we continue to focus on what effect the investors' representation behavior may have on the stock returns and investment decisions.First, we update the analytic hierarchy and criterion we constructed before and try to analyze the stock characteristics for investors with horizontal representation behaviors.Then we use the adapted fuzzy AHP to quantify the impact of the criterion on stocks and pay attention to the measurement of the horizontal and vertical representation returns based on the concept of "distance, " which implies the similarity between stocks.In this way, the Hausdorff distance is applied to weight and compute the horizontal representation returns.And the problem of zero-denominator in the vertical representation return computation is preliminarily solved.With the empirical experiments from Chinese stock market, the horizontal representation behavior is proved to be useful to forecast the returns in some way.And the effective frontiers of the behavioral portfolios with vertical representation returns are also shown, which suggest that the representation behaviors can provide useful information to improve the stock returns forecast, and the portfolio frontiers vary according to the investor's attitude to the return changes.

Table 1 :
Hierarchy, criteria, and weights in the stock selection.

Table 2 :
The comparison matrix of the indicator

Table 3 :
The fuzzy value of every stock on criteria  1 .

Table 4 :
The error reduction of horizontal representation returns forecast.

Table 5 :
The errors of vertical representation returns forecast.