Comparing the State-ofthe-Art Efficient Stated Choice Designs Based on Empirical Analysis

The stated choice (SC) experiment has been generally regarded as an effective method for behavior analysis. Among all the SC experimental design methods, the orthogonal design has been most widely used since it is easy to understand and construct. However, in recent years, a stream of research has put emphasis on the so-called efficient experimental designs rather than keeping the orthogonality of the experiment, as the former is capable of producing more efficient data in the sense that more reliable parameter estimates can be achieved with an equal or lower sample size. This paper provides two state-of-the-art methods called optimal orthogonal choice (OOC) and D-efficient design. More statistically efficient data is expected to be obtained by either maximizing attribute level differences, or minimizing theD-error, a statistic corresponding to the asymptotic variance-covariance (AVC) matrix of the discrete choice model, when using these two methods, respectively. Since comparison and validation in the field of these methods are rarely seen, an empirical study is presented. D-error is chosen as the measure of efficiency. The result shows that both OOC andD-efficient design are more efficient. At last, strength and weakness of orthogonal, OOC, andD-efficient design are summarized.


Introduction
Abundant and accurate data is the foundation of study.To date, emerging technologies are largely introduced in data mining and processing [1].In [2,3], how to obtain highquality data with the high-tech in the field of Intelligent Transportation System is deeply discussed.However, the importance of refining data collection technique has not raised researchers' attention in behavior analysis area until last decades, such as travel mode choice and safety in [4,5].The purpose of conducting choice experiments is to collect data that can be used to estimate the independent influence of attributes on observed choices.There are two paradigms of choice data: revealed preference (RP) and stated choice (SC) data.Typically, in RP surveys, respondents are asked to recall information about his/her last choice, including alternatives and attribute levels available in real market.Differently, SC experiments present sampled respondents with a number of different hypothetical choice situations, each consisting of a universal but finite set of alternatives defined on a number of attribute dimensions.Thus, SC data collected on 300 respondents, each of whom is asked to make 8 choices produces a total of 2400 choice observations while RP experiment on the same sample size of respondents collects only 300 observed choices.
SC experiment has been widely used because it can observe choices on alternatives which do not exist in the current market.So analysts are able to predict, for example, the share rate of a newly introduced transportation mode.Another reason of its popularity lies in the ability to provide variability in attributes in a relatively small sample size compared with RP experiment, with which better estimation of influence of each attribute on choice can be achieved.Usually, respondents in a SC experiment will be faced with some "selected" choice situations, considering that making choices among all the possible combinations of attribute levels is too many to accomplish for a single respondent.Thus, how analysts distribute the levels of the design attributes in an experiment plays a big role.It may impact upon not only whether or not an independent assessment of contribution of each attribute to the choices observed can be determined, but also the ability of the experiment to detect statistical relationships that may exist within the data.
Historically, researchers have relied on orthogonal experimental designs, in which the attributes of the experiment are statistically independent by forcing them to be orthogonal [6].As such, orthogonal designs theoretically allow for an independent determination of each attribute's influence upon the observed choices.To generate an orthogonal design, usually but not necessarily, the first step is to generate a full factorial design, a design which contains all possible attribute level combinations.Mathematically, a full factorial design will produce ∏  =1   choice situations, where   is the number of levels assigned to attribute  (e.g., a design with 4 attributes, two with 2 attribute levels, one with 3 levels, and one with 4 levels will produce a full factorial design with 2×2×3×4 = 48 choice situations).The second step is to take a subset of choice situations from the full factorial design, which is known as fractional factorial designs.Randomized, cyclical, Bayesian, and fold over procedures are the common approaches used to generate fractional factorial design in [7][8][9][10].It is noteworthy that, in either a full factorial or a fractional factorial design, orthogonality is kept between two random attributes.The only difference between them is that orthogonality will not be kept in terms of interaction effects (i.e., the influence of two or more attribute columns multiplied together) in fractional factorial design.Here are some simple rules to check whether a fractional factorial design is orthogonal or not: (i) Every level of every attribute appears at same times.
(ii) All possible attribute level combinations of random two attributes appear at same times.
While orthogonal design has long been used in practice, in [11,12], there is a stream of researchers in recent years doubted the importance of orthogonality in SC data when it is used to estimate discrete choice model, not to mention whether orthogonality can be kept in reality.Orthogonality is important in linear models since it avoids multicollinearity problem and also minimized variance-covariance matrix of the estimated model, in which way standard errors of parameter estimates are also minimized.Unfortunately, discrete choice model is nonlinear; thus the derivation of its parameters' variance-covariance matrix is very different from the way in linear models.Seeing from that, keeping orthogonality of the parameters has little to do with minimizing their standard errors.
Acknowledgment of this fact has led researchers to transfer their efforts to obtain experimental designs that minimize the asymptotic variance-covariance (AVC) matrix of discrete choice models and provide more reliable parameter estimates with an equal or lower sample size.Such designs are called efficient designs.To date, most research has been focused on developing methods to generate efficient designs; comparison and validation of these methods in practice are rarely seen.This paper provides two state-of-the-art methods called optimal orthogonal choice (OOC) and -efficient design.Their performance on both theoretical efficiency and practical use are compared with the conventional orthogonal design.
The remainder of this paper is organized as follows.In Section 2, OOC and -efficient design are introduced.In Section 3, SC experiment design using orthogonal, OOC, and -efficient methods separately is generated.The result in terms of -errors are presented in Section 4, along with the summarized strength and weakness.

Optimal Orthogonal Choice Design.
Considering the popularity and convenience of orthogonal design for analyzers in practice, there is a stream of researchers in [13,14] who kept on exploring improved SC design method maintaining orthogonality, which is called optimal orthogonal choice design.The essential idea of OOC design is to maximize the differences of attribute levels across alternatives, so that the parameters can be estimated in the largest extent of variety of attribute levels as well as independently.The basic process of generating an OOC design is as follows.
Step 1. Generate a fractional factorial orthogonal design for alternative 1.  represents the number of choice situations of the design.
Step 2. Choose some systematic changes to get the allocation of attribute levels in alternative 2 from alternative 1. Systematic changes are certain rules to decide how the attribute levels change from alternative 1 and will be discussed in later context.
Step 3. Choose another systematic change to get the allocation of attribute levels in alternative 3 from alternative 1.
Step 4. Keep doing this until all the alternatives are determined.
It will be much easier to understand this method by starting with a binary attribute level design.Again, we assume that   is the number of levels assigned to generic attribute  for alternative , represented by 0, 1, . . .,   − 1.In a design for 2 alternatives and 3 attributes each with 2 levels, an orthogonal design in 4 choice situations for alternative 1 can be firstly generated.Then 0's and 1's in alternative 1 are interchanged in alternative 2. Thus the attribute levels of each attribute are forced to be different across alternatives.The result is shown in Table 1.
To generate OOC design for more alternatives, it is necessary to introduce   to represent the largest number of different pairs appeared between alternatives for a specific attribute.The equation of   is shown as follows, where  stands for the number of alternatives in choice set: In an example of OOC design for 3 alternatives and 3 attributes each with 2 levels shown in Table 1, we can get   = ( 2 − 1)/4 = 2 for all the attributes.For instance, in the first choice situation (000, 110, and 001), the attribute levels differ twice for each attribute (i.e., for attribute  1 , the levels are 010, creating 3 pairs (01, 10, and 00) in which two of them (01, 10) are different).
We can see from Table 1 that the distribution of attribute levels in alternative 2 is obtained by interchanging 0's and 1's in  1 and  2 in alternative 1 and that in alternative 3 is obtained by interchanging 0's and 1's in  3 in alternative 1.These systematic changes can be also described as adding a generator in alternative 1 to get alternative 2 and adding another generator to get alternative 3. The addition is performed in modulo arithmetic according to the number of levels for a specific attribute.Here in the example,   = 2 for all attributes, thus when a generator 110 is added to the choice situations in alternative 1 in modulo 2 arithmetic like this: 000 + 110 ≡ 110, 011 + 110 ≡ 101, and so on, alternative 2 is obtained.Alternative 3 is generated in the same way by adding a generator 001 to alternative 1.Notice that the generators added to alternatives must have a value of   = 2 (i.e., generators (000,110, and 001) used in Table 2 can meet the requirement while another generators (000, 100, and 010) cannot not).
Designs for any choice set size with any number of attributes each having any number of levels can be generated in similar way.However, a big limitation of OOC design is that it can only generate designs for generic attributes according to the principle of this method.How alternative-specific attributes distribute across alternatives is rarely discussed in the literature.Another shortcoming of OOC design is that it may produce a lot of unreasonable combinations by forcing maximum level differences in attributes across alternatives.Answers from responders who have to make decision in such choice situations may not reflect their actual choice statements since the "proper alternative" may not be contained in the questionnaire.

𝐷-Efficient
Design.While people who raise OOC method keep on improving design with remaining orthogonality, another stream of researchers goes straight forward to increasing the statistical efficiency of the design by minimizing the elements of the asymptotic variance-covariance (AVC) matrix of discrete choice models.The AVC matrix can be obtained by taking the negative inverse of the expected second derivatives of the log-likelihood function of the model proofed in [15].To interpret the process of calculating the AVC matrix and the measure of efficiency of a design, here we briefly introduce the most well-known multinomial logit model.
Assume an individual faced with alternative  = 1, 2, . . .,  in choice situation  = 1, 2, . . ., .The utility of an individual for alternative  in choice situation  can be expressed as where   represents observed part of utility for each alternative  in choice situation .It is assumed to be a linear additive function of several attributes with corresponding weights.These weights are unknown parameters to be estimated and can be divided into two categories: generic parameters and alternative-specific parameters.The generic parameters and alternative-specific parameters can be denoted by  *  ,  = 1, . . .,  * , and   ,  = 1, . . .,   , respectively, with their associated attribute levels  *  and   for each choice situation .Thus, the total number of parameters to be estimated equal to  =  * + ∑  =1   .  can be expressed as where   is the unobserved component, which is independently and identically extreme value type one distributed.
Considering that the most popular way to estimate parameters is maximum likelihood estimation, the loglikelihood function of parameters for a single respondent can be expressed as where  represents the binary outcome of all choice situations.While alternative  is chosen in choice situation ,   equals one; otherwise it is zero.Then the AVC matrix can be expressed as the second derivative of the log-likelihood function as follows: Equations ( 6)-( 8) represent functions that allow generic and alternative-specific parameters.In the case where only generic parameters exist, only (6) remains, and when there are only alternative-specific parameters, (8) remains.In addition, if there are  identical respondents, these second derivatives are multiplied by .Let ( * , ) denote the true values of the parameters.The Fisher information matrix  is defined as the expected values of the second derivative of the log-likelihood function: Hence, the AVC matrix can be expressed as a  ×  matrix that is equal to the negative inverse of the Fisher information matrix [16]: Rather than working with each element within the AVC matrix directly, a preferred measure within the literature is error, calculated by taking the determinant of the AVC matrix and scaling this value by the number of parameters .It is common to assume that a single respondent (i.e.,  = 1) represents all respondents, an assumption consistent with the multinomial logit (MNL) model form [8,17]. Designs that aim at minimizing -error are called -efficient designs.
Since the calculation of -error involves the values of parameters, approaches to determine -error have been improved in recent years.In our empirical study later, we use   -error as the statistic to measure the efficiency of experimental designs.To calculate   -error, nonzero priors are needed.It can be expressed as follows: According to the -efficient method, we can figure out that, for a given sample size, attempts to minimize error statistic will directly lead to the minimization of the AVC matrix.Meanwhile, by taking the square root of the diagonal elements (including ) of the AVC matrix, the minimization of asymptotic standard errors is achieved.Thus the asymptotic standard error of the parameter estimates will diminish in terms of statistical significance from each additional respondent added to a survey.Figure 1(a) reveals that the standard error decreases by increasing sample size at the beginning of a given design  I .Exceeding a certain limit, enlarging sample size has little impact on demising standard error.On the other side, Figure 1(b) shows that investing a more efficient design  II can lead to larger decreases in standard error.In other words, a smaller sample size may be satisfied at a certain level of standard error when using efficient designs rather than common ones.

Empirical Study
In this section, three experimental designs which used different methods (orthogonal, OOC, and -efficient) are generated.They will be used to obtain trip mode choice data on a corridor connecting two large business districts in Chengdu.Shawan conference and exhibition center and Jinsha station are chosen as the origin and destination for the survey.Three alternatives are involved: car, taxi, and bus.A typical multinomial logit model in transportation will be formulated and serve as the basis of most of the analyses in the subsequent section.The observed part of utility of every alternative is expressed as follows: where TT represents travel time.TC represents travel cost.For car users,  equals the fuel cost.For taxi users, TC equals the money paid for the trip.For bus users, TC equals the ticket price.Seeing from (12), parameters for TT and TC are generic across three alternatives.Thus, four parameters are going to be estimated in total (two of them are alternativespecific constant, which has nothing to do with any attribute).
The attribute levels and prior information about parameters are given in Table 3 based on previous study results as well as to preserve realistic estimates for the private and public transport alternatives.In order to obtain better estimation of parameters, three levels are set for each attribute for maximum variation as Obviously, this number is too large for a single respondent.Thus, we introduce a block variable to divide the design into smaller parts (i.e., here we block the design into three parts so that six choice situations are provided to a single respondent).Each block is not orthogonal by itself, but in combination with other blocks.Attribute level balance is maintained as much as possible in each block.We generate three different (attribute level balanced) designs with 18 choice situations assuming the above MNL model, using the software Ngene 1.1.1.The design results are shown in Table 4 as well as -error value for each design.

Results and Discussion
As expected, the two efficient designs produce lower error value (0.118313 and 0.114612 for the OOC design and -efficient design, resp.), while orthogonal design produces higher -error value (0.194724), seeing from Table 4.The -error of the orthogonal design is 1.64 times greater than the -error value of the OOC design and 1.69 times greater than -efficient design.This suggests that, on average, the asymptotic standard errors of the parameter estimates using the orthogonal design will be 1.28 to 1.31 times larger than the efficient designs.Clearly, the efficient designs are able to provide more reliable parameter estimates than orthogonal design.
On the other hand, comparing the two efficient designs, -efficient outperforms OOC in terms of the -error value (0.114612 versus 0.118313).'Furthermore, since the OOC method can only generate designs with generic attributes, the use of it is largely limited.In a word, with high statistical

𝐷-efficient
(i) The smaller the asymptotic standard errors achieved, the smaller the width of the confidence intervals observed around the parameters estimates will be (ii) -Radios will be maximized thus producing more reliable study results and analyst is able to minimize the sample size (i) In general not orthogonal (not that important) (ii) Advanced knowledge of the parameter estimates is needed (iii) It needs more computation power efficiency and wide applicability, -efficient design achieves the best performance.
Further, most research focused on developing one of these experimental design methods by far.Though there may be a few of discussions about strength and shortcomings of every method separately, comparisons are rarely found in theory or practice area [12,18,19].Here, we conclude the advantages and disadvantages of these three methods in Table 5.Also, a popularity rate is given as a reference of their applications in the field.

Conclusion
The SC experiment has been generally regarded as an effective method for discrete choice analysis, especially for newly introduced alternatives.The high cost on survey forces researchers to find more efficient design methods to obtain better estimation on parameters instead of investing a larger sample size.Though orthogonal design has been used as the major experimental design method, orthogonality is not that important in the nonlinear discrete choice models.In this paper, we provide two state-of-the-art efficient designs: OOC and -efficient design.By comparing orthogonal, OOC, and -efficient design in both theory and practice, we find that efficient designs are more capable of producing more efficient data in the sense that more reliable parameter estimates can be achieved with an equal or lower sample size.The generation process requires the assumption of prior parameter estimates and model structure to construct AVC matrix.The result suggests a move away from orthogonal designs for SC experiments towards -efficient designs, which make relative discrete choice models being more fit with such data.

N se 1 Figure 1 :
Figure 1: Comparison of investing in larger sample sizes versus more efficient designs.

Table 1 :
Optimal orthogonal choice design for 2 alternatives with 3 binary attributes.

Table 2 :
Optimal orthogonal choice design for 3 alternatives with 3 binary attributes.

Table 3 :
Prior parameter values and attribute levels for case study.The values of TT car , TT taxi , and TT bus are measured in free, normal, and congested traffic flow.The values of TC car are calculated as the kilometers between the origin and destination by the consumed oil price under 7.3 RMB/liter, 7.7 RMB/liter, and 8 RMB/liter.The values of TC taxi are measured in free, normal, and congested traffic flow.The value of TC bus is based on the current price and plus/minus 1 RMB.The number of choice situations (i.e., 18) is selected such that attribute level balance can be achieved.

Table 4 :
Experimental designs for empirical study.