Exploring Route Choice Behaviours Accommodating Stochastic Choice Set Generations

Modelling route choice behaviours are essential in traffic operation and transportation planning. Many studies have focused on route choice behaviour using the stochastic model, and they have tried to construct the heterogeneous route choice model with various types of data. -is study aims to develop the route choice model incorporating travellers’ heterogeneity according to the stochastic route choice set. -e model is evaluated from the empirical travel data based on a radio frequency identification device (RFID) called dedicated short-range communication (DSRC).-e reliability level is defined to explore the travellers’ heterogeneity in the choice set generation model. -e heterogeneous K-reliable shortest path(HKαRSP-) based route choice model is established to incorporate travellers’ heterogeneity in route choice behaviour. -e model parameters are estimated for the mixed path-size correction logit (MPSCL)model, considering the overlapping paths and the heterogeneous behaviour in the route choice model. -e different behaviours concerning the chosen routes are analysed to interpret the route choice behaviour from revealed preference data by comparing the different coefficients’ magnitude.-ere are model validation processes to confirm the prediction accuracy according to travel distance. -is study discusses the policy implication to introduce the traveller specified route travel guidance system.


Introduction
Many studies have focused on the modelling behaviours of choosing routes using mathematical and empirical solutions. ey have used various stochastic models to provide mathematical approaches for searching the available paths and choosing the most feasible routes [1][2][3][4]. Also, transportation researchers have attempted to formulate the route choice behaviours using empirical data. Recently, the development of intelligent transportation system (ITS), such as the vehicle detection system (VDS), automatic video system (AVS), closed-circuit television (CCTV), and variable message sign (VMS), has made it possible to collect and process the various data. ese various types of travel information provide drivers' judgement about alternative routes [5].
Nevertheless, many travellers usually acquire limited information from experienced travel time [6]. e enormous amounts of data have allowed researchers to analyse travel behaviours and consider mathematical solutions.
e process of generating a set of routes has been constructed using the travellers' cognitive process in choosing a route, and a reasonable number of routes have been derived from increasing the accuracy of the modelling process [7]. Furthermore, the researcher's interest in travel time reliability has increased during the last decades. e travel time reliability problem has required consideration of individuals' perceptions of the uncertainty of the travel time. e travel time perceived by an individual has been defined as a cumulative distribution function based on travellers' experiences [8,9]. e concepts of perceived travel time and travel time reliability have been widely used to evaluate the traffic states in transportation operation.
Each traveller has a perceived travel time for a specific origin and destination (OD) pair to set up the travel time with travel time uncertainty when they start to travel. Even though there are several effective routes, travellers choose another route due to their travel experience. e process of generating a choice set involves modelling the cognitive process incorporating traveller's heterogeneity. e previous models based on rationality are somewhat limited in their ability to explain irrational choice behaviours. is problem results from the limited information about alternative routes. Many studies have recently explored these personal characteristics in the models, e.g., prospect theory, bounded rationality, and choice inertia. e route choice behaviour modelling should incorporate travellers' heterogeneity in the choice set generation and the route choice model. is study aims to construct the route choice model accommodating heterogeneous route choice behaviour for travel time reliability. e route choice model is developed through the heterogeneous K-α-reliable shortest path searching (HKαRSP) method using the reliability level derived by comparing the travel time budget (TTB) of the network of the individuals. Section 2 of this study reviews the choice set generation model and the route choice model comparing travel time reliability studies. In Section 3, the definitions of the terms are presented to establish the models. Section 4 introduces the methodology of determining the size of the choice set and the modelling of route choice behaviour which is developed using the concept of travel time reliability. An empirical analysis is conducted in Section 5 to estimate the route choice behaviour using processed travel data. Section 6 is the conclusion, which summarises the results of this study and discusses future research.

Literature Review
e traditional K shortest paths were modelled to determine the shortest paths concerning the travel time, assuming the determined link travel time. However, link travel times in the network have consistently been observed to have stochastic characteristics recognised by travellers. Several methodologies have been proposed to measure travel time reliability, e.g., the probability of on-time arrival, the TTB, α-reliable mean-excess travel time, and α-reliable travel time for generating the route choice set [10]. Among these methodologies, the on-time arrival probability was applied to the route choice model. e probability distribution function enabled calculating the probability of occurrence for each path, and it generated K paths according to their probabilities [11]. Mathematical modelling has been introduced to determine the optimal path from the sum of the distribution probabilities of the link travel time with the TTB [12,13]. Researchers have employed the label-correcting algorithm to analyse the time-dependent problem and search for reliable paths [14]. e deviation-based path set generation models have defined the distribution of link travel time as normal distribution and constructed the stochastic travel time between the OD pairs to compare the network's reliable paths. ese studies made it possible to remove nondominant paths and derive dominant paths through the various constraints [15,16]. Other studies derived a user equilibrium model by dividing the travel time variation into predictable and unpredictable travel times in the route choice process using the α-reliable, mean-excess approach [17,18]. A route choice model was constructed which reflected risk-aversion characteristics by generating the probability using the TTB [19,20]. e other stochastic travel time-based models employed the TTB for travel time reliability to reflect individuals' heterogeneous risk-averse characteristics [21,22]. e α-reliable travel time was used to determine the optimal path based on travellers' risk preference using the TTB. e models classified individual risk preference levels into riskseeking, risk-neutral, and risk-averse travellers to derive the optimal paths for each scenario [16,23]. ere was other research dealing with the system optimum model reflecting the fuzzy network theory. Research has considered the perceived travel time and risk-taking properties in traffic assignment problem by incorporating fuzzy utility theorem. ey discussed the differences between conventional and fuzzy network theory-based equilibrium model [24][25][26][27][28][29].
Numerous studies have been conducted to reflect the individual's choice behaviour in the model. McFadden (1973) developed the multinomial logit (MNL) model, which is a general form of the random utility-based choice model [30]. In transportation demand analysis, the MNL model has been applied at the mode choice stage before using the route choice model. e probit model and the MNL model also were used in stochastic or probabilistic assignment models [31,32]. Many researchers have used explanatory variables to make the models more feasible, such as landmark dummy, percentage of the major road, percentage of uninterrupted flow, and delay percentage [1,3,[33][34][35]]. e MNL model had some drawbacks in the route choice model, i.e., (1) it does not consider the identification of an individual traveller's choice set, (2) it does not reflect the overlapping links in routes, and (3) it does not consider travellers' heterogeneity in choice behaviour. Several models have been proposed to improve the MNL model, including extended logit models to overcome overlapping links among routes. ese models were composed of a deterministic term and a random error term that includes the additional overlapping term in the utility function. A modified MNL model, called the C-logit model, was proposed by subtracting the utility function's commonality [5]. Researchers have developed the implicit availability/perception (IAP) logit model by aggregating the path generation model and the route choice model using travellers' perceptions of routes [36]. e most considered route choice model to overcome the overlapping problem was the path-size logit model (PSL). e PSL model was introduced to modify the MNL model and considered the degree of overlapping of the routes in the MNL model [37][38][39]. e other researchers also proposed an improved PSL model, known as the path-size correction logit model (PSCL), by suggesting detailed and systematic derivations of the assumption for correcting the path-size factor [35].
ere were other types of models based on the generalised extreme value (GEV) theory that considers the hierarchical structure for choices to capture the error component of overlapping links. Such models included the paired combinational logit (PCL) model [40,41], the cross nested logit (CNL) model [34,41,42], the generalised nested logit (GNL) model [34,43,44], and the mixed multinomial logit (MMNL) model [45,46] summarised in Table 1. Since individual travellers have different travel experiences, they have different characteristics in determining choice sets and route choice behaviours. is study contributes a new approach for the generation of choice sets that incorporate individual travel behaviours and uses actual travel data to validate the choice behaviour. Significant differences compare the shortest path problem by perceived travel time from the previous research [16,29]. e different methodologies of choice set generation are compared to improve the accuracy of route choice models, i.e., the k shortest paths-based choice model (KSP), the k reliable shortest paths-based choice model (KαRSP), and the heterogeneous k reliable shortest paths-based choice model (HKαRSP). e route choice models are used to compare the accuracy of choice probability according to the choice set generation methodologies from various choice models, i.e., MNL, PSL, PSCL, CNL, PCL, and MMNL models. e route choice model makes it possible to determine whether the choice set generation models are well-formulated.

Measuring of Individual Travel
Time Reliability

Travel Time Budget (TTB).
Travel time reliability is generated from the travel experience of individual travellers in specific OD pairs. It defines the distribution function of perceived travel time to obtain the probability of on-time arrival. e TTB is introduced to identify the risk preferences from the distribution as the value determined by the confidence level. e TTB has been defined the minimum total travel time threshold satisfied the reliability requirement with constraints, concerning the percentile of total travel time distribution specified by decision-makers using the confidence level, α. e meaning of this definition is interpreted as the value derived from the distribution of travel time by using the predetermined confidence level [9,47]. Based on actual travel data and previous research, a lognormal distribution, a nonnegative and asymmetrical distribution, represents the stochastic travel time [16,47]. erefore, the travel time distributions for OD pairs are assumed to follow a lognormal distribution, lognormal (μ, σ), represented to the probability density function (PDF) and cumulative distribution function (CDF). In this study, TTB means that travellers plan for the travel time before departure to achieve their requirement of travel time reliability, which is expressed by the distribution of travel time experienced in the network for the confidence level, α, and the reliability level, α l . ere are three kinds of TTBs, i.e., TTB in the network, TTB of route k, and TTB for an individual. e TTB is required to achieve an α confidence level in the network from the origin, i, to the destination, j. e TTB in the network is TTB T ij (α) in equation (1). TTB of the k th path required achieving the α confidence level from the origin, i, to the destination, j. e TTB of route k in the network is TTB T ij k (α) in equation (2). e TTB required to achieve the α confidence level for individual l from the origin node i to the destination node j. e TTB for individuals is TTB T ijl (α) in equation (3): where i is the origin, j is the destination, k is the order of the α-reliable path or the predetermined number of the route choice set, l is the individual traveler, α is the confidence level (i.e., on-time arrival probability), μ ij is the mean of the travel time distribution from the origin, i, to the destination, j, μ ij k is the mean of the travel time distribution of the k th α-reliable path from the origin, i, to the destination, j, μ ijl is the perceived mean of travel time distribution for individual, l, from the origin, i, to the destination, j, σ ij is the standard deviation of travel time distribution from the origin, i to the destination, j, σ ij k is the standard deviation of the travel time distribution of the k th α-reliable path from the origin, i, to the destination, j, and σ ijl is the perceived standard deviation of the travel time distribution for individual, l, from the origin, i, to the destination, j.

Risk Preferences.
e TTB has a structure combined with the predictable travel time in the travel time distribution. Travellers accept the predictable risk from their experiences to meet the predetermined travel time, which is defined as the TTB in the OD pair. Individual travellers set up a TTB for a specific OD pair using the perceived travel time based on their experience. e distribution of perceived travel time is expressed more clearly as individual travellers accumulated travel time for a specific OD pair. e travel time distribution in the network causes individual travellers to incur late arrivals because the distribution of the individual travel time is different from the distribution of the travel time determined by the network.
Reliability level (α l ) means that individuals determine the value of the probability of on-time arrival by cumulative distribution for a specific OD pair in the network based on repeated travels. When the TTB of an individual at the confidence level α has the same TTB of the network for a specific OD pair, the TTB represents the reliability level, α l on the cumulative distribution of the network. e reliability level, α, is expressed as the on-time arrival probability for an individual's perceived TTB from the travel time distribution in this study. Risk preference is defined that the travellers have the characteristics of risk-taking for travel failure or delay due to travel time reliability. Since individual travellers have different risk preferences based on their travel experiences for each specific OD pair (i, j), the reliability level, α l , are determined individual risk preference. e Journal of Advanced Transportation characteristics of individual travellers were categorised, as shown below [16]: α l > 0.5, risk-averse for on-time arrival α l � 0.5, risk-neutral for on-time arrival α l < 0.5, risk-seeking for on-time arrival Risk preference is an essential factor in the choice set generation model. e process of choice set generation modelling is formulated using the reliability level, α l , which is referred to as risk preference. e reliability level, α l , is determined according to the difference between the individual's perceived travel time and the travel time provided by the network, so a difference occurred in generating the choice set. is analysis develops a route choice model that reflected travellers' behaviour according to whether they were risk-seeking or risk-averse in Figure 1. When the TTB for an individual is derived from the confidence level, α, according to the mean and standard deviation in the travel time distribution, it is possible to compare the TTB for the individual and the network confidence level, α. In other words, when the travel time experienced by an individual is less than the travel time in the network, the traveller would be concerned about late arrival based on the perceived travel time, in case of which it is defined as the risk-seeking characteristic. However, individual travellers' experiences indicate that they have more travel time than the network's travel time because they have experienced more travel time for the specific OD pair (i, j). Risk preference makes travellers calculate the TTB to arrive on time, which is a characteristic of risk-averse travellers.

Route Choice Behaviour.
e travel behaviour models have developed the following structure by dividing the choice set generation and route choice model. Researchers have tried to construct the modelling framework of route choice behaviour [7,46]. e model is constructed to determine the size of the consideration set and individual choice set. Consideration choice set is derived by the number of experienced routes using the observed data from the universal set occurring in the network for a specific OD pair. A modelling process also includes a different choice set for individuals using TTB and risk preference in the individual choice set generation. e route choice model using the individual choice set is derived from the collective individual travel data. e individual choice set is a set of routes for incorporating traveller's heterogeneity. It is Cascetta et al. [36], schussler and Axhausen [5], zhou et al. [18] PSL (path-size logit) Frejinger et al. [38], schussler and Axhausen [5], Li et al. [39] Multiplications of unobserved probability(P ij ) Bliemer and Bovy [35] CNL (crossnested logit)

Multiplications of Marginal(nested) probability
Prato and bekhor [34], bliemer and Bovy [35] GNL (generalized nested logit) Including the allocation parameter(m, α nm ) Prato and bekhor [34], wen and Koppelman [44] MNW Xu et al. [47] PSW (path-size weibit) Castillo et al. [52], kitthamkesorn and Chen [53] Mixed logit MMNL (mixed multinomial logit) Factor analytic specification Ramming [51], Prato and Bekhor [34], Alizadeh et al. [54], Lee et al. [55] important to determine the choice set by the different travel behaviour for individuals. Likewise, travellers consist of their own considered choice set of routes from information and experience. ey set for their own choice set to choose the proper route of the travel. In comparison between the cognitive and modelling process, the constructing set of choice is crucial for interpreting route choice behaviour. e cognitive process and modelling process for route choice behaviour is shown in Figure 2 Determining the choice set in the route choice model is essential because it affects the prediction accuracy in modelling results [41]. Since it is important to know the routes considered in the network, this study employs actual travel data to derive the size of the consideration set and the individual choice set. Travellers identify the choice set for their travel by travelling the known routes and determining the alternative routes. e travellers recognise the optimal path between specific OD pairs according to their individual experiences, and the individuals choose the observed route from their optimal choice sets. us, all observed choice sets could be the optimal paths experienced by individuals for the OD pairs. Travellers repeat creating and determining a route from the set of choices by considering their specific situations.
e route searching algorithm is developed to generate a choice set for individuals using TTB and risk preference. is algorithm generates a set of considered paths using the CDF of travel times. A set of individual paths is determined according to the number of paths specified in advance. It is necessary to generate the choice set with an appropriate size to estimate the choice probability. ere are experienced paths that could be used to determine the proper size of the choice set, making it possible to know the exact path for each traveller. Also, the single alternative chosen by a traveller is one of the experienced paths. Fiorenzo-Catalano [48] mentioned the importance of determining the choice set by  Journal of Advanced Transportation considering the researchers' perspectives because there are differences between travellers' perspectives and researchers' perspectives [48]. Since the researchers do not know individual travellers' choice sets, some assumptions are required in the choice set generation model. Two sets identify the appropriate choice set in the route choice model, i.e., the consideration set and the individual choice set. e consideration set includes the paths that most travellers are likely to choose. Besides, individual choice sets have the proper size for individual travellers to make their route choices. e k-α-reliable shortest path searching algorithm for generating an individual choice set is consisted of eight steps, as shown below. First, the observed travel time is extracted from the database for a specific OD pair. Next, confidence level, α, is specified for the travel time reliability to achieve network performance with the value of 0.9 or more, as suggested in the previous study [16,49]. en, it is necessary to calculate the travel time distribution in the network and the TTB to derive the reliability level, α l (TTB T ij (α)). Next, the travel time distribution is formed for each route TTB (TTB T ij k (α)). From this process, the reliability level, α l , is calculated according to the individual travel time distribution from the individual TTB (TTB T ijl (α)). Finally, the choice set for an individual is derived by calculating TTB according to reliability level, α l . e algorithm for searching choice set includes the procedure for probabilistic reliable path searching algorithm for k-α-reliable shortest paths (PRPSA-KαRSP). (Algorithm 1) Figure 3 illustrates an example to understand the differences in travel time budgets.
ere are five alternatives to choose the proper route for the traveller. Some travellers choose the dominant route A among the alternatives due to the fastest mean travel time; on the contrary, the other travellers are willing to choose route B for the reason of reliable travel route. In addition, the travellers varied with the formations of different choice sets considering travel experiences with reliability level, α l . e above algorithm is revised to generate the individual route choice set considering the observed route travels. Generating individual choice sets among various OD pairs should be repeated to model travellers' heterogeneity. Due to the different characteristics of individuals' observed choice set, it is possible to implement and derive the different perceived choice sets using the algorithm above. Even though the same travellers are on the other OD pairs, they would have different choice sets between the OD pairs due to their different experiences. e models are compared to the other models to evaluate each model's accuracy by developing the route choice model based on the choice set generation models.

Route Choice Model.
ere are various choice models to deal with the overlapping problems and cognitive process in the models. We explored which types of models are suitable for using data types and behavioural differences.
ere are overlapping problems in the route choice model, so it is necessary to propose an appropriate form. Also, a model that incorporated the heterogeneity of the travellers' route choice behaviour was suggested. ere are various types of models, such as MNL [33], PSL and PSCL [35,41], GNL [34], MMNL [45,46], and MPSCL, based on the three kinds of choice set generation models. We compare those types of choice models considering data type and goodnessof-fit indexes.
Researchers tried to develop the improved model form in the overlapping problem. ey developed the path-size logit model (PSL) for the improved MNL model, considering the degree of overlapping links. Bovy et al. proposed the improved path-size logit model [35]. Since there is no satisfactory derivation based on theoretical arguments, it is necessary to employ the correction terms. e model  (4) and (5): where V is the A n by K matrix of variables, β is the column vector of K unknown parameters for variables, PSC in is the A n by one vector of path-size correction term, L i is the length of the travelled route of alternative i, l a is the overlapped link a, and δ aj is the binary variable if the link a exists in route L i , 1, otherwise 0. Moreover, researchers have proposed a mixed logit model to overcome the limitations of the logit model by adding error terms in the equation to account for the correlation among routes [39]. Since travellers' perceived routes are correlated, the error term is added to illustrate the relationship based on the topology of paths. e error term is divided into two parts in the model. One part represents correlation and heterogeneity, and the other part describes i.i.d (independently identically differentiated) extreme value. e equation of MPSCL is presented as P n (i) � Λ(i|ξ) � exp μ X in β + F in Tξ + ln PSC in J∈C n exp μ X jn β + F jn Tξ + ln PSC jn , where X is C n by K matrix of variables, β is the column vector of K unknown parameters for variables, F in Tξ is C n by one vector of error terms, F is the C n by M factor loading matrix, T is M by M lower triangular matrix of unknown parameters, ζ is M by one vector of i.i.d standard normal variables as unobservable factors, ] is M by M lower triangular matrix of unknown parameters, and Γ(k | ζ) is the probability of chosen route k with given ζ.

Data Descriptions.
A case study was performed to apply the proposed methodology to solve the HKαRSP problem. e actual travelled data were used on the road network in the Daegu metropolitan area in South Korea. e actual path travel data were constructed by processing the information collected by the roadside equipment (RSE) installed on the intersections between arterial roads. Information of vehicular travel was collected using telecommunications between the RSE device installed on the road and the on-board unit (OBU) device installed in vehicles by a dedicated short-range communication (DSRC) device.
Step 1. Choosing the OD pair to observe the path (i, j) Step 2. Setting the confidence level, α, for the satisfaction of the level of service, i.e.,α � 0. 9 Step 3. Building the distribution of travel time (T ij ) for travel from origin i to destination j, and calculating the TTB (TTB T ij (α)), concerning the confidence level, α Step 4. Building the distribution of travel time for the k th path (T ij k ), ∀k ∈ (1, . . . , U), for each observed travel from origin i to destination j Step 5. Building the distribution of travel time for individual l (T ijl ) for each traveller from origin i to destination j Step 6. Evaluating the reliability level, α l , for each traveller l,  Journal of Advanced Transportation e DSRC was a useful technique for collecting traffic information, such as the number of vehicles passing by a specific location. e data were more accurate than the GPS data used in previous research. However, since it collected point data, a conversion process was required to track the OBU ID of an individual vehicle observed from the RSE to convert the data into individual route data. e model included a process of generating routes to track an individual's chosen routes. It used the route data with the high frequency for a specific OD pair to model an individual's perceived travel time. From a brief analysis of the data, basic statistics and study area are shown in Table 2.

Data Processing and Missing Correction.
Since the DSRC data was a type of point-based data observed at an intersection of the arterial roads, it was necessary to convert them into route data. e process for tracking the travellers with the same vehicle ID (OBU ID) was conducted to identify each route. e model was constructed based on methods of classifying and generating route data by tracking individual vehicles.
It was necessary to identify the individual vehicles to change from point data to route data. e observed-time variable was used to construct this process. If the observed times for individuals on RSE were arranged in order, it was possible to produce the individuals' route data. e link travel time was calculated while moving from node to node, and it included checking whether the path was configured using the link travel time. To generate the route travel time for specific OD pairs, it is necessary to produce the route travel data from point observation data. When the link travel time was excessive from a certain marginal value (divided by 10 minutes), it was divided into different travels [50]. We also scattered the plot using the observed travel time to separate the route travel, including about 98% of travels in 10 minutes. e route data process has presented the step by step to generate each travellers' route travel, sequentially listing the data observed at the point (see Figure 4(a)). However, it was impossible to confirm whether the link between the two nodes is connected or not. ere were the following three types of missing data.
(1) Missing data between nodes on an arterial road by straightway, (2) missing data between nodes for the type of road with the uninterrupted flow, and (3) observation of one node on two different observations. It was necessary to define the links between the nodes to ensure whether they were related links. If the produced route data were the case of missing data, it was necessary to identify the target nodes or links. With the missing correction method, the route data were connected with the other node. is process was performed using all of the missing data. e developed algorithm performed the missing correction procedures (see Figure 4(b)).

Structure of Route Choice Model.
e specified model has required the actual data to generate individual choice sets based on the distribution of perceived travel times. e individual choice set was a set of paths that incorporated the travellers' heterogeneity. e choice set generation model determines the individual choice sets based on the different travel experiences. e route choice model that incorporated the heterogeneous choice set generation model was used to compare the travel behaviours.
ere were more than 30 thousands of possible OD pairs among nodes. It was necessary to choose the feasible data for analysis of route choice behaviour. Since some OD pairs were too close or far away to analyse the travel behaviours, available OD pairs were selected, having more than twenty thousand observed trips and proper distances within 5 km to 25 km between OD pairs. e 76 OD pairs were chosen for the analysis to describe the heterogeneous travel behaviours. From the observations, 40 thousands travellers having frequent observed trips were selected for the final analysis. As mentioned before, it was important to determine the appropriate set to be considered from the thousands of alternatives. A methodology was established for choosing the choice set to be considered using actual travel data. e consideration choice sets should include all of the possible choice sets for most of the travellers. According to the assumptions presented above, we determined the possible number of consideration set and the individual's number of the choice set (K) using the observed data. Since the use of all observed paths was against the assumption, the size of the consideration set was determined to 16 observed paths considering 90% of the coverage probability as consideration choice set. It is necessary to determine how many travelled paths were chosen in the choice set for individuals from the observed data. To determine the alternative K for each individual, the 80% observed routes for each individual were calculated on average 3.12 routes except for observed at once, and the number of individual choices set was determined as four paths in the model. e developed model used the actual travel data to analyse the route choice model. e NLOGIT 6.0 program, which is generally used for econometric analysis, was used to analyse the route choice model in this study. e MNL model was developed for estimating the parameters in the choice model with maximum likelihood estimation (MLE) methods. Generally, the more explanatory variables make a better goodness-of-fit index, but the correlated variables decrease the accuracy of parameter estimation. Even though there are many other kinds of variables from the raw data, it is necessary to analyse the correlation among variables to identify the effects of parameters appropriately. e explanatory variables were compared to whether the variables improve the goodness-of-fit or multicollinearity, and Pearson correlation analysis was employed to choose the appropriate variables. e model was developed to compare the relative size of variables between alternatives in the model without alternative specific constants (ASCs). It was necessary to retain the dummy variables to avoid biasedness [51]. Since travellers tended to consider more travel attributes than an immanent attribute of alternatives in route choice, the additional variables were needed instead of ASCs in the model. e variables were used to analyse the route 8 Journal of Advanced Transportation choice model using the DSRC data, i.e., travel time, buffer time, distance, ratio of uninterrupted flow road, tolls, and number of bridges. e final model was established with the several chosen variables of the following equation: where V k is the utility function for alternative k, β i are the parameters, μ ij k is the mean travel time for alternative k from origin i to destination j, BT ij k is the buffer travel time for alternative k from origin i to destination j, DIST ij k is the distance travelled for alternative k from origin i to destination j, UNINT ij k is the ratio of uninterrupted flow for alternative k from origin i to destination j, TOLL ij k is the toll for alternative k from origin i to destination j, and BRIDGE ij k is the number of bridges for alternative k from origin i to destination j.

Heterogeneous Route Choice Models.
e model was determined by evaluating the data, modelling structure, and goodness-of-fit index among the various other models, i.e., MNL, PSL, PSCL, MMNL, and MPSCL.
is research employed the MPSCL model reflecting the overlapping links and considering the traveller's heterogeneity. e MPSCL model is necessary to analyse the route choice behaviour considered the route overlapping, which has a significant impact on the model's estimation, and the model had a much improved ρ 2 compared to the other models. e result showed the route choice model based on the different choice set generation model. e results of the model comparison are presented in Table 3. e proposed model provided the most precise prediction of a route's choice probability using choice set generation with traveller heterogeneity. Due to the coincidence of consideration set generation and path-size correction term, the model had better model fitness indexes. Consideration of identified choice set for travellers was adopted in the MPSCL model. e model had a better accuracy of prediction for route choice probability in HKαRSP model than the KαRSP and KSP model. e estimated model parameters had the appropriate value in the model and drew the significance at 1% level for most models. e parameters represented the variables' variations; in other words, the variables had a different effect on the choice for individual travellers, which is modelled by the random parameters. e mean and standard deviation parameters of travel time affected the model in the MPSCL  ere was a tendency for less preference to use of toll road for travel in the urban area. Travellers tended to avoid crossing the bridge in the model due to traffic congestion. Reflecting the traveller heterogeneity in the mixed logit model made the accuracy of estimations. is was due to the consistency of the structure for a choice set generation model and route choice model. Also, we evaluated the best fit for the MPSCL model with HKαRSP choice set generation model. e model had better model fitness indexes which resulted from the coincidence of consideration set generation and path-size correction term.
Many studies have recently been conducted to provide a new concept of transportation services such as smart mobility, mobility-as-a-service (MaaS), and an autonomous vehicle.
ese studies focused on identifying individual preferences and providing more convenient service by combining various travel modes suitable for those preferences. From this perspective, analysing route choice behaviour based on individual travel experience would be an important process in introducing new transportation services. e results derived through this study were judged to establish a more efficient transportation operation strategy by providing information on the reliable route for an individuals' preference. e provision of transportation services should provide faster information from individuals' experiences, and such information makes the entire system operate efficiently.

Model Validation.
We validated the prediction accuracy for the route choice probability using the estimated parameters.
ere are differences according to the distance between OD pairs, and it is necessary to divide with the three categories based on the distance (short/medium/long distance).
e prediction results were calculated by the observed travel attributes for each OD pair considering types of

Conclusion
In this study, the distributional characteristics were employed to model the uncertainty concerning the travel time for individual travellers. We used the concept of TTB and the probability that travellers would arrive at their destinations on time.
e definition of risk preference was introduced according to the difference between the TTB considered by individual travellers and the TTB presented in the network.
ere was a process for generating an individual choice set based on the accumulated experience of individuals. e process of route choice was performed to consider a different choice set for each traveller. e reliability level, α l , generated a path set by the cumulative travel time distribution for each path. We constructed a model for generating the choice set for individual travellers to incorporate the traveller's heterogeneity.
e results obtained from actual path travel data showed that most travellers might consider the dominant path and select alternative paths similar to it if one dominant path exists. Also, travellers chose reliable paths to ensure ontime arrivals by the generated choice set. e travellers were more sensitive to travel distance than travel time in the level of service attributes. e coefficients of travel time were in the range from −0.3121 to 0.0077, and the coefficients of travel distance were in the range from −1.1117 to −0.5840 in the level of service attributes. e travellers tended to have preferences for the use of uninterrupted flow and bridges, and they preferred not to use toll roads. e coefficients of the ratio of uninterrupted flow were in the range from −0.1099 to 4.7544, the estimation result of toll roads was in the range from −0.0959 to −0.0342, and the parameters of bridges were in the range from −1.1636 to −0.0508. e model had a better accuracy of prediction for route choice probability in the HKαRSP model than the KαRSP and KSP models. We derived better prediction according to the different travel distances. e results are applicable to transportation planning and traffic management by clarifying the choice set considered in the existing network. Moreover, it was possible to establish a strategy for providing route information using individuals' behavioural characteristics concerning transportation operation. Depending on the individual's risk preference, a different set of paths was considered, and a set of paths was established to provide information that is tailored to the individual reliability level, α l . is study contributes to increasing the efficiency of traffic operation and planning according to individuals' route attributes.
ere is further research from the additional improvements in modelling. e choice set generation model derives the appropriate number of sets as a necessary process for constructing the route choice model. It is necessary to compute travel time distribution following the time-dependent model to compare the differences in the choice sets. Also, the methodology for estimating the route travel time can be developed based on the difference between an individual's actual travel time on a given route and the estimated route travel time from the link travel time distribution. Finally, this research can extend the stochastic user equilibrium model according to travellers' risk preferences using the route choice model, such as the fuzzy traffic assignment model.

Data Availability
e data used in this research were provided by the Trlab research programme conducted at the Seoul National University, Seoul, Republic of Korea. e data used to support the findings of the study are available from the corresponding author upon request.

Conflicts of Interest
e authors declare that there are no conflicts of interest regarding the publication of this paper.