Improving Location Prediction by Exploring Spatial-Temporal-Social

As there is great differences of movement patterns and social correlation between weekdays and weekends, we propose a fallback social-temporal-hierarchicMarkovmodel (FSTHM) to predict individual’s future location.The division of weekdays and weekends is used to decompose the original state of traditional Markov model into two different states and distinguish the difference of the strength of social ties on weekdays and weekends. Except for the time division, the distribution of the visit time for each state is also considered to improve the predictive performance. In addition, in order to best suit the characteristics of Markov model, we introduce the modified cross-sample entropy to quantify the similarities between the individual and his friends. The experiments based on real location-based social network show the FSTHM model gives a 9% improvement over the Markov model and 2% improvement over the social Markov models which use cosine similarity or mutual information to measure the social correlation.


Introduction
The last few years have witnessed a considerable increase in the number of mobile devices along with the use of wireless communication and a rapid development of location-based social networks, which collects a large amount of individuals' movement data.For example, "high-resolution GPS mobility data", "coarse-grained mobile phone data" and "social and geographical context obtained from online location-based social networks" are parallel relation.Recently, the researches on individuals' movement behavior not only concern the mobility regularity, but also explore how to use individuals' historical context to predict the future location.The ability of exactly predicting users' future location could provide more natural and customized services for location-based applications.Generally, individual's movement data consist of a series of geographic coordinates points stamped with time.However, with the development of online locationbased social networks (LBNSs), they provide location related services that allow users to "check-in" at geographical locations and share such experiences with their friends.Millions of "check-in" records in LBSNs contain rich information of social and geographical context and provide a unique opportunity for researchers to study user's movement behaviors from a social perspective [1].
There are three major graphs in LBSNs: a user-location graph, a location-location graph, and a user-user graph [2].The user-location graph indicates the user's visit history, to some extent, which implies the user's interests and temporal information of the visited places.The locationlocation graph shows the transition information between the locations.The user-user graph represents a social structure made up of individuals and their friends, which could help us judging how close they are and what type their friends are from their visit history.Therefore, the availability of multiple information sources presents various views to study a user's mobile behavior from spatial-temporal, socialtemporal, social-spatial, and spatial-temporal-social aspects.
From spatial-temporal aspect, existing location prediction methods could be divided into three categories.
(1) Temporal-Based Location Prediction Methods.For example, most frequent daily model, most frequent hourly model, and most frequent daily hourly model.This kind of location prediction method considers individual's movements to be a series of isolated physical position with a given time-stamp.The time information is usually segmented under uniform time intervals.The length of time interval would affect the predictive performance.If the number of visits is greater than one within a specified time interval, then temporalbased location prediction method could not completely grasp the user's all possible behaviors.Besides, user's activities are not isolated and there are potential patterns hidden behind the transitions between locations.Hence, except for the temporal information of individuals' movements, the usage of the spatial information could make better predictive performance.
(2) Spatial-Based Location Prediction Method.This sort of method mainly considers the spatial mobility regularity in individual's movements and neglects the specific time where they visited.Various spatial-based predicting techniques are used for the position prediction, for example, first-order Markov model [3], variable-length Markov model [4], hidden Markov model [5], Bayesian network [6], neural network, state predictor [7], methods based on the association rules mining [8], or movement patterns abstraction [9].
(3) Spatial-Temporal Location Prediction Method.Spatialbased location prediction methods define user's movement behaviors as sequential patterns or association rules that frequently appear among location sequences.The temporal information could help us identify the best matches from the patterns which have the highest similarity.For example, a user would go to lunch at midday but go home in the afternoon from the same location (office).
Previous researches have proved expanding location information by encoding temporal information within the spatial information could improve the location prediction performance.At present, time information is likely to be used in the following 3 ways.
(1) An Order Indicator.Temporal information is used to connect the visits chronologically for generating location sequence and then the time information is discarded.
(2) Time Stamp.The relative time for each location and the time interval between successive locations are recorded to describe individual's visit habits.Lei et al. proposed a spatialtemporal trajectory model (STT) to capture movement behaviors of objects [10].STT is represented as a probabilistic suffix tree with both spatial and temporal information of movements, which contains not only transition probabilities among frequent regions but also transition interval vector and representative time of next movement region for each tree node.
(3) Time Stamp and Hierarchy.Time hierarchy refers to the containment relationship among the time division with different length, for example, year, month, week, day, hour, and minute.Day and hour have shown comparatively higher influence to individual's movement regularity than year, month, and week.In literate [11], Gao et al. considered the temporal information of the visit happening at a given location could be decomposed into "daily" and "hourly" information and the user's visits at the specific location follow Gaussian distribution over the day of the week and the hour of the day.
As observed in [12], human movement exhibits strong temporal cyclic patterns in terms of the hour of the day and the day of the week.Previous work has leveraged these patterns as features under supervised learning to improve location prediction performance.However, there are large differences of the activity locations and interests between weekdays and weekends.On weekdays, an individual's life is relatively regular, but on weekends, he usually participates in some social activity.For example, user A usually leaves his home in the morning to go to his office on workdays, while on weekends he visits his parents from home in the morning.Literate [13] plotted a heat map of the average instantaneous entropies according to the local time of the week and this map shows the levels and the possible time of unpredictability mostly have differences between weekdays and weekends.Cho et al. [12] and so forth also found the work days show strong correlation with each other, but less correlation with the weekend days, and vice versa.Therefore, dividing the week into weekdays and weekends is very reasonable and that could make a better understanding of individuals' movements.
In recent years, the popularity of location-based and preference-aware recommender systems provide users more opportunities to leverage similar users' experiences to extend individual knowledge and retrieve the information matching their tastes with minimal efforts [14].Therefore, we could obtain better predictive performance by virtue of the movement data of their friends or similar users who share similar interests, locations, and travel sequences.Cho et al. [12] developed a model of human mobility that combines periodic short range movements with long-distance travel that is influenced by social network ties.The experiments showed that social relationships can explain about 10% to 30% of all human movement, while periodic behavior explains 50% to 70%.Although the effects of social ties are considered, the movement data of their friends are neglected.Gao et al. [1] proposed a social-historical model that integrates the social and historical effects to explore user's check-in behavior on LBSNs.Their model captured the property of user's checkin history and the strength of social ties between the user and his friends.Existing location prediction methods which considered social relationships usually applied the movement data of all his friends in every prediction.However, Eagle et al. presented the social connections concentrated on colleagues' traditional behaviors at work during the daytime and the proximity in the evening and on weekends it is representative of behaviors outside the work environment [15].Therefore, time information not only could reflect user's activity habits, but also help us to know the type of the activity and partition the friends into several groups.
To the purpose of understanding how it is possible to improve mobility prediction by exploiting the influence of temporal information on user's historical movement and social ties, we propose a fallback social-temporal-hierarchic Markov model (FSTHM) to predict the future location.We first decompose time into two time slices-weekdays and weekends, and each state of the traditional Markov model corresponds to two different states, one for each time slice.Then given the latest visit location and the next visit time, the probability of the next visit at given location is calculated by considering the spatial transition information and the relative daily time stamp.In addition, we use the modified cross-sample entropy to quantify the correlation between two mobility sequences and calculate the different similarities on weekdays and weekends.The experiments on real LBSNs dataset show our FSTHM gives a 9% improvement over the Markov model and 2% improvement over the social Markov models which use cosine similarity or mutual information to measure the social correlation.

Exploring User's Behavior from
Spatial-Temporal Aspects where  +1 =  ,week() indicates the ( + 1)th visit at state  ,week() ,   =  ,week(  ) indicates the th visit happened at state  ,week(  ) , week(t) indicates the discrete time slices of time  which is an element of set week = {weekdays, weekends}, and ( +1 ) = () indicates the ( + 1)th visit happens at time .Note that the variable  here is an absolute time indicating the time stamp of the next visit.
We decompose the time information  into two parts: week() and (), where () indicates the daily information abstracted in one day cycle.The location of candidate state  ,week() with the highest probability would be the prediction result of the ( + 1)th visit location.Using Bayes' rule, the probability in We consider (( +1 ) = () |   =  ,week(  ) ,  +1 =  ,week() ) = (( +1 ) = () |  +1 =  ,week() ) under the assumption that the current visit time is just associated with the current visit location but not with the last visit information [11].
(1) ( +1 =  ,week() |   =  ,week(  ) ). ( +1 =  ,week() |   =  ,week(  ) ) is the probability of the next visit at the state  ,week() observing that user's a series of visit history and the last state.According to user's visit history, the transition probability from state  ,week(  ) to  ,week() is defined as ( +1 =  ,week() |   =  ,week(  ) ) = (the number of the visits which happened at location   on week() after visited location   on week(  ))/(the number of the times visited at location   on week(  )).
Note that the probability ( +1 =  ,week() |   =  ,week(  ) ) in THM is equal to the probability in TM.
(2) (( +1 ) = () |  +1 =  ,week() ).(( +1 ) = () |  +1 =  ,week() ) indicates the probability of the next visit happening at time ( +1 ) = () given the next visit happening at location   on time week(t).Generally, the visits of a certain location usually focus on several time intervals.For example, user A often arrives at the work place at around 9 or 14 o' clock.We assume the user's visits of a specific location approximately follow mixture Gaussian distribution (GMM) which is the same as literate [10].Each component of GMM represents a temporal cluster and the parameters of the component's probability density functions could be extracted by an expectation-maximization (EM) algorithm.

Fallback Temporal-Hierarchic Markov Model.
With the abundance of the user's historical movement data, the temporal-hierarchic Markov model will be more robust.As the time information is projected down to weekdays and weekends, the new state pointing to the existing locations would be generated, which makes no predictive results.Therefore, we present a fallback temporal-hierarchic Markov model (FTHM) to resolve this issue.If no result could be gained from a state, we would transform the current state to the opposite state which corresponds to the same location but different time slice.For example, the state  1,weekdays is transformed into  1,weekends , and  1,weekends is transformed into  1,weekdays .If the opposite state exists, the probability ( +1 =  ,week() |   =  ,∼week(  ) , ( +1 ) = ()) is used to predict, where ∼week() is the opposite timeslice.Otherwise, the prediction by FTHM is fail.
In conclusion, the predictive probability of the next check-in location   ( +1 ) =   at time  +1 by TM, THM, and FTHM model are defined as (4)

Location Prediction Algorithm by Exploring Spatial-Temporal-Social Ties
The literates have confirmed the interdependence of human movement and social ties could help us to improve the accuracy of location prediction and the division of time into weekdays and weekends could partition the friends into several groups.Therefore, this paper would consider the similarities between the user and his friends on weekdays and weekends, respectively.Previous studies apply the cosine similarity [1] or mutual information [16] to quantify the strength of social ties, which only considers the different distributions of individuals' historical locations and neglects the transition information between location pairs.Entropy is an indicator of complexity of nonlinear time series and has been used in several kinds of applications [17].This paper uses the modified cross-sample entropy to quantify the correlation between the movements of different users.The cross-sample entropy is an effective technique for analyzing the degree of asynchrony between two related time series [18].Greater synchrony means the existence of some similar pattern of the two time series.Because the fallback temporal-hierarchic Markov model is used as an elementary predictor, the similarity of historical location sequence between individuals should be taken into full account.
Thus taking the weekday similarity between the user and his friend for example, we present Algorithm 1 to describe the detailed calculation steps of modified cross-sample entropy MCS, where  1 's historical location sequence on weekdays before time   is  =   1 ,⟨ 1 ,  ⟩,weekdays = { 1 ,  2 , . . .,   1 },  2 's location sequence on weekdays is  =   2 ,⟨ 1 ,  ⟩,weekdays = {V 1 , V 2 , . . ., V  2 }, the embedding dimension is m (the length of vectors to be compared), and the similarity tolerance is r (the tolerance for accepting matches).The calculation steps of weekend similarities are similar with the weekday similarities.

Experiments
4.1.Dataset.We consider a dataset of online location-based social network-Gowalla [17]-to capture human mobility.Gowalla contains 196,591 nodes, 950,327 edges, and 6,442,890 check-ins between Feb. 2009 and Oct. 2010.In our experiments, we select 199 users who have at least 1000 check-ins, 50 to 300 locations, and spend more than 200 days on Gowalla.We obtain 39,101 unique geographical check-in locations as the location vocabulary and 196 average distinct locations per user.

Evaluation Metrics.
To measure the predictive performance of different mobility models, we use the following evaluation metrics.Given a user u's observed location sequence,  = (  ( 1 ),   ( 2 ), . . .,   (  )), and the location sequence that is predicted by model.We separate the location sequence of each user into 10 parts, and each part has approximately equal check-in count.Let the time stamp at the end of each part be  = { 1 ,  2 , . . .,  10 }.We predict the check-in location at each part for the user, with his historical check-ins before that time as observed context.In addition, we put at least 10% of check-ins in the training set, so the comparisons of prediction results start from  2 .
(1) Prediction Accuracy (PA).The prediction accuracy for user  at part   is defined as PA(  ) = ∑  −1 <  ≤  (  (  ) =  *  (  ))/(check-in times between  −1 and   ), where I() is the Boolean indicator function that returns to 1 if its argument evaluate to true and returns to 0 otherwise.
(2) Prediction Failure.Indeed, except for the prediction accuracy, the percentage of no prediction results is also a key point in the prediction quality.If the user visits at a new state, surely there will be no useful states to predict.So we define predictive failure as the proportion of no prediction results.

Baseline Models.
To evaluate our fallback socialtemporal-Hierarchic Markov model and other prediction model, we choose four baseline models, with detailed descriptions below:  = (  ( 1 ),   ( 2 ), . . .,   (  )) is the set of visiting history and   ( −1 ) and   (  ) are individual's last visit location and next visit location.

Predicting Mobility by Historical
Movements.We compare the performance of the TM, THM, and FTHM model with the performance of baselines by prediction accuracy.
Figure 1 shows the comparison results at 9 parts.MFD performs the worst with the average accuracy of 15%, while FTHM obtains the best performance after T 3 with an average accuracy of 30%, followed by THM.
The prediction accuracy of all temporal-based location prediction models (MFD, MFH, and MFHD) decreases greatly as time goes by.This is because as time goes by, the models cannot distinguish which check-in locations are more important at corresponding time slices in the long history.However, our models and Markov model are not affected by the length of the history as much as temporal-based location prediction methods.
Our models and Markov model perform stably and even an increasing trend in accuracy.However, the difference between Markov and TM model is smaller.This is due to the fine spatial resolution of Gowalla: the number of checkin locations on both weekdays and weekends is relatively small.As FTHM model fully considers the time stamp and time hierarchy, it performs the best and gives a 5% relative improvement over Markov model, which suggests the importance of temporal information.Figures 2 and 3 show the performance of various social models by considering the prediction accuracy and prediction failure.We can see FSTHM model performs best with an average accuracy of 33% and an average failure of 13%.FSTHM model provides considerable improvement over FTHM model-about a 4% relative improvement for the prediction accuracy and about a 4.5% relative improvement for the prediction failure.The social FTHM models based on cosine similarity and mutual information make similar prediction performance and perform worse than FTHM-MCS model.That is because the modified cross-sample entropy can be more effective in measuring two users' mobility similarity.Moreover, as the division of user's social ties on weekdays and weekends, the FSTHM model gives a 1% improvement over FTHM-MCS model for both prediction accuracy and prediction failure.

Adjusting the Weight between Historical and Social Ties.
To investigate the contribution of social ties and historical ties in affecting user's behavior, we increase the parameter  from 0 to 1 with an increment step of 0.1 and observe the prediction performance at each .We only show the prediction accuracy at parts T 4 , T 7 , and T 9 in Figure 4, since the similar performance can be observed at other parts.When  = 0, FSTHM model just considers social ties.Its performance is always worst, suggesting that considering social information only is not enough to capture the individual's behaviors.By increasing , the performance first increases to the peak and then decreases.When  = 1, FSTHM model converts to the FTHM model, which only considers the historical movements.Its performance is not the best, which indicates the social ties are also important and beneficial to the location prediction.

Conclusions
In this paper, a fallback social-temporal-hierarchic Markov model (FSTHM) is proposed to predict the future location.Firstly, we divide the time into two different discrete time slice, weekdays and weekends, and the state of original Markov is assigned as two different states.Secondly, the distributions of the visit times at each state are captured to describe the individual's movement habit and a Bayes' rule is used to predict the next state according to the temporal and spatial information.Finally, we introduce the modified cross-sample entropy to measure the mobility similarities between two different users on both weekdays and weekends.The FSTHM model integrates the historical and social ties to obtain the final prediction result.The experiments based on Gowalla dataset show the FSTHM model gives a 9% improvement over the traditional Markov model.In the future, we will consider the influence of the real-time social ties.

Figure 3 :
Figure 3: The prediction failure comparison of social models.

Figure 4 :
Figure 4: The performance changes with .
Due to the difference of user's mobility regularity between weekdays and weekends, time is decomposed into two different discrete time slices: weekdays and weekends.Thus, each state from the original Markov model will be represented by two different states in the new Markov model (temporal Markov model-TM) and the new states set is  = { 1,weekdays ,  1,weekends , . . .,  ,weekdays ,  ,weekends, }.If we take user's home for example, which is state  1 in original Markov model, it will be represented by the states  1,weekdays ,  1,weekends .or take 40 minutes to get to his home H.According to above description, the TM model is built as  W,weekdays →  R,weekdays / H,weekdays .The transition probabilities are defined as the frequency between location W to R and S and the result for each prediction is the same if the model does not update in time.However, when considering the possible visit time, we will find user's next location is R if we estimate A's next visit time is 12:10 or 5:10 and he is more likely to go home at about 17:40.Therefore, based on the temporal Markov model, each state will be associated with the possible visit times and the temporal-Hierarchic Markov model (THM) could be represented as  W,weekdays →  R,weekdays (12:10, 5:10)/ H,weekdays (5:40).Given user's temporal-Hierarchic Markov model, a context of the latest visit location and the time of the next visit, the location prediction problem can be described as finding the probability 2.1.Online Location-Based Social Networks.For a dataset of the online location-based social network, we define  = { 1 ,  2 , ...,   } as the set of users and  = { 1 ,  2 , ...,   } as the set of locations where  and  are the numbers of users and locations, respectively.Each check-in action is represented as a tuple ⟨  ,   ,   ⟩ ∈ (  ), indicating user   ∈  checks in at location   ∈  at time   , where (  ) is user   's observed check-in set.Let   () ∈  denote the geographic location of user  at time ,  ,⟨ 1 ,  ⟩ = (  ( 1 ),   ( 2 ), ...,   (  )) (  (  ) ∈  and  1 ≤   ≤   ) be the observed historical location sequence of  between time  1 and   .()denotesu'ssocialfriends and let  ,⟨ 1 ,  ⟩ = {   ,⟨ 1 ,  ⟩ |   ∈ ()} be the observed location sequences of u's friends between time  1 and   .2.2.Temporal Markov Model.Markov model is a probabilistic automation in which states represent individual's historical locations and the transition matrix describes the probabilities of particular transitions between states.Given a series of historical visits, the locations  = { 1 ,  2 , ...,   } correspond to the states  = { 1 ,  2 , ...,   } of Markov model and the movements between locations could be used to calculate the transition matrix between states.2.3.Temporal-Hierarchic Markov Model.Except for the discrete time slices, this paper also considers continuous time constraints when individual visits at the locations.For example, on weekdays, user A usually walks 10 minutes to restaurant R for lunch from his office W at 12 o' clock.After getting off work at 17:00, he may continue to get dinner at R