A Novel Contextual Information Recommendation Model and Its Application in e-Commerce Customer Satisfaction Management

,


Introduction
With the development of mobile commerce, Internet of Things, social networking, and ubiquitous computing, the boundaries of supply chain have been greatly expanded.We have entered a business information era called "big data, " including user data, resource data, and transaction data [1].Due to the fact everyone has different age, gender, occupation, education, background, and also interests, toward the same information, what they need from resources and which aspects they are interested in are different [2,3].Currently, most of the e-commerce sites can only provide same results for different users, or prediction result in user interest is low.They are facing a problem of "Though people access massive amounts of information, they have the difficulty of obtaining useful information, " which is affecting customer satisfaction [4][5][6].
As to how to enhance the customer satisfaction, there are many approaches such as better quality of products and appropriate service for customers.Personalized information recommender system is able to meet these needs of individual users.It provides information services based on explicit user request or through the analysis of users' personality and users' habits to reach out to provide information service that may be required by users.Thus, personalized information recommender system gradually becoming an effective means to ease the problem of "information overload, " as well as eventually satisfying customers [7,8].Currently, recommended system mainly uses content-based algorithm [9], collaborative filtering-based algorithm [10], and hybrid-based algorithm [11].Among these methods, collaborative filtering recommendation algorithm (CF) has a good effect; meanwhile, it has lower implementation and maintenance costs.Thus, CF gets a lot of research and practical applications [12][13][14].But with the deepening of research and applications, people gradually discovered some problems such as poor accuracy and reliability of the acquisition on information needs of individual users and low adaptive of information matching in the current existing methods [15,16].Also, in the current business environment of social networks, users' interest and need will frequently change with some contexts related to time and space [5].However, in the specific field of recommender systems, some systems tend to focus on the relationship between "users and items, " while showing less consideration in contextual environment (such as time, location, surrounding people, mood, activity status, and network conditions).In many application scenarios, relying solely on the "useritem" dual relationship cannot generate accurate recommendations.Some systems only focus on the recommendation generation process, while ignoring the process of user's interest extraction.It means that they assume that they already have plenty and contextual information-included explicit user interest information.In practical applications, this type of contextual user interest is often difficult to obtain or only can obtain a very small amount, resulting in the fact that current system is hard to practice [17,18].
The issues discussed above show that, in the process of information obtaining, users' personalized information needs have context-sensitivity, which is a major cause of these problems.Context-sensitivity is the context that has a stronger impact on user interest.It is necessary to provide some information services that accurately and efficiently are consistent with the user's current context and meet user preferences; we called it contextual information recommendation [19].Currently, the context is introduced into the recommender system by scholars [20][21][22][23].These scholars start to study definition, identification, and modeling of the context.But contextual recommender has many problems waiting to solve, such as contextual user preferences extraction and high-dimensional data sparsity [5,24].Meanwhile, literatures basically study the influence of single context factors on recommender system [25,26]; actually, various context factors affect recommender system, considering that a variety of context factors can get a higher recommendation result [27,28].On the other hand, obtaining user needs and providing personalized service, in fact, is a process to help people make decisions.At the same time cognitive ability means human brain's processing capacity, storage capacity, and extract information.There is a close link between user needs acquiring and individualized decision-making [14,29].Distributed cognition theory tells us that various types of context information affect user decision-making.Therefore, for the problem in contextual recommender, paper proposes a novel contextual information recommendation model based on distributed cognition theory.Firstly, for the problem of user interest extraction under complex contexts, in order to extract more comprehensive, accurate user interest, the theory of distributed cognition is introduced to the modeling process of user interest.The innovative introduction of the distributed cognition theory emphasizes cognitive differences on various environmental factors and how these differentiated contexts affect each level of the recommender, rather than each context factor having equal weight influence on user interest.Secondly, on the basis of the extracting of sensitive contexts proposed above, paper proposes a novel collaborative filtering recommendation algorithm integrating with context similarity.It combines user similarity with context similarity in context-sensitivity to complete recommended tasks.Experimenting in an e-commerce and benchmark data set, this paper concludes that the new model has superiority over others.
The remainder of the paper is organized as follows.Section 2 discusses theoretical background and related research of recommender systems.The user interest and its information structure modeling within context are described in Section 3. In Section 4, details about the contextual information recommendation model based on distributed cognition theory are proposed and illustrated, including the extraction algorithm of user interest and a new collaborative filtering recommendation algorithm.In Section 5, we discuss the results of our empirical experiments based on the Moviepilot-mp.mooddata set 1 and the e-commerce data set 2. We then summarize the main characteristics and advantages of the proposed model.Finally, we offer concluding remarks and describe future directions of our research work.

Theoretical Background and Related Work
2.1.Collaborative Filtering Recommendation Method.The recommender system based on collaborative filtering is dependent on user rating to build user-item ratings matrix.This method uses statistical techniques to find neighbors users that have the same or similar interest preferences (e.g., similar score and similar purchased goods for different commodities).Using the rating of commodity items from neighbors to predict the rating of target user for new items.Then, it selects the top  items sorted in descending order as recommended set to the target user.(top  recommendation) [2].It is another advantage that it has no special requirements to the recommended object and can handle unstructured complex objects, such as music and movies.Meanwhile, the relationship between users requires a lot of historical data of study on the behavior of users, which intersects with social networks.Thus, there is a wealth of research foundation and broad prospects.Early research of collaborative filtering includes Tapestry system, GroupLens, and Jester system [2].Currently the technology for the study is divided into userbased, item-based, and model-based collaborative filtering [5].Recently, the character where user interests and needs change over context of time and space is more obvious, while traditional collaborative filtering is not considered the effects of context factors on the user's preferences [3].

Distributed Cognition
Theory.Distributed cognition theory is related to not only the perception in the distribution of the individual, but also more emphasis on cognitive distributed in a variety of external factors (environment, media, culture, society, time, etc.).This theory describes the distributed and difference cognition of various external elements, which are important to the whole cognition activities.In the broad sense, cognitive psychology includes structural psychology, psychologism, and information processing [28].In 1990s, Hutchins proposed the concept of "distributed cognition" where the nature of cognition is distributed.Cognitive phenomena include not only cognitive activities that occur in the minds of individuals, but also the external environment, media, cultural, social, and time, and so forth.In the followup study, Hutchins team is committed to the application of distributed cognition theory.Unremittingly, they took the distributed cognition as research of framework or theoretical basis to analyze human-computer interaction (HCI) and computer supported cooperative work (CSCW) [30,31].
From the conclusion of related work, we can see that (1) existing user interest extraction technique is mainly based on traditional machine learning, data mining, with little regard to the impact on the user context information needs.That means ignoring the inherent relevance between demand level of user information content and user interest under different contexts.(2) Some techniques for extraction of user interest had considered context, but they only consider the same kind of context types (such as time) for different context instances (morning, afternoon, etc.) on the users' interests, while ignoring the various context types for distributed and differentiated impact of user needs in recommendation service.It also means that little research had detected and quantified the sensitive contexts and priority order of impact to user interest.
(3) Existing recommendation algorithm did not well consider the impact of contextual factors on recommendation process.Although some did it, most of them failed to consider the effective integration of existing methods perfectly.They mostly have the problems of low predicting accuracy or bad coverage.Therefore, it is difficult for company to put it in practice and the contribution of this paper is to solve this problem.

Description of User Interest and
Information Structure Modeling Part of user interest contexts is described as following.

User Context Information
(1) Regional culture: in addition to the various regions having different networks of total retail sales, per capita GDP, per capita consumption expenditure, network penetration, and so forth, they also have different cultural backgrounds among users, and they have different lifestyles; what is more is that some consumer behaviors have local characteristics.For instance, "Shanghai culture, " "Beijing culture, " and "Guangzhou culture, " all of them affect individuals in consumer behavior.To facilitate quantitative analysis, this paper characterizes the culture as a geographical feature points.Different regions may have similar or different shopping interests due to local customs or cultural theme.
(2) Gender: there is a great difference in interest between men and women.Women generally have more interests in cosmetics, garment, and accessories, while men are interested in digital electronics products.
(3) Age: people of different ages have different interests in commodity groups.Young people are generally focused on fashion, the old tend to be more focused on practicality, and, with age, they have a gradual shift of interest.For example, after students enter the society from campus, their interest in things will change.
(4) Marriage: whether a person is married or not will also make the user's interest drift.People who are not married will decide their own interests by individual needs.The married person shows their points of interest in entire family, and after a person gets married, he or she shows more focus on children's goods.
(5) Education: people of different educational background have different requirements for commodities.People of higher educational background have the more comprehensive understanding of online shopping and requirements.
(6) Incomes: incomes directly affect the quality requirements of users' interested in the product.People who have high income always have high quality requirements.With the increase in personal income, he or she pays more attention to the quality of goods.

User Behavior Information.
Often the user's interest can be reflected in their behavior and there will be a certain tendency when they are interested in something.The user's needs and interests are recorded in their behavior information.Therefore, we can deduce the true interest of the user by analyzing searching behavior, browsing behavior, purchase records, and other information.In this paper, user behavior information includes the following aspects.
(1) User's Search Keywords.To a certain extent, search keywords reflect the exact user's current interest, which illustrates that users have a strong desire to obtain the concept.But it is difficult for users to describe their interest completely and clearly, which they need to be adjusted to match the concept in knowledge base, in order to describe the user's interest more accurately.
(2) User's Purchase Records.Commodities that people have purchased could reflect user's previous interest; this interest may be long in term or just temporary.
According to knowledge rules we can identify the user's long-term interest theme or impact users' interest in the topic content in the near term.
(3) User's History Browsing Behavior.User's browsing behavior also reflects to some extent the user's current interest that may include the subject of browsing page, the number of visiting the product category, add to favorites, the residence time of the page, drag the scroll bar, bookmarked, copy, click on the link, and so on.

Modeling
The uhb  is the behavior variable expressed by a single nonnegative real number or multiple, such as search keywords, user interest page topic, time of browsing the page, add bookmarks, save the page, drag the scroll bar, and click on the link, which composes the behavior vector UBH = {Keywords, Topics, Times, Book, Save page, Scroll, Links}.Definition 5. Context information set of user history behavior is defined as The UHBC means user history behavior information collection in the contexts, where   ∈  is a multidimensional example of user context.uhbc  is the behavior variable expressed by a single nonnegative real number or multiple, such as user purchase goods information at a certain time (Product), buy time (BTime), and purchase intentions (Intention), which components the behavior vector UHBC = {Product, BTime, Intention}.
is a real number.UI is defined as an interest set of different services in different contexts.A larger value of ui   indicates a higher preference, wherein    represents an instance of user combination context.

Contextual Information Recommendation Model Based on Distributed Cognition Theory
This paper proposes to construct a contextual information recommendation model based on distributed cognition theory.The basic idea is to solve existing problems and is inspired by "distributed cognition theory"; this paper researches the hierarchical user needs, distributed cognition of contextsensitive information, and differential calculation and gradually extracted the user preferences precisely.Firstly, to some extent, for a user's preference affected by internal and external context, this model makes the context factors distributed cognition and analyzes the differential influence for user preferences caused by various effective context type and their instances.Then, model extracts the user preferences based on cognition of context-sensitivity.Secondly, model introduces user context information into collaborative filtering recommendation process and calculates the similarity among context in the context-sensitive set.Then, it constructs a similar context set of target user with current contexts and designs a novel collaborative filtering recommendation algorithm that integrated contextual similarity.Detailed model is shown in Figure 1.Step 1. Calculate the average value through a specific instance of a type of context and call it the single-dimension context of user historical behavior (uhbc   ), wherein the number of context of user historical behavior is   and contains the context    .Consider

Extraction Algorithm of
Step 2. Calculate user behavior variable variance (  ) of   for   under the single dimension context and obtain concept set of sensitive contextual cognition (( sensitivity means the value of user behavior expectations of   for   under the context   .  means the probability of interactions between   and   under the instance of context    (comparing to other specific instances of   ).
Step 3. Calculate the impact extent of certain sensitive context type   to detect the distributed cognition of sensitive context.Consder where   ≤  is the number of type of sensitive contexts.
Step Specific process of algorithm is divided into the following steps.
Step 1.Using DCUIE algorithm to extract context-sensitive set ( sensitivity )  and then obtaining the sheets of target user () preference data (UI  )   by using the selected sheet of key attributes, user behaviors, and user ratings, then, CFRCS constructs the 2-dimensional preference matrix of contextservice.
Step 2. Using the method in literature 5 to calculate the context similarity among contexts ( sensitivity )  , is the service set that user () has scored under context   and context   (  ,   ∈ ( sensitivity )  ).The rated value of  for  under context   is expressed as  (,,  ) .The average rated value of  under context   is expressed as  ,  .
Step 3. It calculates the similarity among users under various contexts for the construction of user similarity matrix.This similarity matrix is 3-dimensional, and each element Sim(, , ) represents cosine similarity between user  and user  under context : Step 5.According to the nearest neighbor set of target user (  ) and multidimensional user interest in sensitive context (UI  )   for generating recommendation, then, it brings the top- services in maximum preference set for target user .
User  has a prediction score  , for service . , can be obtained by calculating a weighted average value of user's () scores for service in   .Specific formula is as follows:

Experiment Results and Discussion
Experimental hardware and software environment: computers have Intel Dual Core CPU 2.8 GHz, 4 GB RAM, and Windows 7 operating system.Development language environment is JDK1.7,Eclipse3.7,Matlab7, database MySQL5.5.Data uses benchmark set and real-world e-commerce data set.movies.e-commerce data set is got from a B2C platform and is described as follows:

Description of Data Set and Evaluation
(1) user data set: 500 users, whose attributes contain user identity, gender, age, occupation, personality, education, income, and so forth; (2) network services data set: 100 kinds of services, whose attributes contain service ID, service price, quality of service, type of service, and so forth; (3) context data set: 5 kinds of context: time, equipment, geography, emotion, and purpose; time is divided into quarter, month, week, and day and distinguished between weekdays and weekends; area is divided into north, east, south, northeast, central, northwest, and southwest; device is divided into PDA, mobile phones, and computers; emotion is divided into negative emotions, positive emotions, and other emotions; purpose is divided into self-use and gifts; (4) context data set of user historical behavior: 8600 records (statistics within 30 days, user behavior variables take staying time on the same interest service as the key factor); (5) user behavior data set: user-service behavior matrix (500 × 100) is composed by number 0 or 1.
User has used the service behavior marked as 1 and user has not used the service behavior marked as 0. Evaluation criteria adopted @ and the mean absolute error (MAE).@ means ranking by users interests to recommend top- kinds of service.These are compared in accuracy with the top- kinds of service in test data set: Top- is the number of top- service both in recommend service set and test date set.
MAE measures the prediction accuracy by calculating the deviation between predicted value and actual value.It has a higher accuracy with a less bias.
Let us suppose that a collection of prediction score is expressed as ( 1 ,  2 , . . .,   ), the corresponding set of actual rating is ( 1 ,  2 , . . .,   ), and then Furthermore, the evaluation standard also uses DOA (degree of agreement).
predict rank   means predicted position of   in a recommended list.The DOA value of random prediction is about 50%, while DOA is 100% when all of the predictions are sorted correctly.This paper adopts averaged DOA among all users as the overall effect evaluation. Figure 2 shows that when  threshold = 0.6, the overall accuracy of DCUIE is better than random service selection  method without personalized services.It also shows the introduction of user interest extraction method based on the fact that a distributed cognitive model is meaningful.Figure 3 shows the @ comparative results of MCUIE and DCUIE when  threshold = 1.2.As can be seen, set @ as the evaluation criteria and DCUIE method is better than MCUIE method.It states that paper discriminated and quantified impact of the user preference with various contexts and its examples can help improve the extraction accuracy of contextual user preferences.Meanwhile, the value of  threshold will affect the accuracy of DCUIE.To some sense, this is due to a user preference for certain services which is more affected by context of location and activities, less of time, device, and personnel around, and vice versa.Namely, it can not take all contexts as the sensitive ones, and also can not consider them as being of equal weight.The analysis results have verified the practicality and validity of theories and methods established in the paper.

Experimental Results Analysis of
Experiments of DCUIE in e-commerce data set show user's interest is affected by contexts in a certain extent.Different context factors can lead to differences in user interest topics, such as the fact users who own different cultural backgrounds have different lifestyles; even some consumer behaviors have local characteristics.Someone may "like" a certain entertainment, but his interest value in this service may be low when he stays under the instance of combination context of "office" and "work." The results also find that contexts such as age, gender, income, geography, time, and purpose are more important factors that influence consumer choice of interest.For example, the rules indicate that the one who focuses on "brand" is most high-income young man, but the proportion of online shopping is not high.The one who is most concerned about the "cost" is highly educated people in middle-income level, especially among married women.The regional contexts affect consumer interest, and shopping behavior, such as per capita GDP, network penetration rate (including mobile, Internet), per capita consumption expenditures.Interest rule examples are shown in Table 1, where some of the rules have been merged.

Experimental Results Analysis of Contextual Recommendation.
Paper selects the optimal context-sensitive test threshold  threshold for testing context based on distributed cognition theory in order to improve accuracy of contextual recommendation by filter of noise context.Therefore, this paper divides e-commerce datasets into many data pairs and compares DOA with different  threshold value, as shown in Table 2.The results show that different  threshold affects sorting accuracy of CFRCS on different testing datasets.When  threshold = 0, since all the contexts are considered as sensitivity, CFRCS had more poor sorting results.When  threshold = 1.2, CFRCS has best sorting results on DOA.However, the  threshold increases beyond 1.2 resulting in a reduced number of context-sensitive sets and DOA downward again.On the other hand, different data pair also has a greater effect on the DOA.On the whole, CFRCS divides data into 80%-20% which is the best pair.
Paper adjusts the number of neighbors  in e-commerce data set and analyzes the association between CFRCS and context influence.Table 2 shows that the accuracy of CFRCS achieving the best when the test threshold of context-sensitive  threshold = 1.2.Therefore, paper sets  threshold = 1.2 to compare the MAE of CFRCS, collaborative recommendation algorithm based on the prefiltering context (CCF), and traditional collaborative filtering algorithm without considering the context (TCF).These MAE are even value obtained from three training set, which are showed on the ordinate corresponding in Figure 4.The results find that CFRCS has a better recommendation quality than CCF and TCF with different number of neighbors, and this algorithm has the best recommendation quality when  = 30.It can be seen, by selecting the appropriate test threshold of context-sensitiviy, the CFRCS has a better recommendation quality.This method makes the means of measure of context similarity more flexible in order to adapt to dynamic context change which affects the service needs change.
Overall, the experiment validation shows that DCUIE makes contextual recommendation more favorable to enhance the accuracy of the recommendation than traditional methods.Therefore, based on context-sensitivity, it can integrate some more sophistication and focus on user similarity measure into collaborative filtering recommendation algorithm.
To further verify the score prediction effect of CFRCS, this paper adopts Moviepilot-mp.mood movie rating data sets to do numerical experiments and selects kNN-100, W-SlopeOne, and CCF for contrastive analysis.Firstly, CFRCS extracts context-sensitive set, such as age, occupation, mood, time, and location by DCUIE and then calls CFRCS algorithm to recommend.It could be found that CFRCS gets the maximum DOA value on three different data sets by comparing with the other three recommendation algorithms, as shown in Table 3.It also finds that CFRCS gets the minimum MAE value by comparing with the other two algorithms.Specifically in  = 600, CFRCS compared with traditional user-based collaborative filtering (TCF) and context pretreatment-based collaborative filtering (CCF) increases by 10.57% and 18.62%, respectively, as shown in Figure 5. Therefore, CFRCS can significantly improve the quality of recommender systems and be more suitable for personalized recommendations problem under future mobile business environment.

Conclusion
Personalized information recommendation service is an effective means to ease the "Internet information overload" problem and improve the customer satisfaction.It has important significance and practical value.This paper builds a novel contextual information recommendation model based on distributed cognition theory after modeling user interest information structure.Firstly, it proposes a user interest extraction method based on distributed cognition theory.After researching the hierarchical user needs, the distributed and differentiated contexts, paper analyzes the impact of various factors in the process of interest extraction and gradually obtain precise interest.Secondly, it proposes contexts and contextual users' similarity calculation method to deal with user interest based on differentiated context-sensitive value.After that, it improves collaborative filtering algorithm.The results show that the model can effectively mine user behavior data in context.It also finds the differences in the degree of impact on users' interests with different contexts and its examples and reduces the negative impact caused by noise contexts, thereby improving accurate of contextual information recommendation service.Mainly a next step includes dependency relationship and its impact on user interest among different types of context.Study of how to alleviate the sparse problem of multidimensional contextual user interest and study of contextual recommendation problem based on user interest drift.

Figure 3 :
Figure 3: Comparison of experimental result between MCUIE and DCUIE.

Figure 4 :
Figure 4: MAE of three algorithms in an e-commerce data set.

Figure 5 :
Figure 5: MAE of three algorithms on the standard data set.
The user () has the following features in the internet: register on the site and have unique access account; provide background information about the individual; access the website; browse, purchase, and other operations.Then the set consisting of all users is defined as  = { 1 ,  2 , . . .,   } and  represents the number of users.Any user (  = {  |  ∈ [1, ]}) is defined as the set of user attributes;  represents the number of types of user attributes.Recommended services set () is defined as  = {  |  ∈ [1, ]},  represents the number of services, and   = {  |  ∈ [1, ]} is defined as a collection of service attributes, where  represents the number of types of service attributes.Context object is any associated object in recommendation process, including user attributes and external factors.A nonempty set of properties  =  1 ×  2 ⋅ ⋅ ⋅   is used to describe the characteristic of object.This set is expressed as Cartesian product of a single-dimension context type   .  = {  1 ,   2 , . . .,    },  ∈ [1, ] and  is the number of specific instance of a context type.
User Interest Based on Distributed Cognition Theory.After analyzing distributed cognition theory and user interest model discussed perviously, this section firstly defines the concept set of context-sensitive cognition, detects the context-sensitivity, and quantifies the contexts by using distributed cognition theory.Secondly, the algorithm sets user historical behavior data of single-dimension context as data source in order to extract single-dimension contextual user interest.Finally, the algorithm integrates the distributed influence factors to extract multidimensional contextual user interest based on context-sensitive cognition.
sensitivity )  represents the set of context types which have a significant impact on users' interests. threshold represents the threshold of context-sensitivity.The impact factor   represents influence degree of one context-sensitive type   .It also means users (  ) interest for service   is affected by   .(ui  )   represents the interest of   for   under the single Figure 1: Contextual information recommendation model based on distributed cognition theory.dimensioncontext instance of    .(UI  )   represents the interest based on context-sensitive cognition, namely, represents the interest of   for   under multidimension context instance of    .Detection of context-sensitive cognition has a distributed cognition to the various contexts which may influence the extraction of user interest.It sets user historical behavior data under the effect of a single-dimension context   as data source in order to analyze the fluctuation characteristics of user interest, which is influenced by this context.If   is not smaller than the threshold value  threshold , then the users (  ) interest towards service   is affected by   and, otherwise, is not significantly affected by   .The description of distributed cognitive theory-based algorithm to user interest elicitation (DCUIE) is as follows: input:   ,   ,   ; output: (UI  )   , rule set.
by -means algorithm and its value range is [, ].Consider (ui  )   =  -Means (uhbc   ) Calculate the multidimensional user interest based on sensitive context cognition.Algorithm calculates the multidimensional user interest by using (ui  )   with the action of   .Consider (UI  )   = According to the set of context-sensitive ( sensitivity )  and (UI  )   , algorithm uses C5.0 for rule extraction and interest analysis by combining the key attributes of the selected user characteristics, user behavior data.
4. Calculate the user interest of single dimension.Algorithm sets single-dimension context of user historical behavior uhbc   as data source and computes the user interest of single-dimension (ui  ) 4.2.Collaborative Filtering Recommendation Algorithm Incorporated Context Similarity.In this section, after extraction of users' interest, this paper gives the description on inputs, outputs, and detailed process of collaborative filtering recommendation algorithm incorporated context similarity (CFRCS): input: three-dimensional model of "User-Service-Context"; target user () and current context (); recommendation service set (); output: user's maximum top- service preference in service set () under current context.
is the eigenvector of attribute and behavior under the same context , which is, respectively, for user  and user .Step 4. Create a nearest neighbor set of users.Every user in each context builds a set of nearest neighbors.The nearest neighbors set of user  is   = { , 1 ,  , 2 , . . .,  ,  }, 1 ≤  ≤ Num(),  ,  is the set of nearest neighbor of user  under the context   , where Num() is the total number of users and  1 ,  2 , . . .,   means the number of context is .The users who are obtained according to similarity calculation of context and users have similarity in context   also incorporated into the set of nearest neighbors.The number of users of nearest neighbor set under each context may be the same or may be different, such as the number of neighbors depending on the scale of context.
21,ber of corated movies is |   | min = min corated num = 10.Found through the analysis of the training set and testing set under usually situation, user rating of at least 100 movies, target user, and other parts of users have common rating of not less than 10 films.Therefore, we select no less than 100 films from the training set, which are scored by 10,051 users, and then randomly select one of the 3,000 users as the ultimate training set users.After data preprocessing, we get 3,000 users who score 1,211,291 records on21,333 testing data set contains 160 users who have scored 19,506 records on 3,396 movies.Score is an integer between 0 and 100 and increments at intervals of 5, 100 he said he likes.Since, before calculating the similarity between two users, the user needs to find a common movie rating, the experimental set minimum NW   =  − (   ∪    ) means the item does not appear in both training set of   and testing set of   .   is the item scored by   in training set and    is the item scored by   in testing set.Consider DOA   = ∑ (∈   ,∈NW   ) check order   (  ,   )              *       NW         .check order   (  , User Interest Extraction.DCUIE validation experiment uses the e-commerce data set.Data is divided into two parts: the data of 24 days before (80%) is set as training set to extract user interest; the data of 6 days after (20%) is set as testing set to evaluate user interest extraction accuracy.Steps are as follows.