Cross-Border E-Commerce Intelligent Information Recommendation System Based on Deep Learning

In order to improve the effect of cross-border e-commerce intelligent information recommendation, this paper applies deep learning to the intelligent information processing and intelligent recommendation of e-commerce and proposes an improved version of the topic model to solve the problem of feature extraction of the text of the recommendation system. In order to deal with translation problems, this paper proposes an end-to-end sequence-to-sequence learning method. In addition, this study uses the long tail theory to excavate the mass commodities in the niche and recommends these products to users as suggestions. Finally, this paper proposes a niche product recommendation algorithm based on the graph search strategy based on the graph model. The experiment shows that the cross-border e-commerce intelligent information recommendation system based on deep learning proposed in this paper has a good recommendation effect and meets the recommendation needs of cross-border e-commerce.


Introduction
is has stimulated offline entities to a certain extent and has formed an integrated online and offline sales strategy. With the increasing scale of online business platforms, recommendation systems have also risen [1] and have become one of the backbone technologies in various fields today. It is also mainly from the perspective of "people-oriented" to make the originally large and complex assembly line more intelligent, which greatly facilitates human beings. erefore, it is favored by many scholars at home and abroad, and more and more researchers invest in this field [2].
Nowadays, traditional foreign trade is in a state of insufficient growth and continued sluggishness. However, cross-border e-commerce has sprung up and has experienced a "blowout" growth, which has become an important driving force for industrial upgrading and new growth in foreign trade under the new normal of the economy [3].
Banks and other financial institutions jointly established a series of norms and mechanisms based on the collection, processing, credit rating, credit supervision, and rewards and punishments related to credit information, thereby effectively enhancing the trust relationship and creditworthiness between transaction entities. Optimize the credit environment of cross-border e-commerce e system's data analysis center converts it into indicators in some way, obtains the credit rating of the transaction subject through system analysis, and provides a basis for system supervision, rewards, and punishments.
is study applies deep learning to the cross-border e-commerce intelligent information recommendation system to overcome the geographical problems of cross-border e-commerce and improve the recommendation effect of cross-border e-commerce.

Related Work
e rule-based recommendation algorithm is mainly based on customer-defined rules for information association and information mining and then recommends and analyzes customers through existing rules. e advantage of this algorithm is simple and fast [4]. e recommendation based on content filtering is to make recommendations by comparing the customer profile with existing resources. e key consideration of this recommended technique is the calculation of similarity. Because most customers are unable to accurately analyze and accurately express their needs, therefore, the rule-based recommendation algorithm has big flaws. e content-based recommendation technology compares the customer's feature description file with the resource file and searches for similar word features. Typical systems include Person-alWebWatcher [5], WebMate [6], and WebACE [7]. e recommendation algorithms of collaborative filtering technology include user-based collaborative filtering. Among them, user-based collaborative filtering is realized by analyzing the similarity of user characteristics and behaviors. e item-based collaborative filtering method analyzes the similarity relationship between items and then provides users with similar items. e model-based collaborative filtering is based on recommendations which are based on the relationship between existing item features and user features [8].
Hybrid filtration technology: since the content based have their own advantages and disadvantages, in many practical applications, these two different recommendation algorithms are combined for recommendation, which is a hybrid recommendation algorithm [9]. Huang et al. [10] use the Bayesian probability framework to combine content-based features and collaborative filtering features for recommendation calculations and achieve better results. Liang and Qin [11] combine the two recommendation techniques in a weighted form, thereby increasing the diversity of recommendation results. Liu et al. [12] combined model-based collaborative filtering and memorybased model for recommendation and then recommended products effectively. Chen [13] applies the ranking learning algorithm to the hybrid recommendation algorithm for model fusion. Generally speaking, such an algorithm structure combines the advantages of content-based recommendation algorithm, but increases the complexity of the algorithm and the running time cost. Recommendation algorithms play a huge role in all areas of human life and work. However, although there are various differences in its algorithms, it must be admitted that each algorithm has its own advantages and applicable characteristics. ere is still no algorithm that is close to perfect because various algorithms have their unavoidable defects to a certain extent.
erefore, how to effectively analyze and integrate these algorithm resources so as to combine their strengths and avoid weaknesses as much as possible and then combine their advantages for recommendation has become one of the optimization directions of recommendation algorithms.
He [14] proposed a novel interpretation interface, which focuses on the integration of extracting characteristic emotions and static attributes from product reviews to help users explore and understand the product space more effectively and then learn more from other customers' experiences and explain the product preferences of target users well. Xu et al. [15] proposed a multitask learning interpretable recommendation algorithm. e algorithm uses tensor decomposition to integrate the user preference modeling in the recommendation task and the modeling of the opinion content in the interpretation task. It can not only predict the target user's preferences, for items can also provide explanations about the characteristics of specific items. Wei et al. [16] propose a recommendation method that uses tags as features and explain to the user why the recommended movie is related to him based on the features. At the same time, user research experiments have been conducted, and the results show that providing feature-based explanations for the recommendation results can help improve the effectiveness of the recommendation results. Hosseini et al. [17] also verified through user research experiments that feature-based explanations are closely related to user satisfaction and trust in recommendations. Yu [18] proposed a general framework based on graph regularization. e framework first extracts aspects from reviews, then models the user-item-aspect relationship as a tripartite graph, which can be used while performing Top-N recommendations, and provides various explanations. Sukrat and Papasratorn [19] developed a phrase-level sentiment analysis toolkit Sentiers1 that can extract user sentiment and product aspects from text reviews on a large scale. is toolkit can not only extract the "aspect-view-emotion" triples from the comments received by the item, for example, extract the "pixel-high-positive" and "carton-obvious-negative" three-tuples from the comments on the mobile phone tuple but also detect the emotion of the aspect word based on the context. For example, "pixel" and "high" indicate positive emotions, while "noise" and "high" indicate negative emotions. In addition, since the program constructs a dictionary of "aspect-view-emotion" triples, the toolkit can also detect the triples contained in the review text. Based on the Sentiers toolkit, Fedirko et al. [20] designed an explicit factor model and proposed to use the "aspect-view"the performance of the item in all aspects.

Intelligent Information Recommendation Algorithm Based on Deep Learning
is paper proposes an improved topic model to solve the problem of feature extraction in the text of the recommendation system. e model is shown in Figure 1, where shaded circles represent observed variables and unshaded circles represent hidden variables. User u comments d u,i ∈ D on item i. ese generated sentences usually focus on a topic, either from the user's preference or from the characteristics of the item. When the topic model is used as a text processor, very good performance can be obtained in the recommendation system. e main reason is that the topic model can simulate the process of manually writing comments, generate simulated comments based on user preferences and item characteristics, filter out some irrelevant content, and enhance the representation of users and items.
Convolutional neural network (CNN) is shown in Figure 2. Different from processing image data because text is a characteristic of one-dimensional data, CNN can only use one-dimensional convolution kernel h × d for convolution operation, that is, d is the dimension of a fixed word vector. Multiple convolution kernels with different values of h can obtain multiple feature maps, where h refers to convolution with h words as a window   When a convolutional neural network is used as a text processor, the model has good performance in text classification and sentiment classification. e main reasons are as follows. e dimension of word vector can be kept within d, which avoids the problem of data sparseness and feature dimension disasters and also reduces the amount of model parameters. Convolutional neural networks can extract h-ary syntactic features. Convolutional neural networks can obtain more effective feature items in the feature by using the pooling layer and filter out certain noise to improve the accurate expression of features. e second layer is the convolutional layer, which contains m convolution kernels K ∈ R h×d . e feature formula extracted by the jth neuron from the text is as follows: where * is the convolution operation, ReLU is the nonlinear activation function, and b j is the bias. e text feature extracted by the jth element using the sliding window of t is z 1 , z 2 , . . . , z n−t+1 j . e third layer is the maximum pooling layer, which can reduce dimensionality and capture the most important features. e formula is (2) e model is shown in Figure 3.
In the recommendation system, the text processor based on the convolutional neural network is adopted by many researchers.
e experimental results prove that the text features it extracts can better express users and items, thereby improving the recommendation performance. e research focus in natural language processing is how to extract rich features from sequence data. For example, in tasks such as translation systems, question answering systems, dialogue systems, and sentiment analysis, recurrent neural networks (RNN) are often used to encode or decode text. Among them, the most common recurrent neural networks for solving sequence problems are the long and short-term memory recurrent neural network. is section will take LSTM as an example to illustrate the process of recurrent neural network processing text.
LSTM proposes to encode sentences according to the order of words. In addition to maintaining the sequence information of the text, it can also solve the problem of longdistance dependence. In addition, when LSTM considers the current word information, it also considers the historical code output by the last recurrent neural network unit. Specifically, LSTM has three gate mechanisms: input gate, forget gate, and output gate. e model is shown in Figure 4.
When coding word x t , the unit calculation process is shown in the following formula: Recurrent neural networks are commonly used to deal with sequence problems. However, due to the limitation of memory, the long-distance dependence of the cyclic neural network is still lost. Because the attention mechanism is not restricted by distance, it has become a popular technique for dealing with sequence problems. e text processor based on self-attention first converts the input matrix X into query matrix Q, key matrix K, and value matrix V, as shown in Figure 5. e calculation formula is as follows: Next, the algorithm calculates the dot product of query and key, divides the result by �� d k , and finally applies the softmax function to obtain the weight of value. e formula is as follows: Because the target vector matrix Attention(Q;, K, V) calculated by this self-attention mechanism is obtained by point operation, that is, there is interaction between two words, therefore, using it as the coding matrix of the sentence itself, it can capture the dependency between any two words. e self-attention text processor can extract the dependency relationship between words at any distance, and the execution efficiency is higher than that of the text processors of the convolutional neural network and the recurrent neural network.
As we all know, only part of the words or part of the comments in the comment-based recommendation is useful.
ere have been many works using the attention mechanism in conjunction with text processors of convolutional neural networks or cyclic neural networks to capture the usefulness of the word level or sentence level, but there is also a small amount of work that fully utilizes the attention mechanism to achieve this goal. e MPCN model is shown in Figure 6. MPCN uses the co-attention mechanism to deeply interact with user and item comments and captures the word-level and comment-level attention distribution. Specifically, the model first takes the encoded comment x ∈ R l×d as input 4 Computational Intelligence and Neuroscience and uses the co-attention network to first extract the comment-level co-attention. Among them, l is the length of the comment and d is the size of the word embedding. e formula is as follows: where i and j represent the ith and jth comments, respectively, and F(.) is the feedforward neural network, M ∈ R d×d , and S � s ij ∈ R l×l . Next, the algorithm calculates comments x i and x j and considers the representations x i and x j of the comment-level co-attention; the formula is as follows: where max col and max row are the maximum pooling operations performed on the columns and rows of the matrix S, respectively. e Gumbel function can obtain the one-hot encoded vector, that is, the useful comments are selected. Computational Intelligence and Neuroscience en, the word-level co-attention is processed similarly. e algorithm takes x i and x j as input and outputs comments x i and x j that consider the word-level co-attention. e formula is as follows: where avg col and avg row are the average pooling operations performed on the columns and rows of the matrix w, respectively. MPCN has greatly improved its performance on 24 publicly accessible datasets.
In order to deal with the translation problem, an endto-end sequence-to-sequence learning method is proposed. e model of this method is shown in Figure 7, which consists of two recurrent neural networks (RNN). One acts as an encoder to process the input sequence 4 , and the other acts as a decoder (sentence generator) to generate a sequence y 1 ⟶ y 2 ⟶ y 3 . e specific steps are as follows: (1) e algorithm uses the word embedding model to encode the input sequence and then enters the encoded words into the RNN one by one. e formula is as follows:   Figure 7: seq2seq model diagram. 6 Computational Intelligence and Neuroscience where x t is the input of the current unit, h t−1 is the output of the previous hidden unit, and h t is the output of the current unit. (2) e algorithm uses the output of the encoder as the first hidden state h 0 of the decoder and uses the decoder to generate words one by one: (3) e algorithm uses the classifier to predict each word. e number of classifications is the size of the vocabulary. e formula is as follows: e application of the seq2seq model has been widely used in other fields by translation systems, such as question answering systems, dialogue systems, and recommendation systems. Although LSTM and GRU are proposed to improve the dependence of words in sentences, the long-distance dependence is still not ideal. In order to solve this problem, the seq2seq joint attention mechanism is used to achieve better translation. e model is shown in Figure 8. e specific steps are as follows: (1) e algorithm uses the word embedding model to encode the input sequence and then uses RNN to process the encoded words one by one. e formula is as follows: (2) e algorithm takes the output of the encoder as the first hidden state h 0 of the decoder and uses the decoder to generate words one by one: (3) e algorithm uses the interaction between the hidden state of the decoder and the hidden state of the encoder to find the attention weight. e formula is as follows: (4) e algorithm uses the attention weight to weight the hidden layer of the encoder as a context vector. e formula is as follows: (5) e algorithm concatenates the context vector and the hidden state of the decoder as the output of the predicted word and then inputs the result to the classifier to predict each word: seq2seq combined with attention has better effect. e RNN unit of the seq2seq encoder and decoder can use a long short-term memory network (LSTM) or a gated recurrent unit (GRU), and the decoder is commonly used in recommendation systems as a generator of interpretable sentences. is study uses hierarchical GRU as a translator to generate natural language sentences, and the generated sentences can explain the recommended results well. e formula for multihead attention is as follows: where Concat (.) is the splicing operation. In a multihead decoder, K and V come from the output of the encoder and Q comes from the output of the upper layer of the decoder. Since attention is input in parallel, there is no position information between words, so this article proposes the position code of each word. e calculation formula is as follows: where pos is the position of the word, i is the dimension of the word, and d model is the dimension of the position code. e final output of the decoder can predict the probability of each word through a linear transformation function. e formula is as follows: e encoder and decoder models based on the selfattention mechanism further improve the effect of machine translation. e translation model is similar to seq2seq, in which the decoder can also be used as a sentence generator ... ...

Cross-Border e-Commerce Intelligent Information Recommendation System Based on Deep Learning
e cross-border e-commerce intelligent information recommendation system based on deep learning is shown in Figure 9. e model is divided into 3 layers. e first layer is the embedding layer, which initially expresses users and points of interest through the TransR model. e second layer is the relational reasoning layer, which forms a multilayer relational reasoning through random walks on the knowledge graph, calculates the information attenuation after each walk, and uses a deep neural network to perform a new vector mapping. Each mapping of a vector is a performance of relational information. Each mapping is concatenated to become the final vector representation of the user or point of interest, and the user set and the point of interest set are divided. e third layer is the output layer, which uses the vector representation of the new user's points of interest and recommends top-N points of interest to the user based on the collaborative filtering algorithm.
e new model constructed in this study links the status quo of various businesses in reality and changes the starting point of the previous recommendation system. We use the long tail theory to mine the mass commodities in the niche,   Computational Intelligence and Neuroscience recommend these products to users as suggestions, and combine the graph model to propose a niche commodity recommendation algorithm based on the graph search strategy. e specific framework is shown in Figure 10.
On the basis of the above analysis, the model proposed in this paper is verified. First of all, this article conducts simulation research on the effect of intelligent information processing and builds the system model of this article through simulation software to simulate the intelligent information processing of cross-border e-commerce. e results obtained are shown in Table 1.
From the above research, it can be seen that the method proposed in this paper can have a good effect. On this basis, the recommendation effect test is carried out, and the results shown in Table 2 are obtained. rough the above experiments, it can be seen that the cross-border e-commerce intelligent information recommendation system based on deep learning proposed in this paper has a good recommendation effect and meets the recommendation needs of cross-border e-commerce.

Conclusion
With the continuous expansion of the scale of online platforms, online trading has become one of the indispensable activities on people's daily lives in the new era. is has also stimulated offline entities to a certain extent, forming an integrated online and offline sales strategy. With the increasing scale of online business platforms, recommendation systems have also risen, and they have become one of the backbone technologies in various fields today. It is also mainly from the perspective of ""people-oriented"" to make the originally large and complex assembly line more intelligent, which greatly facilitates human beings. is paper applies deep learning to the cross-border e-commerce intelligent information recommendation system to overcome the geographical problems of cross-border e-commerce and improve the recommendation effect of crossborder e-commerce. e experiment shows that the crossborder e-commerce intelligent information recommendation system based on deep learning proposed in this paper has a good recommendation effect and meets the recommendation needs of cross-border e-commerce.

Data Availability
e labeled dataset used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest
e author declares no conflicts of interest.