A Personalized Travel Route Recommendation Model Using Deep Learning in Scenic Spots Intelligent Service Robots
A Personalized Travel Route Recommendation Model Using Deep Learning in Scenic Spots Intelligent...
Tang, Qili
2022-04-21 00:00:00
Hindawi Journal of Robotics Volume 2022, Article ID 3851506, 8 pages https://doi.org/10.1155/2022/3851506 Research Article A Personalized Travel Route Recommendation Model Using Deep Learning in Scenic Spots Intelligent Service Robots Qili Tang School of Economics and Management, Aba Teachers University, Aba, Sichuan 623002, China Correspondence should be addressed to Qili Tang; 20109637@abtu.edu.cn Received 8 February 2022; Accepted 19 March 2022; Published 21 April 2022 Academic Editor: Shan Zhong Copyright © 2022 Qili Tang. is is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. is paper proposes a personalized tourist interest demand recommendation model based on deep neural network. Firstly, the basic information data and comment text data of tourism service items are obtained by crawling the relevant website data. Furthermore, word segmentation and word vector transformation are carried out through Jieba word segmentation tool and Skip- gram model, the semantic information between di‹erent data is deeply characterized, and the problem of very high vector sparsity is solved. en, the corresponding features are obtained by using the feature extraction ability of DNN’s in-depth learning. On this basis, the user’s score on tourism service items is predicted through the model until a personalized recommendation list is generated. Finally, through simulation experiments, the recommendation accuracy and average reciprocal ranking of the proposed algorithm model and the other two algorithms in three di‹erent databases are compared and analyzed. e results show that the overall performance of the proposed algorithm is better than the other two comparison algorithms. human intention. However, this method does not divide the 1. Introduction user’s access sequence into di‹erent interest segment se- Service robot refers to an autonomous or semiautonomous quences according to time, so the recommendation accuracy robot [1] that completes useful service activities instead of is low. For the dynamic traveling repairman problem and human beings, but does not engage in production work. Its dynamic vehicle routing problem, Reference [12] combined role is to replace service personnel and provide services with time-varying requirements, sent the available robots to required by human beings. Service robot contains many the nearest service request location, sent multiple robots for scienti—c knowledge including mechanical engineering, each service request arriving in the system, and proposed a automation, computer science, and control engineering new model-free operation strategy independent of load [2–4]. With the continuous development of arti—cial intel- factor. However, this method does not have model algorithm ligence, service robots are gradually moving towards intel- and cannot be applied to more complex situations. In order ligence [5, 6]. Due to the increasingly obvious trend of to realize long-term autonomous operation, Reference [13] proposed a modular general software framework for intel- personalized tourism service selection by users, the research on intelligent tourism service robot has become a hot spot in ligent mobile robot, which can use complex human voice intelligent service robot [7, 8], which is innovative and commands to interact with humans. However, this method forward-looking in the —eld of robot application in tourism only considers the home service robot. Reference [14] in- industry. troduced local attention and nonlinear attention to capture At present, the traditional method of providing services local and global project information at the same time. On by service personnel cannot meet people’s personalized this basis, a nonlinear attention similarity model (NASM) needs [9, 10]. Reference [11] developed a robot partner for was proposed for project-based collaborative —ltering information support and proposed a new method that can through local attention embedding. However, the algorithm Ÿexibly recommend all kinds of information according to cannot accurately capture human high-order sequence 2 Journal of Robotics behavior, and it is difficult to realize complex recommen- applied in various occasions and achieve good results, as well dation. Reference [15] proposed a personalized robot service as tourism service recommendation [18]. Although collab- system centered on robot thinking, which can migrate with orative filtering technology has many advantages, such as the user’s geospatial movement at any time, so that it can good processing of unstructured data and high degree of continue to grow with the user. However, this method does personalization and automation of recommendation, col- not consider the environmental factors of users, and the laborative filtering technology also has the problem of data personalized growth cycle is long. Reference [16] constructs sparsity [19]. Generally, in the application of recommen- the intelligent robot control system based on the principle of dation system, the data scored by users are insignificant for human-computer interaction and designs the corresponding the overall data, which will lead to some problems. In this model-based control algorithm to identify the dynamic case, the use of collaborative filtering technology often model of the robot. However, this method is difficult to cannot achieve good results. In other words, for collabo- obtain the prior distribution, and it is difficult to characterize rative filtering technology, if an item has less scores, it is the high-dimensional semantics of users. In Reference [17], difficult to recommend it to other users. In addition, if a user aiming at the path-planning problem of hospital service scores very little, it will be difficult to get some recom- robot in drug delivery, medical insurance order and other mendations. It is difficult to achieve excellent results by services, based on the automatic control robot with visual using traditional collaborative filtering technology. recognition ability, combined with the three-dimensional +erefore, this paper proposes a tourism service rec- reconstructed image and the route area shunting method ommendation model based on deep learning to solve the using edge calculation, an image edge detection algorithm related problems. +e proposed model is mainly divided into based on three-dimensional features is proposed. However, four modules: data preprocessing, construction of depth this method is only suitable for special people in specific prediction model, network training, and final recommen- areas and does not have the characteristics of in-depth dation list generation. +e principle and function of each personalized service. module are shown in Figure 1: Based on the above analysis, aiming at the problem of (1) Data acquisition and preprocessing—this module personalized travel route recommendation of intelligent mainly obtains the basic information data and service robot in the scenic spot, a personalized tourist in- comment text data of tourism service items by terest demand recommendation model based on deep crawling the relevant website data and then pre- learning using word embedding technology is proposed. +e processes these data. basic idea is to (1) reduce the sparsity of data vector and (2) Construction of prediction model—this module uses improve the recommendation accuracy of the algorithm by deep learning technology to predict users’ scores on preprocessing the original data and (2) build a depth pre- tourism service items. diction model to deeply mine the relationship between users and scenic spots. Compared with the traditional service (3) Training network—the module uses sample data to robot travel route recommendation method, the contribu- train the model network, mine the potential rela- tions of the proposed method are as follows: tionship between users and tourism service items, and learn the interaction between users and tourism (1) +e Skip-gram model in the word2vec word em- service items, so as to obtain a predictable model. bedding method is used to transform the word vector (4) Generate personalized recommendation list—the of data, and the effective extraction of topic feature main function of this module is to test the experi- vector, geographic factor feature vector, and user mental data. +e module inputs the experimental access feature vector is realized. data into the trained model. +e model predicts (2) +e proposed model uses the deep neural network to users’ scores on tourism service items, sorts them transform the recommendation of tourists’ interests according to the score, and finally generates a per- and needs into the task of binary classification, which sonalized recommendation list for each user to improves the ability of extracting the features of the complete the recommendation. original data and effectively enhances the perfor- mance of predicting users’ ratings. 2.2. Data Acquisition and Preprocessing. +e goal of data +e rest of this paper is organized as follows: the second acquisition mainly includes three aspects: self-built database, part introduces the personalized tourist interest demand data from Foursquare, and data from Tokyo [20, 21]. +e recommendation model based on deep neural network; the self-built database adopts the data set established by col- third part compares with the existing recommendation lecting a large amount of travel information from the model to realize the feasibility and optimization of the MaFengWo, with about 390000 user travel access records. method proposed in this paper; the fourth part is the Foursquare’s data includes the long-term (about 10 months) conclusion of this paper. check-in data collected in New York from May 2013 to March 2014, filtering out users with less than 8 access 2. Proposed Model records and locations with less than 8 visits. +ere are 1158 users and 5092 locations for the experiment, with a total of 2.1.OverallFramework. Collaborative filtering technology is the most used in the recommendation system, which can be 257221 check-in records in the Foursquare data set. +e data Journal of Robotics 3 Table 1: Recommended data set of tourism series. START Database Self-built Foursquare Tokyo Number of users 9328 1158 2095 Data collection Number of locations 1643 5092 8246 Number of check-in records 392031 257221 605893 Data Data cleaning preprocessing vectors on a word list. +e data can be expressed as Generate word embedding vector feature vectors using the frequency and TF-IDF value of words. A typical representation is One-Hot Build a predictive model network Encoding. However, this method essentially uses a vector containing only one 1 and the others are 0 to Construction of uniquely represent words, and its dimension is the Factorization machine model function prediction models number of words in the whole vocabulary list. However, the vocabulary list used in this represen- Train the constructed network model tation method is very large, so the dimension of feature vectors represented by this method is very Generate recommendation list large, which eventually leads to the problem of very high vector sparsity. Secondly, such methods cannot express the relationship between words and cannot Personalized recommendation results well reflect the deep semantic information between words and text. Here, the word embedding method is END used for word vector conversion [24]. Different from the traditional lexical feature expression methods, the Figure 1: Implementation flow of the proposed recommendation word embedding method can represent words by model. dense real number vectors in low dimensional space. +is cannot only represent words in vector form but also calculate the effective distance between two words from Tokyo are similar to the data from Foursquare, and the and describe the semantic information between same filtering operation is adopted for the data. Finally, there words. It is a very effective method to process text were 2095 users and 8246 locations for the experiment, with information. At present, word2vec method is the most a total of 605893 check-in records. widely used word embedding technology. Word2vec +e specific statistics of the three data sets are shown in model is divided into Skip-gram model and CBOW Table 1. model [25]. Skip-gram model predicts the generation probability of context vocabulary through target vo- 2.3. Vector Representation of Text. Preprocessing the data in cabulary, while CBOW model uses context vocabulary the above three databases mainly includes the following to predict the generation probability of target vo- steps: cabulary. +is paper mainly uses Skip-gram model for word vector representation, and its basic structure is (1) Chinese word segmentation technology is used for shown in Figure 2. word segmentation. In this paper, Jieba word seg- mentation tool is used to segment data [22]. Jieba As can be seen from Figure 2, Skip-Gram model is a word segmentation tool adopts the Chinese word neural network model, including input layer, hidden layer, segmentation algorithm of NShort, which is the and output layer. Firstly, the vocabulary is transformed into python implementation of the algorithm. It has the One-Hot encoding form and input into the input layer and characteristics of simple principle, easy to under- then calculated in the hidden layer. +e output layer outputs stand, low model resource occupation, and easy the probability of the target context vocabulary. When the training. In addition, NShort Chinese word seg- model training is completed, the weight from the input layer mentation algorithm has excellent efficiency in large- to the hidden layer can be used to represent the target word scale word segmentation application scenarios and is vocabulary. +is is because in the weights at this time, only widely used in various commercial fields. Moreover, the weights at the position of “1” in the one-hot encoder are the model supports incremental expansion. It is the activated, and the number of these weights is the same as the mainstream algorithm of Chinese word segmenta- number of hidden layer nodes so that the vector composed tion and is used by most search engine companies. of these weights can represent the target vocabulary. +e (2) Convert the divided data into word vectors. As a position of 1 of one-hot encoder of different words is dif- ferent, so the target word is uniquely mapped into a low- method of text feature representation, vocabulary is expressed as a feature vector, which is called word dimensional dense vector. +e kip-Gram model predicts the probability of target vector [23]. It is a common expression of word vectors to represent text information with feature vocabulary context vocabulary using the following formula (1): 4 Journal of Robotics calculation efficiency is often very low. +erefore, the existing models often use hierarchical softmax for efficient Q (t) Output calculation. Combined with hierarchical softmax, formula (1) is expanded into formula (2): k v v 1− z z ( ) i i T v T v P[S(χ)|χ] � δ v x · 1 − δ v x , (2) χ i−1 χ i−1 a∈S(χ) i�2 Intput where, k represents the path length of the context vocab- ulary in the output hierarchy tree. v represents the context- sensitive vocabulary of the target vocabulary χ. v represents the output vector of the target vocabulary. x represents the i−1 output word vector at the corresponding level under a projection context word. z represents the logistic output indicator variable, when v v v T v z � 0, it is expressed as P(z |v , x ) � δ(v x ), and Q (t+2) Q (t+1) Q (t-1) Q (t-2) i i i−1 χ i−1 v v v T v when z � 1, it is expressed as P(z |v , x ) � 1 − δ(v x ). i i χ i−1 χ i−1 Figure 2: Basic architecture of Skip-gram model. In the process of word embedding model training, some negative samples are often added to improve the training speed and improve the quality of the obtained word vector. P[S(χ) | χ] � P(v | χ), (1) At this time, the objective function of Skip-gram model a∈S(χ) training can be expressed as follows: where, S(χ) represents the context vocabulary of the target vocabulary χ. When using softmax to calculate P(v | χ), the χ T χ T G � y · logδv x + 1 − y · log1 − δv . v v v χ χ (3) χ∈C χ∈S(x) v∈ χ ∪M { } x , v , and y represent parameter vector, context word the order information of words can be well-preserved in the v χ embedding vector, and logistic indicator variable, respec- generated word vector matrix K , which is of great help 1: m tively, in which the sampled vocabulary set of vocabulary χ is and a good advantage for further processing. represented by M . In deep learning tasks based on neural +is method is also used to process the comment in- χ networks, the word embedding vector generated by Skip- formation of tourism service items. All comment texts gram model can be used as good input data. +is method is obtained from tourism service item v are also integrated into used to map word vectors. a single document z and then the document is trans- 1: m For the comment information that has been divided formed into word vector matrix K by the above method. 1: m into words, the ultimate goal is to express the comment Information other than the comments of users and information as a word vector matrix and input it into the tourism service items is called other information, which neural network. In order to achieve this goal, all com- includes the basic information of users and tourism service ments written by user v, that is, user comments, are items. +e age in the user’s basic information is normalized integrated into a single document and recorded as z . by x � x/120, the gender is directly normalized to the real 1: m Document z consists of m words in total, as shown in value of [0, 1], and the occupation and city are directly 1: m equation (4). converted to the word embedding vector. In addition to the comment text, the user’s historical evaluation information v v v v v K � Ψ