Heterogeneous graph-based joint representation learning for users and POIs in location-based social network
Introduction
With the prevalence of mobile devices and the fast development of wireless networks, location-based service such as Foursquare1 and Gowalla2 are widely used worldwide. Unlike general social networks, the users in these location-based services can share the focused geographic entities (i.e., point of interest) they just visit and the related activities to their friends in terms of check-in. Such social network that combines social network and check-in service is called location-based social network (LBSN) (Zheng, 2011). As of 2017, Foursquare had accumulated over 12 billion Foursquare check-ins worldwide.3 That is, these LBSN services play an increasingly important role in people's daily life. Nowadays, many efforts are dedicated to exploit the huge number of check-in records to improve LBNS services. One of the findings is that the learning latent representations of user and POI can be used to enhance POI recommendation (Wang, Tan, Zhang & You, 2016; Pálovics et al., 2017) and social link prediction (Martínez, Berzal & Talavera, 2017; Wu, Zhang & Ren, 2017).
Besides check-in records made by the massive users (Zhao, Cong, Yuan & Zhu, 2015; Zhang & Chow, 2015; Bayrak & Polat, 2014), there are various contextual factors available in LBSN (Valverde-Rebaza, Roche, Poncelet & de Andrade Lopes, 2018), such as geographical influence (Li, Cong, Li, Pham & Krishnaswamy, 2015; Liu, Wei, Sun & Miao, 2014; Cranshaw, Toch, Hong, Kittur & Sadeh, 2010; Scellato, Noulas & Mascolo, 2011), social connections (Bayrak & Polat, 2014; Li, Ge, Hong & Zhu, 2016; Tang, Hu, Gao & Liu, 2013; Valverde-Rebaza, Roche, Poncelet & de Andrade Lopes, 2016), temporal information (Yuan, Cong, Ma, Sun & Thalmann, 2013; Zhao, Zhao, Yang, Lyu & King, 2016). Most of the existing user and POI representation learning methods (Li et al., 2015; Zhao, Fan, Wen & Chang, 2018; Feng et al., 2015; Xie et al., 2016) for LBSN only utilize some of these factors, resulting a partial encoding for the users and POIs towards the understanding of user spatial behavior and their social relationship. For instance, (Zhao, Fan et al., 2018) models chronological check-in sequence and social information without geographical information. (Li et al., 2015; Feng et al., 2015; Xie et al., 2016) exclude social connections for representation learning. And the techniques presented in Li et al. (2015), Zhao, Fan et al. (2018) and Feng et al. (2015) only exploit check-in information, which can not handle the cold-start problem. Although these methods have delivered effective representation learning for the users and POIs in LBSN in terms of POI recommendation and social link prediction, an important problem is not well considered, i.e., how to jointly model all these contextual factors to derive the user and POI representations. In other words, encoding users and POIs in LBSN by jointly considering these contextual factors would result in a more comprehensive understanding of user spatial behavior and their social relationship, leading to better LBSN services, such as POI recommendation and social link prediction.
To this end, in this paper, we propose a heterogeneous LBSN graph-based joint representation learning framework, named as UP2VEC (namely User and POI toVector). UP2VEC takes the users and POIs, check-in records and the auxiliary social network of a LBSN as input, and generates latent representations for each user and POI as output. In UP2VEC, we first build a heterogeneous graph to include the users and POIs, and the interactions between them. This heterogeneous graph could easily facilitate the incorporation of various contextual factors into the node-to-node relatedness measure. Specifically, there are three kinds of node-to-node relatedness, namely user-POI, POI-POI and user-user. We derive the user-POI relatedness by considering both the check-in records of this user-POI pair and the geographical distance between them. The similar procedure is also applied for POI-POI relatedness measure by considering the POI to POI sequential patterns and their geographical distance. As to user-user relations, we calculate their relatedness in terms of both common check-in behaviors and the geographical distance. To model the temporal information, we further consider the time point of the POI visit when building the heterogeneous graph. Inspired by recent progress in network embedding (Zhao, Zhao et al., 2016; Zhao, Fan et al., 2018), we then adopt the effective network embedding technique (i.e., Node2Vec) to learn the latent representations for each user and POI based on the heterogeneous graph. Note that the geographical influence is modeled in all three node-to-node relations. This offers us a straightforward strategy to accommodate the cold-start users and POIs for representation learning.
We evaluate the proposed UP2VEC over two real-world datasets. The experimental results demonstrate that UP2VEC substantially outperforms the existing state-of-the-art techniques in terms of POI recommendation and social link prediction. We further show that UP2VEC provides superior ability in handling cold-start users and POIs. To sum up, the main contributions of this paper are as follows:
- •
We propose a novel framework for representation learning in LBSN By building a heterogeneous LBSN graph. This heterogeneous graph enables a joint modeling of various contextual factors including geographical influence, social relationship and temporal information.
- •
We devise a simple yet effective method to estimate three kinds of node-to-node relatedness, which jointly consider various contextual factors to better reflect the user spatial behavior and their social relationships.
- •
Through extensive experiments on two publicly available datasets, we show that the proposed UP2VEC can achieve significantly improvement in POI recommendation and social link prediction.
The rest of this paper is organized as follows. Section 2 presents literature review for related works in POI recommendation, link prediction and network embedding. Section 3 presents key definitions and problem formulation of node representation learning, POI recommendation and link prediction. Section 4 details the proposed framework and key modules of the framework. Section 5 presents experimental evaluation in terms of POI recommendation, link prediction and parameter sensitivity analysis. Section 6 presents conclusion and future research directions.
Section snippets
Related work
Our work is mainly related to the research studies of POI recommendation, link prediction and network embedding. Therefore, we review the related literature in this section.
Problem formulation
To ease the understanding of the proposed UP2VEC, in this section, we first define the key concepts used in this paper. The problem formulation for POI recommendation and social link prediction are also presented here.
Definition 1 (Point of Interest (POI)) A POI l is defined as a unique geographical entity (e.g., a restaurant or a cinema) whose geographical coordinate is available. The collection of POIs available in a LBSN is denoted as L.
Definition 2 (Check-in) A check-in is a triplet which indicates that
Proposed framework
We need to learn the representations for LBSN users and POIs (including cold-start users and POIs) from the check-ins, social connections, and auxiliary geographical and temporal information. Therefore, we need to figure out a proper way to integrate them into a unified learning model. In LBSN, there are two main entities available, namely users and POIs. The users are connected by social connections, while POIs are connected in the check-in sequences. Users and POIs are also connected via the
Performance evaluation
In this section, we conduct extensive experiments over two real-world datasets to evaluate the efficacy of the proposed UP2VEC framework in terms of three relevant tasks: POI recommendation, cold-start POI recommendation and link prediction. Then, we also present the study of parameter sensitivity for UP2VEC.
Conclusion
In this paper, we have proposed a heterogeneous graph-based joint representation learning framework (UP2VEC) for users and POIs in LBSN. The learned representations of users and POIs are task independent. Therefore, they can be directly used as input for various LBSN tasks, such as POI recommendation and social link prediction. UP2VEC jointly models social relationships, geographical influence, and temporal information through constructing a user-POI heterogeneous graph. Besides that, it
Acknowledgments
This work was supported by the National Key R&D Program of China (No.2016YFB0801303, 2016QY01W0105), the National Natural Science Foundation of China (No.U1636219, U1804263, 61872278, U1736214), and the Plan for Scientific Innovation Talent of Henan Province (No. 184200510018).
References (52)
- et al.
Mining user similarity based on location history
- et al.
Location-aware online learning for top-k recommendation
Pervasive and Mobile Computing
(2017) - et al.
The role of location and social strength for friendship prediction in location-based social networks
Information Processing and Management
(2018) - et al.
A balanced modularity maximization link prediction model in social networks
Information Processing and Management
(2017) - et al.
Find me if you can: Improving geographical prediction with social and spatial proximity
- et al.
Contextual feature analysis to improve link prediction for location based social networks
- et al.
A tutorial on network embeddings
- et al.
Fused matrix factorization with geographical and social influence in location-based social networks
- et al.
Friendship and mobility: User movement in location-based social networks
- et al.
Bridging the gap between physical location and online social networks
Personalized ranking metric embedding for next new poi recommendation
gSCorr: Modeling geo-social correlations for new check-ins on location-based social networks
Node2Vec: Scalable feature learning for networks
Inductive representation learning on large graphs
Advances in Neural Information Processing Systems
Neural collaborative filtering
Point-of-Interest recommendations: Learning potential check-ins from friends
Towards social user profiling: Unified and discriminative influence model for inferring home locations
Rank-GeoFM: A ranking based geographical factorization method for point of interest recommendation
GeoMF: Joint geographical modeling and matrix factorization for point-of-interest recommendation
Exploiting geographical neighborhood characteristics for location recommendation
Recommender systems with social regularization
A survey of link prediction in complex networks
ACM Computing Surveys
Distributed representations of words and phrases and their compositionality
Advances in Neural Information Processing Systems
A random walk around the city: New venue recommendation in location-based social networks
DeepWalk: Online learning of social representations
Towards integrating real-world spatiotemporal data with social networks
Cited by (59)
Towards human-like perception: Learning structural causal model in heterogeneous graph
2024, Information Processing and ManagementMeta-learning adaptation network for few-shot link prediction in heterogeneous social networks
2023, Information Processing and ManagementImproving the spatial–temporal aware attention network with dynamic trajectory graph learning for next Point-Of-Interest recommendation
2023, Information Processing and ManagementHeterogeneous question answering community detection based on graph neural network
2023, Information SciencesGLAE: A graph-learnable auto-encoder for single-cell RNA-seq analysis
2023, Information SciencesCitation Excerpt :Learning and exploiting unknown relations has become an essential issue for scRNA-seq-based cell clustering. Graph neural networks (GNNs) can make good use of the sample relation graph by passing messages over it to aggregate information and have achieved good results in many fields, such as recommendation systems [18–21], social networks [22,23], fake news detection [24], disease-gene association identification [25] and biomedical [26]. Thus, some methods such as scGNN [27] introduce GNN into the field of scRNA-seq analysis.
A multiview graph collaborative filtering by incorporating homogeneous and heterogeneous signals
2022, Information Processing and ManagementCitation Excerpt :However, associations among heterogeneous attributes can also distinguish different interests of users, which help extend the semantic representation of users and items. By considering geographical influence, social relationship and temporal information in location-based social networks, Qiao et al. (2020) designed a heterogeneous graph to learn the representations of users and points of interests for making social link prediction. An attribute graph can be used to model the association of different attributes and help learn a rich attribute embedding of nodes.