Abstract
People are keen to share their geospatial locations to access social activities or services via mobile internet, which provides a new perspective for us to understand human mobility. However, the collection of individual mobility data arouses great concern about privacy among the public. Specifically, anonymized mobility records can be used to imply the routine behavior of individuals such that people can be re-identified even if they share footprints with different accounts or platforms. In this context, this study explored the probability of re-identifying anonymized users with advanced deep learning techniques, only leveraging their trajectories collected during a long time period. Such a user re-identification task realizes the identification of anonymous users by mining the characteristics of anonymous trajectories. Prevailing methods adopt deep sequential learning models, such as recurrent neural networks (RNNs), to capture the inherent similarity between any two trajectories, replacing classic statistical models. Despite that, RNN-like models usually fail in learning effective knowledge from longer sequences, such as one’s visited locations in one week. To this end, we propose a novel model based on the Siamese Transformer network. The entire model comprises a discriminant module and retrieval module. The discriminant module uses the Transformer model to detect the characteristics of the trajectory and employs an improved attention mechanism to achieve similarity measurement between trajectories. The retrieval module helps the model deal with the matching between the anonymous trajectory and user by constructing a mapping relationship between users and locations. Extensive experiments on four real-world location-based social network datasets demonstrated that our method outperforms existing methods.
Similar content being viewed by others
Data Availability
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
References
Korah PI, Cobbinah PB, Nunbogu AM, Gyogluu S (2017) Spatial plans and urban development trajectory in Kumasi, Ghana. GeoJournal 82(6):1113–1134
Xu Y, González MC (2017) Collective benefits in traffic during mega events via the use of information technologies. J R Soc Interface 14(129):20161041
Xu Y, Çolak S, Kara EC, Moura SJ, González MC (2018) Planning for electric vehicle needs by coupling charging profiles with urban mobility. Nature Energy 3:484–493
Martí P, Serrano-Estrada L, Nolasco-Cirugeda A (2019) Social media data: challenges, opportunities and limitations in urban studies. Comput Environ Urban Syst 74:161–174
Qian T, Liu B, Nguyen QVH, Yin H (2019) Spatiotemporal representation learning for translation-based poi recommendation. ACM Trans Inf Syst (TOIS) 37(2):1–24
Lim KH, Chan J, Leckie C, Karunasekera S (2018) Personalized trip recommendation for tourists based on user interests, points of interest visit durations and visit recency. Knowl Inf Syst 54:375–406
Sánchez P, Bellogín A (2022) Point-of-interest recommender systems based on location-based social networks: a survey from an experimental perspective. ACM Comput Surv (CSUR) 54(11s):1–37
Messaoud K, Yahiaoui I, Verroust-Blondet A, Nashashibi F (2020) Attention based vehicle trajectory prediction. IEEE Trans Intell Veh 6(1):175–185
Rudenko A, Palmieri L, Herman M, Kitani KM, Gavrila DM, Arras KO (2020) Human motion trajectory prediction: a survey. Int J Rob Res 39(8):895–935
Huang Y, Du J, Yang Z, Zhou Z, Zhang L, Chen H (2022) A survey on trajectory-prediction methods for autonomous driving. IEEE Trans Intell Veh 7(3):652–674
Gao Q, Zhou F, Zhang K, Trajcevski G, Luo X, Zhang F (2017) Identifying human mobility via trajectory embeddings. IJCAI 17:1689–1695
Zhou F, Gao Q, Trajcevski G, Zhang K, Zhong T, Zhang F (2018) Trajectory-user linking via variational autoencoder. In: IJCAI, pp 3212–3218
Inferring demographics from human trajectories and geographical context (2019) Comput Environ Urban Syst 77:101368
Li M, Gao S, Lu F, Zhang H (2019) Reconstruction of human movement trajectories from large-scale low-frequency mobile phone data. Comput Environ Urban Syst 77:101346
Digital footprints (2018) Using wifi probe and locational data to analyze human mobility trajectories in cities. Comput Environ Urban Syst 72:4–12
Ludewig M, Jannach D (2018) Evaluation of session-based recommendation algorithms. User Model User-Adap Inter 28:331–390
Chen W, Wang W, Yin H, Zhao L, Zhou X (2023) Hful: a hybrid framework for user account linkage across location-aware social networks. The VLDB Journal 32(1):1–22
Gong NZ, Liu B (2018) Attribute inference attacks in online social networks. ACM Trans Priv Secur (TOPS) 21(1):1–30
Rossi L, Musolesi M (2014) It’s the way you check-in: identifying users in location-based social networks. In: Proceedings of the second ACM conference on online social networks, pp 215–226
Han X, Wang L, Xu L, Zhang S (2016) Social media account linkage using user-generated geo-location data. In: 2016 IEEE Conference on intelligence and security informatics (ISI). IEEE, pp 157–162
Riederer C, Kim Y, Chaintreau A, Korula N, Lattanzi S (2016) Linking users across domains with location data: theory and validation. In: Proceedings of the 25th international conference on world wide web, pp 707–719
Croitoru F-A, Hondru V, Ionescu RT, Shah M (2023) Diffusion models in vision: a survey. IEEE Trans Pattern Anal, Mach Intell
Yu Sang WC-LZ, Xie Zhenping (2023) Tulrn: trajectory user linking on road networks. In: World Wide Web, pp 1949–1965
Qiu J, Yao R, Zhou Y, Wang P, Zhang Y, Zhu H (2023) Visible and infrared object tracking via convolution-transformer network with joint multimodal feature learning. IEEE Geosci Remote Sens Lett 20:1–5
Liset Vázquez Romaguera J-FC, Alley S, Kadoury S (2023) Conditional-based transformer network with learnable queries for 4d deformation forecasting and tracking. IEEE Trans Med Imaging 42(6):1603–1618
Zhang S, Wang S, Wang X, Zhang S, Miao H, Zhu J (2023) Machine learning and knowledge discovery in databases. Cham, Springer Nature Switzerland, pp 418–434
Liwei Deng YZ-SL, Sun Hao, Zheng K (2023) S2tul: A semi-supervised framework for trajectory-user linking. In: In Proceedings of the sixteenth ACM international conference on web search and data mining (WSDM ’23), New York, USA, pp 375–383
Md FTI, Mahmood Tareq, Naznin M (2022) Mtul: a novel approach for multi-trajectory user linking. In: Proceedings of the ninth international conference on networking, systems and security authors, pp 1–7
Güting RH, Behr T, Xu J (2010) Efficient k-nearest neighbor search on moving object trajectories. The VLDB Journal 19(5):687–714
Bedagkar-Gala A, Shah SK (2014) A survey of approaches and trends in person re-identification. Image Vis Comput 32(4):270–286
Wang G, Lai J, Huang P, Xie X (2019) Spatial-temporal person re-identification. Proc of the AAAI Conf Artif Intell 33(01):8933–8940
Ge Y, Li Z, Zhao H, Yin G, Yi S, Wang X et al (2018) Fd-gan: pose-guided feature distilling gan for robust person re-identification. Adv Neural Inf Process Syst 31
Tu Z, Xu F, Li Y, Zhang P, Jin D (2018) A new privacy breach: user trajectory recovery from aggregated mobility data. IEEE/ACM Trans Netw 26(3):1446–1459
De Mulder Y, Danezis G, Batina L, Preneel B (2008) Identification via location-profiling in gsm networks. Sci 22:22
H. Wang, C. Gao, Y. Li, G. Wang, D. Jin, and J. Sun, (2018) De-anonymization of mobility trajectories: dissecting the gaps between theory and practice. In: The 25th annual network & distributed system security symposium (NDSS’18)
Fang L, Cheng X, Yang L, Wang H (2018) Location privacy in mobile big data: user identifiability via habitat region representation. J Commun Inf Netw 3(3):31–38
Nascimento JC, Figueiredo MA, Marques JS (2009) Trajectory classification using switched dynamical hidden markov models. IEEE Trans Image Process 19(5):1338–1348
May Petry L, Leite Da Silva C, Esuli A, Renso C, Bogorny V (2020) Marc: a robust method for multiple-aspect trajectory classification via space, time, and semantic embeddings. Int J Geogr Inf Sci 34(7):428–1450
Choi D-W, Pei J, Heinis T (2017) Efficient mining of regional movement patterns in semantic trajectories. Proceedings of the VLDB Endowment 10(13):2073–2084
Hung C-C, Peng W-C, Lee W-C (2015) Clustering and aggregating clues of trajectories for mining trajectory patterns and routes. The VLDB Journal 24(2):169–192
Dai J, Yang B, Guo C, Ding Z (2015) Personalized route recommendation using big trajectory data. In: 2015 IEEE 31st international conference on data engineering. IEEE, pp 543–554
Cui G, Luo J, Wang X (2018) Personalized travel route recommendation using collaborative filtering based on gps trajectories. Int J Digit Earth 11(3):284–307
Zheng Y, Xu X, Qi L (2019) Deep cnn-assisted personalized recommendation over big data for mobile wireless networks. Wirel Commun Mob Comput 2019
Yabe T, Tsubouchi K, Sekimoto Y, Ukkusuri SV (2022) Early warning of covid-19 hotspots using human mobility and web search query data. Comput Environ Urban Syst 92:101747
Vlachos M, Kollios G, Gunopulos D (2002) Discovering similar multidimensional trajectories. In: Proceedings 18th international conference on data engineering. IEEE pp 673–684
Berndt DJ, Clifford J (1994) Using dynamic time warping to find patterns in time series. KDD workshop, vol 10(16) Seattle. WA, USA, pp 359–370
Chen L, Özsu MT, Oria V (2005) Robust and fast similarity search for moving object trajectories. In: Proceedings of the 2005 ACM SIGMOD international conference on Management of data pp 491–502
Naini FM, Unnikrishnan J, Thiran P, Vetterli M (2015) Where you are is who you are: user identification by matching statistics. IEEE Trans Inf Forensics Secur 11(2):358–372
Ma CY, Yau DK, Yip NK, Rao NS (2013) Privacy vulnerability of published anonymous mobility traces. IEEE/ACM Trans Networking 21(03):720–733
Xie H, Qin Z, Li GY, Juang B-H (2021) Deep learning enabled semantic communication systems. IEEE Trans Signal Process 69:2663–2675
Schlechtriemen J, Wirthmueller F, Wedel A, Breuel G, Kuhnert K-D (2015) When will it change the lane? a probabilistic regression approach for rarely occurring events. In: 2015 IEEE Intelligent vehicles symposium (IV). IEEE, pp 1373–1379
Tomar RS, Verma S (2011) Trajectory predictions of lane changing vehicles using svm. Int J Veh Saf 5(4):45–355
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. Adv Neural Inf Process 26
Sun T, Wang F, Zhang Z, Wu L, Xu Y (2022) Human mobility identification byădeep behavior relevant location representation. Database systems for advanced applications. Springer International Publishing, Cham, pp 439–454
Gao Q, Zhang F, Yao F, Li A, Mei L, Zhou F (2020) Adversarial mobility learning for human trajectory classification. IEEE Access 8:20 563–20 576
Cai Z, Xiong Z, Xu H, Wang P, Li W, Pan Y (2021) Generative adversarial networks: a survey toward private and secure applications. ACM Comput Surv (CSUR) 54(6):1–38
Tan Z, Chen J, Kang Q, Zhou M, Abusorrah A, Sedraoui K (2021) Dynamic embedding projection-gated convolutional neural networks for text classification. IEEE Trans Neural Netw Learn Syst 33(3):973–982
Xu S, Shijia E, Xiang Y (2020) Enhanced attentive convolutional neural networks for sentence pair modeling. Expert Syst Appl 151:113384
Wang H, Li Y, Wang G, Jin D (2021) Linking multiple user identities of multiple services from massive mobility traces. ACM Trans Intell Syst Technol (TIST) 12(4):1–28
Feng HW-ZYCZYLJ, Zhang M, Jin D (2022) User identity linkage via co-attentive neural network from heterogeneous mobility data. IEEE Trans Knowl Data Eng 34(11):954–968
Gao Qiang, Zhou Fan, Zhong Ting, Trajcevski Goce, Yang Xin, Li Tianrui (2022) Contextual spatio-temporal graph representation learning for reinforced human mobility mining. Inf Sci 606:230–249
Chen C, Zhang W, Bo Y, Chen C (2023) SAMLink: a mobility signature augmentation model for trajectory-user linking. Neural Comput & Applic 35:24 473–24 491
Chen W, Li S, Huang C, Yu Y, Jiang Y, Dong J (2022) Mutual Distillation Learning Network for Trajectory-User Linking. In: Proceedings of the thirty-first international joint conference on artificial intelligence, IJCAI-22, Lud De Raedt, International Joint Conferences on Artificial Intelligence Organization, 7 pp 1973–1979, Main Track
Bromley J, Bentz JW, Bottou L, Guyon I, LeCun Y, Moore C, Säckinger E, Shah R (1993) Signature verification using a siamese time delay neural network. Intern J Pattern Recognit Artif Intell 7(4):669–688
Huang K, Wei K, Li F, Yang C, Gui W (2023) Lstm-mpc: a deep learning based predictive control method for multimode process control. IEEE Trans Ind Electron 70(11):11 544–11 554
Feng J, Zhang M, Wang H, Yang Z, Zhang C, Li Y, Jin D (2019) Dplink: user identity linkage via deep neural network from heterogeneous mobility data. In: The world wide web conference, pp 459–469
Song C, Qu Z, Blumm N, Barabási A-L (2010) Limits of predictability in human mobility. Science 327(5968):1018–1021
Park C, Han J, Yu H (2020) Deep multiplex graph infomax: Attentive multiplex network embedding using global information. Knowl-Based Syst 197:105861
Acknowledgements
This work was jointly supported by the National Science Foundation of China (62102258), the Shanghai Pujiang Program (21PJ1407300), the Shanghai Municipal Science and Technology Major Project (2021SHZDZX0102), and the National Social Science Foundation Major Project of China (21 &ZD200). Thanks Yan Zhang for his contributions to the work.
Author information
Authors and Affiliations
Contributions
Bin Wang: Methodology, Writing Original Draft; Mingming Zhang: Data Curation, Software, Methodology, Visualization; Peng Ding: Conceptualization, Formal Analysis; Tao Yang: Project Administration; Yaohui Jin: Supervision, Investigation; Yanyan Xu: Methodology, Supervision, Visualization.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Ethical and informed consent for data used
We used publicly available datasets and there are no ethical concerns.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, B., Zhang, M., Ding, P. et al. User re-identification via human mobility trajectories with siamese transformer networks. Appl Intell 54, 815–834 (2024). https://doi.org/10.1007/s10489-023-05234-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-023-05234-8