research-article

RLMob: Deep Reinforcement Learning for Successive Mobility Prediction

Authors:

Ziyan Luo,

Congcong MiaoAuthors Info & Claims

WSDM '22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining

Pages 648 - 656

https://doi.org/10.1145/3488560.3498438

Published: 15 February 2022 Publication History

Get Access

Abstract

Human mobility prediction is an important task in the field of spatiotemporal sequential data mining and urban computing. Despite the extensive work on mining human mobility behavior, little attention was paid to the problem of successive mobility prediction. The state-of-the-art methods of human mobility prediction are mainly based on supervised learning. To achieve higher predictability and adapt well to the successive mobility prediction, there are four key challenges: 1) disability to the circumstance that the optimizing target is discrete-continuous hybrid and non-differentiable. In our work, we assume that the user's demands are always multi-targeted and can be modeled as a discrete-continuous hybrid function; 2) difficulty to alter the recommendation strategy flexibly according to the changes in user needs in real scenarios; 3) error propagation and exposure bias issues when predicting multiple points in successive mobility prediction; 4) cannot interactively explore user's potential interest that does not appear in the history. While previous methods met these difficulties, reinforcement learning (RL) is an intuitive answer for this task to settle these issues. We innovatively introduce RL to the successive prediction task. In this paper, we formulate this problem as a Markov Decision Process. We further propose a framework - RLMob to solve our problem. A simulated environment is carefully designed. An actor-critic framework with an instance of Proximal Policy Optimization (PPO) is applied to adapt to our scene with a large state space. Experiments show that on the task, the performance of our approach is consistently superior to that of the compared approaches.

Supplementary Material

MP4 File (WSDM22-fp349.mp4)

This presentation video is a short video that briefly goes through the whole paper of RLMob. From introduction part, problem statement and MDP formulation, methodology to experiment part. This video only contains basic ideas behind this paper and you can check our paper if you are interested in technical details.

Download
40.30 MB

References

[1]

Oren Barkan and Noam Koenigstein. 2016. Item2vec: neural item embedding for collaborative filtering. In 2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP). IEEE, 1--6.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Predicting Human Mobility with Reinforcement-Learning-Based Long-Term Periodicity Modeling

Robust User Behavioral Sequence Representation via Multi-scale Stochastic Distribution Prediction

Decentralized Attention-based Personalized Human Mobility Prediction

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations