Skip to main content

Estimating Visited Stores Through Positive-Unlabeled Learning

  • Conference paper
  • First Online:
Database Systems for Advanced Applications (DASFAA 2024)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14856))

Included in the following conference series:

  • 699 Accesses

Abstract

This paper addresses the problem of visited store estimation, which estimates the stores where a given user visited from GPS data. Because of the inherent measurement errors in GPS and the presence of multiple stores within the error range, accurately identifying the visited stores is challenging. A simple baseline estimation approach associates GPS data with check-in logs and learns the features during the user’s stay. This approach relies mainly on check-in logs, i.e., positive data, preventing precision (false positive rate) evaluations. Therefore, we propose a visited store estimation model that considers both precision and recall. We use the stores existing in the error ranges of GPS data as unlabeled data. Our proposed model is trained by incorporating unlabeled data and using the features we design. We introduce a new metric, namely the category-aware PUF score, which is an appropriate indicator for estimating precision in our problem setting. We conduct experiments on real-world data, and the results demonstrate that our proposed model achieves high recall and category-aware PUF scores.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    Such kinds of situations, where the feedback from the user is incomplete, are quite generic among real-world mobile apps. For example, a positive example/label of a user checking in to a store can be obtained seamlessly (without extra user-app interaction) after the user makes payment using a mobile payment app at the store. There is, however, no way to tell whether the user has not visited the store (unless the user is asked explicitly, which is impractical from the user-app interaction perspective, as it does not provide a pleasant experience of using the app).

  2. 2.

    Recall that this is the first work considering GPS errors for the visited store estimation problem. No existing visited store estimation methods can deal with sparse and/or GPS errors, so they are not available as competitors.

  3. 3.

    https://github.com/pulearn/pulearn.

References

  1. Amagata, D., Arai, Y., Fujita, S., Hara, T.: Learned k-NN distance estimation. In: SIGSPATIAL, pp. 1–4 (2022)

    Google Scholar 

  2. Amagata, D., Hara, T.: Identifying the most interactive object in spatial databases. In: ICDE, pp. 1286–1297 (2019)

    Google Scholar 

  3. Bekker, J., Davis, J.: Learning from positive and unlabeled data: a survey. Mach. Learn. 109(4), 719–760 (2020)

    Article  MathSciNet  Google Scholar 

  4. Cao, X., Cong, G., Jensen, C.S.: Mining significant semantic locations from GPS data. PVLDB 3(1–2), 1009–1020 (2010)

    Google Scholar 

  5. Elkan, C., Noto, K.: Learning classifiers from only positive and unlabeled data. In: SIGKDD, pp. 213–220 (2008)

    Google Scholar 

  6. Feng, J.,et al.: Deepmove: predicting human mobility with attentional recurrent networks. In: WWW, pp. 1459–1468 (2018)

    Google Scholar 

  7. Huang, Z., Ma, J., Dong, Y., Foutz, N.Z., Li, J.: Empowering next poi recommendation with multi-relational modeling. In: SIGIR, pp. 2034–2038 (2022)

    Google Scholar 

  8. Ke, G., et al.: LightGBM: a highly efficient gradient boosting decision tree. In: NIPS. vol. 30 (2017)

    Google Scholar 

  9. Lee, W.S., Liu, B.: Learning with positive and unlabeled examples using weighted logistic regression. In: ICML. vol. 3, pp. 448–455 (2003)

    Google Scholar 

  10. Mordelet, F., Vert, J.P.: A bagging SVM to learn from positive and unlabeled examples. Pattern Recogn. Lett. 37, 201–209 (2014)

    Article  Google Scholar 

  11. Nishio, S., Amagata, D., Hara, T.: Lamps: location-aware moving top-k pub/sub. IEEE Trans. Knowl. Data Eng. 34(1), 352–364 (2022)

    Article  Google Scholar 

  12. Sánchez, P., Bellogín, A.: Point-of-interest recommender systems based on location-based social networks: a survey from an experimental perspective. ACM Comput. Survey 54(11), 1–37 (2022)

    Article  Google Scholar 

  13. Tsuruoka, S., Amagata, D., Nishio, S., Hara, T.: Distributed spatial-keyword knn monitoring for location-aware pub/sub. In: SIGSPATIAL, pp. 111–114 (2020)

    Google Scholar 

  14. Wu, F., Li, Z.: Where did you go: Personalized annotation of mobility records. In: CIKM, pp. 589–598 (2016)

    Google Scholar 

  15. Xue, A.Y., Zhang, R., Zheng, Y., Xie, X., Huang, J., Xu, Z.: Destination prediction by sub-trajectory synthesis and privacy protection against such prediction. In: ICDE, pp. 254–265 (2013)

    Google Scholar 

  16. Yang, D., Zhang, D., Zheng, V.W., Yu, Z.: Modeling user activity preference by leveraging user spatial temporal characteristics in LBSNs. IEEE Trans. Syst. Man, Cybern.: Syst. 45(1), 129–142 (2014)

    Article  Google Scholar 

  17. Yi, J., Lei, Q., Gifford, W.M., Liu, J., Yan, J., Zhou, B.: Fast unsupervised location category inference from highly inaccurate mobility data. In: SDM, pp. 55–63 (2019)

    Google Scholar 

  18. Zhao, P., et al.: Where to go next: A spatio-temporal gated network for next poi recommendation. IEEE Trans. Knowl. Data Eng. (2020)

    Google Scholar 

  19. Zheng, Y.: Trajectory data mining: an overview. ACM Trans. Intell. Syst. Technol. 6(3), 1–41 (2015)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ryo Shirai .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Shirai, R., Imai, R., Liew, S.P., Amagata, D., Takahashi, T., Hara, T. (2024). Estimating Visited Stores Through Positive-Unlabeled Learning. In: Onizuka, M., et al. Database Systems for Advanced Applications. DASFAA 2024. Lecture Notes in Computer Science, vol 14856. Springer, Singapore. https://doi.org/10.1007/978-981-97-5575-2_28

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-5575-2_28

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-5574-5

  • Online ISBN: 978-981-97-5575-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics