Estimating Visited Stores Through Positive-Unlabeled Learning

Shirai, Ryo; Imai, Ryo; Liew, Seng Pei; Amagata, Daichi; Takahashi, Tsubasa; Hara, Takahiro

doi:10.1007/978-981-97-5575-2_28

Ryo Shirai¹⁵,
Ryo Imai¹⁶,
Seng Pei Liew¹⁶,
Daichi Amagata¹⁵,
Tsubasa Takahashi¹⁶ &
…
Takahiro Hara¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14856))

Included in the following conference series:

International Conference on Database Systems for Advanced Applications

699 Accesses

Abstract

This paper addresses the problem of visited store estimation, which estimates the stores where a given user visited from GPS data. Because of the inherent measurement errors in GPS and the presence of multiple stores within the error range, accurately identifying the visited stores is challenging. A simple baseline estimation approach associates GPS data with check-in logs and learns the features during the user’s stay. This approach relies mainly on check-in logs, i.e., positive data, preventing precision (false positive rate) evaluations. Therefore, we propose a visited store estimation model that considers both precision and recall. We use the stores existing in the error ranges of GPS data as unlabeled data. Our proposed model is trained by incorporating unlabeled data and using the features we design. We introduce a new metric, namely the category-aware PUF score, which is an appropriate indicator for estimating precision in our problem setting. We conduct experiments on real-world data, and the results demonstrate that our proposed model achieves high recall and category-aware PUF scores.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 159.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Annotating Location Semantic Tags in LBSN Using Extreme Learning Machine

Annotating semantic tags of locations in location-based social networks

Article 16 July 2019

Active zero-shot learning: a novel approach to extreme multi-labeled classification

Article 11 February 2017

Notes

1.
Such kinds of situations, where the feedback from the user is incomplete, are quite generic among real-world mobile apps. For example, a positive example/label of a user checking in to a store can be obtained seamlessly (without extra user-app interaction) after the user makes payment using a mobile payment app at the store. There is, however, no way to tell whether the user has not visited the store (unless the user is asked explicitly, which is impractical from the user-app interaction perspective, as it does not provide a pleasant experience of using the app).
2.
Recall that this is the first work considering GPS errors for the visited store estimation problem. No existing visited store estimation methods can deal with sparse and/or GPS errors, so they are not available as competitors.
3.
https://github.com/pulearn/pulearn.

References

Amagata, D., Arai, Y., Fujita, S., Hara, T.: Learned k-NN distance estimation. In: SIGSPATIAL, pp. 1–4 (2022)
Google Scholar
Amagata, D., Hara, T.: Identifying the most interactive object in spatial databases. In: ICDE, pp. 1286–1297 (2019)
Google Scholar
Bekker, J., Davis, J.: Learning from positive and unlabeled data: a survey. Mach. Learn. 109(4), 719–760 (2020)
Article MathSciNet Google Scholar
Cao, X., Cong, G., Jensen, C.S.: Mining significant semantic locations from GPS data. PVLDB 3(1–2), 1009–1020 (2010)
Google Scholar
Elkan, C., Noto, K.: Learning classifiers from only positive and unlabeled data. In: SIGKDD, pp. 213–220 (2008)
Google Scholar
Feng, J.,et al.: Deepmove: predicting human mobility with attentional recurrent networks. In: WWW, pp. 1459–1468 (2018)
Google Scholar
Huang, Z., Ma, J., Dong, Y., Foutz, N.Z., Li, J.: Empowering next poi recommendation with multi-relational modeling. In: SIGIR, pp. 2034–2038 (2022)
Google Scholar
Ke, G., et al.: LightGBM: a highly efficient gradient boosting decision tree. In: NIPS. vol. 30 (2017)
Google Scholar
Lee, W.S., Liu, B.: Learning with positive and unlabeled examples using weighted logistic regression. In: ICML. vol. 3, pp. 448–455 (2003)
Google Scholar
Mordelet, F., Vert, J.P.: A bagging SVM to learn from positive and unlabeled examples. Pattern Recogn. Lett. 37, 201–209 (2014)
Article Google Scholar
Nishio, S., Amagata, D., Hara, T.: Lamps: location-aware moving top-k pub/sub. IEEE Trans. Knowl. Data Eng. 34(1), 352–364 (2022)
Article Google Scholar
Sánchez, P., Bellogín, A.: Point-of-interest recommender systems based on location-based social networks: a survey from an experimental perspective. ACM Comput. Survey 54(11), 1–37 (2022)
Article Google Scholar
Tsuruoka, S., Amagata, D., Nishio, S., Hara, T.: Distributed spatial-keyword knn monitoring for location-aware pub/sub. In: SIGSPATIAL, pp. 111–114 (2020)
Google Scholar
Wu, F., Li, Z.: Where did you go: Personalized annotation of mobility records. In: CIKM, pp. 589–598 (2016)
Google Scholar
Xue, A.Y., Zhang, R., Zheng, Y., Xie, X., Huang, J., Xu, Z.: Destination prediction by sub-trajectory synthesis and privacy protection against such prediction. In: ICDE, pp. 254–265 (2013)
Google Scholar
Yang, D., Zhang, D., Zheng, V.W., Yu, Z.: Modeling user activity preference by leveraging user spatial temporal characteristics in LBSNs. IEEE Trans. Syst. Man, Cybern.: Syst. 45(1), 129–142 (2014)
Article Google Scholar
Yi, J., Lei, Q., Gifford, W.M., Liu, J., Yan, J., Zhou, B.: Fast unsupervised location category inference from highly inaccurate mobility data. In: SDM, pp. 55–63 (2019)
Google Scholar
Zhao, P., et al.: Where to go next: A spatio-temporal gated network for next poi recommendation. IEEE Trans. Knowl. Data Eng. (2020)
Google Scholar
Zheng, Y.: Trajectory data mining: an overview. ACM Trans. Intell. Syst. Technol. 6(3), 1–41 (2015)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Osaka University, Suita, Japan
Ryo Shirai, Daichi Amagata & Takahiro Hara
LY Corporation, Chiyoda City, Japan
Ryo Imai, Seng Pei Liew & Tsubasa Takahashi

Authors

Ryo Shirai
View author publications
You can also search for this author in PubMed Google Scholar
Ryo Imai
View author publications
You can also search for this author in PubMed Google Scholar
Seng Pei Liew
View author publications
You can also search for this author in PubMed Google Scholar
Daichi Amagata
View author publications
You can also search for this author in PubMed Google Scholar
Tsubasa Takahashi
View author publications
You can also search for this author in PubMed Google Scholar
Takahiro Hara
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ryo Shirai .

Editor information

Editors and Affiliations

Osaka University, Suita, Osaka, Japan
Makoto Onizuka
KAIST, Daejeon, Korea (Republic of)
Jae-Gil Lee
Beihang University, Beijing, China
Yongxin Tong
Osaka University, Osaka, Japan
Chuan Xiao
Nagoya University, Nagoya, Japan
Yoshiharu Ishikawa
University of Grenoble Alpes, Saint-Martin d’Hères, France
Sihem Amer-Yahia
University of Michigan, Ann Arbor, MI, USA
H. V. Jagadish
Nagoya University, Nagoya, Japan
Kejing Lu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shirai, R., Imai, R., Liew, S.P., Amagata, D., Takahashi, T., Hara, T. (2024). Estimating Visited Stores Through Positive-Unlabeled Learning. In: Onizuka, M., et al. Database Systems for Advanced Applications. DASFAA 2024. Lecture Notes in Computer Science, vol 14856. Springer, Singapore. https://doi.org/10.1007/978-981-97-5575-2_28

Download citation

DOI: https://doi.org/10.1007/978-981-97-5575-2_28
Published: 02 September 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-5574-5
Online ISBN: 978-981-97-5575-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics