Skip to main content
Log in

Spatial and semantical label inference for social media

A cross-network data fusion approach

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Exploring the spatial and semantical knowledge from messages in social media offers us an opportunity to get a deeper understanding about the mobility and activity of users, which can be leveraged to improve the service quality of online applications like recommender systems. In this paper, we investigate the problem of the spatial and semantical label inference, where the challenges come from three aspects: diverse heterogeneous information, uncertainty of individual mobility, and large-scale sparse data. We address the challenges by exploring two types of data fusion, the fusion of heterogeneous social networks and the fusion of heterogeneous features. We build a 4-dimensional tensor, called spatial–temporal semantical tensor (STST), to model the individual mobility and activity by fusing two heterogeneous social networks, a social media network and a location-based social network (LBSN). To address the challenge arising from diverse heterogeneous information and the uncertainty of individual mobility, we construct three types of heterogeneous features and fuse them with STST by exploring their interdependency relationships. Particularly, a spatial tendency feature is constructed to constrain the inference of individual mobility and reduce the uncertainty. To deal with large-scale sparse data, we propose a parallel contextual tensor factorization (PCTF) to concurrently factorize STST. Finally, we integrate these components into an inference framework, called spatial and semantical label inference SSLI. The results of extensive experiments conducted on real datasets and synthetic datasets verify the effectiveness and efficiency of SSLI.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

References

  1. Bao J, Zheng Y, Mokbel MF (2012) Location-based and preference-aware recommendation using sparse geo-social networking data. In: GIS ’12 Proceedings of the 20th international conference on advances in Geographic Information systems pp 199–208

  2. Casella G, Berger RL (2002) Statistical inference, vol 2. Duxbury Pacific Grove, Pacific Grove, CA

    MATH  Google Scholar 

  3. Cho E, Myers SA, Leskovec J (2011) Friendship and mobility: user movement in location-based social networks. In: KDD ’11 Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining pp 1082–1090

  4. Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30

    MathSciNet  MATH  Google Scholar 

  5. Dunn OJ (1980) Multiple comparisons among means. J Am Stat Assoc 56:52–64

    Article  MathSciNet  MATH  Google Scholar 

  6. Gonzalez MC, Hidalgo CA, Barabasi AL (2008) Understanding individual human mobility patterns. Nature 453:479–482

    Article  Google Scholar 

  7. Guerzhoy M, Hertzmann A (2014) Learning latent factor models of travel data for travel prediction and analysis. In: Canadian conference on artificial intelligence

  8. Hao Q, Cai R, Wang C, Xiao R, Yang JM, Pang Y, Zhang L (2010) Equip tourists with knowledge mined from travelogues. In: WWW ’10 Proceedings of the 19th international conference on World Wide Web pp 401–410

  9. Hong L, Ahmed A, Gurumurthy S, Smola AJ, Tsioutsiouliklis K (2012) Discovering geographical topics in the twitter stream. In: WWW ’12 Proceedings of the 21th international conference on World Wide Web pp 769–778

  10. Karamshuk D, Noulas A, Scellato S, Nicosia V, Mascolo C (2013) Geo-spotting: Mining online location-based services for optimal retail store placement. In: KDD ’13 Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining pp 793–801

  11. Kong X, Zhang J, Philip YS (2013) Inferring anchor links across multiple heterogeneous social networks. In: CIKM ’13 Proceedings of the 22nd ACM international conference on information and Knowledge Management pp 1289–1294

  12. Lian D, Zhao C, Xie X, Sun G, Chen E, Rui Y (2014) Geomf: Joint geographical modeling and matrix factorization for point-of-interest recommendation.In: SigKDD ’14 Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining

  13. Ma Y (2016) Source code. http://pan.baidu.com/s/1qY2GHWS

  14. Paterek A (2007) Improving regularized singular value decomposition for collaborative filtering. In: Proceedings of KDD cup and workshop pp 5–8

  15. Ronald IL, Davenport JM (1980) Approximations of the critical region of the fbietkan statistic. Commun Stat Theory Methods 9(6):571–595

    Article  MATH  Google Scholar 

  16. Shang J, Zheng Y, Tong W, Chang E, Yu Y (2014) Inferring gas consumption and pollution emission of vehicles throughout a city. In: KDD ’14 Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining

  17. Song C, Koren T, Barabasi AL (2010) Modelling the scaling properties of human mobility. Nat Phys 6:818–823

    Article  Google Scholar 

  18. Song C, Qu Z, Blumm N, Barabasi A (2010) Limits of predictability in human mobility. Science 327:1018–1021

    Article  MathSciNet  MATH  Google Scholar 

  19. Stephen-Boyd lV Convex optimiation

  20. Wang C, Wang J, Xie X, Ma WY (2007) Mining geographic knowledge using location aware topic model. In: GIR ’07 Proceedings of the 4th ACM workshop on Geographical information retrieval pp 65–70

  21. Wang Y, Yuan NJ, Lian D, Xu L, Xie X, Chen E, Rui Y (2014) Regularity and conformity: Location prediction using heterogeneous mobility data. In: SigKDD ’14 Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining

  22. Wang Y, Zheng Y, Xue Y (2014) Travel time estimation of a path using sparse trajectories. In: KDD ’14 Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining

  23. Webb B (2006) Netflix update: Try this at home. http://sifter.org/simon/journal/20061211.html

  24. Yuan Q, Cong G, Ma Z, Sun A, Thalmann NM (2013) Who, where, when and what: discover spatio-temporal topics for twitter users. In: KDD ’13 Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining pp. 605–613

  25. Zhang J, Kong X, Philip YS (2013) Predicting social links for new users across aligned heterogeneous social networks. In: ICDM ’13 Proceedings of the 13th International conference on data Mining pp 1289–1294

  26. Zhang J, Kong X, Philip YS (2014) Transferring heterogeneous links across location-based social networks. In: WSDM ’14 Proceedings of the 7th ACM international conference on Web search and data Mining pp 303–312

  27. Zhang J, Philip YS, Zhou Z (2014) Meta-path based multi-network collective link prediction. In: KDD ’14 Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining pp 1286–1295

  28. Zheng VW, Cao B, Zheng Y, Xie X, Yang Q (2010) Collaborative filtering meets mobile recommendation: a user-centered approach. In: AAAI 10

  29. Zheng VW, Zheng Y, Xie X, Yang Q (2010) Collaborative location and activity recommendations with gps history data. In: WWW ’10 Proceedings of the 19th international conference on World Wide Web

  30. Zheng Y (2015) Methodologies for cross-domain data fusion: an overview. IEEE Trans. Big Data 1(1):16–34

    Article  Google Scholar 

  31. Zheng Y, Liu T, Wang Y, Zhu Y, Liu Y, Chang E (2014) Diagnosing new york city’s noises with ubiquitous data. In: Ubicomp ’14 Proceedings of the 2014 ACM international joint conference on Pervasive and Ubiquitous Computing pp 247–256

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Yuchi Ma or Ning Yang.

Additional information

Y. Ma and N. Yang: These authors contributed equally to this study and share first authorship.

This work is supported by National Science Foundation of China through Grant 61173099, the Basic Research Program of Sichuan Province with Grant 2014JY0220, and NSF through Grants CNS-1115234, DBI-0960443, OISE-1129076, IIS-1526499, CNS-1626432, and NSFC 61672313.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ma, Y., Yang, N., Zhang, L. et al. Spatial and semantical label inference for social media. Knowl Inf Syst 53, 153–177 (2017). https://doi.org/10.1007/s10115-017-1036-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-017-1036-2

Keywords

Navigation