Abstract
With the development of web 2.0 technology, people can not only have access to information on the internet but can express their opinions, engage in on-line discussion and interact within the network’s platform. By analyzing user comment from the same region, we can understand the implied region features and trending topics in that region. Region features can be categorized as an event or topic therefore it can be labeled based on the user’s comment.
In this paper, we propose the discovery of similar topics based on semantics and level or extent of attention focusing on the user’s comment data. Semantics represents the user’s comment while level of attention represents the amount of user’s comment on a news topic, therefore, semantics and the level of attention reveals the user’s comment behavior. This paper uses the LDA and K-means clustering algorithm to analyze similar topics in a region and proposes methods to determine region features. By analyzing the region features and the similar region topics, the labeled region topics can be used for advertisement, improve business strategies, and as a reference for regional administration and planning which has a practical significance.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Sina network. http://news.sina.com.cn/zt
Datatang. http://www.dataatang.com/data/19300
Zheng, Y., Yuan, N.J., Xie, X.: Discovering functional groups of an area (2015). http://www.freepatentsonline.com/y2015/0363700.html
Yuan, Y., Raubal, M.: Analyzing the distribution of human activity space from mobile phone usage: an individual and urban-oriented study. Int. J. GIS 30(8), 1594–1621 (2016). doi:10.1080/13658816.2016.1143555
Zhong, C., Huang, X., Arisona, S.M., Schmitt, G., Batty, M.: Inferring building functions from a probabilistic model using public transportation data. Comput. Environ. Urban Syst. 48, 124–137 (2014). http://www.sciencedirect.com/science/article/pii/S0198971514000854
Yin, Z., Cao, L., Han, J., Zhai, C., Huang, T.: Geographical topic discovery and comparison. In: Proceedings of the 20th International Conference on World Wide Web (WWW 2011), pp. 247–256. ACM, New York (2011). http://dx.doi.org/10.1145/1963405.1963443
Liu, F., Janssens, D., Cui, J., et al.: Building a validation measure for activity-based transportation models based on mobile phone, data. Expert Syst. Appl. 41, 6174–6189 (2014). http://www.sciencedirect.com/science/article/pii/S0957417414002036
Liu, J., Huang, Z., Chen, L., Shen, H.T., Yan, Z.: Discovering areas of interest with geo-tagged images and check-ins. In: Proceedings of the 20th ACM International Conference on Multimedia, MM 2012 (2012). http://doi.acm.org/10.1145/2393347.2393429
Jiang, S., Ferreira Jr., J., Gonzalez, M.C.: Discovering urban spatial-temporal structure from human activity patterns. In: Proceedings of the ACM SIGKDD International Workshop on Urban Computing (UrbComp 2012), pp. 95-102. ACM, New York (2012). http://dx.doi.org/10.1145/2346496.2346512
Ferrari, L., Rosi, A., Mamei, M., Zambonelli, F.: Extracting urban patterns from location-based social networks. In: Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Location-Based Social Networks (LBSN 20011), pp. 9–16. ACM, New York (2011). http://dx.doi.org/10.1145/2063212.2063226
Eisenstein, J., O’Connor, B., Smith, N.A., Xing, E.P.: A latent variable model for geographic lexical variation. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP 2010), pp. 1277–1287. Association for Computational Linguistics, Stroudsburg (2010)
Hao, Q., Cai, R., Wang, C., Xiao, R., Yang, J.-M., Pang, Y., Zhang, L.: Equip tourists with knowledge mined from travelogues. In: Proceedings of the 19th International Conference on World Wide Web (WWW 2010), pp. 401–410. ACM, New York (2010). http://dx.doi.org/10.1145/1772690.1772732
Sizov, S.: GeoFolk: latent spatial semantics in web 2.0 social media. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining (WSDM 2010), pp. 281–290. (2010). http://dx.doi.org/10.1145/1718487.1718522
Wakamiya, S., Lee, R., Sumiya, K.: Crowd-sourced urban life monitoring: urban area characterization based crowd behavioral patterns from Twitter. In: Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication (ICUIMC 2012), Article 26, pp. 1–9. ACM, New York (2012). http://dx.doi.org/10.1145/2184751.2184784
Wang, C., Wang, J., Xie, X., Ma, W.-Y.: Mining geographic knowledge using location aware topic model. In: Proceedings of the 4th ACM Workshop on Geographical Information Retrieval (GIR 2007), pp. 65–70. ACM, New York (2007). http://dx.doi.org/10.1145/1316948.1316967
Qi, G., Li, X., Li, S., Pan, G., Wang, Z., Zhang, D.: Measuring social functions of city regions from large-scale taxi behaviors. In: WIP of PERCOM 2011: Work in Progress Workshop (2011). http://hal.archives-ouvertes.fr/hal-01301930
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sun, H., Esho, O., Liu, J., Pang, L. (2016). Discovering Region Features Based on User’s Comments. In: Li, Y., Xiang, G., Lin, H., Wang, M. (eds) Social Media Processing. SMP 2016. Communications in Computer and Information Science, vol 669. Springer, Singapore. https://doi.org/10.1007/978-981-10-2993-6_17
Download citation
DOI: https://doi.org/10.1007/978-981-10-2993-6_17
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2992-9
Online ISBN: 978-981-10-2993-6
eBook Packages: Computer ScienceComputer Science (R0)