Abstract
The spread of microblogging services, such as Twitter, has made it possible to extract geographical characteristics such as keywords specific to a geographical region, with fine granularity. The results of content analysis of microblogging services are easily affected by users who post excessive messages. In addition, because geographical granularity of users’ interests differs, it is preferable to support multiple levels of granularity for usability. Thus, we propose a ranking method of location-dependent keywords based on a term frequency-inverse document frequency method to extract geographical characteristics. In our method, ranking scores are weighted by diversity of information sources so that the effect of loud users is mitigated. Multiple zoom levels of geographical areas are supported by approximation while databases at only several zoom levels are maintained. We evaluated our ranking method with a real dataset from Twitter and showed its effectiveness. We also describe a prototype implementation of a system using our ranking method.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Arakawa, Y., Tagashira, S., Fukuda, A.: Extraction of Location Dependent Words from Twitter Logs. IPSJ SIG Technical Reports 2010-MBL-55(10), 1–6 (2010) (in Japanese)
Bernstein, M., Suh, B., Hong, L., Chen, J., Kairam, S., Chi, E.: Eddi: Interactive Topic-based Browsing of Social Status Streams. In: UIST 2010: 23rd Annual ACM Symposium on User Interface Software and Technology, pp. 303–312 (2010)
Chen, J., Nairn, R., Nelson, L., Bernstein, M., Chi, E.: Short and Tweet: Experiments on Recommending Content from Information Streams. In: CHI 2010: 28th International Conference on Human Factors in Computing Systems, pp. 1185–1194 (2010)
Cranshaw, J., Toch, E., Hong, J., Kittur, A., Sadeh, N.: Bridging the Gap between Physical Location and Online Social Networks. In: UbiComp 2010: 12th ACM International Conference on Ubiquitous Computing, pp. 119–128 (2010)
Google Inc.: Google MAPs JavaScript API V3 (2009), http://code.google.com/intl/en/apis/maps/documentation/javascript/ (retrieved October 2011)
Järvelin, K., Kekäläinen, J.: Cumulated Gain-based Evaluation of IR Techniques. ACM Transactions on Information Systems 20(4), 422–446 (2002)
Mei, Q., Liu, C., Su, H., Zhai, C.: A Probabilistic Approach to Spatiotemporal Theme Pattern Mining on Weblogs. In: WWW 2006: 15th International Conference on World Wide Web, pp. 533–542 (2006)
Sahami, M., Dumais, S., Heckerman, D., Horvitz, E.: A Bayesian Approach to Filtering Junk E-mail. In: AAAI 1998 Workshop on Learning for Text Categorization (1998)
Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake Shakes Twitter Users: Real-time Event Detection by Social Sensors. In: WWW 2010: 19th International Conference on World Wide Web, pp. 851–860 (2010)
Salton, G., Buckley, C.: Term-Weighting Approaches in Automatic Text Retrieval. Information Processing & Management 24(5), 513–523 (1988)
Sankaranarayanan, J., Samet, H., Teitler, B., Lieberman, M., Sperling, J.: Twitterstand: News in Tweets. In: ACM SIGSPATIAL GIS 2009: 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 42–51 (2009)
Toch, E., Cranshaw, J., Drielsma, P., Tsai, J., Kelley, P., Springfield, J., Cranor, L., Hong, J., Sadeh, N.: Empirical Models of Privacy in Location Sharing. In: UbiComp 2010: 12th ACM International Conference on Ubiquitous Computing, pp. 129–138 (2010)
Tumasjan, A., Sprenger, T., Sandner, P., Welpe, I.: Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment. In: ICWSM 2010: 4th International AAAI Conference on Weblogs and Social Media, pp. 178–185 (2010)
Twitter Inc.: Streaming API | Twitter Developers (2010), https://dev.twitter.com/docs/streaming-api (retrieved October 2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ikeda, S., Kami, N., Yoshikawa, T. (2013). Ranking Location-Dependent Keywords to Extract Geographical Characteristics from Microblogs. In: Cordeiro, J., Krempels, KH. (eds) Web Information Systems and Technologies. WEBIST 2012. Lecture Notes in Business Information Processing, vol 140. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36608-6_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-36608-6_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36607-9
Online ISBN: 978-3-642-36608-6
eBook Packages: Computer ScienceComputer Science (R0)