skip to main content
research-article

Geographic information from georeferenced social media data

Published:01 July 2011Publication History
Skip Abstract Section

Abstract

In the last few years, we are witnessing an emerging class of communication and information platforms some call social awareness streams (SAS) [15]. Available from social media services such as Facebook, Twitter, FourSquare, Flickr, and others, these hugely popular platforms allow participants to post streams of lightweight content items, from short status messages to links, pictures, and videos, in a highly connected social environment. Many of these items are associated with location coordinates in the form of latitude and longitude, or with a business or venue that is in turn associated with a precise location. The number of "geotagged" items is likely to grow with the number of people using geo-enabled devices to access and produce SAS data. The vast amounts of SAS data offer unique opportunities for understanding local communities and people's attitudes, attention, and interest in them. Robust methods for learning from SAS data about geographies and local communities, using methods from Artificial Intelligence, Information Retrieval and Natural Language Processing, can greatly improve the state of geographic information retrieval. Such contributions, from better modelling of geographic areas, to improved knowledge about these areas and how they are used by individuals and communities, have begun to surface in the last few years, and are summarized in this article. The structure of this work borrows from Lynch [13], who referred to five elements that make up an individual's perception of city: districts, landmarks, paths, nodes, and edges. This article borrows from Lynch in the context of social media and SAS, proposing the four elements that make up the geographic information that can be derived from social media about a city or area: districts, landmarks, paths, and activities. This article presents a simple model for geographic SAS data, and then considers the four social media elements, or main types of applications of SAS data to geographic information systems. These applications include boundary definition and detection (district); computation of attractions (landmarks); derivation and recommendation of paths; and evaluation of activities, interests and temporal trends.

References

  1. R. Ahas, A. Aasa, S. Silm, and M. Tiru. Mobile positioning data in tourism studies and monitoring: Case study in tartu, estonia. Information and Communication Technologies in Tourism 2007, pages 119--128, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  2. S. Ahern, M. Naaman, R. Nair, and J. H.-I. Yang. World explorer: visualizing aggregate data from unstructured text in geo-referenced collections. In JCDL '07: Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries, pages 1--10, New York, NY, USA, 2007. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. D. Choudhury, M. Feldman, S. Amer-Yahia, N. Golbandi, R. Lempel, and G. Yu. Automatic construction of travel itineraries using social breadcrumbs. In Proceedings of the 21st ACM conference on Hypertext and hypermedia, pages 35--44, Toronto, Ontario, Canada, 2010. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. Crandall, L. Backstrom, D. Huttenlocher, and J. Kleinberg. Mapping the world's photos. In WWW '09: Proceeding of the 18th international conference on World Wide Web, New York, NY, USA, 2009. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. E. Gilbert, K. Karahalios, and C. Sandvig. The network in the garden: an empirical analysis of social media in rural life. In CHI '08: Proceeding of the twenty-sixth annual SIGCHI conference on Human factors in computing systems, pages 1603--1612, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. F. Girardin, F. Calabrese, F. D. Fiore, C. Ratti, and J. Blat. Digital footprinting: Uncovering tourists with user-generated content. IEEE Pervasive Computing, 7(4):36--43, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. C. Gonzalez, C. A. Hidalgo, and A.-L. Barabasi. Understanding individual human mobility patterns. Nature, 453(7196):779--782, 06 2008.Google ScholarGoogle ScholarCross RefCross Ref
  8. B. Hecht, L. Hong, B. Suh, and E. H. Chi. Tweets from justin bieber's heart: the dynamics of the location field in user profiles. In Proceedings of the 2011 annual conference on Human factors in computing systems, CHI '11, pages 237--246, New York, NY, USA, 2011. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. B. J. Hecht and D. Gergle. On the "localness" of user-generated content. In Proceedings of the 2010 ACM conference on Computer supported cooperative work, CSCW '10, pages 229--232, New York, NY, USA, 2010. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. L. Hollenstein and R. S. Purves. Exploring place through user-generated content: Using flickr tags to describe city cores. Spatial Information Science, 1(1):21--48, 2010.Google ScholarGoogle Scholar
  11. J. Jacobs. The death and life of great American cities. Random House, 1961.Google ScholarGoogle Scholar
  12. L. S. Kennedy and M. Naaman. Generating diverse and representative image search results for landmarks. In WWW '08: Proceeding of the 17th international conference on World Wide Web, pages 297--306, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. K. Lynch. The image of the city. Publications of the Joint Center for Urban Studies. Technology Press, Cambridge, 1960.Google ScholarGoogle Scholar
  14. M. Naaman, H. Becker, and L. Gravano. Hip and trendy: Characterizing emerging trends on twitter. Journal of the American Society for Information Science and Technology, 62(5):902--918, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. Naaman, J. Boase, and C.-H. Lai. Is it really about me? message content in social awareness streams. In CSCW '10: Proceedings of the 2010 ACM Conference on Computer Supported Cooperative Work, pages 189--192, New York, NY, USA, 2010. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Nagarajan, K. Gomadam, A. P. Sheth, A. Ranabahu, R. Mutharaju, and A. Jadhav. Spatio-temporal-thematic analysis of citizen sensor data: Challenges and experiences. In Web Information Systems Engineering - WISE 2009, volume 5802 of Lecture Notes in Computer Science, pages 539--553. Springer Berlin/Heidelberg, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. T. Sakaki, M. Okazaki, and Y. Matsuo. Earthquake shakes Twitter users: Real-time event detection by social sensors. In WWW '10: Proceedings of the 19th International Conference on World Wide Web, pages 851--860, New York, NY, USA, 2010. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. Scellato, C. Mascolo, M. Musolesi, and V. Latora. Distance matters: Geo-social metrics for online social networks. In WOSN '10: Proceedings of the 3rd Workshop on Online Social Networks, Boston, MA, USA, June 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. S. Schockaert and M. De Cock. Neighborhood restrictions in geographic ir. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '07, pages 167--174, New York, NY, USA, 2007. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. K. Starbird, L. Palen, A. L. Hughes, and S. Vieweg. Chatter on the red: What hazards threat reveals about the social life of microblogged information. In CSCW '10: Proceedings of the 2010 ACM Conference on Computer Supported Cooperative Work, pages 241--250, New York, NY, USA, 2010. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Y. Takhteyev, A. Gruzd, and B. Wellman. Geography of Twitter networks. Social Networks, To Appear. To Appear.Google ScholarGoogle Scholar
  22. S. Vieweg, A. L. Hughes, K. Starbird, and L. Palen. Microblogging during two natural hazards events: what twitter may contribute to situational awareness. In CHI '10: Proceedings of the 28th international conference on Human factors in computing systems, pages 1079--1088, New York, NY, USA, 2010. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. S. Yardi and d. boyd. Tweeting from the town square: Measuring geographic local networks. In ICWSM '10: Proceedings of the International AAAI Conference on Weblogs and Social Media. AAAI, 2010.Google ScholarGoogle Scholar

Index Terms

  1. Geographic information from georeferenced social media data

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in

              Full Access

              • Published in

                cover image SIGSPATIAL Special
                SIGSPATIAL Special  Volume 3, Issue 2
                July 2011
                76 pages
                EISSN:1946-7729
                DOI:10.1145/2047296
                Issue’s Table of Contents

                Copyright © 2011 Author

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 1 July 2011

                Check for updates

                Qualifiers

                • research-article

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader