ABSTRACT
Over the past couple of years, social networks such as Twitter and Facebook have become the primary source for consuming information on the Internet. One of the main differentiators of this content from traditional information sources available on the Web is the fact that these social networks surface individuals' perspectives. When social media users post and share updates with friends and followers, some of those short fragments of text contain a link and a personal comment about the web page, image or video. We are interested in mining the text around those links for a better understanding of what people are saying about the object they are referring to. Capturing the salient keywords from the crowd is rich metadata that we can use to augment a web page. This metadata can be used for many applications like ranking signals, query augmentation, indexing, and for organizing and categorizing content. In this paper, we present a technique called social signatures that given a link to a web page, pulls the most important keywords from the social chatter around it. That is, a high level representation of the web page from a social media perspective. Our findings indicate that the content of social signatures differs compared to those from a web page and therefore provides new insights. This difference is more prominent as the number of link shares increase. To showcase our work, we present the results of processing a dataset that contains around 1 Billion unique URLs shared in Twitter and Facebook over a two month period. We also provide data points that shed some light on the dynamics of content sharing in social media.
- Omar Alonso and Kartikay Khandelwal. Kondenzer: Exploration and visualization of archived social media. In Proceedings of ICDE, 2014.Google ScholarCross Ref
- Einat Amitay, Adam Darlow, David Konopnicki, and Uri Weiss. Queries as anchors: selection by association. In Proceedings of Hypertext, pages 193--201, 2005. Google ScholarDigital Library
- Peter Anick. Exploiting anchor text as a lexical resource. In LREC, 2004.Google Scholar
- Oisı Boydell and Barry Smyth. Social summarization in collaborative web search. Information processing & management, 46(6):782--798, 2010. Google ScholarDigital Library
- Ronnie Chaiken, Bob Jenkins, Per-Åke Larson, Bill Ramsey, Darren Shakib, Simon Weaver, and Jingren Zhou. Scope: Easy and efficient parallel processing of massive data sets. PVLDB, 1(2):1265--1276, August 2008. Google ScholarDigital Library
- Nadav Eiron and Kevin McCurley. Analysis of anchor text for web search. In Proceedings of SIGIR, pages 459--460, 2003. Google ScholarDigital Library
- Paolo Ferragina and Ugo Scaiella. Tagme: on-the-fly annotation of short text fragments (by wikipedia entities). In Proceedings of CIKM, pages 1625--1628, 2010. Google ScholarDigital Library
- Atsushi Fujii. Modeling anchor text and classifying queries to enhance web document retrieval. In Proceedings of WWW, pages 337--346, 2008. Google ScholarDigital Library
- Michael Gamon, Tao Yano, Xinying Song, Johnson Apacible, and Patrick Pantel. Understanding document aboutness step one: Identifying salient entities. MSR-TR-2013--73, 2013.Google Scholar
- Carolin Gerlitz and Anne Helmond. The like economy: Social buttons and the data-intensive web. New Media Society, 15:1348--1365, 2013.Google ScholarCross Ref
- Chia-Jung Lee and Bruce Croft. Incorporating social anchors for ad hoc retrieval. In Proceedings of OAIR, pages 181--188, 2013. Google ScholarDigital Library
- Donald Metzler, Jasmine Novak, Hang Cui, and Srihari Reddy. Building enriched document representations using aggregated anchor text. In Proceedings of SIGIR, pages 219--226, 2009. Google ScholarDigital Library
- Gilad Mishne and Jimmy Lin. Twanchor text: a preliminary study of the value of tweets as anchor text. In Proceedings of SIGIR, pages 1159--1160, 2012. Google ScholarDigital Library
- Aditi Muralidharan, Zoltan Gyongyi, and Ed Chi. Social annotations in web search. In Proceedings of SIGCHI, pages 1085--1094, 2012. Google ScholarDigital Library
- Michael Noll and Christoph Meinel. The metadata triumvirate: Social annotations, anchor texts and search queries. In Proceedings of Web Intelligence, volume 1, pages 640--647, 2008. Google ScholarDigital Library
- Patrick Pantel, Michael Gamon, Omar Alonso, and Kevin Haas. Social annotations: Utility and prediction modeling. In Proceedings of SIGIR, pages 285--294, 2012. Google ScholarDigital Library
- Seung-Taek Park, David Pennock, C Lee Giles, and Robert Krovetz. Analysis of lexical signatures for finding lost or related documents. In Proceedings of SIGIR, pages 11--18, 2002. Google ScholarDigital Library
- Stéphane Raux, Nils Grünwald, and Christophe Prieur. Describing the web in less than 140 characters. In Proceedings of ICWSM, 2011.Google Scholar
- Luis von Ahn and Laura Dabbish. Labeling images with a computer game. In Proceedings of SIGCHI, pages 319--326, 2004. Google ScholarDigital Library
- Mingfang Wu, David Hawking, Andrew Turpin, and Falk Scholer. Using anchor text for homepage and topic distillation search tasks. Journal of the American Society for Information Science and Technology, 63(6):1235--1255, 2012. Google ScholarDigital Library
- Bo Zhou, Yiqun Liu, Min Zhang, Yijiang Jin, and Shaoping Ma. Incorporating web browsing activities into anchor texts for web search. Information Retrieval, 14(3):290--314, 2011. Google ScholarDigital Library
Index Terms
- The World Conversation: Web Page Metadata Generation From Social Sources
Recommendations
Inspecting interactions: online news media synergies in social media
ASONAM '18: Proceedings of the 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and MiningThe rising popularity of social media has radically changed the way news content is propagated, including interactive attempts with new dimensions. To date, traditional news media such as newspapers, television and radio have already adapted their ...
The emerging viewertariat in South Korea
Social networking sites (SNSs) represent Web 2.0 platforms or networking tools through which users can freely exchange ideas, opinions, experiences, and viewpoints and thus have considerable influence on the formation of political discourse. Despite the ...
Celebrity's self-disclosure on Twitter and parasocial relationships
This study investigated how celebrities' self-disclosure on personal social media accounts, particularly Twitter, affects fans' perceptions. An online survey was utilized among a sample of 429 celebrity followers on Twitter. Results demonstrated that ...
Comments