research-article

Are Some Tweets More Interesting Than Others? #HardQuestion

Authors:
Omar Alonso

Microsoft Corporation

Microsoft Corporation
View Profile

,
Catherine C. Marshall

Microsoft Corporation

Microsoft Corporation
View Profile

,
Marc Najork

Microsoft Corporation

Microsoft Corporation
View Profile

HCIR '13: Proceedings of the Symposium on Human-Computer Interaction and Information RetrievalOctober 2013Article No.: 2Pages 1–10https://doi.org/10.1145/2528394.2528396

Published:03 October 2013Publication History

HCIR '13: Proceedings of the Symposium on Human-Computer Interaction and Information Retrieval

Pages 1–10

ABSTRACT

Twitter has evolved into a significant communication nexus, coupling personal and highly contextual utterances with local news, memes, celebrity gossip, headlines, and other microblogging subgenres. If we take Twitter as a large and varied dynamic collection, how can we predict which tweets will be interesting to a broad audience in advance of lagging social indicators of interest such as retweets? The telegraphic form of tweets, coupled with the subjective notion of interestingness, makes it difficult for human judges to agree on which tweets are indeed interesting.

In this paper, we address two questions: Can we develop a reliable strategy that results in high-quality labels for a collection of tweets, and can we use this labeled collection to predict a tweet's interestingness? To answer the first question, we performed a series of studies using crowdsourcing to reach a diverse set of workers who served as a proxy for an audience with variable interests and perspectives. This method allowed us to explore different labeling strategies, including varying the judges, the labels they applied, the datasets, and other aspects of the task. To address the second question, we used crowdsourcing to assemble a set of tweets rated as interesting or not; we scored these tweets using textual and contextual features; and we used these scores as inputs to a binary classifier. We were able to achieve moderate agreement (κ = 0.52) between the best classifier and the human assessments, a figure which reflects the challenges of the judgment task.

References

Alonso, O., Carson, C., Gerster, D., Ji, X., and Nabar, S. U. Detecting Uninteresting Content in Text Streams. In CSE (July 2010), 39--42.Google Scholar
André, P., Bernstein, M. S., and Luther, K. Who gives a tweet?: evaluating microblog content value. In CSCW (2012), 471--474. Google ScholarDigital Library
Aroyo, L., and Welty, C. Harnessing disagreement in crowdsourcing a relation extraction gold standard. Tech. Rep. RC25371 (WAT1304-058), IBM Research, 2013.Google Scholar
Bowker, G. C., and Star, S. L. Sorting Things Out: Classification and Its Consequences. MIT Press, 1999. Google ScholarDigital Library
Boyd, D., Golder, S., and Lotan, G. Tweet, tweet, retweet: Conversational aspects of retweeting on twitter. In HICSS (2010), 1--10. Google ScholarDigital Library
Choudhary, A. N., Hendrix, W., Lee, K., Palsetia, D., and Liao, W.-K. Social media evolution of the Egyptian revolution. CACM 55, 5 (2012), 74--80. Google ScholarDigital Library
Colton, S., and Bundy, A. On the notion of interestingness in automated mathematical creativity. In AISB (1999), 82--91.Google Scholar
De Choudhury, M., Diakopoulos, N., and Naaman, M. Unfolding the event landscape on twitter: classification and exploration of user categories. In CSCW (2012), 241--244. Google ScholarDigital Library
Duan, Y., Jiang, L., Qin, T., Zhou, M., and Shum, H.-Y. An empirical study on learning to rank of tweets. In COLING (2010), 295--303. Google ScholarDigital Library
Erdelez, S. Information encountering: a conceptual framework for accidental information discovery. In ISIC (1997), 412--421. Google ScholarDigital Library
Fleiss, J. L. Measuring nominal scale agreement among many raters. Psychol.l Bull. 76, 5 (1971), 378--382.Google Scholar
Gaffney, D. #iranelection: quantifying online activism. In WebSci10 (2010).Google Scholar
Gayo-Avello, D. Nepotistic relationships in twitter and their impact on rank prestige algorithms. IPM (2013), 1250--1280.Google Scholar
Hughes, A. L., and Palen, L. Twitter adoption and use in mass convergence and emergency events. IJEM 6, 3/4 (2009), 248--260.Google ScholarCross Ref
Hurlock, J., and Wilson, M. L. Searching twitter: Separating the tweet from the chaff. In ICWSM (2011).Google Scholar
Krippendorff, K. Content Analysis, 2nd ed. Sage Publications, 2003.Google Scholar
Lempel, R., and Moran, S. Salsa: the stochastic approach for link-structure analysis. TOIS 19, 2 (2001), 131--160. Google ScholarDigital Library
Lewis, P. Reading the riots: Investigating England's summer of disorder. The Guardian, September 5, 2011.Google Scholar
Lin, T., Oren, Etzioni, and Fogarty, J. Identifying interesting assertions from the web. In CIKM (2009), 1787--1790. Google ScholarDigital Library
Marcus, A., Bernstein, M., Badar, O., Karger, D., Madden, S., and Miller, R. Twitinfo: aggregating and visualizing microblogs for event exploration. In CHI (2011), 227--236. Google ScholarDigital Library
Metzler, D., and Cai, C. USC/ISI at TREC 2011: Microblog track. In TREC (2011).Google Scholar
Momeni, E., Tao, K., Haslhofer, B., and Houben, G.-J. Identification of useful user comments in social media: A case study on Flickr commons. In JCDL (2013), 1--10. Google ScholarDigital Library
Morris, M. R., Counts, S., Roseway, A., Hoff, A., and Schwarz, J. Tweeting is believing?: understanding microblog credibility perceptions. In CSCW (2012), 441--450. Google ScholarDigital Library
Naaman, M., Becker, H., and Gravano, L. Hip and trendy: characterizing emerging trends on twitter. JASIST 62, 5 (2011), 902--918. Google ScholarDigital Library
Poblete, B., Garcia, R., Mendoza, M., and Jaimes, A. Do all birds tweet the same? characterizing twitter around the world. In CIKM (2011), 1025--1030. Google ScholarDigital Library
Robertson, S. E., Walker, S., Jones, S., Hancock-Beaulieu, M., and Gatford, M. Okapi at TREC-3. In TREC (1994).Google Scholar
Sakaki, T., Okazaki, M., and Matsuo, Y. Earthquake shakes twitter users: real-time event detection by social sensors. In WWW (2010), 851--860. Google ScholarDigital Library
Silvia, P. J. What is interesting? exploring the appraisal structure of interest. Emotion 5 (2005), 89--102.Google ScholarCross Ref
Strauss, A., and Corbin, J. Basics of Qualitative Research. Sage Publications, 1998.Google Scholar
Uysal, I., and Croft, W. B. User oriented tweet ranking: a filtering approach to microblogs. In CIKM (2011), 2261--2264. Google ScholarDigital Library
Yardi, S., Romero, D., Schoenebeck, G., and Boyd, D. Detecting spam in a twitter network. First Monday 15, 1 (2010).Google Scholar
Zhao, D., and Rosson, M. B. How and why people twitter: the role that micro-blogging plays in informal communication at work. In GROUP (2009), 243--252. Google ScholarDigital Library
Zubiaga, A., Spina, D., Fresno, V., and Martínez, R. Classifying trending topics: a typology of conversation triggers on twitter. In CIKM (2011), 2461--2464. Google ScholarDigital Library

Index Terms

Are Some Tweets More Interesting Than Others? #HardQuestion
1. Computing methodologies
  1. Artificial intelligence
    1. Philosophical/theoretical foundations of artificial intelligence
      1. Cognitive science
2. Human-centered computing
  1. Human computer interaction (HCI)

Recommendations

Inferring the Interesting Tweets in Your Network
CGC '13: Proceedings of the 2013 International Conference on Cloud and Green Computing

As the demand for quick, live and relevant information increases, more people look to microblogging sites, such as Twitter, as a source of content. Retweeting acts as a filter of useful information for users with more interesting information likely to ...
Read More
Analyzing and predicting viral tweets
WWW '13 Companion: Proceedings of the 22nd International Conference on World Wide Web

Twitter and other microblogging services have become indispensable sources of information in today's web. Understanding the main factors that make certain pieces of information spread quickly in these platforms can be decisive for the analysis of ...
Read More
Analysis of Tweets Related to Cyberbullying: Exploring Information Diffusion and Advice Available for Cyberbullying Victims

The use of Twitter, especially by teenagers and young people, has raised the issue of cyberbullying. There is a lack of research into what types of advice and support are available in tweets for cyberbullying victims, and into the features influencing ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

HCIR '13: Proceedings of the Symposium on Human-Computer Interaction and Information Retrieval
October 2013
52 pages
ISBN:9781450325707
DOI:10.1145/2528394

Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 3 October 2013
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Twitter
crowdsourcing
interestingness
label quality
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 28
  Total Citations
  View Citations
- 325
  Total Downloads
- Downloads (Last 12 months)10
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Are Some Tweets More Interesting Than Others? #HardQuestion

HCIR '13: Proceedings of the Symposium on Human-Computer Interaction and Information Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Inferring the Interesting Tweets in Your Network

Analyzing and predicting viral tweets

Analysis of Tweets Related to Cyberbullying: Exploring Information Diffusion and Advice Available for Cyberbullying Victims

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Are Some Tweets More Interesting Than Others? #HardQuestion

HCIR '13: Proceedings of the Symposium on Human-Computer Interaction and Information Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Inferring the Interesting Tweets in Your Network

Analyzing and predicting viral tweets

Analysis of Tweets Related to Cyberbullying: Exploring Information Diffusion and Advice Available for Cyberbullying Victims

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media