skip to main content
10.1145/3106426.3106541acmconferencesArticle/Chapter ViewAbstractPublication PageswiConference Proceedingsconference-collections
research-article

Twitter for marijuana infodemiology

Published: 23 August 2017 Publication History

Abstract

Today online social networks seem to be good tools to quickly monitor what is going on with the population, since they provide environments where users can freely share large amounts of information related to their own lives. Due to well known limitations of surveys, this novel kind of data can be used to get additional real time insights from people to understand their actual behavior related to drug use. The aim of this work is to make use of text messages (tweets) and relationships between Chilean Twitter users to predict marijuana use among them. To do this we collected Twitter accounts using a location-based criteria, and built a set of features based on tweets they made and ego centric network metrics. To get tweet-based features, tweets were filtered using marijuana-related keywords and a set of 1000 tweets were manually labeled to train algorithms capable of predicting marijuana use in tweets. In addition, a sentiment classifier of tweets was developed using the TASS corpus. Then, we made a survey to get real marijuana use labels related to accounts and these labels were used to train supervised machine learning algorithms. The marijuana use per user classifier had precision, recall and F-measure results close to 0.7, implying significant predictive power of the selected variables. We obtained a model capable of predicting marijuana use of Twitter users and estimating their opinion about marijuana. This information can be used as an efficient (fast and low cost) tool for marijuana surveillance, and support decision making about drug policies.

References

[1]
Mir M. Ali, Aliaksandr Amialchuk, and Debra S. Dwyer. 2011. The social contagion effect of marijuana use among adolescents. PLoS ONE 6 (2011), 1--6.
[2]
J. W. Ayers, B. M. Althouse, and M. Dredze. 2014. Could behavioral medicine lead the web data revolution? JAMA 311, 14 (2014), 1399--1400.
[3]
Jorge A Balazs and Juan D Velásquez. 2016. Opinion mining and information fusion: A survey. Information Fusion 27 (2016), 95--110.
[4]
Megan R. Brett. 2012. Topic modeling: a basic introduction. Journal of Digital Humanities 2, 1 (2012), 12--16.
[5]
E. P. Cabrera-Nguyen, P. Cavazos-Rehg, M. Krauss, L. J. Bierut, and M. A. Moreno. 2016. Young adults' exposure to alcohol- and marijuana-related content on Twitter. J Stud Alcohol Drugs 77, 2 (2016), 349--353.
[6]
P. A. Cavazos-Rehg, M. Krauss, S. L. Fisher, P. Salyer, R. A. Grucza, and L. J. Bierut. 2015. Twitter chatter about marijuana. J Adolesc Health 56, 2 (2015), 139--145.
[7]
P. A. Cavazos-Rehg, S. J. Sowles, M. J. Krauss, V. Agbonavbare, R. Grucza, and L. Bierut. 2016. A content analysis of tweets about high-potency marijuana. Drug Alcohol Depend 166 (2016), 100--108.
[8]
Michael Chary, Nicholas Genes, Andrew McKenzie, and Alex F Manini. 2013. Leveraging social networks for toxicovigilance. Journal of Medical Toxicology 9, 2 (2013), 184--191.
[9]
Stephanie H. Cook, José A. Bauermeister, Deborah Gordon-Messer, and Marc A. Zimmerman. 2013. Online network influences on emerging adults' alcohol and drug use. Journal of Youth and Adolescence 42, 11 (2013), 1674--1686.
[10]
Hongying Dai and Jianqiang Hao. 2017. Mining social media data for opinion polarities about electronic cigarettes. Tobacco Control 26, 2 (2017), 175--180. arXiv:http://tobaccocontrol.bmj.com/content/26/2/175.full.pdf
[11]
R. Daniulaityte, L. Chen, F. R. Lamy, R. G. Carlson, K. Thirunarayan, and A. Sheth. 2016. "When 'bad' is 'good'": identifying personal communication and sentiment in drug-related tweets. JMIR Public Health Surveill 2, 2 (2016), e162.
[12]
R. Daniulaityte, R. W. Nahhas, S. Wijeratne, R. G. Carlson, F. R. Lamy, S. S. Martins, E. W. Boyer, G. A. Smith, and A. Sheth. 2015. "Time for dabs": analyzing Twitter data on marijuana concentrates across the U.S. Drug Alcohol Depend 155 (2015), 307--311.
[13]
Luis Dujovne and Juan D. Velásquez. 2009. Design and implementation of a methodology for identifying website keyobjects. Knowledge-Based and Intelligent Information and Engineering Systems (2009), 301--308.
[14]
Susan T Ennett, Karl E Bauman, Andrea Hussong, Robert Faris, Vangie A Foshee, Li Cai, and Robert H DuRant. 2006. The peer context of adolescent substance use: Findings from social network analysis. Journal of Research on Adolescence 16, 2 (2006), 159--186.
[15]
G. Eysenbach. 2011. Infodemiology and infoveillance tracking online health information and cyberbehavior for public health. Am J Prev Med 40, 5 Suppl 2 (2011), S154--158.
[16]
A. Gupta and P. Anand. 2015. Focused web crawlers and its approaches. In 2015 International Conference on Futuristic Trends on Computational Analysis and Knowledge Management (ABLAZE). 619--622.
[17]
Kimberly Kobus and David B. Henry. 2009. Interplay of network position and peer substance use in early adolescent cigarette, alcohol, and marijuana use. The Journal of Early Adolescence (2009).
[18]
Haewoon Kwak, Changhyun Lee, Hosung Park, and Sue Moon. 2010. What is Twitter, a social network or a news media. In Proceedings of the 19th International Conference on World Wide Web. USA, 591--600.
[19]
F. R. Lamy, R. Daniulaityte, A. Sheth, R. W. Nahhas, S. S. Martins, E. W. Boyer, and R. G. Carlson. 2016. "Those edibles hit hard": exploration of Twitter data on cannabis edibles in the U.S. Drug Alcohol Depend 164 (2016), 64--70.
[20]
Dong Nguyen, Rilana Gravel, Dolf Trieschnigg, and Theo Meder. 2013. "How old do you think I am?" A study of language and age in Twitter. (2013). https://www.aaai.org/ocs/index.php/ICWSM/ICWSM13/paper/view/5984
[21]
Michael J. Paul and Mark Dredze. 2011. You are what you tweet: analyzing Twitter for public health. ICWSM 20 (2011), 265--272.
[22]
SENDA. 2015. Eleventh National Study of Drugs in National Population. SENDA, Santiago.
[23]
Lukas Shutler, Lewis S. Nelson, Ian Portelli, Courtney Blachford, and Jeanmarie Perrone. 2015. Drug use in the Twittersphere: a qualitative contextual analysis of tweets about prescription drugs. Journal of Addictive Diseases 34, 4 (2015), 303--310. 26364675.
[24]
Gino Slanzi, Gaspar Pizarro, and Juan D Velásquez. 2017. Biometric information fusion for web user navigation and preferences analysis: an overview. Information Fusion 38 (2017), 12--21.
[25]
Sarah A. Stoddard, Jose A. Bauermeister, Deborah Gordon-Messer, Michelle Johns, and Marc A. Zimmerman. 2012. Permissive norms and young adults' alcohol and marijuana use: The role of online communities. Journal of Studies on Alcohol and Drugs 73, 6 (2012), 968--975.
[26]
Q. Tian, J. Lagisetty, and B. Li. 2016. Finding needles of interested tweets in the haystack of Twitter network. In 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). 447--452.
[27]
Julio Villena-Román, Sara Lana-Serrano, Eugenio Martínez-Cámara, and José González-Cristóbal. 2013. TASS - workshop on sentiment analysis at SEPLN. Procesamiento del Lenguaje Natural 50, 0 (2013), 37--44.

Cited By

View all
  • (2023)Understanding and preventing the advertisement and sale of illicit drugs to young people through social media: A multidisciplinary scoping reviewDrug and Alcohol Review10.1111/dar.1371643:1(56-74)Online publication date: 31-Jul-2023
  • (2021)Systematic literature review of sentiment analysis in the Spanish languageData Technologies and Applications10.1108/DTA-09-2020-020055:4(461-479)Online publication date: 16-Feb-2021
  • (2020)Knowledge Extraction from Twitter Towards Infectious Diseases in SpanishTechnologies and Innovation10.1007/978-3-030-62015-8_4(43-57)Online publication date: 28-Oct-2020

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WI '17: Proceedings of the International Conference on Web Intelligence
August 2017
1284 pages
ISBN:9781450349512
DOI:10.1145/3106426
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 August 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. marijuana
  2. opinion mining
  3. social network analysis
  4. text mining
  5. web content mining

Qualifiers

  • Research-article

Conference

WI '17
Sponsor:

Acceptance Rates

WI '17 Paper Acceptance Rate 118 of 178 submissions, 66%;
Overall Acceptance Rate 118 of 178 submissions, 66%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Understanding and preventing the advertisement and sale of illicit drugs to young people through social media: A multidisciplinary scoping reviewDrug and Alcohol Review10.1111/dar.1371643:1(56-74)Online publication date: 31-Jul-2023
  • (2021)Systematic literature review of sentiment analysis in the Spanish languageData Technologies and Applications10.1108/DTA-09-2020-020055:4(461-479)Online publication date: 16-Feb-2021
  • (2020)Knowledge Extraction from Twitter Towards Infectious Diseases in SpanishTechnologies and Innovation10.1007/978-3-030-62015-8_4(43-57)Online publication date: 28-Oct-2020

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media