skip to main content
10.1145/3648188.3675139acmconferencesArticle/Chapter ViewAbstractPublication PageshtConference Proceedingsconference-collections
research-article

Unveiling Coyote Ads: Detecting Human Smuggling Advertisements on Social Media

Published: 10 September 2024 Publication History

Abstract

Human smuggling is a grave social issue that carries several negative ramifications for many countries across the globe. Human smugglers operating on the United States-Mexico border, known as ‘coyotes’, are increasingly relying on online social media to advertise their services, so automatically detecting such advertisements related to this illicit activity is crucial for devising effective countermeasures. However, identifying human-smuggling advertisements is not straightforward as they are often disguised as innocent travel or tourism-related adverts in regional languages. Moreover, there are no readily available datasets that can train models to recognize such adverts automatically. This paper addresses both these issues as follows. First, we built a novel dataset comprising both coyote ads and legitimate travel/tourism adverts by leveraging different sources, including the Web, NGOs, and regional connections in countries with high human-smuggling activities. Specifically, our dataset contained an equal number (1000 images plus 500 videos) of coyote ads and legitimate travel ads, all of which contained embedded textual information. Using this dataset, we trained and assessed several state-of-the-art baseline models, including GPT-4, Gemini, and CLIP. For the image ads, the highest F1 score achieved by a model was 0.92, and for the video ads, the highest achieved F1 score was 0.86. We also conducted an in-depth analysis of the model performances to gain comprehensive insights into their respective strengths and weaknesses.

References

[1]
Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, 2023. Gpt-4 technical report. arXiv preprint arXiv:2303.08774 (2023).
[2]
Luigi Achilli. 2018. The human smuggling industry: Nuances and complexities. (2018).
[3]
J Ainsley. 2022. Migrant border crossings in Fiscal Year 2022 topped 2.76 million, breaking previous record. NBCNews. com. https://www. nbcnews. com/politics/immigration/migrant-border-crossings-fiscal-year-2022-topped-276-million-breaking-rcna53517 (2022).
[4]
Georgios A Antonopoulos, Gabriele Baratto, Andrea Di Nicola, Parisa Diba, Elisa Martini, Georgios Papanicolaou, and Fiamma Terenghi. 2020. Technology in human smuggling and trafficking: case studies from Italy and the United Kingdom. Springer Nature.
[5]
Anurag Arnab, Mostafa Dehghani, Georg Heigold, Chen Sun, Mario Lučić, and Cordelia Schmid. 2021. Vivit: A video vision transformer. In Proceedings of the IEEE/CVF international conference on computer vision. 6836–6846.
[6]
Karen Aroian. 1993. Mental health risks and problems encountered by illegal immigrants. Issues in Mental Health Nursing 14, 4 (1993), 379–397.
[7]
Gedas Bertasius, Heng Wang, and Lorenzo Torresani. 2021. Is space-time attention all you need for video understanding?. In ICML, Vol. 2. 4.
[8]
Victoria Betton, Rohan Borschmann, Mary Docherty, Stephen Coleman, Mark Brown, and Claire Henderson. 2015. The role of social media in reducing stigma and discrimination. The British Journal of Psychiatry 206, 6 (2015), 443–444.
[9]
Samantha Bradshaw and Philip N Howard. 2018. The global organization of social media disinformation campaigns. Journal of International Affairs 71, 1.5 (2018), 23–32.
[10]
Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, 2020. Language models are few-shot learners. Advances in neural information processing systems 33 (2020), 1877–1901.
[11]
CBP. 2021. ’Coyote’ Found Guilty of Smuggling. https://www.cbp.gov/coyote-criminal.
[12]
CBP. 2022. CBP Launches Digital Ad Campaign “Say No to the Coyote” to Warn Migrants About Smuggler Lies. https://www.cbp.gov/newsroom/national-media-release/cbp-launches-digital-ad-campaign-say-no-coyote-warn-migrants-about.
[13]
CBP. 2022. CBP Releases November 2022 Monthly Operational Update. https://www.cbp.gov/newsroom/national-media-release/cbp-releases-november-2022-monthly-operational-update.
[14]
CBP. 2022. Coyote Criminal. https://www.cbp.gov/coyote-criminal.
[15]
CBP. 2023. CBP Enforcement Statistics Fiscal Year 2023. https://www.cbp.gov/newsroom/stats/cbp-enforcement-statistics.
[16]
CBP. 2023. Southwest Land Border Encounters. https://www.cbp.gov/newsroom/stats/southwest-land-border-encounters.
[17]
Claudia Cervantes-Soon. 2014. The US-Mexico border-crossing Chicana researcher: Theory in the flesh and the politics of identity in critical ethnography. Journal of Latino/Latin American Studies 6, 2 (2014), 97–112.
[18]
Justin Akers Chacón. 2021. The border crossed US: The case for opening the US-Mexico border. Haymarket Books.
[19]
Lesley Chiou and Catherine Tucker. 2018. Fake news and advertising on social media: A study of the anti-vaccination movement. Technical Report. National Bureau of Economic Research.
[20]
Barry R Chiswick. 1988. Illegal immigration and immigration control. Journal of Economic Perspectives 2, 3 (1988), 101–115.
[21]
Wayne A Cornelius. 2001. Death at the border: Efficacy and unintended consequences of US immigration control policy. Population and development review 27, 4 (2001), 661–685.
[22]
CRIME and JUSTICE NEWS. 2019. How ‘Coyotes’ Smuggle Migrants Across U.S. Border. https://thecrimereport.org/2019/03/07/how-coyotes-smuggle-migrants-across-u-s-border/.
[23]
Lawrence A DeLuca, Marylyn M McEwen, and Samuel M Keim. 2010. United States–Mexico border crossing: Experiences and risk perceptions of undocumented male immigrants. Journal of Immigrant and Minority Health 12 (2010), 113–123.
[24]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. CoRR abs/1810.04805 (2018). arxiv:1810.04805http://arxiv.org/abs/1810.04805
[25]
Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020).
[26]
Tim Draws, Alisa Rieger, Oana Inel, Ujwal Gadiraju, and Nava Tintarev. 2021. A checklist to combat cognitive biases in crowdsourcing. In Proceedings of the AAAI conference on human computation and crowdsourcing, Vol. 9. 48–59.
[27]
Franck Düvell. 2005. Illegal immigration in Europe. Springer.
[28]
Thomas J Espenshade. 1995. Using INS border apprehension data to measure the flow of undocumented migrants crossing the US-Mexico frontier. International Migration Review 29, 2 (1995), 545–565.
[29]
Ysabel Gerrard. 2018. Beyond the hashtag: Circumventing content moderation on social media. New Media & Society 20, 12 (2018), 4492–4511.
[30]
John Gramlich. 2023. Monthly encounters with migrants at U.S.-Mexico border remain near record highs. https://www.pewresearch.org/short-reads/2023/01/13/monthly-encounters-with-migrants-at-u-s-mexico-border-remain-near-record-highs/.
[31]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.
[32]
Jonathan Herzig, Paweł Krzysztof Nowak, Thomas Müller, Francesco Piccinno, and Julian Martin Eisenschlos. 2020. TaPas: Weakly supervised table parsing via pre-training. arXiv preprint arXiv:2004.02349 (2020).
[33]
Josiah McC Heyman. 2001. Class and classification at the US-Mexico border. Human organization 60, 2 (2001), 128–140.
[34]
Seth M Holmes. 2013. “Is it worth risking your life?”: Ethnography, risk and death on the US–Mexico border. Social Science & Medicine 99 (2013), 153–161.
[35]
Albert Q. Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego de las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lucile Saulnier, Lélio Renard Lavaud, Marie-Anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timothée Lacroix, and William El Sayed. 2023. Mistral 7B. arxiv:2310.06825 [cs.CL]
[36]
Satwik Ram Kodandaram. 2021. Improving the Performance of Neural Networks. (2021).
[37]
Satwik Ram Kodandaram, Mohan Sunkara, Sampath Jayarathna, and Vikas Ashok. 2023. Detecting Deceptive Dark-Pattern Web Advertisements for Blind Screen-Reader Users. Journal of Imaging 9, 11 (2023), 239.
[38]
David Kyle and Rey Koslowski. 2011. Global human smuggling: Comparative perspectives. JHU Press.
[39]
David Kyle and Zai Liang. 2001. Migration merchants: human smuggling from Ecuador and China. (2001).
[40]
Liunian Harold Li, Mark Yatskar, Da Yin, Cho-Jui Hsieh, and Kai-Wei Chang. 2019. Visualbert: A simple and performant baseline for vision and language. arXiv preprint arXiv:1908.03557 (2019).
[41]
Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. 2021. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision. 10012–10022.
[42]
Eileen M Luna-Firebaugh. 2002. The border crossed us: Border crossing issues of the indigenous peoples of the Americas. Wicazo Sa Review 17, 1 (2002), 159–181.
[43]
Douglas S Massey and Chiara Capoferro. 2004. Measuring undocumented migration. International Migration Review 38, 3 (2004), 1075–1102.
[44]
Douglas S Massey, Jorge Durand, and Karen A Pren. 2014. Explaining undocumented migration to the US. International Migration Review 48, 4 (2014), 1028–1061.
[45]
DeAnne K Hilfinger Messias, Marylyn Morris McEwen, and Joyceen S Boyle. 2015. Undocumented migration in the United States: An overview of historical and current policy contexts. Nursing outlook 63, 1 (2015), 60–67.
[46]
Ana Raquel Minian. 2018. Undocumented lives: the untold story of Mexican migration. Harvard University Press.
[47]
Federico Monti, Fabrizio Frasca, Davide Eynard, Damon Mannion, and Michael M Bronstein. 2019. Fake news detection on social media using geometric deep learning. arXiv preprint arXiv:1902.06673 (2019).
[48]
Zewdie Mossie and Jenq-Haur Wang. 2020. Vulnerable community identification using hate speech detection on social media. Information Processing & Management 57, 3 (2020), 102087.
[49]
Marzieh Mozafari, Reza Farahbakhsh, and Noel Crespi. 2020. A BERT-based transfer learning approach for hate speech detection in online social media. In Complex Networks and Their Applications VIII: Volume 1 Proceedings of the Eighth International Conference on Complex Networks and Their Applications COMPLEX NETWORKS 2019 8. Springer, 928–940.
[50]
Nanlir Sallau Mullah and Wan Mohd Nazmee Wan Zainon. 2021. Advances in machine learning algorithms for hate speech detection in social media: a review. IEEE Access 9 (2021), 88364–88376.
[51]
Sarah Myers West. 2018. Censored, suspended, shadowbanned: User interpretations of content moderation on social media platforms. New Media & Society 20, 11 (2018), 4366–4383.
[52]
Chandan Nandy. 2005. Illegal Immigration from Bangladesh to India: The Emerging Conflicts. (2005).
[53]
Nilufer Narli. 2006. Human trafficking and smuggling: the process, the actors and the victim profile. na.
[54]
Pia M Orrenius 2001. Illegal immigration and enforcement along the US-Mexico border: An overview. Economic and Financial Review-Federal Reserve Bank of Dallas1 (2001), 2–11.
[55]
Tony Payan. 2016. The Three US-Mexico Border Wars: Drugs, Immigration, and Homeland Security: Drugs, Immigration, and Homeland Security. ABC-CLIO.
[56]
Malford T Pillow, Laura Hopson, Michael Bond, Daniel Cabrera, Leigh Patterson, David Pearson, Harsh Sule, Felix Ankel, Madonna Fernández-Frackelton, Ronald V Hall, 2014. Social media guidelines and best practices: recommendations from the Council of Residency Directors Social Media Task Force. Western Journal of Emergency Medicine 15, 1 (2014), 26.
[57]
Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, 2021. Learning transferable visual models from natural language supervision. In International conference on machine learning. PMLR, 8748–8763.
[58]
Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, Ilya Sutskever, 2019. Language models are unsupervised multitask learners. OpenAI blog 1, 8 (2019), 9.
[59]
Monalika Padma Reddy and A Deeksha. 2021. Improving the accuracy of Neural Networks through Ensemble Techniques. (2021).
[60]
Machel Reid, Nikolay Savinov, Denis Teplyashin, Dmitry Lepikhin, Timothy Lillicrap, Jean-baptiste Alayrac, Radu Soricut, Angeliki Lazaridou, Orhan Firat, Julian Schrittwieser, 2024. Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context. arXiv preprint arXiv:2403.05530 (2024).
[61]
Kunal Relia, Zhengyi Li, Stephanie H Cook, and Rumi Chunara. 2019. Race, ethnicity and national origin-based discrimination in social media and hate crimes across 100 US cities. In Proceedings of the International AAAI Conference on Web and Social Media, Vol. 13. 417–427.
[62]
J Roberto, Wen-Yee Lee, and Sandra I Campos-Díaz. 2009. Soil-borne polycyclic aromatic hydrocarbons in El Paso, Texas: analysis of a potential problem in the United States/Mexico border region. Journal of Hazardous Materials 163, 2-3 (2009), 946–958.
[63]
Fernando Romero. 2008. Hyperborder: the contemporary US? Mexico border and it’s future. Princeton Architectural Press.
[64]
Somya Ranjan Sahoo and Brij B Gupta. 2021. Multiple features based approach for automatic fake news detection on social networks using deep learning. Applied Soft Computing 100 (2021), 106983.
[65]
Gabriella Sanchez. 2014. Human smuggling and border crossings. Routledge.
[66]
Gabriella E Sanchez. 2017. ‘This Time I Am Going to Cross!’: Fighting Entrapment Processes Through the Provision of Human Smuggling Services on the US–Mexico Border. Springer.
[67]
Mohit Singhal, Chen Ling, Pujan Paudel, Poojitha Thota, Nihal Kumarswamy, Gianluca Stringhini, and Shirin Nilizadeh. 2023. SoK: Content moderation in social media, from guidelines to enforcement, and research to practice. In 2023 IEEE 8th European Symposium on Security and Privacy (EuroS&P). IEEE, 868–895.
[68]
Jeremy Slack and Howard Campbell. 2016. On Narco-coyotaje: Illicit Regimes and Their Impacts on the US–Mexico Border. Antipode 48, 5 (2016), 1380–1399.
[69]
Jeremy Slack and Daniel E Martínez. 2018. What makes a good human smuggler? The differences between satisfaction with and recommendation of coyotes on the US-Mexico border. The Annals of the American Academy of Political and Social Science 676, 1 (2018), 152–173.
[70]
Ray Smith. 2007. An overview of the Tesseract OCR engine. In Ninth international conference on document analysis and recognition (ICDAR 2007), Vol. 2. IEEE, 629–633.
[71]
Kelly Stahl. 2018. Fake news detection in social media. California State University Stanislaus 6 (2018), 4–15.
[72]
Victor Suarez-Lledo and Javier Alvarez-Galvez. 2021. Prevalence of health misinformation on social media: systematic review. Journal of medical Internet research 23, 1 (2021), e17187.
[73]
Eugenio Tacchini, Gabriele Ballarin, Marco L Della Vedova, Stefano Moret, and Luca De Alfaro. 2017. Some like it hoax: Automated fake news detection in social networks. arXiv preprint arXiv:1704.07506 (2017).
[74]
Gemini Team, Rohan Anil, Sebastian Borgeaud, Yonghui Wu, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M Dai, Anja Hauth, 2023. Gemini: a family of highly capable multimodal models. arXiv preprint arXiv:2312.11805 (2023).
[75]
Gemma Team, Thomas Mesnard, Cassidy Hardin, Robert Dadashi, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, 2024. Gemma: Open models based on gemini research and technology. arXiv preprint arXiv:2403.08295 (2024).
[76]
techtransparencyproject. 2022. Facebook Marketplace, WhatsApp Storefronts, and TikTok Videos: How Coyotes Get Creative. https://www.techtransparencyproject.org/articles/facebook-marketplace-whatsapp-storefronts-and-tiktok-videos-how-coyotes-get-creative.
[77]
Zhan Tong, Yibing Song, Jue Wang, and Limin Wang. 2022. Videomae: Masked autoencoders are data-efficient learners for self-supervised video pre-training. Advances in neural information processing systems 35 (2022), 10078–10093.
[78]
Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, 2023. Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288 (2023).
[79]
Pablo Vila. 2003. Processes of identification on the US-Mexico border. The Social Science Journal 40, 4 (2003), 607–625.
[80]
Hilde AM Voorveld, Guda Van Noort, Daniël G Muntinga, and Fred Bronner. 2018. Engagement with social media and social media advertising: The differentiating role of platform type. Journal of advertising 47, 1 (2018), 38–54.
[81]
Shuo Yang, Kai Shu, Suhang Wang, Renjie Gu, Fan Wu, and Huan Liu. 2019. Unsupervised fake news detection on social media: A generative approach. In Proceedings of the AAAI conference on artificial intelligence, Vol. 33. 5644–5651.
[82]
Chisato Yoshida. 1993. The global welfare of illegal immigration: a note. Indian Economic Review (1993), 111–115.
[83]
Sheldon X Zhang. 2007. Smuggling and trafficking in human beings. All roads lead to (2007).

Index Terms

  1. Unveiling Coyote Ads: Detecting Human Smuggling Advertisements on Social Media

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      HT '24: Proceedings of the 35th ACM Conference on Hypertext and Social Media
      September 2024
      415 pages
      ISBN:9798400705953
      DOI:10.1145/3648188
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 10 September 2024

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Coyote Adverts
      2. Dataset
      3. Human Smuggling
      4. Image Ads
      5. Video Ads

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Funding Sources

      • Commonwealth Cyber Initiative

      Conference

      HT '24
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 378 of 1,158 submissions, 33%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 122
        Total Downloads
      • Downloads (Last 12 months)122
      • Downloads (Last 6 weeks)10
      Reflects downloads up to 17 Feb 2025

      Other Metrics

      Citations

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media