Clickbait Detection with Style-Aware Title Modeling and Co-attention

Wu, Chuhan; Wu, Fangzhao; Qi, Tao; Huang, Yongfeng

doi:10.1007/978-3-030-63031-7_31

Chuhan Wu¹⁴,
Fangzhao Wu¹⁵,
Tao Qi¹⁴ &
…
Yongfeng Huang¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12522))

Included in the following conference series:

China National Conference on Chinese Computational Linguistics

928 Accesses
2 Citations

Abstract

Clickbait is a form of web content designed to attract attention and entice users to click on specific hyperlinks. The detection of clickbaits is an important task for online platforms to improve the quality of web content and the satisfaction of users. Clickbait detection is typically formed as a binary classification task based on the title and body of a webpage, and existing methods are mainly based on the content of title and the relevance between title and body. However, these methods ignore the stylistic patterns of titles, which can provide important clues on identifying clickbaits. In addition, they do not consider the interactions between the contexts within title and body, which are very important for measuring their relevance for clickbait detection. In this paper, we propose a clickbait detection approach with style-aware title modeling and co-attention. Specifically, we use Transformers to learn content representations of title and body, and respectively compute two content-based clickbait scores for title and body based on their representations. In addition, we propose to use a character-level Transformer to learn a style-aware title representation by capturing the stylistic patterns of title, and we compute a title stylistic score based on this representation. Besides, we propose to use a co-attention network to model the relatedness between the contexts within title and body, and further enhance their representations by encoding the interaction information. We compute a title-body matching score based on the representations of title and body enhanced by their interactions. The final clickbait score is predicted by a weighted summation of the aforementioned four kinds of scores. Extensive experiments on two benchmark datasets show that our approach can effectively improve the performance of clickbait detection and consistently outperform many baseline methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://www.clickbait-challenge.org/.
2.
http://www.fakenewschallenge.org/.
3.
Most results of baselines are taken from [9], except the result of Siamese Net on the Clickbait Challenge dataset since it is quite unsatisfactory. We report the results using our implementation instead.

References

Agrawal, A.: Clickbait detection using deep learning. In: 2016 2nd International Conference on Next Generation Computing Technologies (NGCT), pp. 268–272. IEEE (2016)
Google Scholar
Anand, A., Chakraborty, T., Park, N.: We used neural networks to detect clickbaits: you won’t believe what happened next! In: Jose, J.M., et al. (eds.) ECIR 2017. LNCS, vol. 10193, pp. 541–547. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-56608-5_46
Biyani, P., Tsioutsiouliklis, K., Blackmer, J.: “8 amazing secrets for getting more clicks”: Detecting clickbaits in news streams using article informality. In: AAAI (2016)
Google Scholar
Bourgonje, P., Schneider, J.M., Rehm, G.: From clickbait to fake news detection: an approach based on detecting the stance of headlines to articles. In: Proceedings of the 2017 EMNLP Workshop: Natural Language Processing meets Journalism, pp. 84–89 (2017)
Google Scholar
Cao, X., Le, T., et al.: Machine learning based detection of clickbait posts in social media. arXiv preprint arXiv:1710.01977 (2017)
Chakraborty, A., Paranjape, B., Kakarla, S., Ganguly, N.: Stop clickbait: detecting and preventing clickbaits in online news media. In: 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 9–16. IEEE (2016)
Google Scholar
Chen, Y., Conroy, N.J., Rubin, V.L.: Misleading online content: recognizing clickbait as “false news”. In: Proceedings of the 2015 ACM on Workshop on Multimodal Deception Detection, pp. 15–19 (2015)
Google Scholar
Dimpas, P.K., Po, R.V., Sabellano, M.J.: Filipino and english clickbait detection using a long short term memory recurrent neural network. In: IALP, pp. 276–280. IEEE (2017)
Google Scholar
Dong, M., Yao, L., Wang, X., Benatallah, B., Huang, C.: Similarity-aware deep attentive model for clickbait detection. In: Yang, Q., Zhou, Z.-H., Gong, Z., Zhang, M.-L., Huang, S.-J. (eds.) PAKDD 2019. LNCS (LNAI), vol. 11440, pp. 56–69. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-16145-3_5
Chapter Google Scholar
Fu, J., Liang, L., Zhou, X., Zheng, J.: A convolutional neural network for clickbait detection. In: 2017 4th International Conference on Information Science and Control Engineering (ICISCE), pp. 6–10. IEEE (2017)
Google Scholar
Geçkil, A., Müngen, A.A., Gündogan, E., Kaya, M.: A clickbait detection method on news sites. In: 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 932–937. IEEE (2018)
Google Scholar
Glenski, M., Ayton, E., Arendt, D., Volkova, S.: Fishing for clickbaits in social images and texts with linguistically-infused neural network models. arXiv preprint arXiv:1710.06390 (2017)
Huang, P.S., He, X., Gao, J., Deng, L., Acero, A., Heck, L.: Learning deep structured semantic models for web search using clickthrough data. In: CIKM, pp. 2333–2338 (2013)
Google Scholar
Indurthi, V., Oota, S.R.: Clickbait detection using word embeddings. arXiv preprint arXiv:1710.02861 (2017)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Kumar, V., Khattar, D., Gairola, S., Kumar Lal, Y., Varma, V.: Identifying clickbait: a multi-strategy approach using neural networks. In: SIGIR, pp. 1225–1228 (2018)
Google Scholar
Okura, S., Tagami, Y., Ono, S., Tajima, A.: Embedding-based news recommendation for millions of users. In: KDD, pp. 1933–1942 (2017)
Google Scholar
Pennington, J., Socher, R., Manning, C.: Glove: global vectors for word representation. In: EMNLP, pp. 1532–1543 (2014)
Google Scholar
Potthast, M., Köpsel, S., Stein, B., Hagen, M.: Clickbait detection. In: Ferro, N., et al. (eds.) ECIR 2016. LNCS, vol. 9626, pp. 810–817. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30671-1_72
Chapter Google Scholar
Rendle, S., Krichene, W., Zhang, L., Anderson, J.: Neural collaborative filtering vs. matrix factorization revisited. arXiv preprint arXiv:2005.09683 (2020)
Shen, Y., He, X., Gao, J., Deng, L., Mesnil, G.: A latent semantic model with convolutional-pooling structure for information retrieval. In: CIKM, pp. 101–110 (2014)
Google Scholar
Srivastava, N., Hinton, G.E., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. JMLR 15(1), 1929–1958 (2014)
MathSciNet MATH Google Scholar
Thomas, P.: Clickbait identification using neural networks. arXiv preprint arXiv:1710.08721 (2017)
Vaswani, A., et al.: Attention is all you need. In: NIPS, pp. 5998–6008 (2017)
Google Scholar
Yang, Z., Yang, D., Dyer, C., He, X., Smola, A., Hovy, E.: Hierarchical attention networks for document classification. In: NAACL-HLT, pp. 1480–1489 (2016)
Google Scholar
Zheng, H.T., Chen, J.Y., Yao, X., Sangaiah, A.K., Jiang, Y., Zhao, C.Z.: Clickbait convolutional neural network. Symmetry 10(5), 138 (2018)
Article Google Scholar
Zhou, Y.: Clickbait detection in tweets using self-attentive network. arXiv preprint arXiv:1710.05364 (2017)

Download references

Acknowledgements

Supported by the National Key Research and Development Program of China under Grant No. 2018YFC1604002, the National Natural Science Foundation of China under Grant Nos. U1936208, U1936216, U1836204 and U1705261.

Author information

Authors and Affiliations

Department of Electronic Engineering & BNRist, Tsinghua University, Beijing, 100084, China
Chuhan Wu, Tao Qi & Yongfeng Huang
Microsoft Research Asia, Beijing, 100080, China
Fangzhao Wu

Authors

Chuhan Wu
View author publications
You can also search for this author in PubMed Google Scholar
Fangzhao Wu
View author publications
You can also search for this author in PubMed Google Scholar
Tao Qi
View author publications
You can also search for this author in PubMed Google Scholar
Yongfeng Huang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chuhan Wu .

Editor information

Editors and Affiliations

Tsinghua University, Beijing, China
Maosong Sun
Peking University, Beijing, China
Sujian Li
Westlake University, Hangzhou, China
Yue Zhang
Tsinghua University, Beijing, China
Yang Liu
Chinese Academy of Sciences, Beijing, China
Shizhu He
Beijing Language and Culture University, Beijing, China
Gaoqi Rao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, C., Wu, F., Qi, T., Huang, Y. (2020). Clickbait Detection with Style-Aware Title Modeling and Co-attention. In: Sun, M., Li, S., Zhang, Y., Liu, Y., He, S., Rao, G. (eds) Chinese Computational Linguistics. CCL 2020. Lecture Notes in Computer Science(), vol 12522. Springer, Cham. https://doi.org/10.1007/978-3-030-63031-7_31

Download citation

DOI: https://doi.org/10.1007/978-3-030-63031-7_31
Published: 12 November 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63030-0
Online ISBN: 978-3-030-63031-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics