Abstract
Clickbait is a form of web content designed to attract attention and entice users to click on specific hyperlinks. The detection of clickbaits is an important task for online platforms to improve the quality of web content and the satisfaction of users. Clickbait detection is typically formed as a binary classification task based on the title and body of a webpage, and existing methods are mainly based on the content of title and the relevance between title and body. However, these methods ignore the stylistic patterns of titles, which can provide important clues on identifying clickbaits. In addition, they do not consider the interactions between the contexts within title and body, which are very important for measuring their relevance for clickbait detection. In this paper, we propose a clickbait detection approach with style-aware title modeling and co-attention. Specifically, we use Transformers to learn content representations of title and body, and respectively compute two content-based clickbait scores for title and body based on their representations. In addition, we propose to use a character-level Transformer to learn a style-aware title representation by capturing the stylistic patterns of title, and we compute a title stylistic score based on this representation. Besides, we propose to use a co-attention network to model the relatedness between the contexts within title and body, and further enhance their representations by encoding the interaction information. We compute a title-body matching score based on the representations of title and body enhanced by their interactions. The final clickbait score is predicted by a weighted summation of the aforementioned four kinds of scores. Extensive experiments on two benchmark datasets show that our approach can effectively improve the performance of clickbait detection and consistently outperform many baseline methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
Most results of baselines are taken from [9], except the result of Siamese Net on the Clickbait Challenge dataset since it is quite unsatisfactory. We report the results using our implementation instead.
References
Agrawal, A.: Clickbait detection using deep learning. In: 2016 2nd International Conference on Next Generation Computing Technologies (NGCT), pp. 268–272. IEEE (2016)
Anand, A., Chakraborty, T., Park, N.: We used neural networks to detect clickbaits: you won’t believe what happened next! In: Jose, J.M., et al. (eds.) ECIR 2017. LNCS, vol. 10193, pp. 541–547. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-56608-5_46
Biyani, P., Tsioutsiouliklis, K., Blackmer, J.: “8 amazing secrets for getting more clicks”: Detecting clickbaits in news streams using article informality. In: AAAI (2016)
Bourgonje, P., Schneider, J.M., Rehm, G.: From clickbait to fake news detection: an approach based on detecting the stance of headlines to articles. In: Proceedings of the 2017 EMNLP Workshop: Natural Language Processing meets Journalism, pp. 84–89 (2017)
Cao, X., Le, T., et al.: Machine learning based detection of clickbait posts in social media. arXiv preprint arXiv:1710.01977 (2017)
Chakraborty, A., Paranjape, B., Kakarla, S., Ganguly, N.: Stop clickbait: detecting and preventing clickbaits in online news media. In: 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 9–16. IEEE (2016)
Chen, Y., Conroy, N.J., Rubin, V.L.: Misleading online content: recognizing clickbait as “false news”. In: Proceedings of the 2015 ACM on Workshop on Multimodal Deception Detection, pp. 15–19 (2015)
Dimpas, P.K., Po, R.V., Sabellano, M.J.: Filipino and english clickbait detection using a long short term memory recurrent neural network. In: IALP, pp. 276–280. IEEE (2017)
Dong, M., Yao, L., Wang, X., Benatallah, B., Huang, C.: Similarity-aware deep attentive model for clickbait detection. In: Yang, Q., Zhou, Z.-H., Gong, Z., Zhang, M.-L., Huang, S.-J. (eds.) PAKDD 2019. LNCS (LNAI), vol. 11440, pp. 56–69. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-16145-3_5
Fu, J., Liang, L., Zhou, X., Zheng, J.: A convolutional neural network for clickbait detection. In: 2017 4th International Conference on Information Science and Control Engineering (ICISCE), pp. 6–10. IEEE (2017)
Geçkil, A., Müngen, A.A., Gündogan, E., Kaya, M.: A clickbait detection method on news sites. In: 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 932–937. IEEE (2018)
Glenski, M., Ayton, E., Arendt, D., Volkova, S.: Fishing for clickbaits in social images and texts with linguistically-infused neural network models. arXiv preprint arXiv:1710.06390 (2017)
Huang, P.S., He, X., Gao, J., Deng, L., Acero, A., Heck, L.: Learning deep structured semantic models for web search using clickthrough data. In: CIKM, pp. 2333–2338 (2013)
Indurthi, V., Oota, S.R.: Clickbait detection using word embeddings. arXiv preprint arXiv:1710.02861 (2017)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Kumar, V., Khattar, D., Gairola, S., Kumar Lal, Y., Varma, V.: Identifying clickbait: a multi-strategy approach using neural networks. In: SIGIR, pp. 1225–1228 (2018)
Okura, S., Tagami, Y., Ono, S., Tajima, A.: Embedding-based news recommendation for millions of users. In: KDD, pp. 1933–1942 (2017)
Pennington, J., Socher, R., Manning, C.: Glove: global vectors for word representation. In: EMNLP, pp. 1532–1543 (2014)
Potthast, M., Köpsel, S., Stein, B., Hagen, M.: Clickbait detection. In: Ferro, N., et al. (eds.) ECIR 2016. LNCS, vol. 9626, pp. 810–817. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30671-1_72
Rendle, S., Krichene, W., Zhang, L., Anderson, J.: Neural collaborative filtering vs. matrix factorization revisited. arXiv preprint arXiv:2005.09683 (2020)
Shen, Y., He, X., Gao, J., Deng, L., Mesnil, G.: A latent semantic model with convolutional-pooling structure for information retrieval. In: CIKM, pp. 101–110 (2014)
Srivastava, N., Hinton, G.E., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. JMLR 15(1), 1929–1958 (2014)
Thomas, P.: Clickbait identification using neural networks. arXiv preprint arXiv:1710.08721 (2017)
Vaswani, A., et al.: Attention is all you need. In: NIPS, pp. 5998–6008 (2017)
Yang, Z., Yang, D., Dyer, C., He, X., Smola, A., Hovy, E.: Hierarchical attention networks for document classification. In: NAACL-HLT, pp. 1480–1489 (2016)
Zheng, H.T., Chen, J.Y., Yao, X., Sangaiah, A.K., Jiang, Y., Zhao, C.Z.: Clickbait convolutional neural network. Symmetry 10(5), 138 (2018)
Zhou, Y.: Clickbait detection in tweets using self-attentive network. arXiv preprint arXiv:1710.05364 (2017)
Acknowledgements
Supported by the National Key Research and Development Program of China under Grant No. 2018YFC1604002, the National Natural Science Foundation of China under Grant Nos. U1936208, U1936216, U1836204 and U1705261.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Wu, C., Wu, F., Qi, T., Huang, Y. (2020). Clickbait Detection with Style-Aware Title Modeling and Co-attention. In: Sun, M., Li, S., Zhang, Y., Liu, Y., He, S., Rao, G. (eds) Chinese Computational Linguistics. CCL 2020. Lecture Notes in Computer Science(), vol 12522. Springer, Cham. https://doi.org/10.1007/978-3-030-63031-7_31
Download citation
DOI: https://doi.org/10.1007/978-3-030-63031-7_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63030-0
Online ISBN: 978-3-030-63031-7
eBook Packages: Computer ScienceComputer Science (R0)