Abstract
The classification of movie genres from their synopses has attracted the attention of many researchers. Indeed, synopses are a source of relevant information that contributes to determinate movie genre. The automation of this classification process is very useful in several applications, such recommendation systems. Moreover, movies can belong simultaneously to several genres (drama, action, comedy, horror), which reflects a typical problem of multi-label classification (MLC). In this article, we use a powerful representation of film synthesis via a document integration technique Doc2vec in the multi-label classification context. The technique used in our experience is One Vs All, which is a transformation approach; it creates a model for each label through a kernel classifier. We have chosen to use three different classifiers: logistic regression, SVM and ANN. The results of our experimental study show that the best accuracies are obtained using ANN model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aubaid, A., Mishra, A.: A rule-based approach to embedding techniques for text document classification. Appl. Sci. 10(11), 4009 (2020). https://doi.org/10.3390/app10114009
Balikas, G., Amini, M.R.: Multi-label, multi-class classification using polylingual embeddings. In: 38th European Conference on Information Retrieval ECIR, Archives-Ouvertes (HAL), Italy (2016)
Brezeale, D., Cook, D.J.: Using closed captions and visual features to classify movies by genre. In: Poster session of the 7th International Workshop on Multimedia Data Mining (MDM/KDD 2006), USA (2006)
Boutell, M.R., Luo, J., Shen, X., Brown, C.M.: Learning multi-label scene classification. Pattern Recogn. 37(9), 1757–1771 (2004)
Clare, a., King, R.D.: Knowledge discovery in multi-label phenotype data. In: European Conference on Principles of Data Mining and Knowledge Discovery, pp. 42–53. Springer, Heidelberg (2001)
Hong, H.-Z., Hwang, J.G.: Multimodal PLSA for movie genre classification. In: International Workshop on Multiple Classifier Systems, pp. 159–167. Springer, Heidelberg (2015)
Katakis, I., Tsoumakas, G., Vlahavas, I.: Multilabel text classification for automated tag suggestion. In: Proceedings of European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2008, vol. 18, pp. 75–83. Google Scholar (2008)
Ouchiha. L.: Classification supervisée de documents étude comparative. Maitrise en sciences et technologies de l’information, Université dé Québec, Outaouais (2016)
Lenc, L., Kral, P.: Word embeddings for multi-label document classification. In: Proceedings of Recent Advances in Natural Language Processing, 4–6 September, Varna, Bulgaria, pp. 431–437 (2017)
Lee, Y.-B., Myaeng, S.-H.: Text genre classification with genre-revealing and subject-revealing features. In: Proceedings of the 25th Annual International SIGIR Conference on Research and Development in Information Retrieval, pp. 145–150. ACM, Finland (2002)
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of the 31st International Conference on Proceding Machine Learning research (PMLR), China, vol. 32, no. 2, pp. 1188–1196 (2014)
Liu, W., Wen, B., Gao, S., Zheng, J., Zheng, Y.: A multi-label text classification model based on ELMo and attention. MATEC Web Conf. 309, 03015 (2020)
Lo, H.-Y., Wang, J.-C., Wang, H.-M., Lin, S.-D.: Cost-sensitive multi-label learning for audio tag annotation and retrieval. IEEE Trans. Multimedia 13(3), 518–529 (2011)
Madjarov, G., Kocev, D., Gjorgjevikj, D., Dzeroski, S.: An extensive experimental comparison of methods for multi-label learning. Pattern Recogn. 45(9), 3084–3104 (2012)
Makarenkov, V., Rokach, L., Shapira, B.: Language Models with GloVe Word Embeddings. Researchgate (2016). https://www.researchgate.net/publication/309037295, Accessed 29 Sept 2020
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: Proceedings of the International Conference on Learning Representations (ICLR), Scottsdale (2013)
Ozonat, K., Young, D.: Towards a universal marketplace over the web: Statistical multi-label classification of service provider forms with simulated annealing. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1295–1304. ACM, France (2009)
Portolese, G., Feltrim, V.D.: On the use of synopsis-based features for film genre classification. In: Conference XV Encontro Nacional de Inteligência Artificial e Computacional (ENIAC), pp. 892–902. SBC, Brazil (2018)
Rasheed, Z., Shah, M.: Movie genre classification by exploiting audio-visual features of previews. In: 16th International Conference on Pattern Recognition, pp. 11–15. IEEE Computer Society Press, Canada (2002)
Rifkin, R., Klautau, A.: In defense of one-vs-all classification. J. Mach. Learn. Res. 5, 101–141 (2004)
Robert, E., Singer, S.Y.: BoosTexter: a boosting-based system for text categorization. In: Machine Learning, vol 39, pp. 135–168. Springer, Yang (2000)
Shi, D.: A Study on Neural Network Language Modeling. Researchgate (2017). https://www.researchgate.net/publication/319271962, Accessed 25 Sept 2020
Snoek, C.G., Worring, M., Van Gemert, J.C., Geusebroek, J.-M., Smeulders, A.W.: The challenge problem for automated detection of 101 semantic concepts in multimedia. In: Proceedings of the 14th Annual ACM International Conference on Multimedia, pp. 421–430. ACM, USA (2006)
Yaghoobzadeh, Y., Kann, K., Schutze, H.: Evaluating word embeddings in multi-label classification using fine-grained name typing. In: Proceedings of the 3rd Workshop on Representation Learning for NLP, pp. 101–106. Association for Computational Linguistics, Australia (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Guehria, S., Belleili, H., Azizi, N., Belhaouari, S.B. (2021). “One vs All” Classifier Analysis for Multi-label Movie Genre Classification Using Document Embedding. In: Abraham, A., Piuri, V., Gandhi, N., Siarry, P., Kaklauskas, A., Madureira, A. (eds) Intelligent Systems Design and Applications. ISDA 2020. Advances in Intelligent Systems and Computing, vol 1351. Springer, Cham. https://doi.org/10.1007/978-3-030-71187-0_44
Download citation
DOI: https://doi.org/10.1007/978-3-030-71187-0_44
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-71186-3
Online ISBN: 978-3-030-71187-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)