Hyper-parameters Tuning of Artificial Neural Networks: An Application in the Field of Recommender Systems

Stergiopoulos, Vaios; Vassilakopoulos, Michael; Tousidou, Eleni; Corral, Antonio

doi:10.1007/978-3-031-15743-1_25

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1652))

Included in the following conference series:

European Conference on Advances in Databases and Information Systems

972 Accesses
4 Citations

Abstract

In this work, we carry out the hyper-parameters tuning of a Machine Learning (ML) Recommender Systems (RS) which utilizes an Artificial Neural Network (ANN), called CATA++. We have performed tuning of the activation function, weight initialization and training epochs of CATA++ in order to improve both training and performance. During the experiments, a variety of state-of-the-art activation functions have been tested: ReLU, LeakyReLU, ELU, SineReLU, GELU, Mish, Swish and Flatten-T Swish. Additionally, various weight initializers have been tested, such as: XavierGlorot, Orthogonal, He, Lecun. Moreover, we ran experiments with different epochs number from 10 to 150. We have used data from CiteULike and AMiner Citation Network. The recorded metrics (Recall, nDCG) indicate that hyper-parameters tuning can reduce notably the necessary training time, while the recommendation performance is significantly improved (up to +44.2% Recall).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://keras.io/api/layers/initializers/.
2.
Code available at https://www.github.com/jianlin-cheng/CATA.
3.
https://www.aminer.cn/citation.

References

Alfarhood, M., Cheng, J.: CATA++: a collaborative dual attentive autoencoder method for recommending scientific articles. IEEE Access 8, 183633–183648 (2020)
Article Google Scholar
Tang, J., et al.: ArnetMiner: extraction and mining of academic social networks. In: Proceedings of 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 990–998 (2008)
Google Scholar
Stergiopoulos, V., Tsianaka, T., Tousidou, E.: AMiner citation-data preprocessing for recommender systems on Scientific Publications. In: Proceedings of 25th Pan-Hellenic Conference on Informatics, pp. 23–27. ACM (2021)
Google Scholar
Stergiopoulos, V., Vassilakopoulos, M., Tousidou, E., Corral, A.: An application of ANN hyper-parameters tuning in the field of recommender systems. Technical report, Data Structuring & Engineering Laboratory, University of Thessaly, Volos, Greece (2022). https://faculty.e-ce.uth.gr/mvasilako/techrep2022.pdf
Nair, V., Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines. In: Proceedings of 27th International Conference on Machine Learning, pp. 807–814, Haifa (2010)
Google Scholar
Pedamonti, D.: Comparison of non-linear activation functions for deep neural networks on MNIST classification task. CoRR:1804.02763 (2018)
Google Scholar
Maas, A.L., Hannun, A.Y., Ng, A.Y.: Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of 30th International Conference on Machine Learning (2013)
Google Scholar
Clevert, D., Unterthiner, T., Hochreiter, S.: Fast and accurate deep network learning by exponential linear units (ELUs). arXiv:1511.07289v5 and ICLR (Poster) (2016)
Rodrigues, W.: SineReLU - An Alternative to the ReLU Activation Function (2018). https://wilder-rodrigues.medium.com/sinerelu-an-alternative-to-the-relu-activation-function-e46a6199997d. Accessed 5 Mar 2022
Hendrycks, D., Gimpel, K.: Gaussian error linear units (GELUS). arXiv:1606.08415v4 (2020)
Misra, D.: Mish: a self regularized non-monotonic neural activation function. CoRR:1908.08681 (2019)
Google Scholar
Ramachandran, P., Zoph, B., Le, Q.: Searching for activation functions. CoRR:1710.05941 (2017) and ICLR’ 2018 (Workshop) (2018)
Google Scholar
Chieng, H., Wahid, N., Pauline, O., Perla, S.: Flatten-T Swish: a thresholded ReLU-Swish-like activation function for deep learning. Int. J. Adv. Intell. Inform. 4(2), 76–86 (2018)
Article Google Scholar
Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the 13th International Conference on Artificial Intelligence and Statistics, Chia Laguna Resort, Sardinia, Italy, vol. 9 of JMLR: W &CP 9 (2010)
Google Scholar
He, K., et al.: Delving deep into rectifiers: surpassing human-level performance on ImageNet classification. In: IEEE International Conference on Computer Vision, pp. 1026–1034 (2015)
Google Scholar
Saxe, A.M., McClelland, J.L., Ganguli, S.: Exact solutions to the nonlinear dynamics of learning in deep linear neural networks. arXiv:1312.6120 and ICLR 2014 (2014)
LeCun, Y.A., Bottou, L., Orr, G.B., Müller, K.-R.: Efficient BackProp. In: Montavon, G., Orr, G.B., Müller, K.-R. (eds.) Neural Networks: Tricks of the Trade. LNCS, vol. 7700, pp. 9–48. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-35289-8_3
Chapter Google Scholar
Eger, S., Youssef, P., Gurevych, I.: Is it time to swish? Comparing deep learning activation functions across NLP tasks. CoRR:1901.02671 (2019) and EMNLP 2018, pp. 4415–4424 (2018)
Google Scholar
Kumar, S.K.: On weight initialization in deep neural networks. CoRR:1704.08863 (2017)
Google Scholar

Download references

Acknowledgements

The work of M. Vassilakopoulos and A. Corral was funded by the MINECO research project [TIN2017-83964-R] and the Junta de Andalucia research project [P20_00809].

Author information

Authors and Affiliations

Data Structuring and Engineering Laboratory, Department of Electrical and Computer Engineering, University of Thessaly, Volos, Greece
Vaios Stergiopoulos, Michael Vassilakopoulos & Eleni Tousidou
Department of Informatics, University of Almeria, Almeria, Spain
Antonio Corral

Authors

Vaios Stergiopoulos
View author publications
You can also search for this author in PubMed Google Scholar
Michael Vassilakopoulos
View author publications
You can also search for this author in PubMed Google Scholar
Eleni Tousidou
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Corral
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vaios Stergiopoulos .

Editor information

Editors and Affiliations

Politecnico di Torino, Turin, Italy
Silvia Chiusano
Politecnico di Torino, Turin, Italy
Tania Cerquitelli
Poznań University of Technology, Poznań, Poland
Robert Wrembel
Norwegian University of Science and Technology, Trondheim, Norway
Kjetil Nørvåg
University of Genoa, Genoa, Italy
Barbara Catania
CNRS, Villeurbanne Cedex, France
Genoveva Vargas-Solar
University of Calabria, Rende, Italy
Ester Zumpano

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Stergiopoulos, V., Vassilakopoulos, M., Tousidou, E., Corral, A. (2022). Hyper-parameters Tuning of Artificial Neural Networks: An Application in the Field of Recommender Systems. In: Chiusano, S., et al. New Trends in Database and Information Systems. ADBIS 2022. Communications in Computer and Information Science, vol 1652. Springer, Cham. https://doi.org/10.1007/978-3-031-15743-1_25

Download citation

DOI: https://doi.org/10.1007/978-3-031-15743-1_25
Published: 29 August 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-15742-4
Online ISBN: 978-3-031-15743-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics