Learning Cross-modal Representations with Multi-relations for Image Captioning Topics: Deep Learning and Neural Networks; Image and Video Analysis and Understanding; Natural Language Processing In Proceedings of the 11th International Conference on Pattern Recognition Applications and Methods ICPRAM - Volume 1, 346-353, 2022