skip to main content
10.1145/3607541acmconferencesBook PagePublication PagesmmConference Proceedingsconference-collections
McGE '23: Proceedings of the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice
ACM2023 Proceeding
  • General Chairs:
  • Cheng Jin,
  • Liang He,
  • Mingli Song,
  • Rui Wang
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
MM '23: The 31st ACM International Conference on Multimedia Ottawa ON Canada 29 October 2023
ISBN:
979-8-4007-0278-5
Published:
29 October 2023
Sponsors:
Next Conference
October 28 - November 1, 2024
Melbourne , VIC , Australia
Bibliometrics
Skip Abstract Section
Abstract

It is our great pleasure to welcome you to the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice - McGE 2023.

We believe that this workshop will provide a valuable platform for researchers and practitioners to discuss and exchange ideas on the latest advancements, challenges, and opportunities in the rapidly evolving field of multimedia content generation.

Skip Table Of Content Section
SESSION: Session 1: Multimedia Content Evaluation: New Methods and Practice
research-article
Automatic Image Aesthetic Assessment for Human-designed Digital Images

Recently, with the ever-growing scale of aesthetic assessment data, researchers have the image aesthetic assessment (IAA) task. Meanwhile, as technology developing, there are more and more human-designed digital images through software like Photoshop on ...

research-article
Open Access
Multimedia Cognition and Evaluation in Open Environments

Within the past decade, a plethora of emerging multimedia applications and services has catalyzed the production of an enormous quantity of multimedia data. This data-driven epoch has significantly propelled the trajectory of advanced research in ...

research-article
Open Access
How Art-like are AI-generated Images? An Exploratory Study

Assessing the artness or artistic quality of AI-generated images continues to be a challenge within the realm of image generation. Most existing metrics cannot be used to perform instance-level and reference-free artness evaluation. This paper presents ...

research-article
Exploring Anchor-Free Approach for Reading Chinese Characters

Scene text spotting has achieved an impressive performance over recent years. Currently, most text localization methods are designed with the text line instance. We argue that building a character-level spotting network is more suited to recognize the ...

research-article
Semi-supervised Learning with Easy Labeled Data via Impartial Labeled Set Extension

Traditional Semi-supervised Learning (SSL) methods usually assume that the labeled data is independent and identically distributed (i.i.d.) from the underlying distribution. However, several relevant researches have revealed that i.i.d. assumption may ...

research-article
EMID: An Emotional Aligned Dataset in Audio-Visual Modality

In this paper, we propose Emotionally paired Music and Image Dataset (EMID), a novel dataset designed for the emotional matching of music and images, to facilitate auditory-visual cross-modal tasks such as generation and retrieval. Unlike existing ...

research-article
2CET-GAN: Pixel-Level GAN Model for Human Facial Expression Transfer

Recent studies have used GANs to transfer expressions between human faces. However, existing models have some flaws, such as relying on emotion labels, lacking continuous expressions, and fail- ing to capture the expression details. To address these ...

research-article
Open Access
4DSR-GCN: 4D Video Point Cloud Upsampling using Graph Convolutional Networks

Time varying sequences of 3D point clouds, or 4D point clouds, are now being acquired at an increasing pace in several applications (personal avatar representation, LiDAR in autonomous or assisted driving). In many cases, such volume of data is ...

SESSION: Session 2: Multimedia Content Generation
research-article
Taming Vector-Wise Quantization for Wide-Range Image Blending with Smooth Transition

Wide-range image blending is a novel image processing technique that merges two different images into a panorama with a transition region. Conventional image inpainting and outpainting methods have been used to complete this task, but always create ...

research-article
TCGIS: Text and Contour Guided Controllable Image Synthesis

Recently, text-to-image synthesis (T2I) has received extensive attention with encouraging results. However, the research still has the following challenges: 1) the quality of the synthesized images cannot be effectively guaranteed; 2) the human ...

research-article
Emotionally Enhanced Talking Face Generation

Several works have developed end-to-end pipelines for generating lip-synced talking faces with real-world applications, such as teaching and language translation in videos. However, these prior works fail to create realistic-looking videos since they ...

research-article
Human Pose Recommendation and Professionalization

Thanks to the proliferation of smartphones, taking photos is a breeze. Embarrassingly, we often find it difficult to strike a proper pose due to a lack of professional photography knowledge or guidance. The resulting photos are less than satisfactory. ...

research-article
Open Access
Alleviating Training Bias with Less Cost via Multi-expert De-biasing Method in Scene Graph Generation

Scene graph generation (SGG) methods have suffered from a severe training bias towards frequent (head) predicate classes. Recent works owe it to the long-tailed distribution of predicates and alleviate the long-tailed problem to conduct de-biasing. ...

research-article
Open Access
Multi-View Predicate Recognition for Solving Semantic Ambiguity Problem in Scene Graph Generation

Recent works on Scene Graph Generation (SGG) have been concentrating on solving the problem of long-tailed distribution. While these methods are making significant improvements on the tail predicate categories, they sacrifice the performance of the head ...

research-article
Nonword-to-Image Generation Considering Perceptual Association of Phonetically Similar Words

Text-to-Image (T2I) generation has long been a popular field of multimedia processing. Recent advances in large-scale vision and language pretraining have brought a number of models capable of very high-quality T2I generation. However, they are reported ...

research-article
Language Guidance Generation Using Aesthetic Attribute Comparison for Human Photography and AIGC

With the proliferation of mobile photography technology, leading mobile phone manufacturers are racing to enhance the shooting capabilities of their equipment and the photo beautification algorithm of their software. However, the development of ...

research-article
Responsive Listening Head Synthesis with 3DMM and Dual-Stream Prediction Network

In a conversation, it is crucial for the listener to provide appropriate reactions to the speaker, as the dialogue becomes challenging to sustain without the listener's involvement. Consequently, responsive listening head synthesis has become an ...

Contributors
  • Zhejiang University

Index Terms

  1. Proceedings of the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice
          Index terms have been assigned to the content through auto-classification.

          Recommendations