- Sponsor:
- sigmm
It is our great pleasure to welcome you to the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice - McGE 2023.
We believe that this workshop will provide a valuable platform for researchers and practitioners to discuss and exchange ideas on the latest advancements, challenges, and opportunities in the rapidly evolving field of multimedia content generation.
Proceeding Downloads
Automatic Image Aesthetic Assessment for Human-designed Digital Images
Recently, with the ever-growing scale of aesthetic assessment data, researchers have the image aesthetic assessment (IAA) task. Meanwhile, as technology developing, there are more and more human-designed digital images through software like Photoshop on ...
Multimedia Cognition and Evaluation in Open Environments
Within the past decade, a plethora of emerging multimedia applications and services has catalyzed the production of an enormous quantity of multimedia data. This data-driven epoch has significantly propelled the trajectory of advanced research in ...
How Art-like are AI-generated Images? An Exploratory Study
Assessing the artness or artistic quality of AI-generated images continues to be a challenge within the realm of image generation. Most existing metrics cannot be used to perform instance-level and reference-free artness evaluation. This paper presents ...
Exploring Anchor-Free Approach for Reading Chinese Characters
Scene text spotting has achieved an impressive performance over recent years. Currently, most text localization methods are designed with the text line instance. We argue that building a character-level spotting network is more suited to recognize the ...
Semi-supervised Learning with Easy Labeled Data via Impartial Labeled Set Extension
Traditional Semi-supervised Learning (SSL) methods usually assume that the labeled data is independent and identically distributed (i.i.d.) from the underlying distribution. However, several relevant researches have revealed that i.i.d. assumption may ...
EMID: An Emotional Aligned Dataset in Audio-Visual Modality
In this paper, we propose Emotionally paired Music and Image Dataset (EMID), a novel dataset designed for the emotional matching of music and images, to facilitate auditory-visual cross-modal tasks such as generation and retrieval. Unlike existing ...
2CET-GAN: Pixel-Level GAN Model for Human Facial Expression Transfer
Recent studies have used GANs to transfer expressions between human faces. However, existing models have some flaws, such as relying on emotion labels, lacking continuous expressions, and fail- ing to capture the expression details. To address these ...
4DSR-GCN: 4D Video Point Cloud Upsampling using Graph Convolutional Networks
Time varying sequences of 3D point clouds, or 4D point clouds, are now being acquired at an increasing pace in several applications (personal avatar representation, LiDAR in autonomous or assisted driving). In many cases, such volume of data is ...
Taming Vector-Wise Quantization for Wide-Range Image Blending with Smooth Transition
Wide-range image blending is a novel image processing technique that merges two different images into a panorama with a transition region. Conventional image inpainting and outpainting methods have been used to complete this task, but always create ...
TCGIS: Text and Contour Guided Controllable Image Synthesis
Recently, text-to-image synthesis (T2I) has received extensive attention with encouraging results. However, the research still has the following challenges: 1) the quality of the synthesized images cannot be effectively guaranteed; 2) the human ...
Emotionally Enhanced Talking Face Generation
Several works have developed end-to-end pipelines for generating lip-synced talking faces with real-world applications, such as teaching and language translation in videos. However, these prior works fail to create realistic-looking videos since they ...
Human Pose Recommendation and Professionalization
Thanks to the proliferation of smartphones, taking photos is a breeze. Embarrassingly, we often find it difficult to strike a proper pose due to a lack of professional photography knowledge or guidance. The resulting photos are less than satisfactory. ...
Alleviating Training Bias with Less Cost via Multi-expert De-biasing Method in Scene Graph Generation
Scene graph generation (SGG) methods have suffered from a severe training bias towards frequent (head) predicate classes. Recent works owe it to the long-tailed distribution of predicates and alleviate the long-tailed problem to conduct de-biasing. ...
Multi-View Predicate Recognition for Solving Semantic Ambiguity Problem in Scene Graph Generation
Recent works on Scene Graph Generation (SGG) have been concentrating on solving the problem of long-tailed distribution. While these methods are making significant improvements on the tail predicate categories, they sacrifice the performance of the head ...
Nonword-to-Image Generation Considering Perceptual Association of Phonetically Similar Words
Text-to-Image (T2I) generation has long been a popular field of multimedia processing. Recent advances in large-scale vision and language pretraining have brought a number of models capable of very high-quality T2I generation. However, they are reported ...
Language Guidance Generation Using Aesthetic Attribute Comparison for Human Photography and AIGC
With the proliferation of mobile photography technology, leading mobile phone manufacturers are racing to enhance the shooting capabilities of their equipment and the photo beautification algorithm of their software. However, the development of ...
Responsive Listening Head Synthesis with 3DMM and Dual-Stream Prediction Network
In a conversation, it is crucial for the listener to provide appropriate reactions to the speaker, as the dialogue becomes challenging to sustain without the listener's involvement. Consequently, responsive listening head synthesis has become an ...
Index Terms
- Proceedings of the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice