skip to main content
10.1145/3475723acmconferencesBook PagePublication PagesmmConference Proceedingsconference-collections
HUMA'21: Proceedings of the 2nd International Workshop on Human-centric Multimedia Analysis
ACM2021 Proceeding
  • General Chairs:
  • Wu Liu,
  • Junbo Guo,
  • John Smith,
  • Program Chairs:
  • Xinchen Liu,
  • Dingwen Zhang,
  • Wenbing Huang
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
MM '21: ACM Multimedia Conference Virtual Event China 20 October 2021
ISBN:
978-1-4503-8671-5
Published:
25 November 2021
Sponsors:
Recommend ACM DL
ALREADY A SUBSCRIBER?SIGN IN

Reflects downloads up to 19 Feb 2025Bibliometrics
Skip Abstract Section
Abstract

It is our great pleasure to welcome you to the 2nd Human-centric Human Analysis Workshop (HUMA'21), which is co-located with ACM Multimedia 2021 in Chengdu, China. This workshop is concentrated on the human-centric multimedia analysis, which is one of the fundamental problems in multimedia understanding. It is a very challenging problem that involves multiple tasks such as face detection and recognition, human pose estimation, human action detection, person tracking, and so on. Today, ubiquitous multimedia sensors and large-scale computing infrastructures are producing a wide variety of big multimodality data for human-centric analysis, which provides rich knowledge to tackle these challenges. Researchers have strived to push the limits of human-centric multimedia analysis in various applications, such as intelligent surveillance, retailing, fashion design, and services. Therefore, the purpose of this workshop is to: 1) bring together the state-of-the-art research on human-centric multimedia analysis; 2) call for a coordinated effort to understand the opportunities and challenges emerging in human-centric multimedia analysis; 3) identify key tasks and evaluate the state-of-the-art methods; 4) showcase innovative methodologies and ideas; 5) introduce interesting real-world human-centric multimedia analysis systems or applications; and 6) propose new real-world datasets and discuss future directions. We solicit original contributions in all fields of human-centric multimedia analysis that explore the multi-modality data to understand the behavior of humans. We believe this workshop can offer a timely collection of research updates to benefit researchers and practitioners in the broad multimedia communities.

Skip Table Of Content Section
SESSION: Invited Talks 1
invited-talk
Modern Learning Methodologies for Co-Saliency Detection

Visual saliency computing aims to imitate the human visual attention mechanism to identify the most prominent or unique areas or objects from a visual scene. It is one of the basic low-level image processing techniques and can be applied to many ...

SESSION: Session 1: Pose, Action, and Interaction
research-article
Open Access
Learning Positional Priors for Pretraining 2D Pose Estimators

The target of 2D human pose estimation is to locate the keypoints of body parts from 2D images. State-of-the-art methods for pose estimation usually construct pixel-wise heatmaps from keypoints as labels for learning neural networks, which are usually ...

research-article
A Closer Look at Temporal Sentence Grounding in Videos: Dataset and Metric

Temporal Sentence Grounding in Videos (TSGV), \ie, grounding a natural language sentence which indicates complex human activities in a long and untrimmed video sequence, has received unprecedented attentions over the last few years. Although each newly ...

research-article
NLOS Imaging Assisted Navigation for BVI

Assistive navigation techniques support the activities of blind or visually impaired (BVI) people and improve their life quality. However, current navigation systems cannot detect hidden objects that may run out and become obstacles. In this paper, we ...

research-article
Using Feature Interaction among GPS Data for Road Intersection Detection

Road intersection plays a vital role in road network construction, automatic drive, and intelligent transportation systems. Most methods detect road intersections only using geometrical features without spatio-temporal features, leading to insufficient ...

SESSION: Session 2: Technical Demos
research-article
Modeling 3D Objects: Implications for Neuroscience, Behavioral and Medical Studies: A Case Demo

We have designed, developed and adapted 3D objects (3DOs) within the interactive environment for in-lab neuroscience research of motor control and the mirror neuron system (MNS) (Figure 1b; 3D view: https://p3d.in/0B202). The modeled 3DOs are ...

Contributors
  • IBM Thomas J. Watson Research Center
  • JD.com, Inc.
  • Northwestern Polytechnical University
  • Renmin University of China

Index Terms

  1. Proceedings of the 2nd International Workshop on Human-centric Multimedia Analysis
      Index terms have been assigned to the content through auto-classification.

      Recommendations