Proceedings of the 2nd International Workshop on Human-centric Multimedia Analysis

HUMA'21: Proceedings of the 2nd International Workshop on Human-centric Multimedia Analysis

November 2021

2021 Proceeding

General Chairs:
Wu Liu
AI Research of JD.com, China
,
Junbo Guo
State Key Laboratory of Communication Content Cognition, People's Daily Online, China
,
John Smith
IBM Research, USA
,
Program Chairs:
Xinchen Liu
AI Research of JD.com, China
,
Dingwen Zhang
Northwestern Polytechnical University, China
,
Wenbing Huang
Tsinghua University, China

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

MM '21: ACM Multimedia Conference Virtual Event China 20 October 2021

ISBN:

978-1-4503-8671-5

Published:

25 November 2021

Sponsors:

SIGMM

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Bibliometrics

Abstract

It is our great pleasure to welcome you to the 2nd Human-centric Human Analysis Workshop (HUMA'21), which is co-located with ACM Multimedia 2021 in Chengdu, China. This workshop is concentrated on the human-centric multimedia analysis, which is one of the fundamental problems in multimedia understanding. It is a very challenging problem that involves multiple tasks such as face detection and recognition, human pose estimation, human action detection, person tracking, and so on. Today, ubiquitous multimedia sensors and large-scale computing infrastructures are producing a wide variety of big multimodality data for human-centric analysis, which provides rich knowledge to tackle these challenges. Researchers have strived to push the limits of human-centric multimedia analysis in various applications, such as intelligent surveillance, retailing, fashion design, and services. Therefore, the purpose of this workshop is to: 1) bring together the state-of-the-art research on human-centric multimedia analysis; 2) call for a coordinated effort to understand the opportunities and challenges emerging in human-centric multimedia analysis; 3) identify key tasks and evaluate the state-of-the-art methods; 4) showcase innovative methodologies and ideas; 5) introduce interesting real-world human-centric multimedia analysis systems or applications; and 6) propose new real-world datasets and discuss future directions. We solicit original contributions in all fields of human-centric multimedia analysis that explore the multi-modality data to understand the behavior of humans. We believe this workshop can offer a timely collection of research updates to benefit researchers and practitioners in the broad multimedia communities.

Proceeding Downloads

PDF(Title Page, Copyright, Welcome, Contents, Organization, Sponsors)

PDF(Author Index)

Select All

Export Citations Save to Binder

SESSION: Invited Talks 1

section

Session details: Invited Talks 1

Jingkuan Song

https://doi.org/10.1145/3502610

invited-talk

Modern Learning Methodologies for Co-Saliency Detection

Junwei Han

Page 1https://doi.org/10.1145/3475723.3487886

Visual saliency computing aims to imitate the human visual attention mechanism to identify the most prominent or unique areas or objects from a visual scene. It is one of the basic low-level image processing techniques and can be applied to many ...

SESSION: Session 1: Pose, Action, and Interaction

section

Session details: Session 1: Pose, Action, and Interaction

Xinchen Liu

https://doi.org/10.1145/3502611

research-article

Open Access

Learning Positional Priors for Pretraining 2D Pose Estimators

Pages 3–11https://doi.org/10.1145/3475723.3484252

The target of 2D human pose estimation is to locate the keypoints of body parts from 2D images. State-of-the-art methods for pose estimation usually construct pixel-wise heatmaps from keypoints as labels for learning neural networks, which are usually ...

research-article

A Closer Look at Temporal Sentence Grounding in Videos: Dataset and Metric

Pages 13–21https://doi.org/10.1145/3475723.3484247

Temporal Sentence Grounding in Videos (TSGV), \ie, grounding a natural language sentence which indicates complex human activities in a long and untrimmed video sequence, has received unprecedented attentions over the last few years. Although each newly ...

research-article

NLOS Imaging Assisted Navigation for BVI

Pages 23–30https://doi.org/10.1145/3475723.3484250

Assistive navigation techniques support the activities of blind or visually impaired (BVI) people and improve their life quality. However, current navigation systems cannot detect hidden objects that may run out and become obstacles. In this paper, we ...

research-article

Using Feature Interaction among GPS Data for Road Intersection Detection

Pages 31–37https://doi.org/10.1145/3475723.3484249

Road intersection plays a vital role in road network construction, automatic drive, and intelligent transportation systems. Most methods detect road intersections only using geometrical features without spatio-temporal features, leading to insufficient ...

SESSION: Session 2: Technical Demos

research-article

Modeling 3D Objects: Implications for Neuroscience, Behavioral and Medical Studies: A Case Demo

Pages 39–42https://doi.org/10.1145/3475723.3484248

We have designed, developed and adapted 3D objects (3DOs) within the interactive environment for in-lab neuroscience research of motor control and the mirror neuron system (MNS) (Figure 1b; 3D view: https://p3d.in/0B202). The modeled 3DOs are ...

Cited By

Contributors

Wu Liu
- Publication Years
- Publication counts0
- Citation count0
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article0
View Full Profile
Junbo Guo
- Publication Years2021 - 2021
- Publication counts2
- Citation count9
- Available for Download2
- Downloads (cumulative)1,224
- Downloads (12 months)352
- Downloads (6 weeks)21
- Average Downloads per Article612
- Average Citation per Article5
View Full Profile
John R Smith
IBM Thomas J. Watson Research Center
- Publication Years1994 - 2023
- Publication counts127
- Citation count3,522
- Available for Download46
- Downloads (cumulative)32,855
- Downloads (12 months)1,744
- Downloads (6 weeks)237
- Average Downloads per Article714
- Average Citation per Article28
View Full Profile
Xinchen Liu
JD.com, Inc.
- Publication Years2019 - 2024
- Publication counts17
- Citation count290
- Available for Download16
- Downloads (cumulative)3,571
- Downloads (12 months)1,436
- Downloads (6 weeks)130
- Average Downloads per Article223
- Average Citation per Article17
View Full Profile
Dingwen Zhang
Northwestern Polytechnical University
- Publication Years2015 - 2024
- Publication counts57
- Citation count945
- Available for Download12
- Downloads (cumulative)2,893
- Downloads (12 months)785
- Downloads (6 weeks)57
- Average Downloads per Article241
- Average Citation per Article17
View Full Profile
Wenbing Huang
Renmin University of China
- Publication Years2014 - 2025
- Publication counts52
- Citation count746
- Available for Download26
- Downloads (cumulative)10,433
- Downloads (12 months)2,176
- Downloads (6 weeks)232
- Average Downloads per Article401
- Average Citation per Article14
View Full Profile

Index Terms

Proceedings of the 2nd International Workshop on Human-centric Multimedia Analysis
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
2. Information systems
  1. Information systems applications

Index terms have been assigned to the content through auto-classification.

Comments

MM

Sections

Proceeding Downloads

Session details: Invited Talks 1

Modern Learning Methodologies for Co-Saliency Detection

Session details: Session 1: Pose, Action, and Interaction

Learning Positional Priors for Pretraining 2D Pose Estimators

A Closer Look at Temporal Sentence Grounding in Videos: Dataset and Metric

NLOS Imaging Assisted Navigation for BVI

Using Feature Interaction among GPS Data for Road Intersection Detection

Modeling 3D Objects: Implications for Neuroscience, Behavioral and Medical Studies: A Case Demo

Cited By

Index Terms

MobileHealth '12: Proceedings of the 2nd ACM international workshop on Pervasive Wireless Healthcare

WOWMOM '02: Proceedings of the 5th ACM international workshop on Wireless mobile multimedia

ICMR '12: Proceedings of the 2nd ACM International Conference on Multimedia Retrieval

Save to Binder

Sections

Proceeding Downloads

Cited By

Save to Binder

Index Terms

Recommendations

MobileHealth '12: Proceedings of the 2nd ACM international workshop on Pervasive Wireless Healthcare

WOWMOM '02: Proceedings of the 5th ACM international workshop on Wireless mobile multimedia

ICMR '12: Proceedings of the 2nd ACM International Conference on Multimedia Retrieval