demonstration

Video ChatBot: Triggering Live Social Interactions by Automatic Video Commenting

Authors:

Yehao Li,

Ting Yao,

Rui Hu,

Tao Mei,

Yong RuiAuthors Info & Claims

MM '16: Proceedings of the 24th ACM international conference on Multimedia

Pages 757 - 758

https://doi.org/10.1145/2964284.2973835

Published: 01 October 2016 Publication History

Get Access

Abstract

We demonstrate a video chatbot, which can generate human-level emotional comments referring to the videos shared by users and trigger a conversation with users. Our video chatbot performs a large-scale similar video search to find visually similar videos w.r.t. a given video using approximate nearest-neighbor search. Then, the comments associated with the searched similar videos are ranked by learning a deep multi-view embedding space for modeling video content, visual sentiment and textual comments. The top ranked comments are selected as responses to the given video and trigger the succeeding text-based chat between users and the chatbot. The demonstration is conducted on a newly collected dataset with over 102K videos and 10.6M comments. Moreover, our video chatbot has great potential to increase live social interactions.

References

[1]

Y. Pan, T. Mei, T. Yao, H. Li, and Y. Rui. Jointly modeling embedding and translation to bridge video and language. In CVPR, 2016.

Crossref

Google Scholar

[2]

K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.

Google Scholar

[3]

D. Tran, L. D. Bourdev, R. Fergus, L. Torresani, and M. Paluri. C3d: generic features for video analysis. In ICCV, 2015.

Google Scholar

[4]

J. Wang and S. Li. Query-driven iterated neighborhood graph search for large scale indexing. In ACM MM, 2012.

Digital Library

Google Scholar

[5]

T. Yao, T. Mei, and C.-W. Ngo. Learning query and image similarities with ranking canonical correlation analysis. In ICCV, 2015.

Digital Library

Google Scholar

Cited By

View all

Fu FFang SChen WMao Z(2024)Sentiment-Oriented Transformer-Based Variational Autoencoder Network for Live Video CommentingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363333420:4(1-24)Online publication date: 11-Jan-2024
https://dl.acm.org/doi/10.1145/3633334
Jolibois SIto ANose T(2023)Multimodal Expressive Embodied Conversational Agent DesignHCI International 2023 Posters10.1007/978-3-031-35989-7_31(244-249)Online publication date: 9-Jul-2023
https://doi.org/10.1007/978-3-031-35989-7_31
Lee LBraud THosio SHui P(2021)Towards Augmented Reality Driven Human-City Interaction: Current Research on Mobile Headsets and Future ChallengesACM Computing Surveys10.1145/346796354:8(1-38)Online publication date: 4-Oct-2021
https://dl.acm.org/doi/10.1145/3467963

Index Terms

Video ChatBot: Triggering Live Social Interactions by Automatic Video Commenting
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
        Vision for robotics
2. Information systems
  1. Information retrieval
    1. Specialized information retrieval
      1. Multimedia and multimodal retrieval
        Video search

Recommendations

Share-and-Chat: Achieving Human-Level Video Commenting by Search and Multi-View Embedding
MM '16: Proceedings of the 24th ACM international conference on Multimedia

Video has become a predominant social media for the booming live interactions. Automatic generation of emotional comments to a video has great potential to significantly increase user engagement in many socio-video applications (e.g., chat bot). ...
See and chat: automatically generating viewer-level comments on images

Image is becoming a predominant medium for social interactions. Automatically expressing opinions on an image, which we refer to as image commenting, has great potential to improve user engagement and thus becomes an emerging yet very challenging ...
Chatbot with Touch and Graphics: An Interaction of Users for Emotional Expression and Turn-taking
CUI '20: Proceedings of the 2nd Conference on Conversational User Interfaces

Use of chatbots for emotional exchange is recently increasing in various domains. However, as existing chatbots have been considered in terms of natural language processing techniques for interaction with text-based chatting, chatbot interaction with ...

Comments

Information & Contributors

Information

Published In

MM '16: Proceedings of the 24th ACM international conference on Multimedia

October 2016

1542 pages

ISBN:9781450336031

DOI:10.1145/2964284

General Chairs:
Alan Hanjalic
Delft University of Technology
,
Cees Snoek
Qualcomm Research Netherlands / University of Amsterdam
,
Marcel Worring
University of Amsterdam
,
Moderator:
Dick Bulterman
CWI / VU University Amsterdam
,
Program Chairs:
Benoit Huet
EURECOM
,
Aisling Kelliher
Virginia Tech
,
Yiannis Kompatsiaris
CERTH-ITI
,
Jin Li
Microsoft

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2016

Check for updates

Author Tags

Qualifiers

Demonstration

Conference

MM '16

Sponsor:

SIGMM

MM '16: ACM Multimedia Conference

October 15 - 19, 2016

Amsterdam, The Netherlands

Acceptance Rates

MM '16 Paper Acceptance Rate 52 of 237 submissions, 22%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
430
Total Downloads

Downloads (Last 12 months)11
Downloads (Last 6 weeks)2

Reflects downloads up to 23 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Fu FFang SChen WMao Z(2024)Sentiment-Oriented Transformer-Based Variational Autoencoder Network for Live Video CommentingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363333420:4(1-24)Online publication date: 11-Jan-2024
https://dl.acm.org/doi/10.1145/3633334
Jolibois SIto ANose T(2023)Multimodal Expressive Embodied Conversational Agent DesignHCI International 2023 Posters10.1007/978-3-031-35989-7_31(244-249)Online publication date: 9-Jul-2023
https://doi.org/10.1007/978-3-031-35989-7_31
Lee LBraud THosio SHui P(2021)Towards Augmented Reality Driven Human-City Interaction: Current Research on Mobile Headsets and Future ChallengesACM Computing Surveys10.1145/346796354:8(1-38)Online publication date: 4-Oct-2021
https://dl.acm.org/doi/10.1145/3467963

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Share-and-Chat: Achieving Human-Level Video Commenting by Search and Multi-View Embedding

See and chat: automatically generating viewer-level comments on images

Chatbot with Touch and Graphics: An Interaction of Users for Emotional Expression and Turn-taking

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations