skip to main content
10.1145/3123266.3127924acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
demonstration

IBM High-Five: Highlights From Intelligent Video Engine

Published: 19 October 2017 Publication History

Abstract

We introduce a novel multi-modal system for auto-curating golf highlights that fuses information from players' reactions (celebration actions), spectators (crowd cheering), and commentator (tone of the voice and word analysis) to determine the most interesting moments of a game. The start of a highlight is determined with additional metadata (player's name and the hole number), allowing personalized content summarization and retrieval. Our system was demonstrated at Masters 2017, a major golf tournament, generating real-time highlights from four live video streams over four days.

Supplementary Material

PDF (demo48.pdf)
PDF File

References

[1]
Yusuf Aytar, Carl Vondrick, and Antonio Torralba. 2016. SoundNet: Learning Sound Representations from Unlabeled Video. In NIPS.
[2]
Michele Merler, Dhiraj Joshi, Quoc-Bao Nguyen, Stephen Hammer, John Kent, John R. Smith, and Rogerio S. Feris. 2017. Automatic Curation of Golf Highlights using Multimodal Excitement Features. In CVPR Int. Workshop on Sports.
[3]
Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional net- works for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).

Cited By

View all
  • (2025)DHMDL: Dynamically Hashed Multimodal Deep Learning Framework for Racket Video Summarization Using Audio and Visual MarkersApplied Artificial Intelligence10.1080/08839514.2025.246238239:1Online publication date: 18-Feb-2025
  • (2024)Enhancing Auto-Generated Baseball Highlights via Win Probability and Bias Injection MethodProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642021(1-18)Online publication date: 11-May-2024
  • (2022)2D Gait Skeleton Data Normalization for Quantitative Assessment of Movement Disorders from Freehand Single Camera Video RecordingsSensors10.3390/s2211424522:11(4245)Online publication date: 2-Jun-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '17: Proceedings of the 25th ACM international conference on Multimedia
October 2017
2028 pages
ISBN:9781450349062
DOI:10.1145/3123266
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 October 2017

Check for updates

Author Tags

  1. highlights generation
  2. multimodal video analysis
  3. sport analytics

Qualifiers

  • Demonstration

Conference

MM '17
Sponsor:
MM '17: ACM Multimedia Conference
October 23 - 27, 2017
California, Mountain View, USA

Acceptance Rates

MM '17 Paper Acceptance Rate 189 of 684 submissions, 28%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)1
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)DHMDL: Dynamically Hashed Multimodal Deep Learning Framework for Racket Video Summarization Using Audio and Visual MarkersApplied Artificial Intelligence10.1080/08839514.2025.246238239:1Online publication date: 18-Feb-2025
  • (2024)Enhancing Auto-Generated Baseball Highlights via Win Probability and Bias Injection MethodProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642021(1-18)Online publication date: 11-May-2024
  • (2022)2D Gait Skeleton Data Normalization for Quantitative Assessment of Movement Disorders from Freehand Single Camera Video RecordingsSensors10.3390/s2211424522:11(4245)Online publication date: 2-Jun-2022
  • (2020)A Multi-Stream Recurrent Neural Network for Social Role Detection in Multiparty InteractionsIEEE Journal of Selected Topics in Signal Processing10.1109/JSTSP.2020.299239414:3(554-567)Online publication date: Mar-2020
  • (2019)AI Coach: Deep Human Pose Estimation and Analysis for Personalized Athletic Training AssistanceProceedings of the 27th ACM International Conference on Multimedia10.1145/3343031.3350910(374-382)Online publication date: 15-Oct-2019
  • (2019)AI CoachProceedings of the 27th ACM International Conference on Multimedia10.1145/3343031.3350609(2228-2230)Online publication date: 15-Oct-2019
  • (2019)Automatic Curation of Sports Highlights Using Multimodal Excitement FeaturesIEEE Transactions on Multimedia10.1109/TMM.2018.287604621:5(1147-1160)Online publication date: May-2019
  • (2018)Automatic Cricket Highlight Generation Using Event-Driven and Excitement-Based Features2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)10.1109/CVPRW.2018.00233(1881-18818)Online publication date: Jun-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media