skip to main content
10.1145/2567948.2577018acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
demonstration

Me-link: link me to the media -- fusing audio and visual cues for robust and efficient mobile media interaction

Published:07 April 2014Publication History

ABSTRACT

In this demo, we present a scalable mobile video recognition system, named "Me-link," based on progressive fusion of light-weight audio visual features. With our system, users only have to point the mobile camera to the video they are interested in. The system will capture the frames and sounds, then retrieve relevant information immediately. As the users hold the mobile longer, the system progressively aggregates the cues temporally and then returns more accurate results. We also consider the real world noisy environment, where users may not get clear visual or audio signals. In the aggregation step of audio and visual cues, our system automatically detects the available channel for the final rank. On the server side, users can upload the videos with information via website. Besides, we also link the streaming signals so that users can get the real time broadcasting with ``Me-link".

References

  1. H. Bay, A. Ess, T. Tuytelaars, and L. V. Gool. Surf: Speeded up robust features. Computer Vision and Image Understanding, 110(3):346--359, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. O. Dan, J. Feng, and B. Davison. Filtering microblogging messages for social tv. In ACM International Conference Companion on World Wide Web, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. P. W. Ellis, B. Whitman, and A. Porter. Echoprint: An open music identification service. In International Society for Music Information Retrieval Conference, 2011.Google ScholarGoogle Scholar
  4. B. Girod, V. Chandrasekhar, D. M. Chen, N.-M. Cheung, R. Grzeszczuk, Y. Reznik, G. Takacs, S. S. Tsai, and R. Vedantham. Mobile visual search. IEEE Signal Processing Magazine, 28(4):61--76, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  5. P. Li, T. J. Hastie, and K. W. Church. Very sparse random projections. In ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. W. Liu, T. Mei, Y. Zhang, J. Li, and S. Li. Listen, look, and gotcha: instant video search with mobile phones by layered audio-video indexing. In ACM international conference on Multimedia, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Muja and D. G. Lowe. Fast matching of binary features. In Conference on Computer and Robot Vision, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. L.-C. Wang. An industrial-strength audio search algorithm. In International Conference on Music Information Retrieval, 2003.Google ScholarGoogle Scholar

Index Terms

  1. Me-link: link me to the media -- fusing audio and visual cues for robust and efficient mobile media interaction

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      WWW '14 Companion: Proceedings of the 23rd International Conference on World Wide Web
      April 2014
      1396 pages
      ISBN:9781450327459
      DOI:10.1145/2567948

      Copyright © 2014 Copyright is held by the International World Wide Web Conference Committee (IW3C2).

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 7 April 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • demonstration

      Acceptance Rates

      Overall Acceptance Rate1,899of8,196submissions,23%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader