skip to main content
10.1145/2814940.2814978acmotherconferencesArticle/Chapter ViewAbstractPublication PageshaiConference Proceedingsconference-collections
research-article

Generating Music from an Image

Published: 21 October 2015 Publication History

Abstract

Images can convey emotion just like music. If that's so, then it might be possible that, given an image, one can obtain a music that can produce a similar reaction from the listener/viewer. The challenge lies in how to do that. In this paper, we analyze the image using the HSV color space model and assume that each one of the three components have a relation with basic music elements, like tone, pitch, rhythm and loudness. The image is then scanned from left to right and top to bottom in order to generate a sequence of notes. In the end, the emotional Mean Opinion Score (MOS) is used to evaluate the performance of the proposed method. This work could prove to be a very important contribution to the field of HCI because it can improve the interaction between computers and humans who are visually and/or hearing impaired. In the current work, we only consider two emotions; positive and negative.

References

[1]
Adamson, J. C. Hue, saturation & value : The characteristics of color, 2012. Retrieved May 13, 2015 from The Muser Physics & Physiology of Color, http://www.greatreality.com/color/ColorHVC.htm.
[2]
Bell, C. Art. New York Frederick A. Stokes Company Publishers, 1913.
[3]
Dan-Glauser, E. S., and Scherer, K. R. The geneva affective picture database (gaped): a new 730-picture database focusing on valence and normative significance. Behavior Research Methods 43, 2 (2011), 468--477. Downloaded May 21, 2015, http://www.affective-sciences.org/system/files/webpage/GAPED_2.zip.
[4]
Dictionary.com, 2015. Retrieved May 20, 2015, http://dictionary.reference.com/browse/emotion.
[5]
Kim, T. Emotion classification by acoustic, visual and eeg signals using fuzzy clustering and anfis, 2014.
[6]
Lee, G., Kwon, M., Kavuri, S., and Lee, M. Emotion recognition based on 3d fuzzy visual and eeg features in movie clips. Elsevier: Neurocomputing 144 (2014), 560--568.
[7]
Levitin, D. J. This Is Your Brain on Music: The Science of a Human Obsession. Dutton Penguin Books Ltd, 375 Hudson Street, New York, NY, USA, 2006.
[8]
Levitin, D. J. Dr. daniel j. levitin: Neuroscientist, musician, author, 2015. Retrieved May 20, 2015, http://daniellevitin.com/publicpage/.
[9]
of Encyclopdia Britannica, T. E. Clive Bell (or Arthur Clive Heward Bell). Encyclopdia Britannica, 2014.
[10]
Press, O. U. Oxford dictionaries: Language matters, 2015. Retrieved May 20, 2015, http://www.oxforddictionaries.com/definition/english/emotion.
[11]
Rouzic, M., 2008. Retrieved May 20, 2015, http://photosounder.com/.
[12]
Singh, J. F., 2012. Retrieved May 20, 2015, http://flexibeatz.weebly.com/paint2sound.html.
[13]
White, D., 2011. Retrieved May 20, 2015, http://www.skytopia.com/software/sonicphoto/.
[14]
Yanulevskaya, V., Gemert, J. v., Roth, K., Herbold, A., Sebe, N., and Geusebroek, J. Emotional valence categorization using holistic image features. 15th IEEE International Conference on Image Processing (ICIP) (2008), 101--104.
[15]
Zhang, Q., Jeong, S., and Lee, M. Autonomous emotion development using incremental modified adaptive neuro-fuzzy inference system. Elsevier: Neurocomputing 96 (2012), 33--44.
[16]
Zhang, Q., and Lee, M. Emotion development system by interacting with human eeg and natural scene understanding. Elsevier: Cognitive Systems Research 14 (2012), 37--49.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
HAI '15: Proceedings of the 3rd International Conference on Human-Agent Interaction
October 2015
254 pages
ISBN:9781450335270
DOI:10.1145/2814940
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

  • BESK: Brain Engineering Society of Korea

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 October 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. MOS
  2. emotion
  3. gaped
  4. hsv color space model
  5. music elements

Qualifiers

  • Research-article

Conference

HAI 2015
Sponsor:
  • BESK
HAI 2015: The Third International Conference on Human-Agent Interaction
October 21 - 24, 2015
Kyungpook, Daegu, Republic of Korea

Acceptance Rates

Overall Acceptance Rate 121 of 404 submissions, 30%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 201
    Total Downloads
  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media