short-paper

PixelTone: a multimodal interface for image editing

Authors:

Eytan AdarAuthors Info & Claims

CHI EA '13: CHI '13 Extended Abstracts on Human Factors in Computing Systems

Pages 2829 - 2830

https://doi.org/10.1145/2468356.2479533

Published: 27 April 2013 Publication History

Get Access

Abstract

Photo editing can be a challenging task, and it becomes even more difficult on the small, portable screens of mobile devices that are now frequently used to capture and edit images. To address this problem we present PixelTone, a multimodal photo editing interface that combines speech and direct manipulation. In this video, we demonstrate how our system uses natural language for expressing users' desired changes to an image. We also demonstrate how we combine natural language and touch gestures for creating named references and sketching to localize image operations to specific regions.

Supplementary Material

suppl.mov (vid0138-file3.mp4)

Supplemental video

Download
90.88 MB

Cited By

View all

Yang JZhang LLu H(2024)Referring Image Segmentation With Fine-Grained Semantic Funneling InfusionIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.328137235:10(14727-14738)Online publication date: Oct-2024
https://doi.org/10.1109/TNNLS.2023.3281372
Park JO'Brien JCai CMorris MLiang PBernstein M(2023)Generative Agents: Interactive Simulacra of Human BehaviorProceedings of the 36th Annual ACM Symposium on User Interface Software and Technology10.1145/3586183.3606763(1-22)Online publication date: 29-Oct-2023
https://dl.acm.org/doi/10.1145/3586183.3606763
Brachman MPan QDo HDugan CChaudhary AJohnson JRai PChakraborti TGschwind TLaredo JMiksovic CScotton PTalamadupula KThomas G(2023)Follow the Successful Herd: Towards Explanations for Improved Use and Mental Models of Natural Language SystemsProceedings of the 28th International Conference on Intelligent User Interfaces10.1145/3581641.3584088(220-239)Online publication date: 27-Mar-2023
https://doi.org/10.1145/3581641.3584088
Show More Cited By

Index Terms

PixelTone: a multimodal interface for image editing
1. Human-centered computing
  1. Human computer interaction (HCI)

Recommendations

PixelTone: a multimodal interface for image editing
CHI '13: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems

Photo editing can be a challenging task, and it becomes even more difficult on the small, portable screens of mobile devices that are now frequently used to capture and edit images. To address this problem we present PixelTone, a multimodal photo ...
Eevee: Transforming Images by Bridging High-level Goals and Low-level Edit Operations
CHI EA '19: Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems

There is a significant gap between the high-level, semantic manner in which we reason about image edits and the low-level, pixel-oriented way in which we execute these edits. While existing image-editing tools provide a great deal of flexibility for ...
Authoring Communicative Behaviors for Situated, Embodied Characters
ICMI '14: Proceedings of the 16th International Conference on Multimodal Interaction

Embodied conversational agents hold great potential as multimodal interfaces due to their ability to communicate naturally using speech and nonverbal cues. The goal of my research is to enable animators and designers to endow ECAs with interactive ...

Comments

Information & Contributors

Information

Published In

CHI EA '13: CHI '13 Extended Abstracts on Human Factors in Computing Systems

April 2013

3360 pages

ISBN:9781450319522

DOI:10.1145/2468356

General Chair:
Wendy E. Mackay
INRIA
,
Program Chairs:
Stephen Brewster
Glasgow University
,
Susanne Bødker
University of Aarhus

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 April 2013

Check for updates

Author Tags

Qualifiers

Short-paper

Conference

CHI '13

Sponsor:

SIGCHI

CHI '13: CHI Conference on Human Factors in Computing Systems

April 27 - May 2, 2013

Paris, France

Acceptance Rates

CHI EA '13 Paper Acceptance Rate 630 of 1,963 submissions, 32%;

Overall Acceptance Rate 6,164 of 23,696 submissions, 26%

Upcoming Conference

CHI 2025

Sponsor:
sigchi

ACM CHI Conference on Human Factors in Computing Systems

April 26 - May 1, 2025

Yokohama , Japan

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

8
Total Citations
View Citations
236
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 18 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Yang JZhang LLu H(2024)Referring Image Segmentation With Fine-Grained Semantic Funneling InfusionIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.328137235:10(14727-14738)Online publication date: Oct-2024
https://doi.org/10.1109/TNNLS.2023.3281372
Park JO'Brien JCai CMorris MLiang PBernstein M(2023)Generative Agents: Interactive Simulacra of Human BehaviorProceedings of the 36th Annual ACM Symposium on User Interface Software and Technology10.1145/3586183.3606763(1-22)Online publication date: 29-Oct-2023
https://dl.acm.org/doi/10.1145/3586183.3606763
Brachman MPan QDo HDugan CChaudhary AJohnson JRai PChakraborti TGschwind TLaredo JMiksovic CScotton PTalamadupula KThomas G(2023)Follow the Successful Herd: Towards Explanations for Improved Use and Mental Models of Natural Language SystemsProceedings of the 28th International Conference on Intelligent User Interfaces10.1145/3581641.3584088(220-239)Online publication date: 27-Mar-2023
https://doi.org/10.1145/3581641.3584088
Xu FLuo BZhang CXu LPu MLi B(2023)Vision-Aware Language Reasoning for Referring Image SegmentationNeural Processing Letters10.1007/s11063-023-11377-z55:8(11313-11331)Online publication date: 2-Aug-2023
https://doi.org/10.1007/s11063-023-11377-z
Fraser CMarkel JBasa NDontcheva MKlemmer SGuimbretière FBernstein MReinecke K(2019)ReMapAdjunct Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology10.1145/3332167.3356884(96-98)Online publication date: 14-Oct-2019
https://dl.acm.org/doi/10.1145/3332167.3356884
Chang MTruong AWang OAgrawala MKim JBrewster SFitzpatrick GCox AKostakos V(2019)How to Design Voice Based Navigation for How-To VideosProceedings of the 2019 CHI Conference on Human Factors in Computing Systems10.1145/3290605.3300931(1-11)Online publication date: 2-May-2019
https://dl.acm.org/doi/10.1145/3290605.3300931
Li JKim SMiele JAgrawala MFollmer SBrewster SFitzpatrick GCox AKostakos V(2019)Editing Spatial Layouts through Tactile Templates for People with Visual ImpairmentsProceedings of the 2019 CHI Conference on Human Factors in Computing Systems10.1145/3290605.3300436(1-11)Online publication date: 2-May-2019
https://dl.acm.org/doi/10.1145/3290605.3300436
Cheng MZheng SLin WVineet VSturgess PCrook NMitra NTorr P(2014)ImageSpiritACM Transactions on Graphics10.1145/268262834:1(1-11)Online publication date: 29-Dec-2014
https://dl.acm.org/doi/10.1145/2682628

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

PixelTone: a multimodal interface for image editing

Eevee: Transforming Images by Bridging High-level Goals and Low-level Edit Operations

Authoring Communicative Behaviors for Situated, Embodied Characters