research-article

Carpé data: supporting serendipitous data integration in personal information management

Authors:
Max Van Kleek

University of Southampton, Southampton, Hampshire, UK

University of Southampton, Southampton, Hampshire, UK
View Profile

,
Daniel A. Smith

University of Southampton, Southampton, Hampshire, UK

University of Southampton, Southampton, Hampshire, UK
View Profile

,
Heather S. Packer

University of Southampton, Southampton, Hampshire, UK

University of Southampton, Southampton, Hampshire, UK
View Profile

,
Jim Skinner

University of Southampton, Southampton, Hampshire, UK

University of Southampton, Southampton, Hampshire, UK
View Profile

,
Nigel R. Shadbolt

University of Southampton, Southampton, Hampshire, UK

University of Southampton, Southampton, Hampshire, UK
View Profile

CHI '13: Proceedings of the SIGCHI Conference on Human Factors in Computing SystemsApril 2013Pages 2339–2348https://doi.org/10.1145/2470654.2481324

Published:27 April 2013Publication History

CHI '13: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems

Pages 2339–2348

ABSTRACT

The information processing capabilities of humans enable them to opportunistically draw and integrate knowledge from nearly any information source. However, the integration of digital, structured data from diverse sources remains difficult, due to problems of heterogeneity that arise when data modelled separately are brought together. In this paper, we present an investigation of the feasibility of extending Personal Information Management (PIM) tools to support lightweight, user-driven mixing of previously un-integrated data, with the objective of allowing users to take advantage of the emerging ecosystems of structured data currently becoming available. In this study, we conducted an exploratory, sequential, mixed-method investigation, starting with two pre-studies of the data integration needs and challenges, respectively, of Web-based data sources. Observations from these pre-studies led to DataPalette, an interface that introduced simple co-reference and group multi-path-selection mechanisms for working with terminologically and structurally heterogeneous data. Our lab study showed that participants readily understood the new interaction mechanisms which were introduced. Participants made more carefully justified decisions, even while weighing a greater number of factors, moreover expending less effort, during subjective-choice tasks when using DataPalette, than with a control set-up.

References

Bergman, O., Beyth-marom, R., and Nachmias, R. The user-subjective approach to personal information management systems. JASIST 54 (2003), 872--878. Google ScholarDigital Library
Bernstein, M., Van Kleek, M., Karger, D., and Schraefel, M. Information scraps: How and why information eludes our personal information management tools. TOIS 26, 4 (2008), 24. Google ScholarDigital Library
Cai, Y., Dong, X. L., Halevy, A., Liu, J. M., and Madhavan, J. Personal information management with SEMEX. In Proc. SIGMOD '05 (2005), 921--923. Google ScholarDigital Library
Castano, S., Ferrara, A., and Montanelli, S. Matching ontologies in open networked systems: Techniques and applications. Journal on Data Semantics V (2006), 25--63. Google ScholarDigital Library
Doan, A., Madhavan, J., Dhamankar, R., Domingos, P., and Halevy, A. Learning to match ontologies on the Semantic Web. VLDB Journal 12, 4 (2003), 303--319. Google ScholarDigital Library
Dontcheva, M., Drucker, S. M., Salesin, D., and Cohen, M. F. Relations, cards, and search templates: user-guided web data integration and layout. In Proc. UIST '07 (2007), 61--70. Google ScholarDigital Library
Dumais, S., Cutrell, E., Cadiz, J., Jancke, G., Sarin, R., and Robbins, D. C. Stuff i've seen: a system for personal information retrieval and re-use. In SIGIR '03, ACM (2003), 72--79. Google ScholarDigital Library
Ennals, R., Brewer, E., Garofalakis, M., Shadle, M., and Gandhi, P. Intel Mash Maker: join the web. SIGMOD Rec. 36, 4 (2007), 27--33. Google ScholarDigital Library
Euzenat, J. An API for ontology alignment. Proc ISWC '04 (2004), 698--712.Google ScholarDigital Library
Fagan, J. C. Mashing up Multiple Web Feeds Using Yahoo! Pipes. Computers in Libraries 27, 10 (2007), 10--17.Google Scholar
Halevy, A., Rajaraman, A., and Ordille, J. Data integration: the teenage years. In Proc. VLDB '06 (2006), 9--16. Google ScholarDigital Library
Hart, S., and Staveland, L. Development of NASA-TLX (Task Load Index): Results of empirical and theoretical research. Human mental workload 1 (1988), 139--183.Google Scholar
Huynh, D., and Karger, D. Parallax and companion: Set-based browsing for the data web. In Proc. WWW '09 (2009).Google Scholar
Huynh, D., Karger, D., and Quan, D. Haystack: A Platform for Creating, Organizing and Visualizing Information Using RDF, 2002.Google Scholar
Huynh, D., Miller, R., and Karger, D. Potluck: Data mash-up tool for casual users. JWS 6, 4 (2008), 274--282. Google ScholarDigital Library
Lin, J., Wong, J., Nichols, J., Cypher, A., and Lau, T. A. End-user programming of mashups with vegemite. In Proc. IUI '09, ACM (2009), 97--106. Google ScholarDigital Library
Shadbolt, N., Berners-Lee, T., and Hall, W. The Semantic Web Revisited. IEEE Intelligent Systems 21, 3 (2006), 96--101. Google ScholarDigital Library
Suchanek, F., Abiteboul, S., and Senellart, P. PARIS: probabilistic alignment of relations, instances, and schema. Proc. VLDB '11 5, 3 (2011), 157--168. Google ScholarDigital Library
Wong, J., and Hong, J. I. Making mashups with marmite: towards end-user programming for the web. In Proc. CHI '07, ACM (2007), 1435--1444. Google ScholarDigital Library

Index Terms

Carpé data: supporting serendipitous data integration in personal information management

Recommendations

Anticipating ageing: Older adults reading their medical records
Highlights
- Older adults, who are still active in working life but approaching retirement, differ from other age groups by their health information behaviour.
Abstract
In spite of the general interest in health information behaviour, there is little earlier research on how older adults, who are still active in working life but approaching retirement, differ from other age groups. A survey with ...
Read More
For Richer, for Poorer, in Sickness or in Health...: The Long-Term Management of Personal Information
CHI EA '16: Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems

People are amassing large personal information stores. These stores present rich opportunities for analysis and use in matters of wealth, health, living and legacy. But these stores also bring with them new challenges for managing information across ...
Read More
Towards task-based personal information management evaluations
SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval

Personal Information Management (PIM) is a rapidly growing area of research concerned with how people store, manage and refind information. A feature of PIM research is that many systems have been designed to assist users manage and refind information, ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CHI '13: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
April 2013
3550 pages
ISBN:9781450318990
DOI:10.1145/2470654
General Chair:
Wendy E. Mackay
INRIA
,
Program Chairs:
Stephen Brewster
Glasgow University
,
Susanne Bødker
University of Aarhus
Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 27 April 2013
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
end-user data integration
mash-ups
personal information management
sensemaking with data
Qualifiers
- research-article
Conference

Acceptance Rates
CHI '13 Paper Acceptance Rate392of1,963submissions,20%Overall Acceptance Rate6,199of26,314submissions,24%
More
Upcoming Conference
CHI '24

Sponsor:

sigchi

CHI Conference on Human Factors in Computing Systems

May 11 - 16, 2024

Honolulu , HI , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 13
  Total Citations
  View Citations
- 364
  Total Downloads
- Downloads (Last 12 months)7
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Carpé data: supporting serendipitous data integration in personal information management

CHI '13: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Anticipating ageing: Older adults reading their medical records

For Richer, for Poorer, in Sickness or in Health...: The Long-Term Management of Personal Information

Towards task-based personal information management evaluations