ABSTRACT
The information processing capabilities of humans enable them to opportunistically draw and integrate knowledge from nearly any information source. However, the integration of digital, structured data from diverse sources remains difficult, due to problems of heterogeneity that arise when data modelled separately are brought together. In this paper, we present an investigation of the feasibility of extending Personal Information Management (PIM) tools to support lightweight, user-driven mixing of previously un-integrated data, with the objective of allowing users to take advantage of the emerging ecosystems of structured data currently becoming available. In this study, we conducted an exploratory, sequential, mixed-method investigation, starting with two pre-studies of the data integration needs and challenges, respectively, of Web-based data sources. Observations from these pre-studies led to DataPalette, an interface that introduced simple co-reference and group multi-path-selection mechanisms for working with terminologically and structurally heterogeneous data. Our lab study showed that participants readily understood the new interaction mechanisms which were introduced. Participants made more carefully justified decisions, even while weighing a greater number of factors, moreover expending less effort, during subjective-choice tasks when using DataPalette, than with a control set-up.
- Bergman, O., Beyth-marom, R., and Nachmias, R. The user-subjective approach to personal information management systems. JASIST 54 (2003), 872--878. Google ScholarDigital Library
- Bernstein, M., Van Kleek, M., Karger, D., and Schraefel, M. Information scraps: How and why information eludes our personal information management tools. TOIS 26, 4 (2008), 24. Google ScholarDigital Library
- Cai, Y., Dong, X. L., Halevy, A., Liu, J. M., and Madhavan, J. Personal information management with SEMEX. In Proc. SIGMOD '05 (2005), 921--923. Google ScholarDigital Library
- Castano, S., Ferrara, A., and Montanelli, S. Matching ontologies in open networked systems: Techniques and applications. Journal on Data Semantics V (2006), 25--63. Google ScholarDigital Library
- Doan, A., Madhavan, J., Dhamankar, R., Domingos, P., and Halevy, A. Learning to match ontologies on the Semantic Web. VLDB Journal 12, 4 (2003), 303--319. Google ScholarDigital Library
- Dontcheva, M., Drucker, S. M., Salesin, D., and Cohen, M. F. Relations, cards, and search templates: user-guided web data integration and layout. In Proc. UIST '07 (2007), 61--70. Google ScholarDigital Library
- Dumais, S., Cutrell, E., Cadiz, J., Jancke, G., Sarin, R., and Robbins, D. C. Stuff i've seen: a system for personal information retrieval and re-use. In SIGIR '03, ACM (2003), 72--79. Google ScholarDigital Library
- Ennals, R., Brewer, E., Garofalakis, M., Shadle, M., and Gandhi, P. Intel Mash Maker: join the web. SIGMOD Rec. 36, 4 (2007), 27--33. Google ScholarDigital Library
- Euzenat, J. An API for ontology alignment. Proc ISWC '04 (2004), 698--712.Google ScholarDigital Library
- Fagan, J. C. Mashing up Multiple Web Feeds Using Yahoo! Pipes. Computers in Libraries 27, 10 (2007), 10--17.Google Scholar
- Halevy, A., Rajaraman, A., and Ordille, J. Data integration: the teenage years. In Proc. VLDB '06 (2006), 9--16. Google ScholarDigital Library
- Hart, S., and Staveland, L. Development of NASA-TLX (Task Load Index): Results of empirical and theoretical research. Human mental workload 1 (1988), 139--183.Google Scholar
- Huynh, D., and Karger, D. Parallax and companion: Set-based browsing for the data web. In Proc. WWW '09 (2009).Google Scholar
- Huynh, D., Karger, D., and Quan, D. Haystack: A Platform for Creating, Organizing and Visualizing Information Using RDF, 2002.Google Scholar
- Huynh, D., Miller, R., and Karger, D. Potluck: Data mash-up tool for casual users. JWS 6, 4 (2008), 274--282. Google ScholarDigital Library
- Lin, J., Wong, J., Nichols, J., Cypher, A., and Lau, T. A. End-user programming of mashups with vegemite. In Proc. IUI '09, ACM (2009), 97--106. Google ScholarDigital Library
- Shadbolt, N., Berners-Lee, T., and Hall, W. The Semantic Web Revisited. IEEE Intelligent Systems 21, 3 (2006), 96--101. Google ScholarDigital Library
- Suchanek, F., Abiteboul, S., and Senellart, P. PARIS: probabilistic alignment of relations, instances, and schema. Proc. VLDB '11 5, 3 (2011), 157--168. Google ScholarDigital Library
- Wong, J., and Hong, J. I. Making mashups with marmite: towards end-user programming for the web. In Proc. CHI '07, ACM (2007), 1435--1444. Google ScholarDigital Library
Index Terms
- Carpé data: supporting serendipitous data integration in personal information management
Recommendations
Anticipating ageing: Older adults reading their medical records
Highlights- Older adults, who are still active in working life but approaching retirement, differ from other age groups by their health information behaviour.
AbstractIn spite of the general interest in health information behaviour, there is little earlier research on how older adults, who are still active in working life but approaching retirement, differ from other age groups. A survey with ...
For Richer, for Poorer, in Sickness or in Health...: The Long-Term Management of Personal Information
CHI EA '16: Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing SystemsPeople are amassing large personal information stores. These stores present rich opportunities for analysis and use in matters of wealth, health, living and legacy. But these stores also bring with them new challenges for managing information across ...
Towards task-based personal information management evaluations
SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrievalPersonal Information Management (PIM) is a rapidly growing area of research concerned with how people store, manage and refind information. A feature of PIM research is that many systems have been designed to assist users manage and refind information, ...
Comments