ABSTRACT
The "real world" nature of field-based citizen science involves unique data management challenges that distinguish it from projects that involve only Internet-mediated activities. In particular, many data contribution and review practices are often accomplished "offline' via paper or general-purpose software like Excel. This can lead to integration challenges when attempting to implement project-specific ICT with full revision and provenance tracking. In this work, we explore some of the current challenges and opportunities in implementing ICT for managing volunteer monitoring data. Our two main contributions are: a general outline of the workflow tasks common to field-based data collection, and a novel data model for preserving provenance metadata that allows for ongoing data exchange between disparate technical systems and participant skill levels. We conclude with applications for other domains, such as hydrologic forecasting and crisis informatics, as well as directions for future research.
- Cifelli, R., Doesken, N., Kennedy, P., Carey, L. D., Rutledge, S. A., Gimmestad, C., and Depue, T. The community collaborative rain, hail, and snow network: Informal education for scientists and citizens. Bulletin of the American Meteorological Society 86, 8 (2005), 1069--1077.Google ScholarCross Ref
- Federal Geographic Data Committee and others. FGDC-STD-001--1998. Content standard for digital geospatial metadata (1998).Google Scholar
- Fegraus, E. H., Andelman, S., Jones, M. B., and Schildhauer, M. Maximizing the value of ecological data with structured metadata: An introduction to ecological metadata language (EML) and principles for metadata creation. Bulletin of the Ecological Society of America 86, 3 (2005), 158--168.Google ScholarCross Ref
- Firehock, K., and West, J. A brief history of volunteer biological water monitoring using macroinvertebrates. Journal of the North American Benthological Society 14, 1 (1995), 197--202.Google ScholarCross Ref
- Gil, Y., Miles, S., Belhajjame, K., Deus, H., Garijo, D., Klyne, G., Missier, P., Soiland-Reyes, S., and Zednik, S. A Primer for the PROV Provenance Model. W3C, 2012. http://www.w3.org/TR/prov-primer/.Google Scholar
- Halfaker, A., Geiger, R. S., Morgan, J. T., and Riedl, J. The rise and decline of an open collaboration system: How Wikipedia's reaction to popularity is causing its decline. American Behavioral Scientist 57, 5 (2013), 664--688.Google ScholarCross Ref
- Hartung, C., Lerer, A., Anokwa, Y., Tseng, C., Brunette, W., and Borriello, G. Open Data Kit: Tools to build information services for developing regions. In Proceedings of the 4th ACM/IEEE International Conference on Information and Communication Technologies and Development, ACM (2010), 18. Google ScholarDigital Library
- Howe, J. The Rise of Crowdsourcing. Wired Magazine 14, 6 (2006), 1--4.Google Scholar
- Juran, J. M. Quality control handbook. McGraw-Hill, 1962.Google Scholar
- Kelling, S., Yu, J., Gerbracht, J., and Wong, W.-K. Emergent filters: Automated data verification in a large-scale citizen science project. In Proceedings of Workshops at the Seventh International Conference on eScience, IEEE (2011), 20--27. Google ScholarDigital Library
- Kim, S., Mankoff, J., and Paulos, E. Sensr: Evaluating a flexible framework for authoring mobile data-collection tools for citizen science. In Proceedings of the 2013 conference on Computer supported cooperative work, ACM (2013), 1453--1462. Google ScholarDigital Library
- Liu, J., and Ram, S. Who does what: Collaboration patterns in the Wikipedia and their impact on article quality. ACM Transactions on Management Information Systems (TMIS) 2, 2 (2011), 11. Google ScholarDigital Library
- Lukyanenko, R., Parsons, J., and Wiersma, Y. Citizen science 2.0: Data management principles to harness the power of the crowd. In Service-Oriented Perspectives in Design Science Research. Springer, 2011, 465--473. Google ScholarDigital Library
- Newman, G., Graham, J., Crall, A., and Laituri, M. The art and science of multi-scale citizen science support. Ecological Informatics 6, 3 (2011), 217--227.Google ScholarCross Ref
- Okolloh, O. Ushahidi, or 'testimony': Web 2.0 tools for crowdsourcing crisis information. Participatory Learning and Action 59, 1 (2009), 65--70.Google Scholar
- Orlandi, F., and Passant, A. Modelling provenance of DBpedia resources using Wikipedia contributions. Web Semantics: Science, Services and Agents on the World Wide Web 9, 2 (2011), 149--164.Google ScholarCross Ref
- Priedhorsky, R., and Terveen, L. Wiki grows up: Arbitrary data models, access control, and beyond. In Proceedings of the Seventh International Symposium on Wikis and Open Collaboration, ACM (2011), 63--71. Google ScholarDigital Library
- Raddick, J., Lintott, C., Schawinski, K., Thomas, D., Nichol, R., Andreescu, D., Bamford, S., Land, K., Murray, P., Slosar, A., et al. Galaxy Zoo: An experiment in public science participation. In Bulletin of the American Astronomical Society, vol. 39 (2007), 892.Google Scholar
- Ram, S., and Liu, J. Understanding the semantics of data provenance to support active conceptual modeling. In Active conceptual modeling of learning. Springer, 2007, 17--29. Google ScholarDigital Library
- Ribes, D., and Finholt, T. A. Representing community: Knowing users in the face of changing constituencies. In Proceedings of the 2008 ACM conference on Computer supported cooperative work, ACM (2008), 107--116. Google ScholarDigital Library
- Roth, M., and Tan, W.-C. Data integration and data exchange: It's really about time. In Proceedings of the 6th Biennial Conference on Innovative Data Systems Research, CIDR (2013).Google Scholar
- Sheppard, S. A. wq: A modular framework for collecting, storing, and utilizing experiential VGI. In Proceedings of the 1st ACM SIGSPATIAL International Workshop on Crowdsourced and Volunteered Geographic Information, ACM (2012), 62--69. Google ScholarDigital Library
- Sheppard, S. A., and Terveen, L. Quality is a verb: The operationalization of data quality in a citizen science community. In Proceedings of the Seventh International Symposium on Wikis and Open Collaboration, ACM (2011), 29--38. Google ScholarDigital Library
- Smith, A. Smartphone adoption and usage. Tech. rep., Pew Internet & American Life Project, Washington, DC, 2011.Google Scholar
- Stvilia, B., Twidale, M. B., Smith, L. C., and Gasser, L. Information quality work organization in Wikipedia. Journal of the American society for information science and technology 59, 6 (2008), 983--1001. Google ScholarDigital Library
- Sullivan, B. L., Wood, C. L., Iliff, M. J., Bonney, R. E., Fink, D., and Kelling, S. eBird: a citizen-based bird observation network in the biological sciences. Biological Conservation 142, 10 (Oct. 2009), 2282--2292.Google ScholarCross Ref
- Vrandečić, D., Ratnakar, V., Krötzsch, M., and Gil, Y. Shortipedia: Aggregating and curating Semantic Web data. Web Semantics: Science, Services and Agents on the World Wide Web 9, 3 (2011), 334--338. Google ScholarDigital Library
- Wang, Z., Dong, H., Kelly, M., Macklin, J. A., Morris, P. J., and Morris, R. A. Filtered-Push: A Map-Reduce platform for collaborative taxonomic data management. In Computer Science and Information Engineering, 2009 WRI World Congress on, vol. 3, IEEE (2009), 731--735. Google ScholarDigital Library
- Wieczorek, J., Bloom, D., Guralnick, R., Blum, S., Döring, M., Giovanni, R., Robertson, T., and Vieglais, D. Darwin Core: An evolving community-developed biodiversity data standard. PLoS One 7, 1 (2012), e29715.Google ScholarCross Ref
- Wiggins, A. Free as in puppies: Compensating for ICT constraints in citizen science. In Proceedings of the 2013 conference on Computer supported cooperative work, ACM (2013), 1469--1480. Google ScholarDigital Library
- Wiggins, A., Bonney, R., Graham, E., Henderson, S., Kelling, S., Littauer, R., LeBuhn, G., Lotts, K., Michener, W., Newman, G., Russell, E., Stevenson, R., and Weltzin, J. Data management guide for public participation in scientific research. DataONE, 2013.Google Scholar
- Wiggins, A., and Crowston, K. From conservation to crowdsourcing: A typology of citizen science. In HICSS'11, IEEE Computer Society (2011), 1--10. Google ScholarDigital Library
- Wiggins, A., Newman, G., Stevenson, R. D., and Crowston, K. Mechanisms for data quality and validation in citizen science. In Proceedings of Workshops at the Seventh International Conference on eScience, IEEE (2011), 14--19. Google ScholarDigital Library
- Wilderman, C. C. Models of community science: Design lessons from the field. In Citizen Science Toolkit Conference (Cornell Laboratory of Ornithology, Ithaca, NY, 2007).Google Scholar
Index Terms
- Capturing quality: retaining provenance for curated volunteer monitoring data
Recommendations
Where the paddle meets the stream: bridging systems design theory and community-based monitoring practice
CSCW Companion '14: Proceedings of the companion publication of the 17th ACM conference on Computer supported cooperative work & social computingMy research is focused on computer support for community-based monitoring, at the intersection of citizen science, volunteered geographic information (VGI), and mobile crowdsourcing. Local-scale volunteer monitoring programs face a particular tension: ...
E-governance: scope and implementation challenges in Bangladesh
ICEGOV '07: Proceedings of the 1st international conference on Theory and practice of electronic governanceWidespread global acceptance of electronic governance bears testimony to the observation that e-governance has rocked the governing systems in the world to a degree. E-governance is the information and communication technology (ICT) enabled system of ...
e-governance: challenges and opportunities
ICEGOV '08: Proceedings of the 2nd international conference on Theory and practice of electronic governanceThe advent of information and communication technology (ICT) together with the diffusion of the technological advances has resulted in the digital revolution and the emergence of the Information age. The revolution has created an enormous impact on ...
Comments