skip to main content
10.1145/2531602.2531689acmconferencesArticle/Chapter ViewAbstractPublication PagescscwConference Proceedingsconference-collections
research-article

Capturing quality: retaining provenance for curated volunteer monitoring data

Published:15 February 2014Publication History

ABSTRACT

The "real world" nature of field-based citizen science involves unique data management challenges that distinguish it from projects that involve only Internet-mediated activities. In particular, many data contribution and review practices are often accomplished "offline' via paper or general-purpose software like Excel. This can lead to integration challenges when attempting to implement project-specific ICT with full revision and provenance tracking. In this work, we explore some of the current challenges and opportunities in implementing ICT for managing volunteer monitoring data. Our two main contributions are: a general outline of the workflow tasks common to field-based data collection, and a novel data model for preserving provenance metadata that allows for ongoing data exchange between disparate technical systems and participant skill levels. We conclude with applications for other domains, such as hydrologic forecasting and crisis informatics, as well as directions for future research.

References

  1. Cifelli, R., Doesken, N., Kennedy, P., Carey, L. D., Rutledge, S. A., Gimmestad, C., and Depue, T. The community collaborative rain, hail, and snow network: Informal education for scientists and citizens. Bulletin of the American Meteorological Society 86, 8 (2005), 1069--1077.Google ScholarGoogle ScholarCross RefCross Ref
  2. Federal Geographic Data Committee and others. FGDC-STD-001--1998. Content standard for digital geospatial metadata (1998).Google ScholarGoogle Scholar
  3. Fegraus, E. H., Andelman, S., Jones, M. B., and Schildhauer, M. Maximizing the value of ecological data with structured metadata: An introduction to ecological metadata language (EML) and principles for metadata creation. Bulletin of the Ecological Society of America 86, 3 (2005), 158--168.Google ScholarGoogle ScholarCross RefCross Ref
  4. Firehock, K., and West, J. A brief history of volunteer biological water monitoring using macroinvertebrates. Journal of the North American Benthological Society 14, 1 (1995), 197--202.Google ScholarGoogle ScholarCross RefCross Ref
  5. Gil, Y., Miles, S., Belhajjame, K., Deus, H., Garijo, D., Klyne, G., Missier, P., Soiland-Reyes, S., and Zednik, S. A Primer for the PROV Provenance Model. W3C, 2012. http://www.w3.org/TR/prov-primer/.Google ScholarGoogle Scholar
  6. Halfaker, A., Geiger, R. S., Morgan, J. T., and Riedl, J. The rise and decline of an open collaboration system: How Wikipedia's reaction to popularity is causing its decline. American Behavioral Scientist 57, 5 (2013), 664--688.Google ScholarGoogle ScholarCross RefCross Ref
  7. Hartung, C., Lerer, A., Anokwa, Y., Tseng, C., Brunette, W., and Borriello, G. Open Data Kit: Tools to build information services for developing regions. In Proceedings of the 4th ACM/IEEE International Conference on Information and Communication Technologies and Development, ACM (2010), 18. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Howe, J. The Rise of Crowdsourcing. Wired Magazine 14, 6 (2006), 1--4.Google ScholarGoogle Scholar
  9. Juran, J. M. Quality control handbook. McGraw-Hill, 1962.Google ScholarGoogle Scholar
  10. Kelling, S., Yu, J., Gerbracht, J., and Wong, W.-K. Emergent filters: Automated data verification in a large-scale citizen science project. In Proceedings of Workshops at the Seventh International Conference on eScience, IEEE (2011), 20--27. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Kim, S., Mankoff, J., and Paulos, E. Sensr: Evaluating a flexible framework for authoring mobile data-collection tools for citizen science. In Proceedings of the 2013 conference on Computer supported cooperative work, ACM (2013), 1453--1462. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Liu, J., and Ram, S. Who does what: Collaboration patterns in the Wikipedia and their impact on article quality. ACM Transactions on Management Information Systems (TMIS) 2, 2 (2011), 11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Lukyanenko, R., Parsons, J., and Wiersma, Y. Citizen science 2.0: Data management principles to harness the power of the crowd. In Service-Oriented Perspectives in Design Science Research. Springer, 2011, 465--473. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Newman, G., Graham, J., Crall, A., and Laituri, M. The art and science of multi-scale citizen science support. Ecological Informatics 6, 3 (2011), 217--227.Google ScholarGoogle ScholarCross RefCross Ref
  15. Okolloh, O. Ushahidi, or 'testimony': Web 2.0 tools for crowdsourcing crisis information. Participatory Learning and Action 59, 1 (2009), 65--70.Google ScholarGoogle Scholar
  16. Orlandi, F., and Passant, A. Modelling provenance of DBpedia resources using Wikipedia contributions. Web Semantics: Science, Services and Agents on the World Wide Web 9, 2 (2011), 149--164.Google ScholarGoogle ScholarCross RefCross Ref
  17. Priedhorsky, R., and Terveen, L. Wiki grows up: Arbitrary data models, access control, and beyond. In Proceedings of the Seventh International Symposium on Wikis and Open Collaboration, ACM (2011), 63--71. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Raddick, J., Lintott, C., Schawinski, K., Thomas, D., Nichol, R., Andreescu, D., Bamford, S., Land, K., Murray, P., Slosar, A., et al. Galaxy Zoo: An experiment in public science participation. In Bulletin of the American Astronomical Society, vol. 39 (2007), 892.Google ScholarGoogle Scholar
  19. Ram, S., and Liu, J. Understanding the semantics of data provenance to support active conceptual modeling. In Active conceptual modeling of learning. Springer, 2007, 17--29. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Ribes, D., and Finholt, T. A. Representing community: Knowing users in the face of changing constituencies. In Proceedings of the 2008 ACM conference on Computer supported cooperative work, ACM (2008), 107--116. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Roth, M., and Tan, W.-C. Data integration and data exchange: It's really about time. In Proceedings of the 6th Biennial Conference on Innovative Data Systems Research, CIDR (2013).Google ScholarGoogle Scholar
  22. Sheppard, S. A. wq: A modular framework for collecting, storing, and utilizing experiential VGI. In Proceedings of the 1st ACM SIGSPATIAL International Workshop on Crowdsourced and Volunteered Geographic Information, ACM (2012), 62--69. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Sheppard, S. A., and Terveen, L. Quality is a verb: The operationalization of data quality in a citizen science community. In Proceedings of the Seventh International Symposium on Wikis and Open Collaboration, ACM (2011), 29--38. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Smith, A. Smartphone adoption and usage. Tech. rep., Pew Internet & American Life Project, Washington, DC, 2011.Google ScholarGoogle Scholar
  25. Stvilia, B., Twidale, M. B., Smith, L. C., and Gasser, L. Information quality work organization in Wikipedia. Journal of the American society for information science and technology 59, 6 (2008), 983--1001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Sullivan, B. L., Wood, C. L., Iliff, M. J., Bonney, R. E., Fink, D., and Kelling, S. eBird: a citizen-based bird observation network in the biological sciences. Biological Conservation 142, 10 (Oct. 2009), 2282--2292.Google ScholarGoogle ScholarCross RefCross Ref
  27. Vrandečić, D., Ratnakar, V., Krötzsch, M., and Gil, Y. Shortipedia: Aggregating and curating Semantic Web data. Web Semantics: Science, Services and Agents on the World Wide Web 9, 3 (2011), 334--338. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Wang, Z., Dong, H., Kelly, M., Macklin, J. A., Morris, P. J., and Morris, R. A. Filtered-Push: A Map-Reduce platform for collaborative taxonomic data management. In Computer Science and Information Engineering, 2009 WRI World Congress on, vol. 3, IEEE (2009), 731--735. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Wieczorek, J., Bloom, D., Guralnick, R., Blum, S., Döring, M., Giovanni, R., Robertson, T., and Vieglais, D. Darwin Core: An evolving community-developed biodiversity data standard. PLoS One 7, 1 (2012), e29715.Google ScholarGoogle ScholarCross RefCross Ref
  30. Wiggins, A. Free as in puppies: Compensating for ICT constraints in citizen science. In Proceedings of the 2013 conference on Computer supported cooperative work, ACM (2013), 1469--1480. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Wiggins, A., Bonney, R., Graham, E., Henderson, S., Kelling, S., Littauer, R., LeBuhn, G., Lotts, K., Michener, W., Newman, G., Russell, E., Stevenson, R., and Weltzin, J. Data management guide for public participation in scientific research. DataONE, 2013.Google ScholarGoogle Scholar
  32. Wiggins, A., and Crowston, K. From conservation to crowdsourcing: A typology of citizen science. In HICSS'11, IEEE Computer Society (2011), 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Wiggins, A., Newman, G., Stevenson, R. D., and Crowston, K. Mechanisms for data quality and validation in citizen science. In Proceedings of Workshops at the Seventh International Conference on eScience, IEEE (2011), 14--19. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Wilderman, C. C. Models of community science: Design lessons from the field. In Citizen Science Toolkit Conference (Cornell Laboratory of Ornithology, Ithaca, NY, 2007).Google ScholarGoogle Scholar

Index Terms

  1. Capturing quality: retaining provenance for curated volunteer monitoring data

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CSCW '14: Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing
      February 2014
      1600 pages
      ISBN:9781450325400
      DOI:10.1145/2531602

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 15 February 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      CSCW '14 Paper Acceptance Rate134of497submissions,27%Overall Acceptance Rate2,235of8,521submissions,26%

      Upcoming Conference

      CSCW '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader