research-article

Context-Informed Scheduling and Analysis: Improving Accuracy of Mobile Self-Reports

Authors:
Niels van Berkel

The University of Melbourne, Melbourne, Australia

The University of Melbourne, Melbourne, Australia
View Profile

,
Jorge Goncalves

The University of Melbourne, Melbourne, Australia

The University of Melbourne, Melbourne, Australia
View Profile

,
Peter Koval

The University of Melbourne, Melbourne, Australia

The University of Melbourne, Melbourne, Australia
View Profile

,
Simo Hosio

University of Oulu, Oulu, Oulu, Finland

University of Oulu, Oulu, Oulu, Finland
View Profile

,
Tilman Dingler

University of Melbourne, Melbourne, Victoria, Australia

University of Melbourne, Melbourne, Victoria, Australia
View Profile

,
Denzil Ferreira

University of Oulu, Oulu, Finland

University of Oulu, Oulu, Finland
View Profile

,
Vassilis Kostakos

University of Melbourne, Melbourne, Victoria, Australia

University of Melbourne, Melbourne, Victoria, Australia
View Profile

CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing SystemsMay 2019Paper No.: 51Pages 1–12https://doi.org/10.1145/3290605.3300281

Published:02 May 2019Publication History

CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems

Pages 1–12

ABSTRACT

Mobile self-reports are a popular technique to collect participant labelled data in the wild. While literature has focused on increasing participant compliance to self-report questionnaires, relatively little work has assessed response accuracy. In this paper, we investigate how participant context can affect response accuracy and help identify strategies to improve the accuracy of mobile self-report data. In a 3-week study we collect over 2,500 questionnaires containing both verifiable and non-verifiable questions. We find that response accuracy is higher for questionnaires that arrive when the phone is not in ongoing or very recent use. Furthermore, our results show that long completion times are an indicator of a lower accuracy. Using contextual mechanisms readily available on smartphones, we are able to explain up to 13% of the variance in participant accuracy. We offer actionable recommendations to assist researchers in their future deployments of mobile self-report studies.

References

Sally Andrews, David A. Ellis, Heather Shaw, and Lukasz Piwek. 2015. Beyond Self-Report: Tools to Compare Estimated and Real-World Smartphone Use. PLOS ONE 10, 10 (2015), 1--9.Google ScholarCross Ref
Alan Baddeley. 1992. Working memory. Science 255, 5044 (1992), 556--559.Google ScholarCross Ref
Douglas Bates, Martin Mächler, Ben Bolker, and Steve Walker. 2015. Fitting Linear Mixed-Effects Models Using lme4. Journal of Statistical Software 67, 1 (2015), 1--48.Google ScholarCross Ref
Daniel J. Beal and Howard M. Weiss. 2003. Methods of Ecological Momentary Assessment in Organizational Research. Organizational Research Methods 6, 4 (2003), 440--464.Google ScholarCross Ref
S. L. Beilock and M. S. Decaro. 2007. From poor performance to success under stress: working memory, strategy selection, and mathematical problem solving under pressure. Journal of Experimental Psychology: Learning, Memory, and Cognition 33, 6 (2007), 983--998.Google ScholarCross Ref
Niels van Berkel, Matthias Budde, Senuri Wijenayake, and Jorge Goncalves. 2018. Improving Accuracy in Mobile Human Contributions: An Overview. In Proceedings of the 2018 ACM International Joint Conference and 2018 International Symposium on Pervasive and Ubiquitous Computing and Wearable Computers (UbiComp '18). ACM, New York, NY, USA, 594--599. Google ScholarDigital Library
Niels van Berkel, Denzil Ferreira, and Vassilis Kostakos. 2017. The Experience Sampling Method on Mobile Devices. Comput. Surveys 50, 6, Article 93 (2017), 40 pages. Google ScholarDigital Library
Niels van Berkel, Jorge Goncalves, Simo Hosio, and Vassilis Kostakos. 2017. Gamification of Mobile Experience Sampling Improves Data Quality and Quantity. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 1, 3, Article 107 (2017), 21 pages. Google ScholarDigital Library
Niels van Berkel, Jorge Goncalves, Lauri Lovén, Denzil Ferreira, Simo Hosio, and Vassilis Kostakos. 2019. Effect of Experience Sampling Schedules on Response Rate and Recall Accuracy of Objective SelfReports. International Journal of Human-Computer Studies (2019).Google Scholar
Benjamin M. Bolker, Mollie E. Brooks, Connie J. Clark, Shane W. Geange, John R. Poulsen, M. Henry H. Stevens, and Jada-Simone S. White. 2009. Generalized linear mixed models: a practical guide for ecology and evolution. Trends in Ecology & Evolution 24, 3 (2009), 127--135.Google ScholarCross Ref
Catherine E. Connelly, David Zweig, Jane Webster, and John P. Trougakos. 2011. Knowledge hiding in organizations. Journal of Organizational Behavior 33, 1 (2011), 64--88.Google ScholarCross Ref
S. Consolvo and M. Walker. 2003. Using the experience sampling method to evaluate ubicomp applications. IEEE Pervasive Computing 2, 2 (2003), 24--31. Google ScholarDigital Library
N. Cowan. 2005. Working Memory Capacity. Psychology Press.Google Scholar
Nelson Cowan. 2010. The Magical Mystery Four: How Is Working Memory Capacity Limited, and Why? Current Directions in Psychological Science 19, 1 (2010), 51--57. PMID: 20445769.Google ScholarCross Ref
M. Csikszentmihalyi, R. Larson, and S. Prescott. 1977. The ecology of adolescent activity and experience. Journal of Youth and Adolescence 6, 3 (1977), 281--294.Google ScholarCross Ref
Mihaly Csikszentmihalyi and Reed Larson. 2014. Validity and Reliability of the Experience-Sampling Method. Springer Netherlands, Dordrecht, 35--54.Google Scholar
Fred J. Damerau. 1964. A Technique for Computer Detection and Correction of Spelling Errors. Commun. ACM 7, 3 (1964), 171--176. Google ScholarDigital Library
Tilman Dingler, Albrecht Schmidt, and Tonja Machulla. 2017. Building Cognition-Aware Systems: A Mobile Toolkit for Extracting Time-ofDay Fluctuations of Cognitive Performance. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 1, 3, Article 47 (2017), 15 pages. Google ScholarDigital Library
Carsten Eickhoff and Arjen P. de Vries. 2013. Increasing cheat robustness of crowdsourcing tasks. Information Retrieval 16, 2 (2013), 121--137. Google ScholarDigital Library
R. W. Engle, S. W. Tuholski, J. E. Laughlin, and A. R. Conway. 1999. Working memory, short-term memory, and general fluid intelligence: a latent-variable approach. Journal of Experimental Psychology: General 128, 3 (1999), 309--331.Google ScholarCross Ref
Hossein Falaki, Ratul Mahajan, Srikanth Kandula, Dimitrios Lymberopoulos, Ramesh Govindan, and Deborah Estrin. 2010. Diversity in Smartphone Usage. In Proceedings of the 8th International Conference on Mobile Systems, Applications, and Services (MobiSys '10). ACM, New York, NY, USA, 179--194. Google ScholarDigital Library
R. Frank Falk and Nancy B Miller. 1992. A primer for soft modeling. University of Akron Press.Google Scholar
Joyce Ho and Stephen S. Intille. 2005. Using Context-aware Computing to Reduce the Perceived Burden of Interruptions from Mobile Devices. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '05). ACM, New York, NY, USA, 909--918. Google ScholarDigital Library
Gary Hsieh, Ian Li, Anind Dey, Jodi Forlizzi, and Scott E. Hudson. 2008. Using Visualizations to Increase Compliance in Experience Sampling. In Proceedings of the 10th International Conference on Ubiquitous Computing (UbiComp '08). ACM, New York, NY, USA, 164--167. Google ScholarDigital Library
Ira E. Hyman, S. Matthew Boss, Breanne M. Wise, Kira E. McKenzie, and Jenna M. Caggiano. 2009. Did you see the unicycling clown? Inattentional blindness while walking and talking on a cell phone. Applied Cognitive Psychology 24, 5 (2009), 597--607.Google ScholarCross Ref
M Iida, P. E. Shrout, J.-P Laurenceau, and Niall Bolger. 2012. Using diary methods in psychological research. (2012), 277--305.Google Scholar
Shamsi T. Iqbal and Brian P. Bailey. 2005. Investigating the Effectiveness of Mental Workload As a Predictor of Opportune Moments for Interruption. In CHI '05 Extended Abstracts on Human Factors in Computing Systems (CHI EA '05). ACM, New York, NY, USA, 1489--1492. Google ScholarDigital Library
Chang-Jae Kim, Sang-hyun Hong, Byung-Sam Kim, Joon-Pyo Cheon, Yoonki Lee, Hyun-Jung Koh, and Jaemin Lee. 2008. Comparison of various tests designed to assess the recovery of cognitive and psychomotor function after ambulatory anesthesia. Korean Journal of Anesthesiology 55, 3 (2008), 291--297.Google ScholarCross Ref
Aniket Kittur, Ed H. Chi, and Bongwon Suh. 2008. Crowdsourcing User Studies with Mechanical Turk. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '08). ACM, New York, NY, USA, 453--456. Google ScholarDigital Library
Predrag Klasnja, Beverly L. Harrison, Louis LeGrand, Anthony LaMarca, Jon Froehlich, and Scott E. Hudson. 2008. Using Wearable Sensors and Real Time Inference to Understand Human Recall of Routine Activities. In Proceedings of the 10th International Conference on Ubiquitous Computing (UbiComp '08). ACM, New York, NY, USA, 154--163. Google ScholarDigital Library
Kostadin Kushlev, Jason Proulx, and Elizabeth W. Dunn. 2016. "Silence Your Phones": Smartphone Notifications Increase Inattention and Hyperactivity Symptoms. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16). ACM, New York, NY, USA, 1011--1020. Google ScholarDigital Library
Donald A. Laird. 1925. Relative Performance of College Students as Conditioned by Time of Day and Day of Week. Journal of Experimental Psychology 8, 1 (1925), 50.Google ScholarCross Ref
Reed Larson and Mihaly Csikszentmihalyi. 2014. The Experience Sampling Method. Springer Netherlands, Dordrecht, 21--34.Google Scholar
Neal Lathia, Kiran K. Rachuri, Cecilia Mascolo, and Peter J. Rentfrow. 2013. Contextual Dissonance: Design Bias in Sensor-based Experience Sampling Methods. In Proceedings of the 2013 ACM International Joint Conference on Pervasive and Ubiquitous Computing (UbiComp '13). ACM, New York, NY, USA, 183--192. Google ScholarDigital Library
V. R. LeBlanc, M. M. McConnell, and S. D. Monteiro. 2015. Predictable chaos: a review of the effects of emotions on attention, memory and decision making. Advances in Health Sciences Education. Theory and Practice 20, 1 (2015), 265--282.Google Scholar
K. O. McCabe, L. Mack, and W. Fleeson. 2012. A guide for data cleaning in experience sampling studies. Guilford Press, New York, NY, US, 321--338.Google Scholar
Abhinav Mehrotra, Jo Vermeulen, Veljko Pejovic, and Mirco Musolesi. 2015. Ask, but Don't Interrupt: The Case for Interruptibility-aware Mobile Experience Sampling. In Adjunct Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2015 ACM International Symposium on Wearable Computers (UbiComp/ISWC'15 Adjunct). ACM, New York, NY, USA, 723--732. Google ScholarDigital Library
M. R. U. Meyer, C. Wu, and S. M. Walsh. 2016. Theoretical Antecedents of Standing at Work: An Experience Sampling Approach Using the Theory of Planned Behavior. AIMS Public Health 3, 4 (2016), 682--701.Google ScholarCross Ref
George A Miller. 1956. The Magical Number Seven, Plus or Minus Two: Some limits on our capacity for processing information. Psychological review 63, 2 (1956), 81.Google Scholar
Minitab. 2014. How to Interpret a Regression Model with Low R-squared and Low P values. https://bit.ly/2otiSw5Google Scholar
Martin Pielot, Tilman Dingler, Jose San Pedro, and Nuria Oliver. 2015. When Attention is Not Scarce - Detecting Boredom from Mobile Phone Usage. In Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing (UbiComp '15). ACM, New York, NY, USA, 825--836. Google ScholarDigital Library
Suzanne Prescott and Mihaly Csikszentmihalyi. 1981. Environmental effects on cognitive and affective states: The experiential time sampling approach. Social Behavior and Personality: an international journal 9, 1 (1981), 23--32.Google Scholar
Robert W. Reeder, Adrienne Porter Felt, Sunny Consolvo, Nathan Malkin, Christopher Thompson, and Serge Egelman. 2018. An Experience Sampling Study of User Reactions to Browser Warnings in the Field. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (CHI '18). ACM, New York, NY, USA, Article 512, 13 pages. Google ScholarDigital Library
Harry T. Reis and Shelly L. Gable. 2000. Event-sampling and other methods for studying everyday experience. Handbook of Research Methods in Social and Personality Psychology (2000), 190--222.Google Scholar
Stephanie Rosenthal, Anind K. Dey, and Manuela Veloso. 2011. Using Decision-Theoretic Experience Sampling to Build Personalized Mobile Phone Interruption Models. In Pervasive Computing, Kent Lyons, Jeffrey Hightower, and Elaine M. Huang (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 170--187. Google ScholarDigital Library
James A. Russell. 1980. A circumplex model of affect. Journal of Personality and Social Psychology 39, 6 (1980), 1161.Google ScholarCross Ref
Ulrich Schimmack. 2003. Affect Measurement in Experience Sampling Research. Journal of Happiness Studies 4, 1 (2003), 79--106.Google ScholarCross Ref
Christina Schmidt, Fabienne Collette, Christian Cajochen, and Philippe Peigneux. 2007. A time to think: Circadian rhythms in human cognition. Cognitive Neuropsychology 24, 7 (2007), 755--789. PMID: 18066734.Google ScholarCross Ref
Christie Napa Scollon, Chu-Kim Prieto, and Ed Diener. 2009. Experience Sampling: Promises and Pitfalls, Strength and Weaknesses. Springer Netherlands, Dordrecht, 157--180.Google Scholar
S. Shiffman, A. A. Stone, and M. R. Hufford. 2008. Ecological Momentary Assessment. Annual Review of Clinical Psychology 4 (2008), 1--32.Google ScholarCross Ref
A. A. Stone, R. C. Kessler, and J. A. Haythornthwaite. 1991. Measuring daily events and experiences: decisions for the researcher. Journal of Personality 59, 3 (1991), 575--607.Google ScholarCross Ref
Khai N. Truong, Thariq Shihipar, and Daniel J. Wigdor. 2014. Slide to X: Unlocking the Potential of Smartphone Unlocking. In Proceedings of the 32Nd Annual ACM Conference on Human Factors in Computing Systems (CHI '14). ACM, New York, NY, USA, 3635--3644. Google ScholarDigital Library
Nash Unsworth, Richard P. Heitz, Josef C. Schrock, and Randall W. Engle. 2005. An automated version of the operation span task. Behavior Research Methods 37, 3 (2005), 498--505.Google ScholarCross Ref
Aku Visuri, Niels van Berkel, Chu Luo, Jorge Goncalves, Denzil Ferreira, and Vassilis Kostakos. 2017. Challenges of quantified-self: encouraging self-reported data logging during recurrent smartphone usage. In Proceedings of the 31st British Computer Society Human Computer Interaction Conference. Google ScholarDigital Library
Aku Visuri, Niels van Berkel, Chu Luo, Jorge Goncalves, Denzil Ferreira, and Vassilis Kostakos. 2017. Predicting Interruptibility for Manual Data Collection: A Cluster-based User Model. In Proceedings of the 19th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI '17). ACM, New York, NY, USA, Article 12, 14 pages. Google ScholarDigital Library
R. West, K. J. Murphy, M. L. Armilio, F. I. Craik, and D. T. Stuss. 2002. Effects of time of day on age differences in working memory. Journal of Gerontology 57, 1 (2002), 3--10.Google ScholarCross Ref
Ladd Wheeler and Harry T. Reis. 1991. Self-Recording of Everyday Life Events: Origins, Types, and Uses. Journal of Personality 59, 3 (1991), 339--354.Google ScholarCross Ref
David L. Woods, Mark M. Kishiyama, E. William Yund, Timothy J. Herron, Ben Edwards, Oren Poliva, Robert F. Hink, and Bruce Reed. 2011. Improving digit span assessment of short-term verbal memory. Journal of Clinical and Experimental Neuropsychology 33, 1 (2011), 101--111.Google ScholarCross Ref
J. C. Cassandra Wright, M. Paul Dietze, A. Paul Agius, Emmanuel Kuntsche, Robin Room, Michael Livingston, Margaret Hellard, and S. C. Megan Lim. 2017. An Ecological Momentary Intervention to Reduce Alcohol Consumption in Young Adults Delivered During Drinking Events: Protocol for a Pilot Randomized Controlled Trial. JMIR Research Protocols 6, 5 (2017), e95.Google ScholarCross Ref
Yulong Yang, Gradeigh D. Clark, Janne Lindqvist, and Antti Oulasvirta. 2016. Free-Form Gesture Authentication in the Wild. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16). ACM, New York, NY, USA, 3722--3735. Google ScholarDigital Library
Zhen Yue, Eden Litt, Carrie J. Cai, Jeff Stern, Kathy K. Baxter, Zhiwei Guan, Nikhil Sharma, and Guangqiang (George) Zhang. 2014. Photographing Information Needs: The Role of Photos in Experience Sampling Method-style Research. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '14). ACM, New York, NY, USA, 1545--1554. Google ScholarDigital Library

Index Terms

Context-Informed Scheduling and Analysis: Improving Accuracy of Mobile Self-Reports
1. Human-centered computing
  1. Human computer interaction (HCI)
    1. Empirical studies in HCI
  2. Ubiquitous and mobile computing
    1. Ubiquitous and mobile computing design and evaluation methods

Recommendations

Gamification of Mobile Experience Sampling Improves Data Quality and Quantity

The Experience Sampling Method is used to capture high-quality in situ data from study participants. This method has become popular in studies involving smartphones, where it is often adapted to motivate participation through the use of gamification ...
Read More
Effect of experience sampling schedules on response rate and recall accuracy of objective self-reports
Research highlights
- We investigate the effect of random, interval, and smartphone-unlock based questionnaires
Abstract
The Experience Sampling Method is widely used to collect human labelled data in the wild. Using this methodology, study participants repeatedly answer a set of questions, constructing a rich overview of the studied phenomena. One of ...
Read More
A large-scale study of daily information needs captured in situ

The goal of this work is to provide a fundamental understanding of the daily information needs of people through a large-scale, in-depth, quantitative investigation. To this end, we have conducted one of the most comprehensive studies of information ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems
May 2019
9077 pages
ISBN:9781450359702
DOI:10.1145/3290605
General Chairs:
Stephen Brewster
University of Glasgow, Scotland, UK
,
Geraldine Fitzpatrick
TU Wien, Austria
,
Program Chairs:
Anna Cox
University College London, UK
,
Vassilis Kostakos
University of Melbourne, Australia
Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 2 May 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
cognition
context
data quality
ecological momentary assessment
ema
esm
experience sampling method
questionnaires
smartphones
working memory
Qualifiers
- research-article
Conference

Acceptance Rates
CHI '19 Paper Acceptance Rate703of2,958submissions,24%Overall Acceptance Rate6,199of26,314submissions,24%
More
Upcoming Conference
CHI '24

Sponsor:

sigchi

CHI Conference on Human Factors in Computing Systems

May 11 - 16, 2024

Honolulu , HI , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 17
  Total Citations
  View Citations
- 653
  Total Downloads
- Downloads (Last 12 months)79
- Downloads (Last 6 weeks)9
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Context-Informed Scheduling and Analysis: Improving Accuracy of Mobile Self-Reports

CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Gamification of Mobile Experience Sampling Improves Data Quality and Quantity

Effect of experience sampling schedules on response rate and recall accuracy of objective self-reports

A large-scale study of daily information needs captured in situ