skip to main content
research-article

An LSTM Based System for Prediction of Human Activities with Durations

Published: 08 January 2018 Publication History

Abstract

Human activity prediction is an interesting problem with a wide variety of applications like intelligent virtual assistants, contextual marketing, etc. One formulation of this problem is jointly predicting human activities (viz. eating, commuting, etc.) with associated durations. Herein a deep learning system is proposed for this problem. Given a sequence of past activities and durations, the system estimates the probabilities for future activities and their durations. Two distinct Long-Short Term Memory (LSTM) networks are developed that cater to different assumptions about the data and achieve different modeling complexities and prediction accuracies. The networks are trained and tested with two real-world datasets, one being publicly available while the other collected from a field experiment. Modeling on the segment level public dataset mitigates the cold-start problem. Experiments indicate that compared to traditional approaches based on sequence mining or hidden Markov modeling, LSTM networks perform significantly better. The ability of LSTM networks to detect long term correlations in activity data is also demonstrated. The trained models are each less than 500KB in size and can be deployed to run in real-time on a mobile device without any dependencies on the cloud. This can help applications like mobile personal assistants by providing predictive context.

Supplementary Material

krishna (krishna.zip)
Supplemental movie, appendix, image and software files for, An LSTM Based System for Prediction of Human Activities with Durations

References

[1]
Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian Goodfellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Jozefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan Mané, Rajat Monga, Sherry Moore, Derek Murray, Chris Olah, Mike Schuster, Jonathon Shlens, Benoit Steiner, Ilya Sutskever, Kunal Talwar, Paul Tucker, Vincent Vanhoucke, Vijay Vasudevan, Fernanda Viégas, Oriol Vinyals, Pete Warden, Martin Wattenberg, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. (2015). https://www.tensorflow.org/ Software available from tensorflow.org.
[2]
Gediminas Adomavicius, Ramesh Sankaranarayanan, Shahana Sen, and Alexander Tuzhilin. 2005. Incorporating contextual information in recommender systems using a multidimensional approach. ACM Transactions on Information Systems (TOIS) 23, 1 (2005), 103--145.
[3]
Nikola Banovic, Tofi Buzali, Fanny Chevalier, Jennifer Mankoff, and Anind K. Dey. 2016. Modeling and Understanding Human Routine Behavior. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI ‘16). ACM, New York, NY, USA, 248--260. https://doi.org/10.1145/2858036.2858557
[4]
Mitra Baratchi, Nirvana Meratnia, Paul JM Havinga, Andrew K Skidmore, and Bert AKG Toxopeus. 2014. A hierarchical hidden semi-Markov model for modeling mobility data. In Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing. ACM, 401--412.
[5]
Samy Bengio, Oriol Vinyals, Navdeep Jaitly, and Noam Shazeer. 2015. Scheduled sampling for sequence prediction with recurrent neural networks. In Advances in Neural Information Processing Systems. 1171--1179.
[6]
James Bergstra, Brent Komer, Chris Eliasmith, Dan Yamins, and David D Cox. 2015. Hyperopt: a Python library for model selection and hyperparameter optimization. Computational Science 8 Discovery 8, 1 (2015), 014008. http://stacks.iop.org/1749-4699/8/i=1/a=014008
[7]
Oliver Brdiczka, Norman Makoto Su, and James Bo Begole. 2010. Temporal Task Footprinting: Identifying Routine Tasks by Their Temporal Patterns. In Proceedings of the 15th International Conference on Intelligent User Interfaces (IUI ‘10). ACM, New York, NY, USA, 281--284. https://doi.org/10.1145/1719970.1720011
[8]
Daniel Castro, Steven Hickson, Vinay Bettadapura, Edison Thomaz, Gregory Abowd, Henrik Christensen, and Irfan Essa. 2015. Predicting Daily Activities from Egocentric Images Using Deep Learning. In Proceedings of the 2015 ACM International Symposium on Wearable Computers (ISWC ‘15). ACM, New York, NY, USA, 75--82. https://doi.org/10.1145/2802083.2808398
[9]
Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555 (2014).
[10]
Mary Czerwinski, Ran Gilad-Bachrach, Shamsi Iqbal, and Gloria Mark. 2016. Challenges for designing notifications for affective computing systems. In Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct. ACM, 1554--1559.
[11]
Mukund Deshpande and George Karypis. 2004. Selective Markov models for predicting Web page accesses. ACM Transactions on Internet Technology (TOIT) 4, 2 (2004), 163--184.
[12]
Rob A Dunne and Norm A Campbell. 1997. On the pairing of the softmax activation and cross-entropy penalty functions and the derivation of the softmax activation function. In Proc. 8th Aust. Conf. on the Neural Networks, Melbourne, 181, Vol. 185.
[13]
Nathan Eagle and Alex Sandy Pentland. 2009. Eigenbehaviors: Identifying structure in routine. Behavioral Ecology and Sociobiology 63, 7 (2009), 1057--1066.
[14]
Nathan Eagle and Alex (Sandy) Pentland. 2006. Reality Mining: Sensing Complex Social Systems. Personal Ubiquitous Comput. 10, 4 (March 2006), 255--268. https://doi.org/10.1007/s00779-005-0046-3
[15]
Marcus Felson and Lawrence E. Cohen. 1980. Human ecology and crime: A routine activity approach. Human Ecology 8, 4 (1980), 389--406. https://doi.org/10.1007/BF01561001
[16]
G David Forney. 1973. The viterbi algorithm. Proc. IEEE 61, 3 (1973), 268--278.
[17]
Jerome H Friedman. 2001. Greedy function approximation: a gradient boosting machine. Annals of statistics (2001), 1189--1232.
[18]
Huiji Gao, Jiliang Tang, and Huan Liu. 2015. Addressing the cold-start problem in location recommendation using geo-social correlations. Data Mining and Knowledge Discovery 29, 2 (2015), 299--323.
[19]
Felix A Gers and E Schmidhuber. 2001. LSTM recurrent networks learn simple context-free and context-sensitive languages. IEEE Transactions on Neural Networks 12, 6 (2001), 1333--1340.
[20]
Alex Graves. 2013. Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850 (2013).
[21]
Geoffrey Hinton, NiRsh Srivastava, and Kevin Swersky. 2012. Lecture 6a Overview of mini--batch gradient descent. (2012).
[22]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural Comput. 9, 8 (Nov. 1997), 1735--1780. https://doi.org/10.1162/neco.1997.9.8.1735
[23]
Arthur E Hoerl and Robert W Kennard. 1970. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 12, 1 (1970), 55--67.
[24]
Andrej Karpathy. 2015. The Unreasonable Effectiveness of Recurrent Neural Networks. http://karpathy.github.io/2015/05/21/rnn-effectiveness/. (2015).
[25]
Reed Larson and Mihaly Csikszentmihalyi. 1983. The experience sampling method. New Directions for Methodology of Social 8 Behavioral Science (1983).
[26]
Joohyun Lee, Kyunghan Lee, Euijin Jeong, Jaemin Jo, and Ness B. Shroff. 2016. Context-aware Application Scheduling in Mobile Systems: What Will Users Do and Not Do Next?. In Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing (UbiComp ‘16). ACM, New York, NY, USA, 1235--1246. https://doi.org/10.1145/2971648.2971680
[27]
Andy Liaw and Matthew Wiener. 2002. Classification and regression by randomForest. R news 2, 3 (2002), 18--22.
[28]
Blerina Lika, Kostas Kolomvatsos, and Stathes Hadjiefthymiades. 2014. Facing the cold start problem in recommender systems. Expert Systems with Applications 41, 4 (2014), 2065--2073.
[29]
Andrew Liu and Dario Salvucci. 2001. Modeling and prediction of human driver behavior. In Intl. Conference on HCI.
[30]
Xin Liu and Karl Aberer. 2013. SoCo: a social network aided context-aware recommender system. In Proceedings of the 22nd international conference on World Wide Web. ACM, 781--802.
[31]
Ye Liu, Liqiang Nie, Lei Han, Luming Zhang, and David S. Rosenblum. 2016. Action2Activity: Recognizing Complex Activities from Sensor Data. CoRR abs/1611.01872 (2016). http://arxiv.org/abs/1611.01872
[32]
Iain L MacDonald and Walter Zucchini. 1997. Hidden Markov and other models for discrete-valued time series. Vol. 110. CRC Press. 115--120 pages.
[33]
James McInerney, Sebastian Stein, Alex Rogers, and Nicholas R. Jennings. 2013. Breaking the habit: Measuring and predicting departures from routine in individual human mobility. Pervasive and Mobile Computing 9, 6 (2013), 808--822. https://doi.org/10.1016/j.pmcj.2013.07.016 Mobile Data Challenge.
[34]
Prem Melville, Raymond J Mooney, and Ramadass Nagarajan. 2002. Content-boosted collaborative filtering for improved recommendations. In Aaai/iaai. 187--192.
[35]
Vinod Nair and Geoffrey E Hinton. 2010. Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th international conference on machine learning (ICML-10). 807--814.
[36]
Saul B Needleman and Christian D Wunsch. 1970. A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of molecular biology 48, 3 (1970), 443--453.
[37]
United States Department of Labor. Bureau of Labor Statistics. 2015. American Time Use Survey (ATUS), 2003--2015, Multi-Year Data. (2015). https://www.bls.gov/tus/datafiles_0315.htm
[38]
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. 2011. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 12 (2011), 2825--2830.
[39]
Alex Pentland and Andrew Liu. 1999. Modeling and prediction of human behavior. Neural computation 11, 1 (1999), 229--242.
[40]
Charith Perera, Arkady Zaslavsky, Peter Christen, and Dimitrios Georgakopoulos. 2014. Context aware computing for the internet of things: A survey. IEEE Communications Surveys 8 Tutorials 16, 1 (2014), 414--454.
[41]
Daniele Riboni, Timo Sztyler, Gabriele Civitarese, and Heiner Stuckenschmidt. 2016. Unsupervised Recognition of Interleaved Activities of Daily Living Through Ontological and Probabilistic Reasoning. In Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing (UbiComp ‘16). ACM, New York, NY, USA, 1--12. https://doi.org/10.1145/2971648.2971691
[42]
S Rasoul Safavian and David Landgrebe. 1991. A survey of decision tree classifier methodology. IEEE transactions on systems, man, and cybernetics 21, 3 (1991), 660--674.
[43]
Haşim Sak, Andrew Senior, and Françoise Beaufays. 2014. Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition. arXiv preprint arXiv:1402.1128 (2014).
[44]
George AF Seber and Alan J Lee. 2012. Linear regression analysis. Vol. 936. John Wiley 8 Sons.
[45]
Stephan Sigg, Sandra Haseloff, and Klaus David. 2010. An alignment approach for context prediction tasks in ubicomp environments. IEEE Pervasive Computing 9, 4 (2010), 90--97.
[46]
Reid Simmons, Brett Browning, Yilu Zhang, and Varsha Sadekar. 2006. Learning to predict driver route and destination intent. In Intelligent Transportation Systems Conference, 2006. ITSC‘06. IEEE. IEEE, 127--132.
[47]
Nitish Srivastava, Geoffrey E Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: a simple way to prevent neural networks from overfitting. Journal of Machine Learning Research 15, 1 (2014), 1929--1958.
[48]
Tijmen Tieleman and Geoffrey Hinton. 2012. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural networks for machine learning 4, 2 (2012).
[49]
Vladimir Vapnik. 1998. The support vector method of function estimation. In Nonlinear Modeling. Springer, 55--85.
[50]
Jian Bo Yang, Minh Nhut Nguyen, Phyo Phyo San, Xiao Li Li, and Shonali Krishnaswamy. 2015. Deep Convolutional Neural Networks on Multichannel Time Series for Human Activity Recognition. In Proceedings of the 24th International Conference on Artificial Intelligence (IJCAI‘15). AAAI Press, 3995--4001. http://dl.acm.org/citation.cfm?id=2832747.2832806
[51]
Zhongtang Zhao, Yiqiang Chen, Junfa Liu, Zhiqi Shen, and Mingjie Liu. 2011. Cross-people Mobile-phone Based Activity Recognition. In Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence - Volume Volume Three (IJCAI‘11). AAAI Press, 2545--2550. https://doi.org/10.5591/978-1-57735-516-8/IJCAI11-423
[52]
Vincent Wenchen Zheng, Bin Cao, Yu Zheng, Xing Xie, and Qiang Yang. 2010. Collaborative Filtering Meets Mobile Recommendation: A User-Centered Approach. In AAAI, Vol. 10. 236--241.

Cited By

View all
  • (2025)Individual mobility prediction by considering current traveling features and historical activity chainGeo-spatial Information Science10.1080/10095020.2025.2455005(1-28)Online publication date: 4-Feb-2025
  • (2025)A Freight Truck Stopping Behavior Prediction Approach Based on Trajectory DatasetInternational Journal of Intelligent Transportation Systems Research10.1007/s13177-025-00462-3Online publication date: 21-Jan-2025
  • (2024)Toward the design of persuasive systems for a healthy workplace: a real-time posture detectionFrontiers in Big Data10.3389/fdata.2024.13599067Online publication date: 17-Jun-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies  Volume 1, Issue 4
December 2017
1298 pages
EISSN:2474-9567
DOI:10.1145/3178157
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 January 2018
Accepted: 01 October 2017
Revised: 01 August 2017
Received: 01 May 2017
Published in IMWUT Volume 1, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Daily Routine Generation
  2. Human Activity Prediction
  3. Long-Short Term Memory Network

Qualifiers

  • Research-article
  • Research
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)82
  • Downloads (Last 6 weeks)7
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Individual mobility prediction by considering current traveling features and historical activity chainGeo-spatial Information Science10.1080/10095020.2025.2455005(1-28)Online publication date: 4-Feb-2025
  • (2025)A Freight Truck Stopping Behavior Prediction Approach Based on Trajectory DatasetInternational Journal of Intelligent Transportation Systems Research10.1007/s13177-025-00462-3Online publication date: 21-Jan-2025
  • (2024)Toward the design of persuasive systems for a healthy workplace: a real-time posture detectionFrontiers in Big Data10.3389/fdata.2024.13599067Online publication date: 17-Jun-2024
  • (2024)Empowering Predictive Modeling by GAN-based Causal Information LearningACM Transactions on Intelligent Systems and Technology10.1145/365261015:3(1-19)Online publication date: 17-May-2024
  • (2024)Learning from User-driven Events to Generate Automation SequencesProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36314277:4(1-22)Online publication date: 12-Jan-2024
  • (2024)A Model for Detecting Abnormality in Activities of Daily Living Sequences Using Inverse Reinforcement LearningProceedings of the 39th ACM/SIGAPP Symposium on Applied Computing10.1145/3605098.3636085(1031-1033)Online publication date: 8-Apr-2024
  • (2024)Federated Learning for Hierarchical Fall Detection and Human Activity Recognition2024 IEEE 10th World Forum on Internet of Things (WF-IoT)10.1109/WF-IoT62078.2024.10811273(1-6)Online publication date: 10-Nov-2024
  • (2023)Activity Prediction Based on Deep Learning TechniquesApplied Sciences10.3390/app1309568413:9(5684)Online publication date: 5-May-2023
  • (2023)Human Digital Twin, the Development and Impact on DesignJournal of Computing and Information Science in Engineering10.1115/1.406313223:6Online publication date: 25-Aug-2023
  • (2023)Predicting Activities of Daily Living for the Coming Time Period in Smart HomesIEEE Transactions on Human-Machine Systems10.1109/THMS.2022.317621353:1(228-238)Online publication date: Feb-2023
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media