Abstract
Evaluation of dressing activities is essential in the assessment of the performance of patients with psycho-motor impairments. However, the current practice of monitoring dressing activity (performed by the patients in front of the therapist) has a number of disadvantages when considering the personal nature of dressing activity as well as inconsistencies between the recorded performance of the activity and performance of the same activity carried out in the patients’ natural environment, such as their home. As such, a system that can evaluate dressing activities automatically and objectively would alleviate some of these issues. However, a number of challenges arise, including difficulties in correctly identifying garments, their position in the body (partially of fully worn) and their position in relation to other garments. To address these challenges, we have developed a novel method based on visual grammars to automatically detect dressing failures and explain the type of failure. Our method is based on the analysis of image sequences of dressing activities and only requires availability of a video recording device. The analysis relies on a novel technique which we call temporal–relational visual grammar; it can reliably recognize temporal dressing failures, while also detecting spatial and relational failures. Our method achieves 91% precision in detecting dressing failures performed by 11 subjects. We explain these results and discuss the challenges encountered during this work.
Similar content being viewed by others
Notes
The grammar does not contain rules describing failures, meaning that if a configuration of a dressing activity cannot be explained by the grammar it is marked as a failure.
References
Afsar P, Cortez P, Santos H (2015) Automatic visual detection of human behavior: a review from 2000 to 2014. Expert Syst Appl 42(20):6936–6956. https://doi.org/10.1016/j.eswa.2015.05.023
Allen JF (1983) Maintaining knowledge about temporal intervals. Commun ACM 26(11):832–843. https://doi.org/10.1145/182.358434
Bahle G, Gruenerbl A, Lukowicz P, Bignotti E, Zeni M, Giunchiglia F (2014) Recognizing hospital care activities with a coat pocket worn smartphone. In: Mobile computing, applications and services (MobiCASE), 2014 6th international conference on, IEEE, pp 175–181
Bay H, Ess A, Tuytelaars T, Gool LJV (2008) Speeded-up robust features (SURF). Comput Vis Image Underst 110(3):346–359. https://doi.org/10.1016/j.cviu.2007.09.014
Breiman L (2001) Random forests. Mach Learn 45(1):5–32. https://doi.org/10.1023/A:1010933404324
Brown C, Moore WP, Hemman D, Yunek A (1996) Influence of instrumental activities of daily living assessment method on judgments of independence. Am J Occup Ther 50(3):202–206
Chen C, Zhang D, Sun L, Hariz M, Yuan Y (2012) Does location help daily activity recognition? In: Donnelly M, Paggetti C, Nugent C, Mokhtari M (eds) Impact analysis of solutions for chronic disease prevention and management. ICOST 2012. Lecture notes in computer science, vol 7251. Springer, Berlin, pp 83–90
Chernbumroong S, Cang S, Atkins A, Yu H (2013) Elderly activities recognition and classification for applications in assisted living. Expert Syst Appl 40(5):1662–1674
Chomsky N (2002) Syntactic structures. A Mouton Classic, Mouton de Gruyter, Berlin. http://books.google.com/books?id=SNeHkMXHcd8C
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297. https://doi.org/10.1007/BF00994018
Costagliola G, Deufemia V, Ferrucci F, Gravino C (2002) Using extended positional grammars to develop visual modeling languages. In: Proceedings of the 14th international conference on software engineering and knowledge engineering, SEKE 2002, Ischia, Italy, July 15–19, 2002, ACM, pp 201–208. https://doi.org/10.1145/568760.568795
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR 2005), 20–26 June 2005, San Diego, CA, USA, IEEE Computer Society, pp 886–893. https://doi.org/10.1109/CVPR.2005.177
Edmans J, Lincoln N (1990) The relation between perceptual deficits after stroke and independence in activities of daily living. Br J Occup Ther 53(4):139–142
Edmans J, Towle D, Lincoln NB (1991) The recovery of perceptual problems after stroke and the impact on daily life. Clin Rehabil 5(4):301–309
Ferrucci F, Pacini G, Satta G, Sessa MI, Tortora G, Tucci M, Vitiello G (1996) Symbol-relation grammars: a formalism for graphical languages. Inf Comput 131(1):1–46. https://doi.org/10.1006/inco.1996.0090
Feyereisen P (1999) Disorders of everyday actions in subjects suffering from senile dementia of Alzheimer’s type: an analysis of dressing performance. Neuropsychol Rehabil 9(2):169–188
Foncubierta-Rodríguez A, Müller H, Depeursinge A (2017) From visual words to a visual grammar: using language modelling for image classification. CoRR arXiv:abs/1703.05571
Friedman A, Ron S (2017) Unlocking the power of visual grammar theory: analyzing social media political advertising messages in the 2016 US election. J Vis Lit 36(2):90–103. https://doi.org/10.1080/1051144X.2017.1379758
Girshick RB, Felzenszwalb PF, McAllester DA (2011) Object detection with grammar models. In: Zemel RS, Bartlett PL, Pereira FCN, Weinberger KQ Shawe-Taylor J (eds) NIPS, pp 442–450
Gottfried B (2015) The systematic design of visual languages applied to logical reasoning. J Vis Lang Comput 28:212–225. https://doi.org/10.1016/j.jvlc.2015.02.001
Haralick RM, Shanmugam KS, Dinstein I (1973) Textural features for image classification. IEEE Trans Syst Man Cybern 3(6):610–621. https://doi.org/10.1109/TSMC.1973.4309314
Kong J, Zhang K, Zeng X (2006) Spatial graph grammars for graphical user interfaces. ACM Trans Comput-Hum Interact 13(2):268–307. https://doi.org/10.1145/1165734.1165739
Lakin F (1987) Visual grammars for visual languages. In: Forbus KD, Shrobe HE (eds) Proceedings of the 6th national conference on artificial intelligence. Seattle, WA, July 1987, Morgan Kaufmann, pp 683–688. http://www.aaai.org/Library/AAAI/1987/aaai87-122.php
Leborg C (2006) Visual grammar. Princeton Architectural Press, New York
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Maio CD, Fenza G, Gallo M, Loia V, Parente M (2017a) Time-aware adaptive tweets ranking through deep learning. Future Gener Comput Syst. https://doi.org/10.1016/j.future.2017.07.039
Maio CD, Fenza G, Loia V, Orciuoli F (2017b) Unfolding social content evolution along time and semantics. Future Gener Comput Syst 66:146–159. https://doi.org/10.1016/j.future.2016.05.039
Marriott K, Meyer B (1996) Towards a hierarchy of visual languages. In: Proceedings of the 1996 IEEE symposium on visual languages, Boulder, Colorado, USA, September 3–6, 1996, IEEE Computer Society, pp 196–203. https://doi.org/10.1109/VL.1996.545288
Matic A, Mehta P, Rehg JM, Osmani V, Mayora O (2012) Monitoring dressing activity failures through RFID and video. J Methods Inf Med 51:45–54
Matic A, Mehta P, Rehg JM, Osmani V, Mayora O (2010) Aid-me: Automatic identification of dressing failures through monitoring of patients and activity evaluation. In: Pervasive computing technologies for healthcare (PervasiveHealth), 2010 4th international conference on, IEEE, pp 1–8
Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630. https://doi.org/10.1109/TPAMI.2005.188
Mjolsness E (1991) Visual grammars and their neural nets. In: Moody JE, Hanson SJ, Lippmann R (eds) Advances in neural information processing systems, vol 4 (NIPS conference, Denver, Colorado, USA, December 2–5, 1991), Morgan Kaufmann, pp 428–435. http://papers.nips.cc/paper/499-visual-grammars-and-their-neural-nets
Namazi KH, Johnson BD (1992) Dressing independently: a closet modification model for Alzheimer’s disease patients. Am J Alzheimers Dis Other Dement 7(1):22–28
Ojala T, Pietikäinen M, Harwood D (1996) A comparative study of texture measures with classification based on featured distributions. Pattern Recognit 29(1):51–59. https://doi.org/10.1016/0031-3203(95)00067-4
Osmani V (2015) Smartphones in mental health: detecting depressive and manic episodes. IEEE Pervasive Comput 14(3):10–13
Qi S, Huang S, Wei P, Zhu S (2017) Predicting human activities using stochastic grammar. In: IEEE international conference on computer vision, ICCV 2017, Venice, Italy, October 22–29, 2017, IEEE Computer Society, pp 1173–1181. https://doi.org/10.1109/ICCV.2017.132
Rublee E, Rabaud V, Konolige K, Bradski GR (2011) ORB: an efficient alternative to SIFT or SURF. In: Metaxas DN, Quan L, Sanfeliu A, Gool LJV (eds) IEEE International conference on computer vision, ICCV 2011, Barcelona, Spain, November 6–13, 2011, IEEE, pp 2564–2571. https://doi.org/10.1109/ICCV.2011.6126544
Sunderland A, Walker CM, Walker MF (2006) Action errors and dressing disability after stroke: an ecological approach to neuropsychological assessment and intervention. Neuropsycholl Rehabil 16(6):666–683
Vedaldi A, Fulkerson B (2010) Vlfeat: an open and portable library of computer vision algorithms. In: Proceedings of the 18th ACM international conference on multimedia, ACM, New York, NY, USA, MM’10, pp 1469–1472. https://doi.org/10.1145/1873951.1874249
Walker M, Lincoln N (1991) Factors influencing dressing performance after stroke. J Neurol Neurosurg Psychiatry 54(8):699–701
Walker C, Walker M (2001) Dressing ability after stroke: a review of the literature. Br J Occup Ther 64(9):449–454
Walker M, Drummond A, Lincoln N (1996) Evaluation of dressing practice for stroke patients after discharge from hospital: a crossover design study. Clin Rehabil 10(1):23–31
Walker CM, Walker MF, Sunderland A (2003) Dressing after a stroke: a survey of current occupational therapy practice. Br J Occup Ther 66(6):263–268
Wittenburg K, Weitzman L (1996) Relational grammars: theory and practice in a visual language interface for process modeling. In: Workshop on theory of visual languages, Springer, Berlin pp 27–29
Wu YN, Si Z, Gong H, Zhu SC (2010) Learning active basis model for object detection and recognition. Int J Comput Vis 90(2):198–235
Zhu SC, Mumford D (2006) A stochastic grammar of images. Found Trends Comput Graph Vis 2(4):259–362. https://doi.org/10.1561/0600000018
Zhu L, Chen Y, Yuille AL (2009) Unsupervised learning of probabilistic grammar-markov models for object categories. Trans Pattern Anal Mach Intell 31(1):114–128. https://doi.org/10.1109/TPAMI.2008.67
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Ruiz, E., Osmani, V., Sucar, L.E. et al. Detecting dressing failures using temporal–relational visual grammars. J Ambient Intell Human Comput 10, 2757–2770 (2019). https://doi.org/10.1007/s12652-018-0975-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-018-0975-0