skip to main content
10.1145/3494322.3494339acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiotConference Proceedingsconference-collections
research-article

Privacy Preserving Workflow Detection for Manufacturing Using Neural Networks based Object Detection

Published:08 March 2022Publication History

ABSTRACT

In this paper, we introduce a detection system for workflow in a manufacturing line using depth images to preserve the privacy of workers. A depth camera sensor is mounted on a ceiling with a top-down angle and pointed to workers below completing a workflow. The system was deployed in a real life industrial process where workers had to work on a metal sheet by completing a sequence of bending steps. In this study, we experimented the effectiveness of using two classification approaches in order to identify the current workstep that workers are doing. The first approach was workflow detection by human activity recognition along with detecting related objects (a tool table, a computer screen and a machine) in the scene using only a depth camera sensor. Because of the similarity between the human body shape during different activities, the results were low and precision was 63.03%. The second approach was workflow detection by object classification and human localisation along with integrating depth camera sensor data with other sensor devices and results were better than the first approach with precision 85.42%. Within this approach, two classification models were created only using data from the Realsense sensor and two more were created including data from the bending machine. Each model has its own benefits in terms of precision, accuracy and performance, and we explain them along with the challenges the system had, in the discussion section. The results are also investigated in details and we present the future plans for the proposed detection system and for the sensors connected.

References

  1. Abdelrahman Ahmad, Michael Haslgrübler, Alois Ferscha, Birgit Ettinger, and Jullius Cho. 2020. Macro workstep detection for assembly manufacturing. In Proceedings of the 13th ACM International Conference on PErvasive Technologies Related to Assistive Environments. 1–6.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Annie I Antón, Julia B Earp, and Jessica D Young. 2010. How internet users’ privacy concerns have evolved since 2002. IEEE Security & Privacy 8, 1 (2010), 21–27.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Shoaib Azam, Aasim Rafique, and Moongu Jeon. 2016. Vehicle pose detection using region based convolutional neural network. In 2016 International Conference on Control, Automation and Information Sciences (ICCAIS). IEEE, 194–198.Google ScholarGoogle ScholarCross RefCross Ref
  4. Matthew Brand, Nuria Oliver, and Alex Pentland. 1997. Coupled hidden markov models for complex action recognition. In Proceedings of IEEE computer society conference on computer vision and pattern recognition. IEEE, 994–999.Google ScholarGoogle ScholarCross RefCross Ref
  5. Vincenzo Carletti, Luca Del Pizzo, Gennaro Percannella, and Mario Vento. 2017. An efficient and effective method for people detection from top-view depth cameras. In 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS). IEEE, 1–6.Google ScholarGoogle ScholarCross RefCross Ref
  6. Raymond H Chan, Chung-Wa Ho, and Mila Nikolova. 2005. Salt-and-pepper noise removal by median-type noise detectors and detail-preserving regularization. IEEE Transactions on image processing 14, 10 (2005), 1479–1485.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. PJ Cheng and SC Lin. 2000. Using neural networks to predict bending angle of sheet metal formed by laser. International Journal of Machine Tools and Manufacture 40, 8(2000), 1185–1197.Google ScholarGoogle ScholarCross RefCross Ref
  8. Tao Cheng, Jochen Teizer, Giovanni C Migliaccio, and Umberto C Gatti. 2013. Automated task-level activity analysis through fusion of real time location sensors and worker’s thoracic posture data. Automation in Construction 29 (2013), 24–39.Google ScholarGoogle ScholarCross RefCross Ref
  9. Matthias Dantone, Juergen Gall, Christian Leistner, and Luc Van Gool. 2013. Human pose estimation using body parts dependent joint regressors. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3041–3048.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Marcin Eichner, Manuel Marin-Jimenez, Andrew Zisserman, and Vittorio Ferrari. 2012. 2d articulated human pose estimation and retrieval in (almost) unconstrained still images. International journal of computer vision 99, 2 (2012), 190–214.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Dumitru Erhan, Christian Szegedy, Alexander Toshev, and Dragomir Anguelov. 2014. Scalable object detection using deep neural networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2147–2154.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Pedro F Felzenszwalb, Ross B Girshick, David McAllester, and Deva Ramanan. 2009. Object detection with discriminatively trained part-based models. IEEE transactions on pattern analysis and machine intelligence 32, 9(2009), 1627–1645.Google ScholarGoogle Scholar
  13. Pedro F Felzenszwalb and Daniel P Huttenlocher. 2005. Pictorial structures for object recognition. International journal of computer vision 61, 1 (2005), 55–79.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Markus Funk, Lars Lischke, Sven Mayer, Alireza Sahami Shirazi, and Albrecht Schmidt. 2018. Teach Me How! Interactive Assembly Instructions Using Demonstration and In-Situ Projection. In Assistive Augmentation. Springer, 49–73.Google ScholarGoogle Scholar
  15. Andreas Geiger, Philip Lenz, and Raquel Urtasun. 2012. Are we ready for autonomous driving? the kitti vision benchmark suite. In 2012 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 3354–3361.Google ScholarGoogle ScholarCross RefCross Ref
  16. Ross Girshick. 2015. Fast r-cnn. In Proceedings of the IEEE international conference on computer vision. 1440–1448.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Ross Girshick, Jeff Donahue, Trevor Darrell, and Jitendra Malik. 2014. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 580–587.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Klaus Greff, André Brandão, Stephan Krauß, Didier Stricker, and Esteban Clua. 2012. A comparison between background subtraction algorithms using a consumer depth camera.. In VISAPP (1). 431–436.Google ScholarGoogle Scholar
  19. Zhixin Guo, Wenzhi Liao, Yifan Xiao, Peter Veelaert, and Wilfried Philips. 2019. Deep learning fusion of RGB and depth images for pedestrian detection. In 30th British Machine Vision Conference. 1–13.Google ScholarGoogle Scholar
  20. Sven Hinrichsen, Daniel Riediger, and Alexander Unrau. 2016. Assistance systems in manual assembly. In Proceedings of 6th International conference on Production Engineering and Management, 29 September 2016. 3–14.Google ScholarGoogle Scholar
  21. Philipp Hold, Selim Erol, Gehard Reisinger, and Wilfried Sihn. 2017. Planning and evaluation of digital assistance systems. Procedia Manufacturing 9(2017), 143–150.Google ScholarGoogle ScholarCross RefCross Ref
  22. Ninghang Hu, Gwenn Englebienne, and Ben Kröse. 2013. Posture recognition with a top-view camera. In 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2152–2157.Google ScholarGoogle Scholar
  23. Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, 2017. Speed/accuracy trade-offs for modern convolutional object detectors. In Proceedings of the IEEE conference on computer vision and pattern recognition. 7310–7311.Google ScholarGoogle ScholarCross RefCross Ref
  24. Isto Huvila. 2016. Awkwardness of becoming a boundary object: Mangle and materialities of reports, documentation data, and the archaeological work. The Information Society 32, 4 (2016), 280–297.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Intel. [n.d.]. Stereo Depth – Intel® RealSense™ Depth and Tracking Cameras. Accessed: 2-10-2019.Google ScholarGoogle Scholar
  26. Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167(2015).Google ScholarGoogle Scholar
  27. Ardalan Khosrowpour, Juan Carlos Niebles, and Mani Golparvar-Fard. 2014. Vision-based workface assessment using depth images for activity analysis of interior construction operations. Automation in Construction 48 (2014), 74–87.Google ScholarGoogle ScholarCross RefCross Ref
  28. Jonathan Krause, Michael Stark, Jia Deng, and Li Fei-Fei. 2013. 3d object representations for fine-grained categorization. In Proceedings of the IEEE international conference on computer vision workshops. 554–561.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2017. ImageNet Classification with Deep Convolutional Neural Networks Mark. Commun. ACM 60, 6 (2017), 84–90.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. John Lafferty, Andrew McCallum, and Fernando CN Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. (2001).Google ScholarGoogle Scholar
  31. Oscar D Lara and Miguel A Labrador. 2012. A survey on human activity recognition using wearable sensors. IEEE communications surveys & tutorials 15, 3 (2012), 1192–1209.Google ScholarGoogle Scholar
  32. Shu-Chun Lin, An-Sheng Liu, Tang-Wei Hsu, and Li-Chen Fu. 2015. Representative body points on top-view depth sequences for daily activity recognition. In 2015 IEEE international conference on systems, man, and cybernetics. IEEE, 2968–2973.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In European conference on computer vision. Springer, 740–755.Google ScholarGoogle ScholarCross RefCross Ref
  34. An-Sheng Liu, Zi-Jun Li, Tso-Hsin Yeh, Yu-Huan Yang, and Li-Chen Fu. 2017. Partially transferred convolution neural network with cross-layer inheriting for posture recognition from top-view depth camera. In 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 4139–4143.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. 2016. Ssd: Single shot multibox detector. In European conference on computer vision. Springer, 21–37.Google ScholarGoogle ScholarCross RefCross Ref
  36. Christian Meurisch and Max Mühlhäuser. 2021. Data Protection in AI Services: A Survey. ACM Computing Surveys (CSUR) 54, 2 (2021), 1–38.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Pardis Emami Naeini, Sruti Bhagavatula, Hana Habib, Martin Degeling, Lujo Bauer, Lorrie Faith Cranor, and Norman Sadeh. 2017. Privacy expectations and preferences in an IoT world. In Thirteenth Symposium on Usable Privacy and Security (SOUPS 2017). 399–412.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Pedro O O Pinheiro, Ronan Collobert, and Piotr Dollár. 2015. Learning to segment object candidates. Advances in neural information processing systems 28 (2015), 1990–1998.Google ScholarGoogle Scholar
  39. Chinh Huu Pham, Quoc Khanh Le, and Thanh Ha Le. 2014. Human action recognition using dynamic time warping and voting algorithm. VNU Journal of Science: Computer Science and Communication Engineering 30, 3(2014).Google ScholarGoogle Scholar
  40. Michael Rauter. 2013. Reliable human detection and tracking in top-view depth images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 529–534.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2016. Faster r-cnn: Towards real-time object detection with region proposal networks. IEEE transactions on pattern analysis and machine intelligence 39, 6(2016), 1137–1149.Google ScholarGoogle Scholar
  42. Ehsan Rezazadeh Azar, Sven Dickinson, and Brenda McCabe. 2013. Server-customer interaction tracker: computer vision–based system to estimate dirt-loading cycles. Journal of Construction Engineering and Management 139, 7(2013), 785–794.Google ScholarGoogle ScholarCross RefCross Ref
  43. Samy Sadeky, Ayoub Al-Hamadiy, Bernd Michaelisy, and Usama Sayed. 2010. Real-time automatic traffic accident recognition using hfg. In 2010 20th International Conference on Pattern Recognition. IEEE, 3348–3351.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Cristian Sminchisescu, Atul Kanaujia, and Dimitris Metaxas. 2006. Conditional models for contextual human motion recognition. Computer Vision and Image Understanding 104, 2-3 (2006), 210–220.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Christian Szegedy, Scott Reed, Dumitru Erhan, Dragomir Anguelov, and Sergey Ioffe. 2014. Scalable, high-quality object detection. arXiv preprint arXiv:1412.1441(2014).Google ScholarGoogle Scholar
  46. Christian Szegedy, Alexander Toshev, and Dumitru Erhan. 2013. Deep neural networks for object detection. Advances in neural information processing systems 26 (2013), 2553–2561.Google ScholarGoogle Scholar
  47. Kentaro Toyama, John Krumm, Barry Brumitt, and Brian Meyers. 1999. Wallflower: Principles and practice of background maintenance. In Proceedings of the seventh IEEE international conference on computer vision, Vol. 1. IEEE, 255–261.Google ScholarGoogle ScholarCross RefCross Ref
  48. Silicon UK. [n.d.]. Tales In Tech History: Microsoft Kinect. Accessed: 05-01-2018.Google ScholarGoogle Scholar
  49. Lu Xia, Chia-Chih Chen, and Jake K Aggarwal. 2012. View invariant human action recognition using histograms of 3d joints. In 2012 IEEE computer society conference on computer vision and pattern recognition workshops. IEEE, 20–27.Google ScholarGoogle Scholar
  50. Wenming Yang, Wang Lu, and Naitong Zhang. 2007. Object extraction combining image partition with motion detection. In 2007 IEEE International Conference on Image Processing, Vol. 3. IEEE, III–337.Google ScholarGoogle ScholarCross RefCross Ref
  51. Jie Yin, Qiang Yang, and Jeffrey Junfeng Pan. 2008. Sensor-based abnormal human-activity detection. IEEE Transactions on Knowledge and Data Engineering 20, 8(2008), 1082–1090.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Privacy Preserving Workflow Detection for Manufacturing Using Neural Networks based Object Detection
              Index terms have been assigned to the content through auto-classification.

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Other conferences
                IoT '21: Proceedings of the 11th International Conference on the Internet of Things
                November 2021
                233 pages
                ISBN:9781450385664
                DOI:10.1145/3494322

                Copyright © 2021 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 8 March 2022

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • research-article
                • Research
                • Refereed limited

                Acceptance Rates

                Overall Acceptance Rate28of84submissions,33%
              • Article Metrics

                • Downloads (Last 12 months)16
                • Downloads (Last 6 weeks)1

                Other Metrics

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader

              HTML Format

              View this article in HTML Format .

              View HTML Format