skip to main content
10.1145/3448823.3448835acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicvispConference Proceedingsconference-collections
research-article

Model-Based Interpolation for Continuous Human Silhouette Images by Height-Constraint Assumption

Published: 04 March 2021 Publication History

Abstract

Boundary information is crucial for defining the shape of an object or a person as a first approximation, and is used extensively for quantitative analysis in various fields such as fluids, cell biology, medical images, and human video analysis. In particular, the analysis of human silhouettes is vital in numerous applications. However, when videos are captured at frame rates that are lower than the speeds of the target human movements, the analysis accuracy of the corresponding videos decreases owing to the lack of in-between boundary information of the pair of successive boundaries. A reliable boundary-tracking technique for interpolating in-between human silhouettes is necessary to compensate for such an information loss; however, a suitable method for human video analysis has not yet been developed. To this end, we propose a novel model-based boundary-tracking method based on the height-constraint assumption, in which the positions of the body points are assumed to change insignificantly along the height (or vertical) direction during walking. In the case of videos of humans walking and jogging, the proposed method can perform frame interpolation with substantially higher accuracy than that of existing methods: e.g. the normalized square error of the proposed method is about 1.4~1.9 fold smaller than those of existing methods in the walking case of 7.5 fps input frame rate. Moreover, it is confirmed that, although the height-constraint assumption is less effective at diagonal-view angles than at lateral-view angles, the proposed method with the additional projective transform can perform more accurate interpolation than that with only the height-constraint assumption, even in the case of diagonal-view angles. Furthermore, it is preliminarily revealed that preprocessing of the proposed method leads to improved accuracy of forensic gait-based human identification and visual hull-based 3D reconstruction under low frame rate conditions. It is expected that the proposed method will also be effective for application in other fields, such as action recognition, kinesiology, and sports video analysis.

References

[1]
D.P. Mukherjee and N. Ray (2012). Contour Interpolation using Level-Set Analysis. International Journal of Image and Graphics, 12(1), 1250004.
[2]
J. Yang and F. Stern (2007). A Sharp Interface Method for Two-Phase Flows Interacting with Moving Bodies. In Proceedings of 18th AIAA Computational Fluid Dynamics Conference.
[3]
M.K. Driscol, C. McCann, R. Kopace, T. Homan, J.T. Fourkas, C. Parent, and W. Losert (2012). Cell Shape Dynamics: From Waves to Migration. PLoS Computational Biology, 8(3), e1002392.
[4]
C. Wang, S. Chowdhury, M. Driscoll, C.A. Parent, S.K. Gupta, and W. Losert (2014). The interplay of cell--cell and cell--substrate adhesion in collective cell migration. The Journal of the Royal Society Interface, 11(100), 20140684.
[5]
L. Zhukov, K. Museth, D.E. Breen, R. Whitakery, and A.H. Barr (2003). Level Set Modeling and Segmentation of DT-MRI Brain Data. Journal of Electronic Imaging, 12(1), 125--133.
[6]
J. Hu, H. Yan, and M. Sakalli (1999). Locating head and face boundaries for head--shoulder images. Pattern Recognition, 32(8), 1317--1333.
[7]
W. Wu, C. Qian, S. Yang, Q. Wang, Y. Cai, and Q. Zhou (2018). Look at Boundary: A Boundary-Aware Face Alignment Algorithm. In Proceedings of 2018 IEEE/CVF Computer Vision and Pattern Recognition (CVPR 2018), 2129--2138.
[8]
J. Han and B. Bhanu (2006). Individual Recognition Using Gait Energy Image. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(2), 316--322.
[9]
Y. Makihara, R. Sagawa, Y. Mukaigawa, T. Echigo, and Y. Yagi (2006). Gait Recognition Using a View Transformation Model in the Frequency Domain, Computer Vision -- ECCV 2006. Lecture Notes in Computer Science, vol. 3953. Springer, Berlin, Heidelberg, Germany.
[10]
H. Iwama, D. Muramatsu, Y. Makihara, and Y. Yagi (2013). Gait Verification System for Criminal Investigation. IPSJ Transactions on Computer Vision and Applications, 5, 163--175.
[11]
Y. Makihara, H. Mannami, A. Tsuji, M.A. Hossain, K. Sugiura, A. Mori, and Y. Yagi (2012). The OU-ISIR Gait Database Comprising the Treadmill Dataset. IPSJ Transactions on Computer Vision and Applications, 4, 53--62.
[12]
Y. Iwashita, R. Baba, K. Ogawara, and R. Kurazume (2010). Person Identification from Spatio-temporal 3D Gait. In Proceedings of 2010 International Conference on Emerging Security Technologies, 30--35.
[13]
D. Imoto, K. Kurosawa, K. Tsuchiya, K. Kuroki, M. Hirabayashi, N. Akiba, H. Kakuda, K. Tanabe, and Y. Hawai (2018). A Novel Human Identification Method by Gait using Dynamics of Feature Points and Local Shape Features. In Proceedings of 2018 7th European Workshop on Visual Information Processing (EUVIP), 1--6.
[14]
G. Rogez, I. Rius, J. Martínez-del-Rincón, and C. Orrite (2007). Exploiting Spatio-temporal Constraints for Robust 2D Pose Tracking. Lecture Notes in Computer Science, vol. 4814. Springer, Heidelberg, Berlin, Germany.
[15]
T.-J. Chin, L. Wang, K. Schindler, and D. Suter (2007). Extrapolating Learned Manifolds for Human Activity Recognition. In Proceedings of 2007 IEEE International Conference on Image Processing, 381--384.
[16]
C. Theobalt, J. Carranza, and M.A. Magnor (2003). Enhancing silhouette-based human motion capture with 3D motion fields. In Proceedings of 11th Pacific Conference on Computer Graphics and Applications, 185--193.
[17]
R. Raman, P.K. Sa, B. Majhi, and S. Bakshi (2016). Direction Estimation for Pedestrian Monitoring System in Smart Cities: An HMM Based Approach. IEEE Access, 4, 5788--5808.
[18]
S. Maity, D. Bhattacharjee, and A. Chakrabarti (2017). A Novel Approach for Human Action Recognition from Silhouette Images. IETE Journal of Research, 63(2), 160--171.
[19]
N. Takemura, Y. Makihara, D. Muramatsu, T. Echigo, and Y. Yagi (2018). Multi-view large population gait dataset and its performance evaluation for cross-view gait recognition. IPSJ Transactions on Computer Vision and Applications, 10, 4.
[20]
H. Chao, Y. He, J. Zhang, and J. Feng (2019). GaitSet: Regarding Gait as a Set for Cross-View Gait Recognition. In Proceedings of The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19).
[21]
S. Yu, H. Chen, E.B.G. Reyes, and N. Poh (2017). GaitGAN: Invariant Gait Feature Extraction Using Generative Adversarial Networks. In Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 532--539.
[22]
M.Z. Uddin, D. Muramatsu, N. Takemura, M.A.R. Ahad, and Y. Yagi (2019). Spatio-temporal silhouette sequence reconstruction for gait recognition against occlusion. IPSJ Transactions on Computer Vision and Applications, 11, 9.
[23]
Y. Makihara and Y. Yagi (2010). Earth Mover's Morphing: Topology-Free Shape Morphing Using Cluster-Based EMD Flows, Computer Vision -- ACCV 2010. Lecture Notes in Computer Science, vol. 6495. Springer, Heidelberg, Berlin, Germany.
[24]
N. Akae, Y. Makihara, and Y. Yagi (2011). Gait recognition using periodic temporal super resolution for low frame-rate videos. In Proceedings of 2011 International Joint Conference on Biometrics (IJCB), 1--7.
[25]
A.J. London (2019). Artificial Intelligence and Black-Box Medical Decisions: Accuracy versus Explainability. The Hastings Center Report, 49(1), 15--21.
[26]
P.W. Nutter (2019). Machine Learning Evidence: Admissibility and Weight. Journal of Constitutional Law, 21(3), 919--958.
[27]
K. He, G. Gkiozari, P. Dollár, and R. Girshick (2017). Mask R-CNN. In Proceedings of 2017 IEEE International Conference on Computer Vision (ICCV), 2980--2988.
[28]
K.S. Ray and S. Chakraborty (2019). Object detection by spatio-temporal analysis and tracking of the detected objects in a video with variable background. Journal of Visual Communication and Image Representation, 58, 662--674.
[29]
R. Kurazume (2008). Level Set and Applications, Computer Vision and Image Media (2nd ed.). Advanced Communication Media, Sinjuku, Tokyo, Japan (in Japanese).
[30]
W. Matusik, C. Buehler, R. Raskar, S.J. Gortler, and L. McMillan (2000). Image-based visual hulls. In Proceedings of the 27th annual conference on Computer graphics and interactive techniques (SIGGRAPH '00), 369--374.
[31]
S.N. Sinha and M. Pollefeys (2004). Visual-hull reconstruction from uncalibrated and unsynchronized video streams. In Proceedings of 2nd International Symposium on 3D Data Processing, Visualization and Transmission (3DPVT 2004), 349--356.
[32]
L. Quan and Z. Lan (1999). Linear N-point camera pose determination. IEEE Transactions on Pattern Analysis and Machine Intelligence, 21(8), 774--780.
[33]
K. Forbes (2007). Calibration, Recognition, and Shape from Silhouettes of Stones. Ph.D Thesis, University of Cape Town.
[34]
J.-H. Yoo and M.S. Nixon (2011). Automated Markerless Analysis of Human Gait Motion for Recognition and Classification, ETRI Journal, 33(2), 259--266.
[35]
J. Long, E. Shelhamer, and T. Darrell (2015). Fully convolutional networks for semantic segmentation. In Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[36]
Y. Guo, Y. Liu, T. Georgiou, and M.S. Lew (2018). A review of semantic segmentation using deep neural networks, International Journal of Multimedia Information Retrieval, 7, 87--93.

Cited By

View all
  • (2024)Running Gait Biometrics at a Distance: A Novel Silhouette/3D Running Gait Dataset in Forensic Scenarios2024 International Conference of the Biometrics Special Interest Group (BIOSIG)10.1109/BIOSIG61931.2024.10786737(1-7)Online publication date: 25-Sep-2024
  • (2023)Pre‐set estimation‐based in‐silico silhouette‐based methodology for improving the robustness to viewing direction difference for assisting forensic gait analysisJournal of Forensic Sciences10.1111/1556-4029.1521468:2(470-487)Online publication date: 10-Feb-2023
  • (2022)Biometrics: Going 3DSensors10.3390/s2217636422:17(6364)Online publication date: 24-Aug-2022

Index Terms

  1. Model-Based Interpolation for Continuous Human Silhouette Images by Height-Constraint Assumption

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICVISP 2020: Proceedings of the 2020 4th International Conference on Vision, Image and Signal Processing
    December 2020
    366 pages
    ISBN:9781450389532
    DOI:10.1145/3448823
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 March 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Biometrics
    2. Pedestrian Video Analysis
    3. Silhouette Interpolation

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Funding Sources

    Conference

    ICVISP 2020

    Acceptance Rates

    ICVISP 2020 Paper Acceptance Rate 60 of 147 submissions, 41%;
    Overall Acceptance Rate 186 of 424 submissions, 44%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 07 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Running Gait Biometrics at a Distance: A Novel Silhouette/3D Running Gait Dataset in Forensic Scenarios2024 International Conference of the Biometrics Special Interest Group (BIOSIG)10.1109/BIOSIG61931.2024.10786737(1-7)Online publication date: 25-Sep-2024
    • (2023)Pre‐set estimation‐based in‐silico silhouette‐based methodology for improving the robustness to viewing direction difference for assisting forensic gait analysisJournal of Forensic Sciences10.1111/1556-4029.1521468:2(470-487)Online publication date: 10-Feb-2023
    • (2022)Biometrics: Going 3DSensors10.3390/s2217636422:17(6364)Online publication date: 24-Aug-2022

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media