skip to main content
10.1145/3595916.3626419acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

AniCropify: Image Matting for Anime-Style Illustration

Published:01 January 2024Publication History

ABSTRACT

Recently, deep learning-based image matting methods have emerged. However, the existing methods lack the capability to provide precise matting for anime-style illustrations because their network parameters are trained on primarily photo-realistic images. In this paper, we introduces a new anime image dataset, Chara-1M, designed for matting purposes. In addition, we propose AniCropify, a new matting method for character anime images. Focusing on the commonalities of representation between anime images and photo-realistic images, in AniCropify, an anime image is first converted into a photo-realistic image. From the converted image, a trimap is generated to identify the human regions in images. By using the trimap in the matting process, precise alpha masks of anime images can be obtained. From experiments, we confirmed that based on the quality evaluation of matting results, the proposed method received the highest rating compared to other state-of-the-art techniques.

References

  1. Shaofan Cai, Xiaoshuai Zhang, Haoqiang Fan, Haibin Huang, Jiangyu Liu, Jiaming Liu, Jiaying Liu, Jue Wang, and Jian Sun. 2019. Disentangled image matting. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 8819–8828.Google ScholarGoogle ScholarCross RefCross Ref
  2. Quan Chen, Tiezheng Ge, Yanyu Xu, Zhiqiang Zhang, Xinxin Yang, and Kun Gai. 2018. Semantic human matting. In Proceedings of the 26th ACM international conference on Multimedia. 618–626.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. XiangGuang Chen, Ye Zhu, Yu Li, Bingtao Fu, Lei Sun, Ying Shan, and Shan Liu. 2022. Robust human matting via semantic guidance. In Proceedings of the Asian Conference on Computer Vision (ACCV). 2984–2999.Google ScholarGoogle Scholar
  4. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 770–778.Google ScholarGoogle ScholarCross RefCross Ref
  5. Qiqi Hou and Feng Liu. 2019. Context-aware image matting for simultaneous foreground and alpha estimation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 4130–4139.Google ScholarGoogle ScholarCross RefCross Ref
  6. Zhanghan Ke, Jiayu Sun, Kaican Li, Qiong Yan, and Rynson W.H. Lau. 2022. MODNet: Real-time trimap-free portrait matting via objective decomposition. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 1140–1147.Google ScholarGoogle ScholarCross RefCross Ref
  7. Jizhizi Li, Sihan Ma, Jing Zhang, and Dacheng Tao. 2021. Privacy-preserving portrait matting. In Proceedings of the 29th ACM International Conference on Multimedia. 3501–3509.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Jizhizi Li, Jing Zhang, , and Dacheng Tao. 2021. Deep automatic natural image matting. In Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI-21). 800–806.Google ScholarGoogle ScholarCross RefCross Ref
  9. Yaoyi Li and Hongtao Lu. 2020. Natural image matting via guided contextual attention. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 11450–11457.Google ScholarGoogle ScholarCross RefCross Ref
  10. Shanchuan Lin, Andrey Ryabtsev, Soumyadip Sengupta, Brian L. Curless, Steven M. Seitz, and Ira Kemelmacher-Shlizerman. 2021. Real-time high-resolution background matting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 8762–8771.Google ScholarGoogle ScholarCross RefCross Ref
  11. Jinlin Liu, Yuan Yao, Wendi Hou, Miaomiao Cui, Xuansong Xie, Changshui Zhang, and Xian-Sheng Hua. 2020. Boosting semantic human matting with coarse annotations. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 8563–8572.Google ScholarGoogle ScholarCross RefCross Ref
  12. Qinglin Liu, Haozhe Xie, Shengping Zhang, Bineng Zhong, and Rongrong Ji. 2021. Long-range feature propagating for natural image matting. In Proceedings of the 29th ACM International Conference on Multimedia. 526–534.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Hao Lu, Yutong Dai, Chunhua Shen, and Songcen Xu. 2019. Indices matter: Learning to index for deep image matting. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 3266–3275.Google ScholarGoogle ScholarCross RefCross Ref
  14. Simon Niklaus and Feng Liu. 2018. Context-aware synthesis for video frame interpolation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1701–1710.Google ScholarGoogle ScholarCross RefCross Ref
  15. GyuTae Park, SungJoon Son, JaeYoung Yoo, SeHo Kim, and Nojun Kwak. 2022. MatteFormer: Transformer-based image matting via prior-tokens. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 11696–11706.Google ScholarGoogle ScholarCross RefCross Ref
  16. Yu Qiao, Yuhao Liu, Xin Yang, Dongsheng Zhou, Mingliang Xu, Qiang Zhang, and Xiaopeng Wei. 2020. Attention-guided hierarchical structure aggregation for image matting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 13676–13685.Google ScholarGoogle ScholarCross RefCross Ref
  17. Christoph Rhemann, Carsten Rother, Jue Wang, Margrit Gelautz, Pushmeet Kohli, and Pamela Rott. 2009. A perceptually motivated online benchmark for image matting. In Proceddings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1826–1833.Google ScholarGoogle ScholarCross RefCross Ref
  18. Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. 2022. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 10684–10695.Google ScholarGoogle ScholarCross RefCross Ref
  19. Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-Net: Convolutional networks for biomedical image segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention. 234–241.Google ScholarGoogle ScholarCross RefCross Ref
  20. Soumyadip Sengupta, Vivek Jayaram, Brian Curless, Steven M. Seitz, and Ira Kemelmacher-Shlizerman. 2020. Background matting: The world is your green screen. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2291–2300.Google ScholarGoogle ScholarCross RefCross Ref
  21. Rishab Sharma, Rahul Deora, and Anirudha Vishvakarma. 2020. AlphaNet: An attention guided deep network for automatic image matting. In Proceedings of 2020 International Conference on Omni-layer Intelligent Systems (COINS). 1–8.Google ScholarGoogle ScholarCross RefCross Ref
  22. Yanan Sun, Chi-Keung Tang, and Yu-Wing Tai. 2021. Semantic image matting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 11120–11129.Google ScholarGoogle ScholarCross RefCross Ref
  23. Bo Xu, Jiake Xie, Han Huang, Ziwen Li, Cheng Lu, Yong Tang, and Yandong Guo. 2022. Situational perception guided image matting. In Proceedings of the 30th ACM International Conference on Multimedia. 5283–5293.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Ning Xu, Brian Price, Scott Cohen, and Thomas Huang. 2017. Deep image matting. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2970–2979.Google ScholarGoogle ScholarCross RefCross Ref
  25. Qihang Yu, Jianming Zhang, He Zhang, Yilin Wang, Zhe Lin, Ning Xu, Yutong Bai, and Alan Yuille. 2021. Mask guided matting via progressive refinement network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 1154–1163.Google ScholarGoogle ScholarCross RefCross Ref
  26. Yunke Zhang, Lixue Gong, Lubin Fan, Peiran Ren, Qixing Huang, Hujun Bao, and Weiwei Xu. 2019. A late fusion CNN for digital matting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 7469–7478.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. AniCropify: Image Matting for Anime-Style Illustration

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in Asia
        December 2023
        745 pages
        ISBN:9798400702051
        DOI:10.1145/3595916

        Copyright © 2023 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 January 2024

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

        Acceptance Rates

        Overall Acceptance Rate59of204submissions,29%

        Upcoming Conference

        MM '24
        MM '24: The 32nd ACM International Conference on Multimedia
        October 28 - November 1, 2024
        Melbourne , VIC , Australia
      • Article Metrics

        • Downloads (Last 12 months)52
        • Downloads (Last 6 weeks)8

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format