Skip to main content

Web Crawler for an Anonymously Processed Information Database

  • Conference paper
  • First Online:
  • 810 Accesses

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 279))

Abstract

The Japanese Act on the Protection of Personal Information (Act No. 57 of 2003) allows companies to deidentify personal data and to provide them to any third party without obtaining the data subject’s consent. Officially, the “Anonymously Processed Information” will eventually improve public convenience and aid big-data business. In this paper, we aim to clarify the landscape of anonymously processed information databases in Japan. Focusing on disclosed statements that specify the production and provisioning processes for anonymously processed information, we have developed an automated Web crawler system that detects such statements using heuristics associated with legal statements. We demonstrate that our crawler system performs very well in terms of processing time and detection accuracy. In addition, the resulting statistics may prove useful to others exploring the landscape of Japanese personal-data business structures.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    (Provision of Anonymously Processed Information) Article 37 An anonymously processed information handling business operator, when providing anonymously processed information (excluding those which it produced itself by processing personal information; hereinafter the same in this Section) to a third party, shall, pursuant to rules of the Personal Information Protection Commission, in advance disclose to the public the categories of personal information contained in anonymously processed information to be provided to a third party and state to the third party explicitly to the effect that the provided information is anonymously processed information.

References

  1. Google Custom Search API https://developers.google.com/custom-search/v1/overview?hl=ja

  2. Act on the Protection of Personal Information (Act No. 57 of 2003, ), officially translated by the Personal Information Protection Commission, Japan, 2016. (https://www.ppc.go.jp/files/pdf/Act_on_the_Protection_of_Personal_Information.pdf)

  3. Report by the Personal Information Protection Commission Secretariat, Anonymously Processed Information – Towards Balanced Promotion of Personal Data Utilization and Consumer Trust, February 2017 (available from https://www.ppc.go.jp/files/pdf/The_PPC_Secretariat_Report_on_Anonymously_Processed_Information.pdf)

  4. General Data Protection Regulation, Regulation (EU) 2016/679 (https://gdpr-info.eu refereed in 2019)

  5. Information Commissioner’s Office (ICO), Anonymisation: managing data protection risk code of practice (2012)

    Google Scholar 

  6. El Emam, K., Arbuckle, L.: Anonymizing Health Data Case Studies and Methods to Get you Started. O’Reilly, Sebastopol (2013)

    Google Scholar 

  7. Domingo-Ferrer, J., Ricci, S., Soria-Comas, J.: Disclosure risk assessment via record linkage by a maximum-knowledge attacker. In: 2015 Thirteenth Annual Conference on Privacy. IEEE, Security and Trust (PST) (2015)

    Google Scholar 

  8. ISO/IEC 20889, Privacy enhancing data de-identification terminology and classification of techniques (2018)

    Google Scholar 

  9. Pyrgelis, A., Troncoso, C., De Cristofaro, E.: Knock Knock, Who’s There? Membership Inference on Aggregate Location Data, NDSS 2018 (2018)

    Google Scholar 

  10. Japan Industrial Standard JIS Q 15001: Personal Information Protection Management System - Requirements (2006)

    Google Scholar 

  11. PrivacyMark Promotion Center of JIPDEC official webpage (https://privacymark.org/)

  12. Tokyo Stock Exchange, TOPIX Sector Indices / TOPIX-17 Series (https://www.jpx.co.jp/english/markets/indices/line-up/files/e_fac_13_sector.pdf)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hiroaki Kikuchi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kikuchi, H., Ono, A., Ito, S., Fujita, M., Yamanaka, T. (2022). Web Crawler for an Anonymously Processed Information Database. In: Barolli, L., Yim, K., Chen, HC. (eds) Innovative Mobile and Internet Services in Ubiquitous Computing. IMIS 2021. Lecture Notes in Networks and Systems, vol 279. Springer, Cham. https://doi.org/10.1007/978-3-030-79728-7_51

Download citation

Publish with us

Policies and ethics