Abstract
The Japanese Act on the Protection of Personal Information (Act No. 57 of 2003) allows companies to deidentify personal data and to provide them to any third party without obtaining the data subject’s consent. Officially, the “Anonymously Processed Information” will eventually improve public convenience and aid big-data business. In this paper, we aim to clarify the landscape of anonymously processed information databases in Japan. Focusing on disclosed statements that specify the production and provisioning processes for anonymously processed information, we have developed an automated Web crawler system that detects such statements using heuristics associated with legal statements. We demonstrate that our crawler system performs very well in terms of processing time and detection accuracy. In addition, the resulting statistics may prove useful to others exploring the landscape of Japanese personal-data business structures.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
(Provision of Anonymously Processed Information) Article 37 An anonymously processed information handling business operator, when providing anonymously processed information (excluding those which it produced itself by processing personal information; hereinafter the same in this Section) to a third party, shall, pursuant to rules of the Personal Information Protection Commission, in advance disclose to the public the categories of personal information contained in anonymously processed information to be provided to a third party and state to the third party explicitly to the effect that the provided information is anonymously processed information.
References
Google Custom Search API https://developers.google.com/custom-search/v1/overview?hl=ja
Act on the Protection of Personal Information (Act No. 57 of 2003, ), officially translated by the Personal Information Protection Commission, Japan, 2016. (https://www.ppc.go.jp/files/pdf/Act_on_the_Protection_of_Personal_Information.pdf)
Report by the Personal Information Protection Commission Secretariat, Anonymously Processed Information – Towards Balanced Promotion of Personal Data Utilization and Consumer Trust, February 2017 (available from https://www.ppc.go.jp/files/pdf/The_PPC_Secretariat_Report_on_Anonymously_Processed_Information.pdf)
General Data Protection Regulation, Regulation (EU) 2016/679 (https://gdpr-info.eu refereed in 2019)
Information Commissioner’s Office (ICO), Anonymisation: managing data protection risk code of practice (2012)
El Emam, K., Arbuckle, L.: Anonymizing Health Data Case Studies and Methods to Get you Started. O’Reilly, Sebastopol (2013)
Domingo-Ferrer, J., Ricci, S., Soria-Comas, J.: Disclosure risk assessment via record linkage by a maximum-knowledge attacker. In: 2015 Thirteenth Annual Conference on Privacy. IEEE, Security and Trust (PST) (2015)
ISO/IEC 20889, Privacy enhancing data de-identification terminology and classification of techniques (2018)
Pyrgelis, A., Troncoso, C., De Cristofaro, E.: Knock Knock, Who’s There? Membership Inference on Aggregate Location Data, NDSS 2018 (2018)
Japan Industrial Standard JIS Q 15001: Personal Information Protection Management System - Requirements (2006)
PrivacyMark Promotion Center of JIPDEC official webpage (https://privacymark.org/)
Tokyo Stock Exchange, TOPIX Sector Indices / TOPIX-17 Series (https://www.jpx.co.jp/english/markets/indices/line-up/files/e_fac_13_sector.pdf)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Kikuchi, H., Ono, A., Ito, S., Fujita, M., Yamanaka, T. (2022). Web Crawler for an Anonymously Processed Information Database. In: Barolli, L., Yim, K., Chen, HC. (eds) Innovative Mobile and Internet Services in Ubiquitous Computing. IMIS 2021. Lecture Notes in Networks and Systems, vol 279. Springer, Cham. https://doi.org/10.1007/978-3-030-79728-7_51
Download citation
DOI: https://doi.org/10.1007/978-3-030-79728-7_51
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-79727-0
Online ISBN: 978-3-030-79728-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)