ABSTRACT
There is a need to persuade public and private entities to share their currently unexposed bio-data banks by preserving ownership and secrecy. The reason is to make available results that can be obtained by massively exploiting the content of such data by modern machine learning approaches. Digital catalogues of data collections are being provided. However, they are not developed to protect private content that may be shared according to privileges assigned by the owners. Here, we present BIOCHAIN, a data-sharing module which will be the basis for a computational platform aimed at performing federated data analysis. The platform is intended to be used by a consortium of private and public institutions in the field of microbiology. BIOCHAIN makes use of blockchain technology to guarantee fairness among entities of the consortium by allowing them to securely share their data.
- Elli Androulaki, Artem Barger, Vita Bortnikov, Christian Cachin, Konstantinos Christidis, Angelo De Caro, David Enyeart, Christopher Ferris, Gennady Laventman, Yacov Manevich, Srinivasan Muralidharan, Chet Murthy, Binh Nguyen, Manish Sethi, Gari Singh, Keith Smith, Alessandro Sorniotti, Chrysoula Stathakopoulou, Marko Vukolic, Sharon Weed Cocco, and Jason Yellick. 2018. Hyperledger fabric: a distributed operating system for permissioned blockchains. In Proceedings of the Thirteenth EuroSys Conference, EuroSys 2018, Porto, Portugal, April 23-26, 2018, Rui Oliveira, Pascal Felber, and Y. Charlie Hu (Eds.). ACM, 30:1–30:15. https://doi.org/10.1145/3190508.3190538Google ScholarDigital Library
- Shaoqi Chen, Bin Duan, Chenyu Zhu, Chen Tang, Shuguang Wang, Yicheng Gao, Shaliu Fu, Lixin Fan, Qiang Yang, and Qi Liu. 2022. Privacy-preserving integration of multiple institutional data for single-cell type identification with scPrivacy. Science China Life Sciences (2022), 1–13.Google Scholar
- Shaoqi Chen, Dongyu Xue, Guohui Chuai, Qiang Yang, and Qi Liu. 2020. FL-QSAR: a federated learning-based QSAR prototype for collaborative drug discovery. Bioinformatics 36, 22-23 (2020), 5492–5498.Google Scholar
- Xu Cheng, Fulong Chen, Dong Xie, Hui Sun, and Cheng Huang. 2020. Design of a secure medical data sharing scheme based on blockchain. Journal of medical systems 44, 2 (2020), 52.Google ScholarDigital Library
- Tom Dedeurwaerdere, Paolo Melindi-Ghidi, and Arianna Broggiato. 2016. Global scientific research commons under the Nagoya Protocol: Towards a collaborative economy model for the sharing of basic research assets. Environmental science & policy 55 (2016), 1–10.Google Scholar
- Pietro Ferrara, Luca Negrini, Vincenzo Arceri, and Agostino Cortesi. 2021. Static analysis for dummies: experiencing LiSA. In SOAP@PLDI 2021: Proceedings of the 10th ACM SIGPLAN International Workshop on the State Of the Art in Program Analysis, Virtual Event, Canada, 22 June, 2021, Lisa Nguyen Quang Do and Caterina Urban (Eds.). ACM, 1–6. https://doi.org/10.1145/3460946.3464316Google ScholarDigital Library
- Loveleen Gaur, Arun Solanki, Samuel Fosso Wamba, and Noor Zaman Jhanjhi. 2021. Advanced AI techniques and applications in bioinformatics. CRC Press.Google Scholar
- Justin Guinney and Julio Saez-Rodriguez. 2018. Alternative models for sharing confidential biomedical data. Nature biotechnology 36, 5 (2018), 391–392.Google Scholar
- Manzour Hernando Hazbón, Leen Rigouts, Marco Schito, Matthew Ezewudo, Takuji Kudo, Takashi Itoh, Moriya Ohkuma, Katalin Kiss, Linhuan Wu, Juncai Ma, 2018. Mycobacterial biomaterials and resources for researchers. Pathogens and disease 76, 4 (2018), fty042.Google Scholar
- Wouter Heyndrickx, Lewis Mervin, Tobias Morawietz, Noé Sturm, Lukas Friedrich, Adam Zalewski, Anastasia Pentina, Lina Humbeck, Martijn Oldenhof, Ritsuya Niwayama, 2022. MELLODDY: cross pharma federated learning at unprecedented scale unlocks benefits in QSAR without compromising proprietary information. (2022).Google Scholar
- Tsung-Ting Kuo, Hyeon-Eui Kim, and Lucila Ohno-Machado. 2017. Blockchain distributed ledger technologies for biomedical and health care applications. Journal of the American Medical Informatics Association 24, 6 (2017), 1211–1220.Google ScholarCross Ref
- Qinbin Li, Zeyi Wen, Zhaomin Wu, Sixu Hu, Naibo Wang, Yuan Li, Xu Liu, and Bingsheng He. 2021. A survey on federated learning systems: vision, hype and reality for data privacy and protection. IEEE Transactions on Knowledge and Data Engineering (2021).Google Scholar
- Tian Li, Anit Kumar Sahu, Ameet Talwalkar, and Virginia Smith. 2020. Federated learning: Challenges, methods, and future directions. IEEE signal processing magazine 37, 3 (2020), 50–60.Google Scholar
- Yu Li, Chao Huang, Lizhong Ding, Zhongxiao Li, Yijie Pan, and Xin Gao. 2019. Deep learning in bioinformatics: Introduction, application, and perspective in the big data era. Methods 166 (2019), 4–21.Google ScholarCross Ref
- Samuel D Okegbile, Jun Cai, and Attahiru S Alfa. 2022. Performance Analysis of Blockchain-Enabled Data-Sharing Scheme in Cloud-Edge Computing-Based IoT Networks. IEEE Internet of Things Journal 9, 21 (2022), 21520–21536.Google ScholarCross Ref
- Luca Olivieri, Fabio Tagliaferro, Vincenzo Arceri, Marco Ruaro, Luca Negrini, Agostino Cortesi, Pietro Ferrara, Fausto Spoto, and Enrico Talin. 2022. Ensuring determinism in blockchain software with GoLiSA: an industrial experience report. In SOAP ’22: 11th ACM SIGPLAN International Workshop on the State Of the Art in Program Analysis, San Diego, CA, USA, 14 June 2022, Laure Gonnord and Laura Titolo (Eds.). ACM, 23–29. https://doi.org/10.1145/3520313.3534658Google ScholarDigital Library
- Henning Perl, Yassene Mohammed, Michael Brenner, and Matthew Smith. 2014. Privacy/performance trade-off in private search on bio-medical data. Future Generation Computer Systems 36 (2014), 441–452.Google ScholarCross Ref
- Gerard Verkley, Giancarlo Perrone, Mery Piña, Amber Hartman Scholz, Jörg Overmann, Aurora Zuzuarregui, Iolanda Perugini, Benedetta Turchetti, Marijke Hendrickx, Glyn Stacey, 2020. New ECCO model documents for Material Deposit and Transfer Agreements in compliance with the Nagoya Protocol. FEMS microbiology letters 367, 5 (2020), fnaa044.Google Scholar
- Qingyong Wang and Yun Zhou. 2022. FedSPL: federated self-paced learning for privacy-preserving disease diagnosis. Briefings in Bioinformatics 23, 1 (2022), bbab498.Google Scholar
- Zhiyuan Wang, Zhiqiang Zheng, Wei Jiang, and Shaojie Tang. 2021. Blockchain-enabled data sharing in supply chains: Model, operationalization, and tutorial. Production and Operations Management 30, 7 (2021), 1965–1985.Google ScholarCross Ref
- Qiang Yang, Yang Liu, Tianjian Chen, and Yongxin Tong. 2019. Federated machine learning: Concept and applications. ACM Transactions on Intelligent Systems and Technology (TIST) 10, 2 (2019), 1–19.Google ScholarDigital Library
- Huadi Zheng, Haibo Hu, and Ziyang Han. 2020. Preserving user privacy for machine learning: local differential privacy or federated machine learning?IEEE Intelligent Systems 35, 4 (2020), 5–14.Google Scholar
Index Terms
- BIOCHAIN: towards a platform for securely sharing microbiological data
Recommendations
Blockchain-Based Research Data Sharing Framework for Incentivizing the Data Owners
Blockchain – ICBC 2018AbstractData sharing practices are much needed to maximize knowledge gain by researchers. However, when and what data should be shared with whom, and how credit should be awarded to the data owner needs to be clearly addressed to create an individual ...
SDSBT: A Secure Multi-party Data Sharing Platform Based on Blockchain and TEE
Cyberspace Safety and SecurityAbstractWith the rise of big data analytics and artificial intelligence, an increasing number of enterprises and individuals are concerned about the security and privacy of the shared data. However, it is still challenging to achieve a data sharing scheme,...
Blockchain aware proxy re-encryption algorithm-based data sharing scheme
AbstractThe blockchain stores transaction data in a distributed shared global ledger. It is challenging to strike a balance between privacy protection and usefulness while sharing data. Moreover, the dynamic adjustment of blockchain data ...
Comments