Abstract
A bitcoin address is required for trading and maintaining pseudonymity for the owner. By exploiting this pseudonymity, various illegal activities are conducted around the world. To detect and deter illegal transactions, this paper proposes a method of identifying the characteristics of bitcoin addresses related to illegal transactions. We extracted 80 features from bitcoin transactions. Using machine-learning techniques, we successfully categorized addresses involved with illegal activities with a \(\sim \)84% accuracy. We also examined the address features most affecting classification performance and compared two machine-learning models. By applying the majority voting to the classification results of bitcoin addresses associated with a particular transaction, it will be possible to determine which category the transaction belongs to.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Nakamoto, S., et al. Bitcoin: a peer-to-peer electronic cash system (2008)
Swan, M.: Blockchain: Blueprint for a New Economy. O’Reilly Media Inc., Newton (2015)
Harvey, C.R.: Bitcoin myths and facts. Available at SSRN 2479670 (2014)
Bitcoin Magazine: Bitcoin magazine: bitcoin news, bitcoin charts, events. https://bitcoinmagazine.com/articles/darknet-markets-cant-live-with-or-without-bitcoin
Chainalysis. Chainalysis: The blockchain analysis company. https://www.chainalysis.com/
Wikipedia: Silk road (marketplace). https://en.wikipedia.org/wiki/Silk_Road_(marketplace)
Wood, G., et al.: Ethereum: a secure decentralised generalised transaction ledger. Ethereum project yellow paper 151(2014), 1–32 (2014)
Monero: Zero to monero. Technical report (2018). https://www.getmonero.org/library/Zero-to-Monero-1-0-0.pdf
Kotsiantis, S.B., Zaharakis, I., Pintelas, P.: Supervised machine learning: a review of classification techniques. Emerging artificial intelligence applications in computer engineering 160, 3–24 (2007)
Brands, S.: Rethinking Public Key Infrastructures and Digital Certificates: Building in Privacy. MIT Press, Cambridge (2000)
Fischer, A.M.: Public key/signature cryptosystem with enhanced digital signature certification, US Patent 4,868,877, September 19 1989
Zambre, D., Shah, A.: Analysis of bitcoin network dataset for fraud. Unpublished Report (2013)
Bitcoin.com: Bitcoin history part 11: the first major loss of coins. https://news.bitcoin.com/bitcoin-history-part-11-the-first-major-loss-of-coins/
Techcrunch: Binance says more than \$40 million in bitcoin stolen in ? Large scale??Hack. https://techcrunch.com/2019/05/07/binance-breach/
Hartigan, J.A., Wong, M.A.: Algorithm as 136: a k-means clustering algorithm. J. Roy. Stat. Soc.: Ser. C (Appl. Stat.) 28(1), 100–108 (1979)
Toyoda, K., Ohtsuki, T., Mathiopoulos, P.T.: Identification of high yielding investment programs in bitcoin via transactions pattern analysis. In: GLOBECOM 2017–2017 IEEE Global Communications Conference, pp. 1–6. IEEE (2017)
Kanemura, K., Toyoda, K., Ohtsuki, T.: Identification of darknet markets??Bitcoin addresses by voting per-address classification results. In: 2019 IEEE International Conference on Blockchain and Cryptocurrency (ICBC), pp. 154–158. IEEE (2019)
Wikipedia: Darknet market. https://en.m.wikipedia.org/wiki/Darknet_market
Lin, Y., Wu, P., Hsu, C., Tu, I., Liao, S.: An evaluation of bitcoin address classification based on transaction history summarization. In: 2019 IEEE International Conference on Blockchain and Cryptocurrency (ICBC), pp. 302–310 (2019)
Lee, C., Maharjan, S., Ko, K., Hong, J.W.-K.: Toward detecting illegal transactions on bitcoin using machine-learning methods. In: Zheng, Z., Dai, H.-N., Tang, M., Chen, X. (eds.) BlockSys 2019. CCIS, vol. 1156, pp. 520–533. Springer, Singapore (2020). https://doi.org/10.1007/978-981-15-2777-7_42
WalletExplorer: Walletexplorer: smart bitcoin block explorer. https://www.walletexplorer.com/
BeautifulSoup: Beautiful soup documentation. Technical report. https://www.crummy.com/software/BeautifulSoup/bs4/doc/
Bitcoin.org: Bitcoin core. https://bitcoin.org/en/bitcoin-core/
Bitcoin.org: Bitcoin core json apis. https://bitcoin.org/en/developer-reference#bitcoin-core-apis
Pal, M.: Random forest classifier for remote sensing classification. Int. J. Remote Sens. 26(1), 217–222 (2005)
Zurada, J.M.: Introduction to Artificial Neural Systems, vol. 8. West Publishing Company, St. Paul (1992)
Scikit learn.org: Scikit-learn: machine learning in Python. https://scikit-learn.org/
Tensorflow.org: Tensorflow: an end-to-end open source machine learning platform. https://www.tensorflow.org/?hl=en
Kursa, M.B., Rudnicki, W.R., et al.: Feature selection with the Boruta package. J. Stat. Softw. 36(11), 1–13 (2010)
Davis, J., Goadrich, M.: The relationship between precision-recall and ROC curves. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 233–240. ACM (2006)
Powers, D.M.: Evaluation: from precision, recall and f-measure to ROC, informedness, markedness and correlation (2011)
Goutte, C., Gaussier, E.: A probabilistic interpretation of precision, recall and f-score, with implication for evaluation. In: Losada, D.E., Fernández-Luna, J.M. (eds.) Advances in Information Retrieval. ECIR 2005. LNCS, pp. 345–359. Springer, Heidelberg (2005). https://doi.org/10.1007/978-3-540-31865-1_25
Acknowledgments
This work was supported by the ICT R&D program of MSIT/IITP. [No.2018-0-00539, Development of Blockchain Transaction Monitoring and Analysis Technology].
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Lee, C., Maharjan, S., Ko, K., Woo, J., Hong, J.WK. (2020). Machine Learning Based Bitcoin Address Classification. In: Zheng, Z., Dai, HN., Fu, X., Chen, B. (eds) Blockchain and Trustworthy Systems. BlockSys 2020. Communications in Computer and Information Science, vol 1267. Springer, Singapore. https://doi.org/10.1007/978-981-15-9213-3_40
Download citation
DOI: https://doi.org/10.1007/978-981-15-9213-3_40
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-9212-6
Online ISBN: 978-981-15-9213-3
eBook Packages: Computer ScienceComputer Science (R0)