Skip to main content

Machine Learning Based Bitcoin Address Classification

  • Conference paper
  • First Online:

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1267))

Abstract

A bitcoin address is required for trading and maintaining pseudonymity for the owner. By exploiting this pseudonymity, various illegal activities are conducted around the world. To detect and deter illegal transactions, this paper proposes a method of identifying the characteristics of bitcoin addresses related to illegal transactions. We extracted 80 features from bitcoin transactions. Using machine-learning techniques, we successfully categorized addresses involved with illegal activities with a \(\sim \)84% accuracy. We also examined the address features most affecting classification performance and compared two machine-learning models. By applying the majority voting to the classification results of bitcoin addresses associated with a particular transaction, it will be possible to determine which category the transaction belongs to.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Nakamoto, S., et al. Bitcoin: a peer-to-peer electronic cash system (2008)

    Google Scholar 

  2. Swan, M.: Blockchain: Blueprint for a New Economy. O’Reilly Media Inc., Newton (2015)

    Google Scholar 

  3. Harvey, C.R.: Bitcoin myths and facts. Available at SSRN 2479670 (2014)

    Google Scholar 

  4. Bitcoin Magazine: Bitcoin magazine: bitcoin news, bitcoin charts, events. https://bitcoinmagazine.com/articles/darknet-markets-cant-live-with-or-without-bitcoin

  5. Chainalysis. Chainalysis: The blockchain analysis company. https://www.chainalysis.com/

  6. Wikipedia: Silk road (marketplace). https://en.wikipedia.org/wiki/Silk_Road_(marketplace)

  7. Wood, G., et al.: Ethereum: a secure decentralised generalised transaction ledger. Ethereum project yellow paper 151(2014), 1–32 (2014)

    Google Scholar 

  8. Monero: Zero to monero. Technical report (2018). https://www.getmonero.org/library/Zero-to-Monero-1-0-0.pdf

  9. Kotsiantis, S.B., Zaharakis, I., Pintelas, P.: Supervised machine learning: a review of classification techniques. Emerging artificial intelligence applications in computer engineering 160, 3–24 (2007)

    Google Scholar 

  10. Brands, S.: Rethinking Public Key Infrastructures and Digital Certificates: Building in Privacy. MIT Press, Cambridge (2000)

    Book  Google Scholar 

  11. Fischer, A.M.: Public key/signature cryptosystem with enhanced digital signature certification, US Patent 4,868,877, September 19 1989

    Google Scholar 

  12. Zambre, D., Shah, A.: Analysis of bitcoin network dataset for fraud. Unpublished Report (2013)

    Google Scholar 

  13. Bitcoin.com: Bitcoin history part 11: the first major loss of coins. https://news.bitcoin.com/bitcoin-history-part-11-the-first-major-loss-of-coins/

  14. Techcrunch: Binance says more than \$40 million in bitcoin stolen in ? Large scale??Hack. https://techcrunch.com/2019/05/07/binance-breach/

  15. Hartigan, J.A., Wong, M.A.: Algorithm as 136: a k-means clustering algorithm. J. Roy. Stat. Soc.: Ser. C (Appl. Stat.) 28(1), 100–108 (1979)

    MATH  Google Scholar 

  16. Toyoda, K., Ohtsuki, T., Mathiopoulos, P.T.: Identification of high yielding investment programs in bitcoin via transactions pattern analysis. In: GLOBECOM 2017–2017 IEEE Global Communications Conference, pp. 1–6. IEEE (2017)

    Google Scholar 

  17. Kanemura, K., Toyoda, K., Ohtsuki, T.: Identification of darknet markets??Bitcoin addresses by voting per-address classification results. In: 2019 IEEE International Conference on Blockchain and Cryptocurrency (ICBC), pp. 154–158. IEEE (2019)

    Google Scholar 

  18. Wikipedia: Darknet market. https://en.m.wikipedia.org/wiki/Darknet_market

  19. Lin, Y., Wu, P., Hsu, C., Tu, I., Liao, S.: An evaluation of bitcoin address classification based on transaction history summarization. In: 2019 IEEE International Conference on Blockchain and Cryptocurrency (ICBC), pp. 302–310 (2019)

    Google Scholar 

  20. Lee, C., Maharjan, S., Ko, K., Hong, J.W.-K.: Toward detecting illegal transactions on bitcoin using machine-learning methods. In: Zheng, Z., Dai, H.-N., Tang, M., Chen, X. (eds.) BlockSys 2019. CCIS, vol. 1156, pp. 520–533. Springer, Singapore (2020). https://doi.org/10.1007/978-981-15-2777-7_42

    Chapter  Google Scholar 

  21. WalletExplorer: Walletexplorer: smart bitcoin block explorer. https://www.walletexplorer.com/

  22. BeautifulSoup: Beautiful soup documentation. Technical report. https://www.crummy.com/software/BeautifulSoup/bs4/doc/

  23. Bitcoin.org: Bitcoin core. https://bitcoin.org/en/bitcoin-core/

  24. Bitcoin.org: Bitcoin core json apis. https://bitcoin.org/en/developer-reference#bitcoin-core-apis

  25. Pal, M.: Random forest classifier for remote sensing classification. Int. J. Remote Sens. 26(1), 217–222 (2005)

    Article  Google Scholar 

  26. Zurada, J.M.: Introduction to Artificial Neural Systems, vol. 8. West Publishing Company, St. Paul (1992)

    Google Scholar 

  27. Scikit learn.org: Scikit-learn: machine learning in Python. https://scikit-learn.org/

  28. Tensorflow.org: Tensorflow: an end-to-end open source machine learning platform. https://www.tensorflow.org/?hl=en

  29. Kursa, M.B., Rudnicki, W.R., et al.: Feature selection with the Boruta package. J. Stat. Softw. 36(11), 1–13 (2010)

    Article  Google Scholar 

  30. Davis, J., Goadrich, M.: The relationship between precision-recall and ROC curves. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 233–240. ACM (2006)

    Google Scholar 

  31. Powers, D.M.: Evaluation: from precision, recall and f-measure to ROC, informedness, markedness and correlation (2011)

    Google Scholar 

  32. Goutte, C., Gaussier, E.: A probabilistic interpretation of precision, recall and f-score, with implication for evaluation. In: Losada, D.E., Fernández-Luna, J.M. (eds.) Advances in Information Retrieval. ECIR 2005. LNCS, pp. 345–359. Springer, Heidelberg (2005). https://doi.org/10.1007/978-3-540-31865-1_25

Download references

Acknowledgments

This work was supported by the ICT R&D program of MSIT/IITP. [No.2018-0-00539, Development of Blockchain Transaction Monitoring and Analysis Technology].

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chaehyeon Lee .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Lee, C., Maharjan, S., Ko, K., Woo, J., Hong, J.WK. (2020). Machine Learning Based Bitcoin Address Classification. In: Zheng, Z., Dai, HN., Fu, X., Chen, B. (eds) Blockchain and Trustworthy Systems. BlockSys 2020. Communications in Computer and Information Science, vol 1267. Springer, Singapore. https://doi.org/10.1007/978-981-15-9213-3_40

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-9213-3_40

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-9212-6

  • Online ISBN: 978-981-15-9213-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics