Skip to main content
Log in

Demystifying Ethereum account diversity: observations, models and analysis

  • Research Article
  • Published:
Frontiers of Computer Science Aims and scope Submit manuscript

Abstract

Blockchain platform Ethereum has involved millions of accounts due to its strong potential for providing numerous services based on smart contracts. These massive accounts can be divided into diverse categories, such as miners, tokens, and exchanges, which is termed as account diversity in this paper. The benefit of investigating diversity are multi-fold, including understanding the Ethereum ecosystem deeper and opening the possibility of tracking certain abnormal activities. Unfortunately, the exploration of blockchain account diversity remains scarce. Even the most relevant studies, which focus on the deanonymization of the accounts on Bitcoin, can hardly be applied on Ethereum since their underlying protocols and user idioms are different. To this end, we present the first attempt to demystify the account diversity on Ethereum. The key observation is that different accounts exhibit diverse behavior patterns, leading us to propose the heuristics for classification as the premise. We then raise the coverage rate of classification by the statistical learning model Maximum Likelihood Estimation (MLE). We collect real-world data through extensive efforts to evaluate our proposed method and show its effectiveness. Furthermore, we make an in-depth analysis of the dynamic evolution of the Ethereum ecosystem and uncover the abnormal arbitrage actions. As for the former, we validate two sweeping statements reliably: (1) standalone miners are gradually replaced by the mining pools and cooperative miners; (2) transactions related to the mining pool and exchanges take up a large share of the total transactions. The latter analysis shows that there are a large number of arbitrage transactions transferring the coins from one exchange to another to make a price difference.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Buterin V. A next-generation smart contract and decentralized application platform. White Paper, 2014, 3(37)

  2. Nakamoto S. Bitcoin: a peer-to-peer electronic cash system. Decentralized Business Review, 2008: 2120

  3. Wood G. Ethereum: a secure decentralised generalised transaction ledger. Ethereum Project Yellow Paper, 2014, 151: 1–32

    Google Scholar 

  4. Anzai Y. Pattern recognition and machine learning. Journal of Electronic Imaging, 2007, 16(4): 99–101

    Google Scholar 

  5. Li Y, Liu B. A normalized Levenshtein distance metric. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007, 29(6): 1091–1095

    Article  Google Scholar 

  6. Ron D, Shamir A. Quantitative analysis of the full Bitcoin transaction graph. In: Proceedings of the 2013 International Conference on Financial Cryptography and Data Security. 2013, 6–24

  7. Androulaki E, Karame G O, Roeschlin M, Scherer T, Capkun S. Evaluating user privacy in Bitcoin. In: Proceedings of the 2013 International Conference on Financial Cryptography and Data Security. 2013, 34–51

  8. Athey S, Parashkevov I, Sarukkai V, Xia J. Bitcoin pricing, adoption, and usage: theory and evidence. Stanford University Graduate School of Business, 2016: 16–42

  9. Victor F. Address clustering heuristics for ethereum. In: Proceedings of the 24th International Conference on Financial Cryptography and Data Security. 2020, 617–633

  10. Wang M J, Ichijo H, Xiao B. Cryptocurrency address clustering and labeling. 2020, arXiv: 2003.13399

  11. Sun H Y, Ruan N, Liu H Q. Ethereum analysis via node clustering. In: Proceedings of International Conference on Network and System Security. 2019, 114–129

  12. Norvill R, Fiz B, State R, Cullen A. Automated labeling of unknown contracts in Ethereum. In: Proceedings of the 26th International Conference on Computer Communication and Networks. 2017, 1–6

  13. Jourdan M, Blandin S, Wynter L, Deshpande P. Characterizing entities in the Bitcoin blockchain. In: Proceedings of 2018 IEEE International Conference on Data Mining Workshops. 2018, 55–62

  14. Jourdan M, Blandin S, Wynter L, Deshpande P. A probabilistic model of the Bitcoin blockchain. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2019, 2784–2792

  15. Fröwis M, Fuchs A, Böhme R. Detecting token systems on ethereum. In: Proceedings of the 23th International Conference on Financial Cryptography and Data Security. 2019, 93–112

  16. Chen T, Zhu Y, Li Z, Chen J, Li X, Luo X, Lin X, Zhang X. Understanding ethereum via graph analysis. In: Proceedings of IEEE International Conference on Computer Communications. 2018, 1484–1492

  17. Chen T, Li Z, Zhang Y, Luo X, Chen A, Yang K, Hu B, Zhu T, Deng S, Hu T. Dataether: Data exploration framework for ethereum. In: Proceedings of the 39th International Conference on Distributed Computing Systems. 2019, 1369–1380

  18. Guo D C, Dong J Q, Wang K. Graph structure and statistical properties of ethereum transaction relationships. Science China Information Sciences, 2019, 492(1): 58–71

    Article  MathSciNet  Google Scholar 

  19. He N, Wu L, Wang H, Guo Y, Jiang X. Characterizing Code Clones in the Ethereum Smart Contract Ecosystem. 2019, arXiv: 1905.00272

  20. Kiffer L, Levin D, Mislove A. Analyzing Ethereum’s Contract Topology. In: Proceedings of the Internet Measurement Conference. 2018, 494–499

  21. Norvill R, Fiz B, State R, Cullen A. Standardising smart contracts: Automatically inferring ERC standards. In: Proceedings of International Conference on Blockchain and Cryptocurrency. 2019, 192–195

  22. Massimo B, Salvatore C, Tiziana C, Roberto S. Dissecting ponzi schemes on ethereum: Identification, analysis, and impact. Future Generation Computer Systems, 2020, 102(1): 259–277

    Google Scholar 

  23. Chen W, Wu J, Zheng Z, Chen C, Zhou Y. Detecting ponzi schemes on ethereum: Towards healthier blockchain technology. In: Proceedings of the 2018 World Wide Web Conference. 2018, 1409–1418

  24. Christof F T, Mathis S, Radu S. The art of the scam: demystifying honeypots in Ethereum smart contracts. In: Proceedings of the 28th USENIX Security Symposium. 2019, 1591–1607

  25. Ramiro C, Christof F T, Mathis B, Radu S. A data science approach for honeypot detection in ethereum. In: Proceedings of IEEE International Conference on Blockchain and Cryptocurrency. 2020, 1–9

  26. Wu J, Yuan Q, Lin D, You W, Chen W, Chen C, Zheng Z. Who are the phishers? phishing scam detection on ethereum via network embedding. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2020: 1–11

  27. Chen W, Wu J, Zheng Z, Chen C, Zhou Y. Market manipulation of Bitcoin: evidence from mining the Mt. Gox transaction network. In: Proceedings of IEEE Conference on Computer Communications. 2019, 964–972

Download references

Acknowledgements

This work was supported by Key-Area Research and Development Program of Guangdong Province (2020B0101090005) and the National Natural Science Foundation of China (Grant No. 62072197).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jiang Xiao.

Additional information

Chaofan Wang received his BS degree from Huazhong University of Science and Technology (HUST), China in 2017. He is currently pursuing his MS degree in School of Computer Science and Technology from HUST. His current research interests include blockchain system and data mining.

Xiaohai Dai received his MS degree in School of Computer Science and Technology from Huazhong University of Science and Technology (HUST), China in 2017. He is currently pursuing his PhD degree in School of Computer Science and Technology from HUST. His current research interests include blockchain and distributed system.

Jiang Xiao is currently an associate professor in School of Computer Science and Technology at Huazhong University of Science and Technology (HUST), China. She received her BS degree from HUST in 2009 and the PhD degree from Hong Kong University of Science and Technology (HKUST), China in 2014. She has been engaged in research on blockchain, distributed computing, wireless indoor localization, and smart sensing. Jiang has directed and participated in many research and deelopment projects and grants from funding agencies such as National Natural Science Foundation of China (NSFC), Hong Kong Research Grant Council (RGC), China, Hong Kong Innovation and Technology Commission (ITC), China, and industries like Huawei, Tencent and Intel, and been invited by NSFC in reviewing research projects. Her awards include CCF-Intel Young Faculty Research Program 2017, Hubei Downlight Program 2018, ACM Wuhan Rising Star Award 2019, and Best Paper Awards from IEEE ICPADS/GLOBECOM/GPC.

Chenchen Li received his BS degree from Zhongnan Unversity of Economics and Law, China. He is currently working toward the PhD degree in the CGCL, Huazhong University of science and technology, China. His research interests include block-chain, data mining and finance security.

Ming Wen is now an associate professor at the School of Cyber Science and Engineering, Huazhong University of Science and Technology, China. He received his PhD from the department of computer science and engineering at the Hong Kong University of Science and Technology (HKUST), China. His research interests include program analysis, fault localization & repair, and software security. His research work has been regularly published in top conferences and journals in the research communities of software engineering, including ICSE, FSE, ASE, TSE and EMSE and so on. More information about him can be found at: https://justinwm.github.io.

Bingbing Zhou is currently an associate professor in School of Computer Science and the theme leader for distributed computing applications in the Centre for Distributed and High Performance Computing at the University of Sydney, Australia. He received his BS degree in 1982 from Nanjing Institute of Technology, China and his PhD degree in computer science in 1989 from Australian National University, Australia. His research interests include parallel/distributed computing, cloud/edge computing, parallel algorithms, IoT and bioinformatics. He has a number of publications in leading international journals and conferences. His research has been funded by the Australian Research Council through several Discovery Project grants.

Hai Jin is a Professor of computer science and engineering at Huazhong University of Science and Technology (HUST), China. Jin received his PhD in computer engineering from HUST in 1994. In 1996, he was awarded a German Academic Exchange Service fellowship to visit the Technical University of Chemnitz, Germany. Jin worked at The University of Hong Kong, China between 1998 and 2000, and as a visiting scholar at the University of Southern California, US between 1999 and 2000. He was awarded Excellent Youth Award from the National Science Foundation of China in 2001. Jin is the chief scientist of ChinaGrid, the largest grid computing project in China, and the chief scientists of National 973 Basic Research Program Project of Virtualization Technology of Computing System, and Cloud Security. Jin is a Fellow of the IEEE and a member of the ACM. He has co-authored 15 books and published over 700 research papers. His research interests include computer architecture, virtualization technology, cluster computing and cloud computing, peer-to-peer computing, network storage, and network security.

Electronic supplementary material

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, C., Dai, X., Xiao, J. et al. Demystifying Ethereum account diversity: observations, models and analysis. Front. Comput. Sci. 16, 164505 (2022). https://doi.org/10.1007/s11704-021-0221-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11704-021-0221-3

Keywords

Navigation