skip to main content
research-article

PU-Detector: A PU Learning-based Framework for Real Money Trading Detection in MMORPG

Published: 13 February 2024 Publication History

Abstract

Massive multiplayer online role-playing games (MMORPG) have been becoming one of the most popular and exciting online games. In recent years, a cheating phenomenon called real money trading (RMT) has arisen and damaged the fantasy world in many ways. RMT is the sale of in-game items, currency, or even characters to earn real money, breaking the balance of the game economy ecosystem and damaging the game experience. Therefore, some studies have emerged to address the problem of RMT detection. However, they cannot well handle the label uncertainty problem in practice, where there are only labeled RMT samples (positive samples) and unlabeled samples, which could either be RMT samples or normal transactions (negative samples). Meanwhile, the trading relationship between RMTers is modeled in a simple way, leading to some normal transactions being falsely classified as RMT. In this article, we propose PU-Detector, a novel framework based on PU learning (learning from positive and unlabeled data) for RMT detection, considering the fact that there are only labeled RMT samples and other unlabeled transactions. We first automatically estimate the likelihood of one transaction being RMT by developing an improved PU learning method and proposing an assessment rule. Sequentially, we use the estimated likelihood as edge weight to construct a trading graph to learn trader representation. Then, with the trader representations and basic trading features, we detect RMT samples by the improved PU learning method. PU-Detector is evaluated on a large-scale real world dataset consisting of 33,809,956 transaction logs generated by 43,217 unique players. Compared with other approaches, it achieves the state-of-the-art performance and demonstrates its advantages in detecting underlying RMT samples.

References

[1]
Saleh Alghamdi and Natalia Beloff. 2015. Virtual currency concept: Its implementation, impacts and legislation. In Science and Information Conference (SAI’15). IEEE, 175–183.
[2]
Teresa Basile, Nicola Di Mauro, Floriana Esposito, Stefano Ferilli, and Antonio Vergari. 2017. Density estimators for positive-unlabeled learning. In International Workshop on New Frontiers in Mining Complex Patterns. Springer, 49–64.
[3]
Jessa Bekker and Jesse Davis. 2018. Estimating the class prior in positive and unlabeled data through decision tree induction. In AAAI Conference on Artificial Intelligence, Vol. 32.
[4]
Jessa Bekker and Jesse Davis. 2020. Learning from positive and unlabeled data: A survey. Mach. Learn. 109, 4 (2020), 719–760.
[5]
Vincent D. Blondel, Jean-Loup Guillaume, Renaud Lambiotte, and Etienne Lefebvre. 2008. Fast unfolding of communities in large networks. J. Stat. Mechan.: Theor. Experim. 2008, 10 (2008), P10008.
[6]
Sneha Chaudhari and Shirish Shevade. 2012. Learning from positive and unlabelled examples using maximum margin clustering. In International Conference on Neural Information Processing. Springer, 465–473.
[7]
Selin Chun, Deajin Choi, Jinyoung Han, Huy Kang Kim, and Taekyoung Kwon. 2018. Unveiling a socio-economic system in a virtual world: A case study of an MMORPG. In World Wide Web Conference. 1929–1938.
[8]
Aaron Clauset, Mark E. J. Newman, and Cristopher Moore. 2004. Finding community structure in very large networks. Phys. Rev. E 70, 6 (2004), 066111.
[9]
Marthinus Du Plessis, Gang Niu, and Masashi Sugiyama. 2015. Convex formulation for learning from positive and unlabeled data. In International Conference on Machine Learning. PMLR, 1386–1394.
[10]
Marthinus C. Du Plessis, Gang Niu, and Masashi Sugiyama. 2014. Analysis of learning from positive and unlabeled data. Adv. Neural Inf. Process. Syst. 27 (2014).
[11]
Marthinus Christoffel Du Plessis and Masashi Sugiyama. 2014. Semi-supervised learning of class balance under class-prior change by distribution matching. Neural Netw. 50 (2014), 110–119.
[12]
Charles Elkan and Keith Noto. 2008. Learning classifiers from only positive and unlabeled data. In 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 213–220.
[13]
Atsushi Fujita, Hiroshi Itsuki, and Hitoshi Matsubara. 2011. Detecting real money traders in MMORPG by using trading network. In AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, Vol. 7. 26–31.
[14]
Hongxiao Gan, Yang Zhang, and Qun Song. 2017. Bayesian belief network for positive unlabeled learning with uncertainty. Pattern Recog. Lett. 90 (2017), 28–35.
[15]
Saurabh Garg, Yifan Wu, Alexander J. Smola, Sivaraman Balakrishnan, and Zachary Lipton. 2021. Mixture proportion estimation and PU learning: A modern approach. Adv. Neural Inf. Process. Syst. 34 (2021).
[16]
Magdalena Gawron and Artur Strzelecki. 2021. Consumers’ adoption and use of E-currencies in virtual markets in the context of an online game. J. Theoret. Appl. Electron. Commerce Res. 16, 5 (2021), 1266–1279.
[17]
Mee Lan Han, Byung Il Kwak, and Huy Kang Kim. 2022. Cheating and detection method in massively multiplayer online role-playing game: Systematic literature review. IEEE Access 10 (2022), 49050–49063.
[18]
Fengxiang He, Tongliang Liu, Geoffrey I. Webb, and Dacheng Tao. 2018. Instance-dependent PU learning by Bayesian optimal relabeling. arXiv preprint arXiv:1808.02180 (2018).
[19]
Cho-Jui Hsieh, Nagarajan Natarajan, and Inderjit Dhillon. 2015. PU learning for matrix completion. In International Conference on Machine Learning. PMLR, 2445–2453.
[20]
Dino Ienco and Ruggero G. Pensa. 2016. Positive and unlabeled learning in categorical data. Neurocomputing 196 (2016), 113–124.
[21]
Dmitry Ivanov. 2020. DEDPUL: Difference-of-estimated-densities-based positive-unlabeled learning. In 19th IEEE International Conference on Machine Learning and Applications (ICMLA’20). IEEE, 782–790.
[22]
Ting Ke, Ling Jing, Hui Lv, Lidong Zhang, and Yaping Hu. 2018. Global and local learning from positive and unlabeled examples. Appl. Intell. 48, 8 (2018), 2373–2392.
[23]
Ting Ke, Hui Lv, Mingjing Sun, and Lidong Zhang. 2018. A biased least squares support vector machine based on Mahalanobis distance for PU learning. Phys. A: Stat. Mechan. Applic. 509 (2018), 422–438.
[24]
Shane Kelly. 2021. Money laundering through virtual worlds of video games: Recommendations for a new approach to AML regulation. Syrac. L. Rev. 71 (2021), 1485.
[25]
Ryuichi Kiryo, Gang Niu, Marthinus C. Du Plessis, and Masashi Sugiyama. 2017. Positive-unlabeled learning with non-negative risk estimator. Adv. Neural Inf. Process. Syst. 30 (2017).
[26]
Hyukmin Kwon, Aziz Mohaisen, Jiyoung Woo, Yongdae Kim, Eunjo Lee, and Huy Kang Kim. 2016. Crime scene reconstruction: Online gold farming network analysis. IEEE Trans. Inf. Forens. Secur. 12, 3 (2016), 544–556.
[27]
Eunjo Lee, Jiyoung Woo, Hyoungshick Kim, and Huy Kang Kim. 2018. No silk road for online gamers! Using social network analysis to unveil black markets in online games. In World Wide Web Conference. 1825–1834.
[28]
Wenbin Li, Xiaokai Chu, Yueyang Su, Di Yao, Shiwei Zhao, Runze Wu, Shize Zhang, Jianrong Tao, Hao Deng, and Jingping Bi. 2022. FingFormer: Contrastive graph-based finger operation transformer for unsupervised mobile game bot detection. In ACM Web Conference. 3367–3375.
[29]
Bing Liu, Yang Dai, Xiaoli Li, Wee Sun Lee, and Philip S. Yu. 2003. Building text classifiers using positive and unlabeled examples. In 3rd IEEE International Conference on Data Mining. IEEE, 179–186.
[30]
Lu Liu and Tao Peng. 2014. Clustering-based method for positive and unlabeled text categorization enhanced by improved TFIDF. J. Inf. Sci. Eng. 30, 5 (2014), 1463–1481.
[31]
Nagarajan Natarajan, Nikhil Rao, and Inderjit Dhillon. 2015. PU matrix completion with graph information. In IEEE 6th International Workshop on Computational Advances in Multi-sensor Adaptive Processing (CAMSAP’15). IEEE, 37–40.
[32]
Mohamed Nazir and Carrie Siu Man Lui. 2017. A survey of research in real-money trading (RMT) in virtual world. International Journal of Virtual Communities and Social Networking (IJVCSN) 9, 1 (2017), 34–53.
[33]
Yuseung Noh, Seonghoon Jeong, and Huy Kang Kim. 2021. Trading behind-the-scene: Analysis of online gold farming network in the auction house system. IEEE Transactions on Games 14, 3 (2021), 423–434.
[34]
Curtis G. Northcutt, Tailin Wu, and Isaac L. Chuang. 2017. Learning with confident examples: Rank pruning for robust classification with noisy labels. arXiv preprint arXiv:1705.01936 (2017).
[35]
Kyung Ho Park, Eunjo Lee, and Huy Kang Kim. 2020. Show me your account: Detecting MMORPG game bot leveraging financial analysis with LSTM. In 20th International Conference on Information Security Applications. Springer, 3–13.
[36]
Kyung Ho Park, Eunjo Lee, and Huy Kang Kim. 2022. Cashflow tracing: Detecting online game bots leveraging financial analysis with recurrent neural networks. In Annual Symposium on Computer-Human Interaction in Play. 189–195.
[37]
Usha Nandini Raghavan, Réka Albert, and Soundar Kumara. 2007. Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E 76, 3 (2007), 036106.
[38]
Harish Ramaswamy, Clayton Scott, and Ambuj Tewari. 2016. Mixture proportion estimation via kernel embeddings of distributions. In International Conference on Machine Learning. PMLR, 2052–2060.
[39]
Clayton Scott. 2015. A rate of convergence for mixture proportion estimation, with application to learning from noisy labels. In Artificial Intelligence and Statistics. PMLR, 838–846.
[40]
Alfred Snay. 2021. Impact of real-world trading into online video games. (2021).
[41]
Jianrong Tao, Jianshi Lin, Shize Zhang, Sha Zhao, Runze Wu, Changjie Fan, and Peng Cui. 2019. MVAN: Multi-view attention networks for real money trading detection in online games. In 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2536–2546.
[42]
Jianrong Tao, Yu Xiong, Shiwei Zhao, Runze Wu, Xudong Shen, Tangjie Lyu, Changjie Fan, Zhipeng Hu, Sha Zhao, and Gang Pan. 2023. Explainable AI for cheating detection and churn prediction in online games. IEEE Transactions on Games 15, 2 (2023), 242–251. DOI:
[43]
Yu Xiong, Jianrong Tao, Shiwei Zhao, Runze Wu, Xudong Shen, Tangjie Lyu, Changjie Fan, Zhipeng Hu, Sha Zhao, and Gang Pan. 2022. Explainable AI for cheating detection and churn prediction in online games. IEEE Trans. Games (2022).
[44]
Jiarong Xu, Yifan Luo, Jianrong Tao, Changjie Fan, Zhou Zhao, and Jiangang Lu. 2020. NGUARD+ an attention-based game bot detection framework via player behavior sequences. ACM Trans. Knowl. Discov. Data 14, 6 (2020), 1–24.
[45]
Sha Zhao, Junwei Fang, Shiwei Zhao, Runze Wu, Jianrong Tao, Shijian Li, and Gang Pan. 2022. T-Detector: A trajectory based pre-trained model for game bot detection in MMORPGs. In IEEE 38th International Conference on Data Engineering (ICDE’22). IEEE, 992–1003.
[46]
Sha Zhao, Julian Ramos, Jianrong Tao, Ziwen Jiang, Shijian Li, Zhaohui Wu, Gang Pan, and Anind K. Dey. 2016. Discovering different kinds of smartphone users through their application usage behaviors. In ACM International Joint Conference on Pervasive and Ubiquitous Computing. 498–509.
[47]
Dengyong Zhou, Olivier Bousquet, Thomas Lal, Jason Weston, and Bernhard Schölkopf. 2003. Learning with local and global consistency. Adv. Neural Inf. Process. Syst. 16 (2003).
[48]
Zhi-Hua Zhou. 2018. A brief introduction to weakly supervised learning. Nat. Sci. Rev. 5 (2018), 44–53.

Cited By

View all
  • (2025)MAHKT: Knowledge tracing with multi-association heterogeneous graph embedding based on knowledge transferKnowledge-Based Systems10.1016/j.knosys.2025.112958310(112958)Online publication date: Feb-2025
  • (2024)The MMO Economist: AI Empowers Robust, Healthy, and Sustainable P2W MMO EconomiesCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3648344(443-452)Online publication date: 13-May-2024
  • (2024)EBERT: A lightweight expression-enhanced large-scale pre-trained language model for mathematics educationKnowledge-Based Systems10.1016/j.knosys.2024.112118300(112118)Online publication date: Sep-2024
  • Show More Cited By

Index Terms

  1. PU-Detector: A PU Learning-based Framework for Real Money Trading Detection in MMORPG

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Knowledge Discovery from Data
      ACM Transactions on Knowledge Discovery from Data  Volume 18, Issue 4
      May 2024
      707 pages
      EISSN:1556-472X
      DOI:10.1145/3613622
      Issue’s Table of Contents

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 13 February 2024
      Online AM: 29 December 2023
      Accepted: 15 December 2023
      Revised: 30 September 2023
      Received: 10 October 2022
      Published in TKDD Volume 18, Issue 4

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Real money trading detection
      2. positive and unlabeled learning
      3. graph representation learning
      4. community detection
      5. online games

      Qualifiers

      • Research-article

      Funding Sources

      • STI 2030 Major Projects
      • Natural Science Foundation of China
      • Key Research and Development Program of Zhejiang Province

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)203
      • Downloads (Last 6 weeks)33
      Reflects downloads up to 02 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2025)MAHKT: Knowledge tracing with multi-association heterogeneous graph embedding based on knowledge transferKnowledge-Based Systems10.1016/j.knosys.2025.112958310(112958)Online publication date: Feb-2025
      • (2024)The MMO Economist: AI Empowers Robust, Healthy, and Sustainable P2W MMO EconomiesCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3648344(443-452)Online publication date: 13-May-2024
      • (2024)EBERT: A lightweight expression-enhanced large-scale pre-trained language model for mathematics educationKnowledge-Based Systems10.1016/j.knosys.2024.112118300(112118)Online publication date: Sep-2024
      • (2024)scCDCG: Efficient Deep Structural Clustering for Single-Cell RNA-Seq via Deep Cut-Informed Graph EmbeddingDatabase Systems for Advanced Applications10.1007/978-981-97-5575-2_11(172-187)Online publication date: 2-Jul-2024
      • (2024)Graph Contrastive Learning for Dissolved Gas AnalysisAdvanced Data Mining and Applications10.1007/978-981-96-0821-8_12(178-190)Online publication date: 3-Dec-2024

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Full Text

      View this article in Full Text.

      Full Text

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media