Abstract:
Digital transformation represents a complex process that entails the complete reconfiguration of an organization to harness the full potential of technology on a grand sc...Show MoreMetadata
Abstract:
Digital transformation represents a complex process that entails the complete reconfiguration of an organization to harness the full potential of technology on a grand scale, aiming to deliver maximum value. To illustrate this transformative journey, we present a real-world case study centered around a company's ambitious pursuit to modernize its manual business text classification processes by embracing automated machine-based methods. During our endeavor, we encountered an imbalanced dataset provided by the business company, with certain categories containing only a few text examples while others had a substantial number of them. To rectify this imbalance, we balanced the dataset by removing excessive text examples, resulting in a more manageable and smaller dataset. To use it, we thoroughly evaluated four multilabel text classification methods: Binary Relevance, Classifier Chains, Label Powerset, and Generative Pre-trained Transformer (GPT) i.e., a large language model developed by OpenAI. The results of our investigation unveiled a success for the GPT-based approach, which significantly outperformed its counterparts across vital performance metrics, including Accuracy, F1-Score, Precision, and Recall. This outcome is particularly noteworthy because it showcases the remarkable effectiveness of GPT-based models in multi-label text classification tasks, even when dealing with relatively small datasets. In contrast, the other methods, Binary Relevance, Classifier Chains, and Label Powerset didn't demonstrate competence and exhibited comparatively less impressive performance levels. Notably, these traditional methods typically rely on larger datasets for training, making them less suitable for scenarios where data availability is limited. These findings underscore the power of GPT-based models in text classification tasks with small datasets, making them valuable assets for businesses in the complex landscape of digital transformation.
Date of Conference: 14-15 November 2023
Date Added to IEEE Xplore: 25 December 2023
ISBN Information: