Authors:
Dongming Jiang
;
Abhishek Shah
;
Stanley Yeung
;
Jessica Zhu
;
Karan Singh
and
George Goldenberg
Affiliation:
CaaStle Inc, U.S.A.
Keyword(s):
Large Language Model, Few-Shot Learning, Zero-Shot Learning, Inference, Knowledge Generation, Multi-Label Classification, Scalability, Fashion Dynamics.
Abstract:
Multi-Label classification is essential in the fashion industry due to the complexity of fashion items, which often have multiple attributes such as style, material, and occasion. Traditional machine-learning approaches face challenges like data imbalance, high dimensionality, and the constant emergence of new styles and labels. To address these issues, we propose a novel approach that leverages Large Language Models (LLMs) by integrating few-shot and zero-shot learning. Our methodology utilizes LLMs to perform few-shot learning on a small, labeled dataset, generating precise descriptions of new fashion classes. These descriptions guide the zero-shot learning process, allowing for the classification of new items and categories with minimal labeled data. We demonstrate this approach using OpenAI’s GPT-4, a state-of-the-art LLM. Experiments on a dataset from CaaStle Inc., containing 2,480 unique styles with multiple labels, show significant improvements in classification performance. F
ew-shot learning enhances the quality of zero-shot classifiers, leading to superior results. GPT-4’s multi-modal capabilities further improve the system’s effectiveness. Our approach provides a scalable, flexible, and accurate solution for fashion classification, adapting to dynamic trends with minimal data requirements, thereby improving operational efficiency and customer experience. Additionally, this method is highly generalizable and can be applied beyond the fashion industry.
(More)